^{1}

^{*}

^{2}

^{3}

^{4}

Olive mill waste water (OMWW) is a by-product issued after triturating olives. In Sfax, its management is different from urban to farming area. In this paper we treat it through a statistical analysis study during the season 2005-2006. Principal Component Analysis (PCA) and Hierarchical Classification (HC) methods are carried out on this work. Applied to variables issued from an exhaustive questionnaire including 274 mills, four Principal Components (PCs) are found to be significant, explaining 67% of the total variance. The coordinates of the 13 active variables retained by PCA were used to create a typology relative to the OMWW management and offered 7 groups of individuals which have the same characteristics, explaining 70% of the total inter-variance. This study showed that OMWW management in farming area could causes environmental problems because oleifactors haven’t controlled tanks and could evacuated OMWW on soil (causing oil deposit, waterproofing and possible asphyxia) or on public sewage network (causing corrosion, flow reduction). So, mills transfer from urban to farming areas in the form of agro-industrial complex is needed in the Sfax region.

The olive oil industry is one of the driving sectors of the agricultural economy of the Mediterranean basin. Every year about 11 million tons of olives are produced and about 1.7 million tons of olive oil are produced corresponding to 95% of the world’s production [

Olive oil is produced in olive mills either by the discontinuous press method or by the continuous centrifugation method. In the last decades, development of continuous centrifugation method has been observed. This method has many advantages compared to the discontinuous press method such as complete automation and better oil quality [^{3}/year [4,5]. Furthermore, with chemical oxygen demand (COD) values in the range 40 - 220 g·L^{−1} and biochemical oxygen demand (BOD) values in the range 23 - 100 g·L^{−1}, which is 25 - 80 times higher than the pollution level of common municipal wastewater [

To avoid these environmental impacts, olive mills were forced to treat or eliminate this waste. Hence, a wide range of systems has been studied for the disposal [7,8] or use of OMWW, such as aerobic [9,10] and anaerobic [11,12] treatments composting [2,13,14] and direct watering on fields [15-17]. However, these methods present several drawbacks that make their implementation very difficult and very expensive [5,18,19].

In the present study, statistical methods such as Principal Component Analysis (PCA), and Hierarchical Classification were used to determine the OMWW management in Sfax. Variables were collected using a questionnaire with 274 mills distributed over the study zone. PCA, a fundamental and one of the most popular multivariate statistics based on monitoring methods [

Sfax has been chosen for its outstanding contribution to the olive production in the country, its triturating capacity and its OMWW production.

Sfax is located in the South of Tunisia, situated in 34˚43' on north latitude and in 10˚41' on east longitude. It is bordered by Mahdia prefecture to the North, Kairouan prefecture, the prefectures of Sidi Bouzid and Gafsa prefecture to the West, Gabes to the South and finally the Mediterranean to the East. Sfax region is made up of 16 administrative units called delegations (

Sfax belongs to the pre-Saharian part of Tunisia, is characterized therefore by an arid to semi-arid Mediterranean zone. These factors explain the important contribution of this sector to the economy of the country.

Olive groves in Sfax cover 312,000 hectares representing 44% of the total agricultural area and 19% of the national olive-growing region and counts 6.13 million feet. The olive variety in Sfax is 100% Chemlali. Trees are planted to 83% in full with a density of 20 ft/ha. Consequently, Sfax has a very important contribution to the economy of the country, especially in the olive production sector.

In the period 2005-2006, there were 405 oil mills. However, only 305 were functional assuring the triturating of 12,000 tons per day which represented 37% of the national capacity and generated about 9090 tons per day of OMWW.

Data used in this work were collected through an exhaustive investigation of 305 functional mills on Sfax during the season 2005-2006 [

The statistical study was applied only on 274 individuals. The others were excluded from this study because of their questionnaire refusal or because mills were newly created and responses could distort our study.

Multivariate analysis of the variables relative to mills of

Sfax was performed through PCA [20,27]. PCA can be described as a method to project high dimensional measurement space with significantly fewer dimensions. A set of data, has been processed by multivariate statistical techniques in order to investigate the OMWW management during the season 2005-2006. The experimental matrix (138 × 274) was analysed by PCA [

The PCA method involves the transformation of a greater number of unorthogonal variables into smaller number of orthogonal variables, which present common causes of their changes. It can therefore reduce the dimensionality of a problem by replacing the measured variables and the inter-correlated variables by using a smaller number of uncorrelated variables. This can be useful in reducing the amount of basic data to be processed [28,29].

Statistical methods were later applied to complete and refine the analysis. They especially included linear correlations and hierarchical classification.

The correlation coefficient explains its position in the selected factorial plane. It is given by R(x, y) = cov(x, y)/σ(x)σ(y).

The good interpretation of documents issued from statistical analysis lead to the right choice of the principal axis. The choice of axis is based on the statistical test; the percentage of the eigenvalues average is equal to (100/13) = 7.69 which imply that we can choose the first four axis that have an inertia more than this percentage (7.69%) [

The objective of the statistical analysis is to create typology based on variables relative to triturating system, OMWW evacuation and localization.

This typology was conducted on 13 active variables (

In the period 2005-2006, Sfax accounted 405 mills among which only 305 only were functional and whose characteristics are gathered in

The geographic distribution of mills showed their concentration in urban environment. These mills triturated 3130 tons per day representing 34.43% (

Mills generated OMWW with different coefficients according to triturating system [

PCA identified four factors, which are responsible for the data structure explaining 67% of the total variance of the data set and allowed to group the selected parameters according to common features as well as to evaluate the incidence of each group on the overall variation in OMWW management.

negatively correlated with the rural environment, mills located in Mahres, Mahres’tank, Garïba’s tank and owned tanks. The variable “project of transfer” is positively correlated with urban environment. This was explained by problems encountered by oleiafctors as those related to traffic and the high cost of olives and OMWW transportation. The project of transfer will be the best solution for them.

PC2 explains 20.15% of the variance and was mainly participated by mills localisation and triturating system. The continuous centrifugation method was correlated negatively with the discontinuous press method and positively correlated with peri-urban environment, triturating capacity and use of OMWW as a fertilizer. In fact, mills concentrated on this zone were essentially continuous,

newly created, with high triturating capacity and generates a large amount of OMWW. This affluent is rich in organic matter, nitrogen (N), phosphorus (P), potassium (K) and magnesium (Mg) [

PC3 explains 12.72% of the variance and defined the OMWW evacuation in Mahres.

PC4 explains 8.23% of the variance and defined the OMWW evacuation in Graïba For better refining the above-mentioned groupings, the recourse to plane projections is of a great interest. The following paragraphs include the factorial distribution of the variables in the 1 × 2 and 1 × 3 plans.

According to this factorial representation we notice a distribution of two variables groups illustrated in

According to this factorial representation we notice a distribution of three groups of variables illustrated in

The Hierarchical classification allowed selecting seven classes (

Class 2 regrouped 36 classic mills located in urban environment assuring the triturating of 941 tons/day. OMWW was evacuated in Agareb’s tank but oleifactors didn’t encourage its use as a fertiliser. 34% of them have a project of transfer.

Class 3 regrouped 75 mills located in peri-urban environment assuring the triturating of 4170 tons/day. 64% were continuous. OMWW was evacuated in Agareb’s tank and oleifactors encouraged its use as a fertiliser. Only 27% of them have a project of transfer because their mills are generally newly created.

Class 4 regrouped 15 mills located in Mahres assuring the triturating of 616 tons/day. 50% were classic. OMWW was evacuated in Mahres’tank and oleifactors encouraged its use as a fertiliser. However oleifactors of this group applied the OMWW spreading illegally.

Class 5 regrouped 5 mills located in Graïba assuring the triturating of 184 tons/day. 75% were classic. OMWW was evacuated in Graïba’s tank and oleifactors encouraged its use as a fertiliser.

Class 6 regrouped 49 mills located in rural environment assuring the triturating of 1109 tons/day. 45 % were classic and 11% were continuous. 20% of OMWW generated by mills of this group was evacuated in Agareb’s tank and 80% was evacuated in owned tanks, in tanks belonging to other governorates near some mills or given

to transporters. Oleifactors of this class encouraged the use of OMWW as a fertiliser.

Class 7 regrouped 24 mills located in rural environment assuring the triturating of 1101 tons/day. 25% were classic, 25% were super pressure and 25% were continuous. OMWW was evacuated in owned tanks, in tanks belonging to other governorates near some mills or given to transporters. Oleifactors of this class encouraged the use of OMWW as a fertiliser.

Thus, the statistical study showed that groups 6 and 7, whose mills were located in rural areas, can be a source of pollution that’s why the decision maker must make a priority to these areas. The other groups contributed to the environment pollution with lesser degree because they use tanks controlled and managed by the State. But it does not prevent that we must give them special attention.

40% of oleifactors belonging to urban and peri-urban areas encouraged the transfer project of olive mills from urban to farming areas provided that all infrastructures were present that’s why we propose in another work [

This study investigated data relative to OMWW management in 274 mills. The PCA and HC analysis were applied and resulted essentially in four principal components, describing approximately 68% of the total variance and seven distinct groups of individuals were identified, describing approximately 70% of the total variance, showing that OMWW management was different between urban and peri-urban in a first hand and the rural one on another hand.

PCA allowed the reduction of the 13 variables to four PCs and HC allowed the reduction of the 274 individuals to seven classes. PC1 (25.84% of variance) was defined by mills localisation and OMWW evacuation. This correlation can be attributed to a particular behaviour of the decision maker who proposes distinct solutions of OMWW management. The PC2 (20.15% of variance) can be assigned to mills localisation and triturating system. PC3 (12.72% of variance) can be assigned to OMWW evacuation in Mahres. PC4 (8.23% of variance) can be assigned to OMWW in Graïba.

Thus, the multivariable statistical analysis served as an excellent exploratory tool in analysis and interpretation of complex data set on OMWW management especially when different disposal methods based on evaporation ponds [35,36], thermal concentration [37,38], physicochemical [