ترجمه فارسی عنوان مقاله

استفاده از شبکه های بیزی با استخراج قاعده برای استنتاج خطر هجوم علف های هرز در محصول ذرت

عنوان انگلیسی

Using Bayesian networks with rule extraction to infer the risk of weed infestation in a corn-crop

کد مقاله	سال انتشار	تعداد صفحات مقاله انگلیسی
28792	2009	15 صفحه PDF

منبع

Publisher : Elsevier - Science Direct (الزویر - ساینس دایرکت)

Journal : Engineering Applications of Artificial Intelligence, Volume 22, Issues 4–5, June 2009, Pages 579–592

ترجمه کلمات کلیدی

- ï شبکه های بیزی - ساده بیز - حکومت استخراج - هجوم علف هرز - داده ها -

کلمات کلیدی انگلیسی

Bayesian network, Naïve Bayes, Rule extraction, Weed infestation, Kriging,

دانلود رایگان 2 صفحه اول مقاله لاتین (PDF)

پیش نمایش مقاله

چکیده انگلیسی

This paper describes the modeling of a weed infestation risk inference system that implements a collaborative inference scheme based on rules extracted from two Bayesian network classifiers. The first Bayesian classifier infers a categorical variable value for the weed–crop competitiveness using as input categorical variables for the total density of weeds and corresponding proportions of narrow and broad-leaved weeds. The inferred categorical variable values for the weed–crop competitiveness along with three other categorical variables extracted from estimated maps for the weed seed production and weed coverage are then used as input for a second Bayesian network classifier to infer categorical variables values for the risk of infestation. Weed biomass and yield loss data samples are used to learn the probability relationship among the nodes of the first and second Bayesian classifiers in a supervised fashion, respectively. For comparison purposes, two types of Bayesian network structures are considered, namely an expert-based Bayesian classifier and a naïve Bayes classifier. The inference system focused on the knowledge interpretation by translating a Bayesian classifier into a set of classification rules. The results obtained for the risk inference in a corn-crop field are presented and discussed.

مقدمه انگلیسی

Agricultural procedures may modify the ecological balance of a field due to the tilling procedures growers use to prepare the land, quite often leading to a population explosion or infestation of some inconvenient plants commonly known as weeds. Weed control is a fundamental part of all crop production systems. Yield reductions due to weeds are commonly known obstacle in harvest operations as they lower crop quality by competing with the crop for limited resources, such as water, nutrients, light, etc. Oerke et al. (1994) estimated that a 10% loss of worldwide agricultural production might be a consequence of weed activity. In general, the main components of weed management systems are herbicides. Usually, herbicides are uniformly spread over the entire field aiming at weed control. A uniform application rate is often based on a visual evaluation of the weed density, with no procedure used to evaluate the risks associated with under and over spraying (Faechner et al., 2002). However, weed infestation does not occur over the entire field and the amount of herbicides could be reduced by spraying only over the weed patches (Wallinga et al., 1998 and Jurado-Expósito et al., 2004). The prediction of weed dispersion can be efficiently used in preventing infestations by applying herbicides only in specific regions (Jurado-Expósito et al., 2003 and Faechner et al., 2002). Reducing the quantity of herbicides potentially reduces herbicide residues in water, food crops and in the environment, and it may prevent the development of weed resistance (Aitkenhead et al., 2003). In the literature, a considerable diversity of weed management decision models can be found. There are many different approaches, ranging from empirical functions to mechanistic simulation models. As surveyed by Wilkerson et al. (2002), some of the models are too simple as they do not include all factors that can influence weed competition or other issues farmers consider when deciding how to manage weeds. Other models can be excessively complex given that many users might find difficulty in obtaining the needed information or do not have the required equipment for acquiring the data. According to Wilkerson et al. (2002), weed management decision models must be built and evaluated from three perspectives: biological accuracy, quality of recommendations and ease of use. In addition, another important issue to be taken into account when building weed management systems is related to the interpretation of the model. The latter is of particular interest in the experiments conducted in this paper. There are few formalisms that can be used to model weed infestation in a crop field. Primot et al. (2006) developed 20 simple models (five are linear regression models and the other 15 are logistic regression models). The models were evaluated for their ability to discriminate the fields with a high level of weed infestation from the fields with a low level of infestation—the parameters of the 20 models were estimated using 3 years of experimental data. The models can be used to help farmers decide what type of weed control (chemical, mechanical or biological) to use. The risk of weed infested crop can be inferred from the mathematical modeling of the weed behavior, based on experimental data. Dynamic models for weed seed populations describe the population size at life-cycle t as a function of the population size at life-cycle t-1t-1 using difference ( Sakai, 2001 and Cousens and Mortimer, 1995). The dynamic models indicate that infestation is not only dependent upon the weed density but also on the competitiveness of the weed species ( Park et al., 2003, Firbank and Watkinson, 1985 and Kropff and Spitters, 1991). More recently, competitive indexes and weed ranking were used to quantify the weed competitiveness in a soybean field ( Hock et al., 2006). Although purely mathematical models can be used for modeling the weed risk of infestation, with good performance, as described in several of the previous references, most of them lack flexibility and more important, lack interpretability—they work as ‘black boxes’ where the user feeds a few values and the system outputs a diagnosis. A particular class of models is based on probability. Of special interest in this paper is the class of Bayesian networks (BN) models, which are based on the probability that a given set of measurements define objects as belonging to a certain class. In the literature, Bayesian based methods have already been used for modeling similar problems (Hughes and Madden, 2003, Smith and Blackshaw, 2002 and Banerjee et al., 2005). Particularly, Hughes and Madden (2003) proposed a risk assessment methodology to identify which exotic plant species, among those presented for import, are a threat (to agricultural and ecological systems) and which are not. Bayesian theory has also been employed in the agriculture domain as the basis for developing classification systems, as described in Granitto et al. (2002). In their work, the performance of a naïve Bayes classifier (BC) is used as the selection criterion for identifying a nearly optimal set of 12 seed characteristics further used as classification parameters, such as coloration, morphological and textural features. Considering the seed identification problem, the work described in Granitto et al. (2005) compared naïve Bayes classifier performance to an artificial neural network (NN) based classifier. In this particular experiment the naïve Bayes classifier with an adequately selected set of classification features outperformed the NN based classifier. Similar result was also obtained in Marchant and Onyango (2003) but with a Bayesian classifier and a multilayer feed-forward neural network in a task for discriminating plants, weeds, and soil in color images. The main goal of this paper is to propose and describe the use of Bayesian network methods to infer the risk of weed infestation in a corn-crop as well as to present and discuss the results obtained in a real application domain based on empirical data. The procedure is implemented as a collaborative system that integrates two classification tasks. The first uses a Bayesian network to infer the competitiveness of weeds expressed by their biomass, using as input the total density of weeds, and corresponding narrow and broad-leaved proportions. The second task assesses the risk of infestation, expressed by the yield loss, using as input the previous inferred competitiveness, as well as features extracted from the weed seed density, weed coverage and weed seed patches. The three last variables are estimated with a geostatistics method called kriging (Brooker, 1979 and Isaaks and Srivastana, 1989) and image objects (Gonzalez and Woods, 2002) from weed seed density and weed coverage data samples. In addition, the paper also presents the translation of the induced Bayesian networks into a set of classification rules, aiming at a more comprehensible knowledge representation. As mentioned before, this is an important aspect of a knowledge based system construction, since it provides the system credibility, a quality that other types of representation lack. Therefore, the main idea of the conducted experiments is not to show that the translation method is better than traditional classifiers (as C4.5, for instance) or rule extraction methods. The claim is that it is possible to take advantage of both the causal knowledge representation (which can be adequately represented in a BN or BC) and high accuracy of a Bayesian classifier to have a set of classification rules (extracted from the BC) as a knowledge base. For both classification tasks implemented by the collaborative system, two different Bayesian network structures are used for comparison purposes. One is induced by the naïve Bayes algorithm (Duda and Hart, 1973) using empirical data and the other, an unrestricted Bayesian network, is designed and refined by an expert using the same empirical data. The networks in this paper are referred to as naïve Bayes and expert-based networks, respectively. Due to their different architectures, the two Bayesian networks have different performances, depending on the available information. A set of probabilistic classification rules is then extracted from each of the Bayesian networks using a Markov-based strategy proposed in Hruschka et al. (2008). To reduce the number of rules where the Markov-based strategy does not remove categorical variables, a pruning strategy is proposed. The pruning strategy is mainly motivated by the fact that no extra computation effort is needed. The pruning can be done by considering only the rules having estimated probability higher than a predefined threshold. This paper is an extended and revised version of two earlier conference papers namely Bressan et al., 2007a and Bressan et al., 2007b. The remaining of this paper is organized as follows. Section 2 describes the basics of Bayesian networks and naïve Bayes classifiers and discusses the importance of improving their understandability. Section 3 focuses on two important issues: the approach used to collect and to interpolate empirical data, and the construction of the collaborative system that integrates two Bayesian classifiers. Section 4 presents the results of the collaborative system, focusing on the results of the individual classifiers, that is, the Bayesian network and the naïve Bayes classifiers. Finally, Section 5 presents some concluding remarks and highlights the next steps for this research work.

نتیجه گیری انگلیسی

This work explores Bayesian network based methods to infer the risk of weed infestation in a corn-crop. The proposed inference system is implemented as a collaboration between two classification tasks. The first one infers the competitiveness (expressed by the biomass) of weeds and the second infers the risk of infestation (expressed by the yield loss), using as input the inferred competitiveness, the weed seed density, weed coverage and weed seed patches. The last three features are inferred from kriging and image objects. For both classification tasks, two different Bayesian network structures, a naïve Bayes and an expert-based network structures, were used for comparison purposes. The numeric parameters of both Bayesian models were learned from the empirical data collected from a corn-crop field. A hybrid approach, implemented by the BayesRule method, which articulates Bayes and categorical rules, was used to improve the model's understandability, by extracting classification rules from each model. The Markov blanket concept was used in the BayesRule method to reduce the number and the complexity of classification rules. When pruning is applied, the number of rules tends to be smaller and the comprehensibility tends to be higher. On the other hand, having fewer rules may imply having a less detailed overview of the problem (with fewer rules and fewer antecedents). Thus, the trade off between accuracy and complexity is a very important issue to be analyzed in each specific application domain. In this work, for the expert-based network, the Markov blanket concept was sufficient to prune the rule set efficiently, since the results indicate 72.5% and 66.3% of agreement without and with the pruning strategy, respectively. In addition, the results reveal that the expert-based Bayesian network classifier yields a higher accuracy than the naïve Bayes classifier. In the former, the application of the pruning strategy made no difference in the results. The strong and unrealistic assumption (that all the features are independent given the class) which is an intrinsic aspect of any naïve Bayes classifier may have contributed to this behavior. It is worthwhile mentioning that the results presented are specific to a particular crop field, subject to the conditions described in Section 3.1. Further work includes the use of extensive simulations and experiments to generalize the obtained results. It is also worth looking into the use of the proposed pruning strategy in other domains in order to confirm its relevance.