ترجمه فارسی عنوان مقاله

دو مرحله از استدلال موردی مبتنی بر مجتمع کردن الگوریتم ژنتیک با مکانیسم داده کاوی

عنوان انگلیسی

Two stages of case-based reasoning – Integrating genetic algorithm with data mining mechanism

کد مقاله	سال انتشار	تعداد صفحات مقاله انگلیسی
22106	2008	11 صفحه PDF

منبع

Publisher : Elsevier - Science Direct (الزویر - ساینس دایرکت)

Journal : Expert Systems with Applications, Volume 35, Issues 1–2, July–August 2008, Pages 262–272

ترجمه کلمات کلیدی

استدلال موردی پایه - مورد بازیابی - الگوریتم ژنتیک

کلمات کلیدی انگلیسی

Case-base reasoning, Case retrieve, Genetic algorithm

دانلود رایگان 2 صفحه اول مقاله لاتین (PDF)

پیش نمایش مقاله

چکیده انگلیسی

Case-based reasoning (CBR) is a paradigm, concept and instinctive mechanism for problem solving. Recently, CBR has been widely integrated with some AI algorithms and applied to various kinds of problems. The ill-defined and unstructured problems are suitably solved by CBR. This research proposes a hybrid CBR mechanism including two stages. In stage I, the genetic algorithm is adopted to improve efficiency of case retrieving process. Compared to traditional CBR, the proposed mechanism could reduce about 14% case evaluations, but still achieved 90% satisfactory results. In stage II, the knowledge discovering and data mining (KDD) processes are implemented to produce the refined information from the retrieved cases. Because these retrieved cases and target problem satisfy similar or even same conditions, the outcome of KDD would be more valuable for reference. In addition to retrieved cases of stage I, the proposed mechanism provides direction and relevant knowledge for decision makers in their decision supporting and revising processes in stage II. The proposed CBR mechanism also deals with efficiency and outcome quality issues of traditional CBR.

مقدمه انگلیسی

Case-based reasoning (CBR) is a paradigm, concept and instinctive mechanism for problem solving. Similarly to human problem solving process, CBR retrieves past experiences to reuse for target problem. Of course, the solutions of past cases may need to be revised for applying. The successful problem-solving experiences are then retained for further reusing (Aamodt & Plaza, 1994). These are well-known 4R processes (retrieve, reuse, revise, and retain) of traditional CBR. For recently decade, CBR has been applied to various kinds of problem solving. The ill-defined and unstructured problems are suitably solved by CBR (Belecheanu, Pawar, Barson, Bredehorst, & Weber, 2003). Some applications are as follows: • Construction industry: Cirovic and Cekic (2002) applied CBR to support construction project in preliminary design phase. They retrieved pervious cases in historical IPDB (Integrated Project DataBase), and then stored in CKB (Construction Knowledge Base) for quality designing. Juan, Shin, and Perng (2006) integrated CBR with genetic algorithm (GA) for housing designing. They devoted to customization housing layout design. • Diagnosis system: Guiu, Ribe, Mansilla, and Fabrega (1999) and Golobardes, Llora, Salamo, and Marti (2002) both used CBR for breast cancer diagnosis from Mammary Biopsy Image and micro calcifications separately. Chang (2005) applied CBR to screen children with developmental delay. In practice, these diseases can be early detected from symptoms and increase the cure rate. • SCM/CRM supporting: Choy and Lee (2002) used CBR to select suppliers for outsourcing. They evaluated cases (i.e., suppliers) according to technical capability, quality assessment and organizational profile. Further, Choy, Fan, and Lo (2003) extended the above research to CRM and developed intelligent customer-SRM system (ISRMS) by CBR. Cheng and Cheng (2004) predicted yarn qualities from cotton fibers properties by CBR. In past, these problems usually solved by mathematic model, e.g., regression. But, there are too many hypotheses to apply in real world. CBR seems to be another good option to deal with these kinds of problems. • Strategic planning: Changchien and Lin (2005) produced marketing plans for marketing personnel of Telecommunication Company. Through CBR they analyzed situations, e.g., external environmental audit, industry audit, etc., and produced key marketing policies, including product, price, place, and promotion. Lim, Ahn, and Lee (2005) applied CBR to formulate strategies for competing stakeholder management. After stakeholder analyzing, RDAP (reactive, defensive, accommodative, or proactive) strategies are retrieved, revised, and implemented by CBR mechanism. • Other applications: Belecheanu et al. (2003) referred to past records to improve uncertain information, e.g., industrial requirements, for new product development especially in the concurrent engineering approach. Gwen (2003) suggested the application of CBR to knowledge management. Choobineh and Lo (2005/2006) intended to improve database conceptual design by CBR. For all above mentioned applications, CBR is easier to understand, implement and outperform than others compared algorithms. In practices, there is some well-developed software to facilitate CBR concept implementation, including the commercial packages (such as KATE, Spotlight, and ESTEEM) and free software (such as CBR-Works and CASPIAN). For illustration, in the CBR-Work 4.0 user Interface as Fig. 1, decision makers can set conditions of target problem and then submit to system. CBR-Work will provide similar case(s) for the references of decision makers.However, traditional CBR has some problems. Traditional CBR has to evaluate all cases in the case-base to return the most similar case(s). The efficiency of CBR is obviously negatively related to the case-base size. Thus, a number of approaches have devoted to decrease the effort for case evaluation. For illustration, K-means is the most popular. K/CBR, CBR integrated with K-means approach, clusters all cases first and only evaluates the case of the most similar cluster for case retrieving. In addition, Chang and Lai (2005) attempted self-organizing map (SOM). In their research, SOM/CBR outperformed than K/CBR. Both K/CBR and SOM/CBR improve the efficiency of CBR. However, the performances of these two revised CBR mechanisms are closely related to case representation and indexing approaches (Shin & Han, 1999). Hence, their superior performances are not stable and can not be promised. CBR has been widely integrated with other mechanisms and applied to various territories. Marling, Sqalli, Rissland, Hector, and Aha (2002) surveyed various CBR integrations. Some systems, e.g., ADIOP, CADRE, CADSYN, CHARADE, COMPOSER, IDIOM, JULIA, and Weigel’s, integrated CBR with constraint satisfaction problem (CSP) algorithm. Some systems, e.g., ANAPRON, AUGUSTE project, CABARET, CAMPER, GREBE, GYMEL and SAXEX, combined CBR with rule-based reasoning (RBR) mechanism. There are also hybrids of CBR and model-based reasoning (MBR), e.g., CASEY, FORMTOOL and SOPHIST. These integrations could improve and enhance traditional CBR. There are some researches integrating CBR with other AI techniques. Genetic algorithm (GA) seems to be a good option for CBR efficiency improving. As mentioned before, Juan et al. (2006) combined CBR with GA for housing layout customization. In their proposed approach, GA facilitates the solution revision. In addition, Shin and Han (1999) used GA to support CBR for enhancing classification accuracy. In their research, a weight vector is determined by GA to improve case indexing and retrieving process. Finally, Kim and Han (2001) applied GA to determine case representation and indexing for case maintaining. These integrations of CBR and GA show superior performance. On the other hand, the retrieved cases may be helpless and meaningless to support decision makers. If decision makers input parameters with loose conditions, too many similar cases might be retrieved. It would create information overload. Even if CBR only retrieved a few cases, the retrieved cases are descriptions of “just cases”, which could not provide knowledge. Decision makers would require refined information and more relevant knowledge. This research proposes a revised CBR mechanism, named GCBR, to integrate CBR with genetic algorithm and to combine with data mining techniques. GCBR consists of two stages. In stage I, GCBR integrates CBR with GA to form core algorithm. It intends to improve traditional CBR efficiency steadily no matter what kinds of case representation and indexing approaches. In stage II, GCBR adopts data mining techniques to provide the further refined and potential knowledge from retrieved cases. GCBR deals with efficiency problem in stage I and focus on quality improvement of outcomes in stage II. The remainder of this paper is organized as follows. Section 2 presents the proposed CBR model. Section 3 reports the experimental results. Section 4 depicts the procedures embedded in GCBR. Section 5 provides conclusions.

نتیجه گیری انگلیسی

This research proposes a revised CBR mechanism, named GCBR, not only to improve efficiency of traditional CBR, but also to provide more refined knowledge. GCBR consists of two stages, which integrate CBR with genetic algorithm and KDD approach, respectively. In stage I, genetic algorithm is implemented to retrieve cases. As reported in the experiment results, the processing time could be improved by reducing about 14% case evaluations, but still keeping about 90% satisfactory results. Compared with K/CBR or SOM/CBR, GCBR has not to pre-process cases. GCBR inherits the essence of genetic algorithm and uses natural selection concept that the fitter chromosome would have higher survival probability. Thus, the retrieving process is irrelevant to any kinds of cases presentation and indexing approaches. As the result, the superior performance of GCBR is stable. Furthermore, GCBR provides decision maker knowledge rather than just cases. Thus, the quality of decision outcome would be improved. In stage II, KDD approaches are applied to these retrieved cases. Because these cases and target are similar, the mining knowledge should be valuable for references. GCBR could be further embedded as core algorithm of diagnosis and recommender systems. It could be applied to various fields, e.g., shopping recommendation, schedule planning, fraud detection and the risk diagnosis, and medical symptom diagnosis, and so on.