دانلود مقاله ISI انگلیسی شماره 1403
ترجمه فارسی عنوان مقاله

تحقیق تجربی از ترکیب تجزیه و تحلیل مولفه های اصلی با تجزیه و تحلیل تفکیک چند متغیره و رگرسیون لجستیک برای پیش بینی شکست کسب و کار

عنوان انگلیسی
Empirical research of hybridizing principal component analysis with multivariate discriminant analysis and logistic regression for business failure prediction
کد مقاله سال انتشار تعداد صفحات مقاله انگلیسی
1403 2011 10 صفحه PDF
منبع

Publisher : Elsevier - Science Direct (الزویر - ساینس دایرکت)

Journal : Expert Systems with Applications, Volume 38, Issue 5, May 2011, Pages 6244–6253

ترجمه کلمات کلیدی
پیش بینی شکست کسب و کار () - تجزیه و تحلیل مولفه های اصلی () - تجزیه و تحلیل تفکیک چند متغیره () - رگرسیون لجستیک (لوجیت) -
کلمات کلیدی انگلیسی
Business failure prediction BFP,Principal component analysis PCA, Multivariate discriminant analysis MDA,Logistic regression logit,
پیش نمایش مقاله
پیش نمایش مقاله  تحقیق تجربی از ترکیب تجزیه و تحلیل مولفه های اصلی با تجزیه و تحلیل تفکیک چند متغیره و رگرسیون لجستیک برای پیش بینی شکست کسب و کار

چکیده انگلیسی

Predicting business failure of listed companies is a hot topic because of the emergency of financial crisis in developed countries recently. The two classical statistical methods of multivariate discriminant analysis (MDA) and logistic regression (logit) have taken a key role in the area of business failure prediction (BFP). However, they are frequently criticized for the relative low predictive performance compared with some newly developed predictive methods. The present research contributes to the construction of a new hybrid method for BFP by integrating principal component analysis (PCA) with MDA and logit to help them produce dominating predictive performance, a pilot study demonstrating the effectiveness of employing PCA with MDA and logit for BFP, even a pilot study demonstrating the usefulness of PCA in BFP. The hybrid method is implemented by using stepwise method of MDA, stepwise method of logit, and independent sample t test as the preprocessing procedure, on the basis of which PCA is further used to extract features for MDA and logit. The optimal feature set for a specific task of BFP is determined by an empirical means of splitting all available data for thirty times to obtain the optimal preprocessing procedure for PCA. In the empirical research, this study attempts to investigate whether or not the new hybrid method can produce dominating performance in short-term BFP of Chinese listed companies. For comparison, the most preferred filter approach of stepwise method of MDA for short-term BFP in China was employed to select optimal features for MDA and logit. Meanwhile, the use of PCA on all available data to extract features for MDA and logit to make predictions is also employed to make a comparison. Empirical results indicate that the new method, namely: the hybridization of PCA with MDA and logit, can produce dominating predictive performance in short-term BFP of Chinese listed companies. The preferred preprocessing procedure of PCA for short-term BFP of Chinese listed companies is stepwise method of MDA and corresponding performance of MDA and logit respectively outperforms all the other methods. Meanwhile, predictive performance of MDA and logit with the optimal PCA procedure is not significantly different.

مقدمه انگلیسی

Business failure occurs when a company has serious losses and when the company becomes insolvent with liabilities. This phenomenon causes great loss to stock holders, managers, investors, and employees because of the rapid change in global economy. Industrials and academies need effective methods that can predict business failure accurately. Business failure prediction (BFP) is a specific application of nonlinear analysis and linear analysis in real world. The inside principle of BFP is to find hidden patterns in data by using intelligent and statistical techniques. Financial ratios and cash flow information are chief variables in BFP (McGurr & DeVaney, 1998). The research mode of BFP is to find more accurate predictive models and predictors. Data for BFP is usually collected from public information of companies. Financial ratios take a key role in predicting business failure as various predictors. Thus, collecting as much financial ratios as possible as predictors to make a prediction is a fundamental step in the area. If financial ratios are at hand, methods of feature selection or feature extraction are commonly used to find optimal predictors for a specific task of BFP in a standalone means. Assume that all optimal features and data are ready, predictive models can be constructed on the basis of the data by using methodologies and technologies from statistic, artificial intelligence, data mining, and business computing. When constructing a method for BFP, the problem of finding optimal features and finding optimal models should both be taken into consideration. Models used in this area chiefly belong to the two categories of statistical models and intelligent models (Ahn and Kim, 2009, Chauhan et al., 2009, Cho et al., 2010, Hardle et al., 2009, Hu and Ansell, 2009, Kim and Kang, 2010, Kumar and Ravi, 2007, Li et al., 2011, Lin et al., 2011, Li and Sun, 2010, Li and Sun, 2011, Li et al., 2010, Ravisankar and Ravi, 2010, Sun and Li, 2009, Tsai, 2009, Tseng and Hu, 2010 and Yang et al., 1999). Thus, appliers classify methods for BFP into two groups, that is, the group of statistical method and the group of intelligent method. Till now, classical statistical methods of multivariate discriminant analysis (MDA) and logistic regression (logit) are the most popular in the area of BFP (Altman & Saunders, 1998). However, lots of researches criticize the relative low predictive performance of the two classical statistical methods when comparing their predictive performance with that of newly developed intelligent predictive method. The application of classical statistical methods in predicting business failure witnesses the foundation of the BFP. The two classical statistical methods of MDA and logit were applied to solve the problem of BFP long time ago and achieved satisfactory performance. Altman (1968) firstly emploied the method of MDA to predict business failure. Ohlson (1980) pioneered the study of using logit to predict business failure. Since the use of MDA and logit to predict business failure, the two classical statistical methods have all along served as challenge methods for the problem. For example, Taffler (1983) emploied the Z-score model of MDA with the data from UK to predict short-term business failure. Grice and Ingram (2001) examined three research questions using Altman’s Z-score method, that is, (1) Is the method useful for predicting business failure in recent periods; (2) Is the method useful for predicting business failure of non-manufacturing companies; (3) Is the method useful for predicting financial stress conditions, with the conclusion that Altman’s Z-score method is still useful to predict financial distress conditions. Recently, the two statistical methods are still vital in predicting business failure, for example, Yim and Mitchell (2005) used logit models and MDA as baseline methods for compassion with hybrid networks in predicting business failure. Tseng and Lin (2005) proposed a quadratic interval logit model by using a quadratic programming approach to deal with binary features in BFP. Jones and Hensher (2007) attempted to evaluate the theoretical and empirical significance of the presented multinomial nested logit model for the explanation and prediction of business failure. Hensher, Jones, and Greene (2007) used a multinomial error component logit model, an extension of the more familiar mixed logit model, to predict business failure. In addition to the use of the two classical statistical methods of MDA and logit to predict business failure in standalone way, some researches also attempt to integrate them with newly developed predictive methods; for example, Laitinen and Laitinen (2000) attempted to combine logit model with Taylor’s series expansion to predict short-term business failure. Cheng, Chen, and Fu (2006) attempted to predict business failure by combining logit model with neural network, which combination retains advantages of both methods and avoids disadvantages of both methods. Hua, Wang, Xu, Zhang, and Liang (2007) utilized logistic regression to decrease the empirical risk of support vector machine in solving the task of BFP. Sun and Li (2008b) used MDA and logit as components to construct multi-classifiers system for BFP. This research addresses prediction of business failure by hybridizing principal component analysis (PCA) with the two classical statistical methods of MDA and logit to construct a new hybrid predictive method. The three filters of stepwise method of MDA, stepwise method of logit, and independent sample t test are integrated into the hybrid method as preprocessing procedure of PCA. The optimal feature set for a specific task of BFP is determined by an empirical means of splitting all available data for thirty times to obtain the optimal preprocessing procedure for PCA. This study attempts to make an investigation on whether or not the employment of PCA can help MDA and logit produce dominating predictive performance. What is of interest is whether the feature extraction method of PCA can produce more optimal features in helping MDA and logit use more information inside data to make a prediction. In order to fulfill this objective, this study uses PCA in three means. The commonly used filter approaches in BFP of Chinese listed companies are stepwise method of MDA, stepwise method of logit, ANOVA, and independent sample t test. However, the last two filters produce the same features with the data for short-term BFP of Chinese listed companies. Thus, the study uses PCA respectively on preprocessed features selected by stepwise method of MDA, preprocessed features selected by stepwise method of logit, and preprocessed features selected by independent sample t test. Corresponding features extracted are respectively named MPCA features, LPCA features, and TPCA features. The way of using PCA on all available features is also employed to make a comparison. Corresponding features are called PCA features. Stepwise method of MDA is the most preferable filter approach of short-term BFP of Chinese listed companies ( Li and Sun, 2008 and Li and Sun, 2009). Corresponding feature set selected by stepwise method of MDA is called MDA feature set (MDAFS). Predictive performance of MDA and logit on the four PCA feature sets is to be compared with that of MDA and logit on MDAFS. This study is a pilot research focusing on hybridizing PCA with MDA and logit to predict business failure. Till now, seldom researches demonstrate the effectiveness of PCA in the area of BFP, which guarantees the originality of this research. The paper is organized as follows. Section 2 presents the new hybrid method for BFP by integrating PCA with MDA and logit. Section 3 presents the design and results of the empirical research of the use of the new hybrid method with data from Chinese listed companies. Section 4 addresses detailed analysis and discussion on the results. Section 5 makes conclusion and outlines further research directions.

نتیجه گیری انگلیسی

In summary, the conclusion of this research is that the employment of PCA with the two classical statistical methods of MDA and logit can help them produce significantly better predictive performance in short-term BFP of Chinese listed companies. This study made comparisons between the new hybrid method and the two classifiers by using the most effective filter approach of stepwise method of MDA to help them produce a prediction. This study employed PCA in four different means that is, the use of PCA on all available features, the use of PCA on the features selected by stepwise method of MDA, the use of PCA on the features selected by stepwise method of logit, and the use of PCA on the features selected by independent sample t test. The best way of employing PCA in the two methods of MDA and logit is to use PCA to extract features on the processed results of stepwise method of MDA. MPCA features helped both MDA and logit produce the most superior predictive performance inside each method for short-term BFP of Chinese listed companies. However, the best predictive performance of MDA and logit was not significantly different, though MDA achieved better mean predictive accuracy than logit. This study provided some evidence that the use of PCA to find more effective information from initial data for the two classical statistical methods of MDA and logit is feasible. This type of procedure can help them produce significantly better predictive performance in short-term BFP of Chinese listed companies. Some further researches are valuable: whether or not the employment of PCA can help MDA and logit produce better predictive performance in BFP with data from other countries should be further investigated. Investigation on new means of employing PCA in the area of BFP is also valuable.