ارتباط رده بندی سهام معادن و خوشه بندی با بازار سهام تایوان
|کد مقاله||سال انتشار||مقاله انگلیسی||ترجمه فارسی||تعداد کلمات|
|15707||2008||11 صفحه PDF||سفارش دهید||6565 کلمه|
Publisher : Elsevier - Science Direct (الزویر - ساینس دایرکت)
Journal : Expert Systems with Applications, Volume 35, Issues 1–2, July–August 2008, Pages 19–29
One of the most important problems in modern finance is finding efficient ways to summarize and visualize the stock market data to give individuals or institutions useful information about the market behavior for investment decisions. The enormous amount of valuable data generated by the stock market has attracted researchers to explore this problem domain using different methodologies. This paper investigates stock market investment issues on Taiwan stock market using a two-stage data mining approach. The first stage Apriori algorithm is a methodology of association rules, which is implemented to mine knowledge and illustrate knowledge patterns and rules in order to propose stock category association and possible stock category investment collections. Then the K-means algorithm is a methodology of cluster analysis implemented to explore the stock cluster in order to mine stock category clusters for investment information. By doing so, this paper proposes several possible Taiwan stock market portfolio alternatives under different circumstances.
The stock market is one of the most popular forms of investment due to its high-expected profit. However, higher expected profit, also imply higher risk. Thus, numerous studies have proposed different analysis methods to assist investors in analysis and decision-making. On the other hand, many individual investors, stockbrokers, and financial analysts attempt to predict stock market price activities and their potential development. This mass behavior runs counter to the counsel of the many academic studies, which contend that the prediction of stock market development is ineffective. This point of view is codified as the generally called efficient markets hypothesis (Fama, 1991 and Haugen, 1997). There are three degrees of market efficiency. The first degree is the strong form of the efficient markets hypothesis, which states that all information that is knowable is immediately factored into the market’s price for a security. If this is true, then all of those price predictors are definitely wasting their time, even if they have access to private information. The second degree is the semi-strong form of the efficient markets hypothesis, that all public information is considered to have been possessors of private information, which can use that information for profit. The third degree is the weak form, which holds only that any information gained from examining the security’s past trading history is reflected in price. Indeed, the past trading history is public information implying that the weak form is a specialization of the semi-strong form, which itself is a specialization of the strong form of the efficient market hypothesis. Due to the different degrees of market efficiency, academic researchers investigate the efficient market hypothesis by exploring the unknown and valuable knowledge from historical data, using techniques such as data mining. Enke and Thawornwong (2005) introduces an information gain technique used in machine learning for data mining to evaluate the predictive relationships of numerous financial and economic variables. Neural network models for level estimation and classification are then examined for their ability to provide an effective forecast of future values. Boginski, Butenko, and Pardalos (2006) propose a network representation of the stock market data referred to as the market graph, which is constructed by calculating cross-correlations between pairs of stocks based on the opening price data over a certain period of time. Chun and Park (2005) proposes a learning technique, which extracts new case vectors using Dynamic Adaptive Ensemble CBR (DAE CBR). The main idea of DAE CBR originates from finding combinations of parameter and updating and applying an optimal CBR model to an application or domain area. These concepts are investigated against the backdrop of a practical application involving the prediction of a stock market index. In addition, Rapach and Wohar (2006) implement an analysis of in-sample and out-of-sample tests of stock return predictability in an effort to better understand the nature of the empirical evidence on return predictability. That study finds that certain financial variables display significant in-sample and out-of-sample predictive ability with respect to stock returns. Overall, most articles consider stock market analysis as a time series problem, and there have been few studies using stock market efficiency to explore the possible cause-and-effect relationships among different stock categories or the influence of outside factors. This paper investigates stock market investment issues in the Taiwan stock market by implementing a two-stage data mining approach. First, the Apriori algorithm is a methodology of association rules that mines knowledge from historical data and this knowledge is illustrated as knowledge patterns and rules in order to propose stock category association and possible stock investment collections. Next, the K-means algorithm is a methodology of cluster analysis that explores the clustering of stock in order to mine this information for investment. Thus, using two different data mining approaches, this paper provides two aspects of data mining results in terms of presenting possible investment portfolio with stock market association and cluster knowledge. The rest of this paper is organized as follows. In Section 2, we describe the Taiwan stock market. Section 3 presents the research design. Section 4 introduces the proposed data mining system, which includes system framework, relational database design, and physical database design. Section 4 presents the data mining approach, including the Apriori and K-means algorithm. Section 5 describes the data mining results. Research findings and discussions are presented in Sections 6 and 7 presents a brief conclusion.
نتیجه گیری انگلیسی
This paper considers that a stock market strong associations with both inside and outside factors. Some stock index categories of stock rise or drop together at the same time or are influenced by domestic or foreign economic, social, and political situations. For individual or institutional investors, finding indications for the trend of stock market association is a valuable task. Data mining of the stock market analysis and interpretation of the properties of the data mining results gives new insights into possible associations in the stock markets. In this paper, we use two data mining approaches, Apriori algorithm and K-means, for association rule and clustering analysis. By doing so, this research finds that different possible portfolio of stock categories investment can be implemented in the Taiwan stock market. Thus, this case study of implementing data mining approaches and integrating them into stock market research on Taiwan stock market is an example for future research and implementation.