یک روش داده کاوی برای کشف دانش خرده فروشی با در نظر گرفتن اثر همجواری قفسه - مجاورت در فروش
کد مقاله | سال انتشار | تعداد صفحات مقاله انگلیسی |
---|---|---|
21403 | 2006 | 18 صفحه PDF |
Publisher : Elsevier - Science Direct (الزویر - ساینس دایرکت)
Journal : Decision Support Systems, Volume 42, Issue 3, December 2006, Pages 1503–1520
چکیده انگلیسی
Recent marketing research has suggested that in-store environmental stimuli, such as shelf-space allocation and product display, has a great influence upon consumer buying behavior and may induce substantial demand. Prior work in this area, however, has not considered the effect of spatial relationships, such as the shelf-space adjacencies of distinct items, on unit sales. This paper, motivated in great part by the prominent beer and diapers example, uses data mining techniques to discover the implicit, yet meaningful, relationship between the relative spatial distance of displayed products and the items' unit sales in a retailer's store. The purpose of the developed mining scheme is to identify and classify the effects of such relationships. The managerial implications of the discovered knowledge are crucial to the retailer's strategic formation in merchandising goods. This paper proposes a novel representation scheme and develops a robust algorithm based on association analysis. To show its efficiency and effectiveness, an intensive experimental study using self-defined simulation data was conducted. The authors believe that this is the first academically researched attempt at exploring this emerging area of the merchandising problem using data mining.
مقدمه انگلیسی
Over the past three decades, merchandisers have relied heavily on marketing stimuli to increase their sales volume. Marketing research has suggested that the in-store stimuli (such as display, layout, atmosphere, and shelf-space arrangement) has a great influence on consumer buying behavior and may encourage sales by maximizing impulse buying and cross-selling. For example, a factorial experiment revealed that an in-store display of an item creates excitement and increases the average amount purchased [13]. A case study showed that effective store layout could stimulate demand to the point of doubling the sales rate by making it easier to find items and creating a positive image or feeling [43]. Psychological experiments show that elements of a store's atmosphere, like lighting, color, music, and aisle width, may have a greater influence on shopping behavior than characteristics of the product itself [19] and [27]. The positive curvilinear relationship between an item's shelf-space and its sales has been verified empirically for a wide variety of consumer goods [16] and [24]. Prior studies on in-store environmental stimuli, or merchandising techniques, have not considered the effect of spatial relationships, such as the shelf-space adjacencies of distinct items, on unit sales. As the famous beer and diapers example reveals [4], not considering the effects of side-by-side displays of items commonly purchased together may cause a retailer to miss out on tremendous revenue potential. The visual effect of adjacency can stimulate impulse purchases that account for 70% of buying decisions in a supermarket [5]. In light of this potential, this paper attempts to discover the implicit, yet meaningful, relationship between the relative spatial “distance” of displayed products and the items' unit sales in a retail store using data mining techniques. Special focus is placed on building a novel representation scheme for the historical transaction data and on developing an efficient and robust algorithm for knowledge mining. The proposed approaches measure and classify the effects of spatial adjacency of distinct items on increased sales. Our data mining approach differs from the well-known market basket analysis in several aspects. Our approach takes the product-to-shelf assignment information into account and incorporates the transaction time into the data stream dynamically. In contrast, the market basket analysis mainly determines what products customers purchase together in a static fashion, disregarding the product-to-shelf information. Therefore, our approach requires a differently formatted data-warehouse that must have spatial and temporal contents, and demands a more sophisticated algorithm for mining the dynamic transaction data effectively and efficiently. For the purposes of this study, we have assumed that all the required historical data is readily available and has been stored in the data-warehouse. To manage sales, a retail company must continue storing information in its databases on when the items are on the shelf and where they are placed. By properly pre-processing and integrating this data using the standard ETL tools provided by data warehouse software, one can obtain all this information in the format specified in this paper. Based on the simplified scenario, the problem is defined, the representation scheme is proposed, and the mining algorithm is developed using the association rules. The more sophisticated mining techniques, like the one discussed in this paper, are superior to traditional approaches in retail knowledge discovery, such as the market basket analysis or frequent-buyer program [7]. In some cases, the fact that items sell well together is obvious, such as laundry detergent and fabric softener [34], greeting cards and seasonal candy, or coffee and coffee makers. Occasionally, however, the fact that certain items would sell well together is far from obvious, such as in the case of diapers and beer [4] or bottled juice and cold remedies [7]. The true reason behind such purchase patterns remain unclear; it may be due to their close proximity in shelf location or other consumer behavior we have yet to discover. In this regard, the market basket analysis or frequent-buyer program is unable to provide satisfactory results. The proposed scheme attempts to dig for obscure clues by introducing the spatial relationship and transaction time information into the mining techniques. These approaches are separated not only by function and required data content but also by their managerial implications. The frequent-buyer program focuses on a consumer-level analysis to investigate individual purchase habits, such as a customer's affinity analysis. The market basket analysis, on the other hand, is mainly devoted to a market-level analysis. Due to the crossover of consumer traffic among stores, market-level analysis classifies the demand relationships across product categories into complement, independent, or substitute within the consumer choice process [39]. In contrast, our mining scheme is a store-level merchandise technique that identifies the effects of shelf-space proximity on unit sales over a finite time horizon and classifies the patterns as positive, independent, or negative. A positive pattern refers to a positive effect of shelf arrangement of distinct products on sales, meaning that placing specific product assortments side-by-side or in close proximity will trigger supplemental sales due to factors such as increased impulse purchasing and cross-selling. The negative pattern refers to a negative effect of such a spatial relationship. The discovered relationship between shelf patterns and unit sales is crucial for effective decision-making and strategic planning in merchandising goods. For example, retailers can rearrange their shelf-space to increase impulse buying, and the store manager can measure the effect on revenue. The beer and diapers example has suggested the potential of utilizing spatial relationships [4]. In another related example [30], Seven-Eleven Japan has a policy of adjusting its store layout and product placement multiple times every day to reflect the changing purchase patterns at different hours of the day, so that customers can easily find their favorite items. In this regard, this paper is very likely the first academic research that explores this emerging, high-potential area. The remainder of this paper is organized as follows. The related literature is reviewed in Section 2. Section 3 defines the problem context and proposes a representation scheme, and algorithmic development based on that scheme is detailed in Section 4. Using self-defined simulation data, an intensive experimental study was carried out and the results are discussed in Section 5. Special emphasis is placed on a comparative study between the proposed scheme and the traditional scheme, the Apriori algorithm, in terms of efficiency and effectiveness. The final section is devoted to recapping the research contributions and potential applications, discussing the limitations of the research, and offering directions for future research.
نتیجه گیری انگلیسی
Traditional data-driven analysis techniques such as the frequent-buyer program or the market basket analysis can only provide a profile of customers' purchasing affinities; they tell us what combinations are in their shopping carts, but cannot tell us why. The reason why some products are frequently bought together, like detergent and fabric softener, is apparent, while other combinations, like bottled juice and cold remedies or beer and diapers, are not so easily explained. This research seeks a possible answer to these questions by investigating spatial relationships between displayed products and their impact on sales that result from the visual effects of adjacency on impulse buying and cross-selling. In this regard, this paper opens a new research dimension by treating the spatial relationship as an important marketing tool in retailing and merchandising. In addition, the proposed mining scheme focuses on a store-level analysis in a dynamic fashion, which remedies the inherent shortcoming of the existing association analysis in dealing with the diverse and changing retail store environment. Our extensive and well-designed experiment has shown promising results that are numerically sound and computationally efficient. As with most management science applications, however, the proposed scheme is purely theoretical. In our study, the retailer's transaction data was assumed to include spatial content, such as product-to-shelf assignments, in a dynamic fashion. Furthermore, the proposed scheme can only work well in a designated scenario, which requires changes of such assignments. These assumptions and restrictive conditions may limit its applicability, yet they do not decrease its merit. Our exploratory research has demonstrated the technical feasibility of the proposed scheme and may also be applicable in practice, provided that adequate data is available. Our paper may contribute to the areas of data mining and data-warehousing and may also help data-gathering content in databases. These areas are extremely valuable in decision-making and strategic formation in future merchandise planning. The representation scheme and knowledge-mining algorithm proposed in this paper represent a positive initiative in the emerging area of data-driven marketing, or so-called database marketing [38]. Using the proposed scheme, the positive or negative adjacent relationships among distinct products can be discovered in the first stage of implementation. In the second stage, the manager can develop an effective merchandising strategy by using this information to rearrange shelf-space and product placement in the store. For example, a manager can place products in the same area that have positive adjacent effects, such as displaying beer just outside the diaper aisle, and can place products as far apart as possible if a negative effect between the items is found. Additionally, a catalog or on-line merchant could use the information to determine the display and layout of its catalog or on-screen design, and direct marketers could use the knowledge to determine which new products should be introduced and which items should be bundled together with frequently bought items and offer them to their prior customers. In relation to the proposed scheme, there are some critical managerial issues that should be further studied. First, the issue of how the obtained positive and negative shelf patterns can be optimally implemented into the existing layout in a store remains unclear. Second, if the total space is given when the shelf-space layout has not yet been determined, how can we determine the shelf-space layout and product placement simultaneously, so that some objective criteria, such as total revenue or profit margin, can be optimized? Third, if the total space can be expanded via capital investment, how can the space, shelf layout, and location assignment be optimally determined? These questions should be further investigated in future research. An empirical study should be conducted to validate the effectiveness of any theoretical scheme. Since database reformatting and data-gathering procedures are time-consuming and costly, empirical research in this area is challenging and may be difficult to accomplish. We believe, however, that our research idea and proposed scheme is a first step towards a novel approach of embracing data mining techniques and merchandise planning.