برنامه های کاربردی شبکه عصبی در رفتار مصرف کننده
|کد مقاله||سال انتشار||مقاله انگلیسی||ترجمه فارسی||تعداد کلمات|
|1809||2010||9 صفحه PDF||سفارش دهید||1 کلمه|
Publisher : Elsevier - Science Direct (الزویر - ساینس دایرکت)
Journal : Journal of Consumer Psychology, Volume 20, Issue 3, July 2010, Pages 381–389
This article introduces the concepts and terminology of artificial neural networks. The approach is demonstrated on data that represent a domain of interest to the consumer psychologist. ANNs are mathematical models that are commonly used in business applications to model relationships between variables. A key strength of ANNs is their flexibility, i.e. their ability to easily accommodate linear and non-linear relationships without any a priori functional form specification. They can easily be used to study topics of interest to consumer psychologists such as persuasion, influence, segmentation etc and can offer distinct advantages over traditional statistical techniques such as ANOVA and regression. We demonstrate the application of ANNs in three different areas: regression, non-linear principal component analysis and classification.
Artificial neural networks (ANNs) were originally developed by researchers in computer science as an analogy to the human brain (cf., Hinton, 1992). Neurons receive inputs from other neurons, combine them, and send signals on to other neurons. Some cognitive sciences use the network analog as a fairly literal model of the phenomena being studied, but other disciplines have begun exploring the utility of the network model as a statistical analytical tool. In particular, consider that some form of the general linear model—analysis of variance or regression—accounts for much of the findings in many social science literatures, even though researchers know that some phenomena are non-linear (e.g. response to fear appeals, the effects of repetition on attitudes). One of the strengths of ANNs is the ability to easily accommodate linear and non-linear relationships; indeed the researcher need not specify functional forms a priori. Distributions do not need to be specified, hence there are fewer opportunities for assumptions, such as multivariate normality, to be violated. Naturally, any modeling approach comes with strengths and weaknesses, and Table 1 summarizes key advantages and disadvantages of ANNs relative to the more frequently used general linear models (e.g. regression). An early disadvantage that had been heavily cited was the requirement of large sample sizes, yet increasingly marketers and consumer psychologists are working with extremely large datasets (e.g., companies' CRM databases, online records and the like).Broadly speaking, ANNs can be classified along two dimensions: 1) the type of learning algorithms used—where supervised and unsupervised are the two extreme pure forms, and 2) whether the networks are recursive or non-recursive. In this literature, the term “learning” comes from the analogy to the human brain, and refers to the iterative methods used to estimate the parameters of the model. To elaborate on the first dimension, in “supervised learning,” the dependent variable(s) is (are) observed, the learning algorithms define an error function to be minimized (e.g., squared distance between the observed and predicted variables, like in regression) and then a gradient-based optimization technique is used to estimate the parameters. In “unsupervised learning,” the dependent variable is not observed (much like the “truth” not being known when conducting a cluster analysis), but the algorithms estimate the parameters to implicitly or explicitly maximize similar objective functions (Cheng & Titterington, 1994). To understand the difference between recursive and non-recursive networks, consider Fig. 1. It depicts three types of layers (or nodes) within a network: 1) input, 2) hidden and 3) output. There can be zero or more hidden layers within a network. The input layer is analogous to independent variables, as this is where they enter the network. In Fig. 1, there are two inputs (x1 and x2), so there are two input nodes. The output layer is analogous to the prediction of the network model, whether it is forecasting, cluster membership, classification, probability of occurrence, etc. In Fig. 1, there is a single output variable, y′, so there is one output node. Depending upon the application, y′ could be the forecasted value of the observed dependent variable y. The arrows in Fig. 1 between the input layer and hidden layer, and between the hidden layer and output layer represent coefficients to be estimated.The nodes which are not input or output nodes are the hidden nodes. Hidden nodes are similar to the latent or unobservable variables that are often related to consumer attitudes or behavior (e.g. personality differences). They can capture complex non-linearities and interactions. There is only one layer in Fig. 1 because no hidden nodes are connected to other hidden nodes. The network depicted in Fig. 1 is non-recursive because there are no loops among the hidden nodes that would result in the output of a node also being an input to the same node. Non-recursive networks are also called “Feed-Forward” ANNs. From a mathematical standpoint, Feed-Forward ANNs can be written as Y = ƒ(X) + ε, where ε is the error term, Y is the observed dependent variable and ƒ(X) is some function of the independent variables, X. While there are applications where recurrent ANNs and unsupervised learning can be used in marketing research, this article focuses on feed-forward ANNs because of their close relationship to regression and applicability to behavioral research. The remainder of this article is organized as follows. The next section describes the relationship between feed-forward ANNs and statistics and includes a discussion of some of the limitations of ANNs from a statistical perspective. The following section provides some examples of neural networks that address specific applications. The article closes with conclusions and discussion.
نتیجه گیری انگلیسی
This article was intended to serve as an overview of neural networks as a statistical methodology that could be applied to domains of interest to consumer psychologists. We fully expect the typical consumer psychologist reader to continue the tradition in which experimental techniques and analysis of variance dominate in the pursuit of their theory building and testing scholarship. Yet it is always good practice to be aware of alternative analytical tools, for flexibility, should the need arise. In particular, we have painted ANNs as an extremely powerful technique. Part of that power is the versatility the network structures allow, and the fact that the functional forms are part of the analytical discovery. In this regard, and given their minimalist assumptions, e.g., little concern for distributional form, ANNs are related to nonparametric estimation. Techniques with greater flexibility tend also to be those with somewhat more ambiguity regarding, for example, the architectural form of the network (e.g., number of hidden nodes, or layers of same). In addition, the interpretation of the resulting weights and parameter estimates require some skill, as ANNs involve some art, as well as science, in their estimation. Nevertheless, cautioning a consumer psychologist to be careful when interpreting results is no different from similar concerns in analogous techniques, such as structural equations models.