Vague or fuzzy data find application in several fields, such as psychometry, reliability, marketing, quality control, ballistics, ergonomy, image recognition, artificial intelligence, etc. A typical problem where vague data arise is that of assigning numbers to subjective perceptions or to linguistic variables (such as “enough”, “good”, “sufficiently”, etc.). In fact, there are many cases where observations cannot be known or quantified exactly, and, thus, we can only provide an approximate description of them, or intervals to enclose them. For instance, “in measuring the influence of character size on the reading ability from a video display terminal […] the reading ability of the subject, which is essentially the experimental output, depends on his/her eyesight, age, the environment, individual responses, and even how tired is the individual. Some of these factors cannot be described accurately and […] the best description of these kinds of output is that they are fuzzy outputs” (Chang et al., 1996).
In the analysis of fuzzy data, the study of the relationship between a set of (crisp or fuzzy) independent variables and one or more dependent fuzzy variables is an important issue. Such a relationship can be studied, for instance, through fuzzy linear regression techniques. In particular, fuzzy linear regression was first introduced by Tanaka 1980 and Tanaka 1982. Several developments followed these pioneer papers, such as those studied by Chang and Lee 1994a, Chang and Lee 1994b, Chang and Lee 1994c and Chang and Lee 1996, Diamond (1990), Diamond and Kloeden (1994), Kacprzyk and Fedrizzi (1992), Kim et al. (1996), Moskowitz and Kim (1993), Peters (1994), Redden and Woodall 1994 and Redden and Woodall 1996, Savic and Pedrycz (1991), Tanaka (1987), Tanaka and Watada (1988), Tanaka and Ishibuchi (1991), Tanaka et al. (1995)Xizhao and Minghu (1992). Also several applications have been proposed: for instance, see Chang et al. (1996), Lee (1996), McCauley and Wang (1997).
Following Dubois and Prade (1980) and Zimmermann (1985), a fuzzy number may be defined as F=(α,β,γ)LR, where α denotes the center (or mode), β and γ are the left spread (or width) and right spread, respectively, L and R denote the left and right shape functions. When β=γ and the shape functions are specular (w.r.t. the center), we have a symmetrical fuzzy number, denoted by F∗=(α,β). It is common to define a symmetrical fuzzy number by using a triangular membership function (also shown in Fig. 1):
Full-size image (1 K)
Fig. 1.
Triangular membership function.
Figure options
In the following section, we propose a new approach to fuzzy linear regression analysis. Our treatment is not based on the usual linear programming techniques, but upon the idea of fitting a regression model through minimization of a distance function between the empirical fuzzy data and the model.
For the sake of simplicity, we will consider the regression problem from a descriptive point of view, thus avoiding inferential issues.
We have proposed a new approach to fuzzy regression analysis based on a decomposition of the relationship between a fuzzy-dependent variable and independent crisp variables into two components: one for the centers and one for the spreads of the dependent variable. Our approach, not based on linear programming techniques, is an alternative way to fuzzy linear regression analysis, which, we believe, should be closer to a “statistical” mind.