دانلود مقاله ISI انگلیسی شماره 24999
ترجمه فارسی عنوان مقاله

بررسی اثر غیر پارامتری سن رانندگان در تصادفات عقب پایان از طریق مدل رگرسیون لجستیک افزودنی

عنوان انگلیسی
Examining the nonparametric effect of drivers’ age in rear-end accidents through an additive logistic regression model
کد مقاله سال انتشار تعداد صفحات مقاله انگلیسی
24999 2014 8 صفحه PDF
منبع

Publisher : Elsevier - Science Direct (الزویر - ساینس دایرکت)

Journal : Accident Analysis & Prevention, Volume 67, June 2014, Pages 129–136

ترجمه کلمات کلیدی
رگرسیون لجستیک افزودنی - خطرات نسبی - اثر سن - جفت ذاتا همسان -
کلمات کلیدی انگلیسی
Additive logistic regression, Relative risks, Age effect, Inherently matched pairs,
پیش نمایش مقاله
پیش نمایش مقاله   بررسی اثر غیر پارامتری سن رانندگان در تصادفات عقب پایان از طریق مدل رگرسیون لجستیک افزودنی

چکیده انگلیسی

This study seeks to inspect the nonparametric characteristics connecting the age of the driver to the relative risk of being an at-fault vehicle, in order to discover a more precise and smooth pattern of age impact, which has commonly been neglected in past studies. Records of drivers in two-vehicle rear-end collisions are selected from the general estimates system (GES) 2011 dataset. These extracted observations in fact constitute inherently matched driver pairs under certain matching variables including weather conditions, pavement conditions and road geometry design characteristics that are shared by pairs of drivers in rear-end accidents. The introduced data structure is able to guarantee that the variance of the response variable will not depend on the matching variables and hence provides a high power of statistical modeling. The estimation results exhibit a smooth cubic spline function for examining the nonlinear relationship between the age of the driver and the log odds of being at fault in a rear-end accident. The results are presented with respect to the main effect of age, the interaction effect between age and sex, and the effects of age under different scenarios of pre-crash actions by the leading vehicle. Compared to the conventional specification in which age is categorized into several predefined groups, the proposed method is more flexible and able to produce quantitatively explicit results. First, it confirms the U-shaped pattern of the age effect, and further shows that the risks of young and old drivers change rapidly with age. Second, the interaction effects between age and sex show that female and male drivers behave differently in rear-end accidents. Third, it is found that the pattern of age impact varies according to the type of pre-crash actions exhibited by the leading vehicle.

مقدمه انگلیسی

Age has been identified as a crucial factor that affects the behavior of driving and is consequently associated with the risks of causing or being involved in accidents. Earlier studies (e.g. Massie et al., 1995 and Zhang et al., 1998) observed that young and old drivers are more likely to be involved in crashes and that old drivers also present the highest fatal involvement rate. Based on regression analyses, many other studies (e.g. Abdel-Aty and Radwan, 2000 and Yan et al., 2005) have discovered similar patterns by examining the risk-taking behaviors of drivers in different age groups. Generally, the overall age effect on accident risk exhibits a U-shaped trend. The underlying reasons are, firstly, that old drivers have slower perception and reaction times (Abdel-Aty et al., 1998), which may lead to an excessive load from mental activities while driving (Cantin et al., 2009). On the other hand, young drivers are more likely to exhibit less maturity and less experience in their driving skills (Borowsky et al., 2010), as well as speeding behaviors. Past accident analyses have largely relied on parametric statistical models, applied to observed crash data from police reports or experimental data from driving simulators (e.g. Martin et al., 2010) or eye tracers (e.g. Borowsky et al., 2010). In these models, the nonlinear effect of age is formulated by categorizing age into several predefined groups (e.g. Yan et al., 2005 and Martin and Lenguerrand, 2008) or using a quadratic spline function with predefined knots (Cummings et al., 2003b). In addition to these considerations, many other aspects have been applied to enrich the inspection of the age effect. For example, Lourens et al. (1999) performed a multivariate analysis with correction for annual mileage, and Kim et al. (2013) took the heterogeneous effects of age into account. Gwyther and Holland (2012) introduced a theory of the self-regulation behavior of age, in which old drivers tend to avoid driving in poor situations due to compromised physical condition. Undeniably, past studies have explored the age effect with various considerations. However, a more natural and flexible way of formulating the nonlinear effect of age is still lacking because inappropriate specifications can introduce additional bias into the estimation. For example, the results from models in which age is categorized into predefined groups will depend on the arbitrary definition of these groups. In addition, recent findings have indicated that the proportions of licensed drivers within different age groups have changed noticeably in many countries over the years (Sivak and Schoettle, 2012) and more notably the percentage of licensed drivers in the older group has increased rapidly for the last several decades. Therefore, it is necessary to inspect the mechanism used to connect accident risk to age through more flexible and precise specifications. Treated as a continuous variable, age has been examined with nonparametric specifications in many medical and epidemiological studies for a long time. For example, Durrleman and Simon (1989) characterized the nonlinear relationship between age and survival using Stanford Heart Transplant data, and Hultman et al. (2011) used smoothing splines to model the association between paternal age and offspring autism. However the nonparametric nature of age has not been investigated in traffic accident analyses. To this end, this study seeks to introduce an alternative approach based on a flexible specification of age in a more objective manner. An immediate impediment to developing models to examine age impact would be that the observed data do not cover those drivers who are not involved in any crashes during the research period, because crash data usually originate from police reports of accidents (Lourens et al., 1999). In this study, an inherently matched paired data structure is developed to overcome this issue, by matching the driver who was not at fault (struck) with the driver who was at fault (striking) in the same accident. The method is effective for evaluating relative accident risks by examining the characteristics of drivers. Similar to the case–control analysis (Agresti, 2002) used in epidemiology studies, the observations in a matched pair here consist of exactly one driver whose role is being at fault (striking) and one driver whose role is not being at fault (struck). The method actually also belongs to the quasi-induced conception (Yan et al., 2005). Therefore, this study is restricted to rear-end accidents involving only two vehicles, partly because such data are perfect for constituting the paired samples. Additive models are designed to supply nonparametric specifications for measuring the nonlinear effects of factors through flexible additive terms. In order to formulate a regression model able to capture more natural patterns of age impact, smooth functions of age are adopted as an additive term in this study, replacing the traditional linear term under fixed coefficients. Under such a modification, the additive models essentially constitute a more general family of models, which is advantageous in considering age impact. Generalized additive models (GAM) (Hastie and Tibshirani, 1986) provide an extension to the generalized linear models, including such additive terms. Under the GAM framework, past studies have attempted to model crash frequency using negative binomial additive models (Li et al., 2011 and Xie and Zhang, 2008). Yet, these models were based on subjects at more aggregated levels, for example accident counts of certain transportation facilities, and it is difficult to use these models to measure the age impact related to risk-taking behaviors for disaggregated subjects (drivers). Consequently, this study introduces the nonparametric version of regression models for examining drivers’ rear-end crash-involvement risk using an inherently matched paired data structure. On the other hand, age itself needs to be treated as a continuous variable in order for the sensitivity between aging and the exhibited risks of drivers to be understood, which has usually been ignored by past studies (e.g. Yan et al., 2005).

نتیجه گیری انگلیسی

Age has been verified as an important person-level attribute that affects the risk of causing or being involved in an accident. Since the distribution of ages of licensed drivers is presently changing rapidly in many countries, it is necessary to re-examine the age impact on the risk-taking behaviors of drivers, in depth, using recent data. When attempting to understand the contribution of age at each point in the continuum, the traditional methods in which age is commonly categorized into several predefined groups are less informative. To address this, this study seeks to introduce an alternative nonparametric regression analysis. While conventional methods are able to capture the nonlinear pattern of age to some degree, they are restricted since the magnitudes of age impact within each defined group are designed to be equal. Yet, age is in nature a continuous variable, and this study has discovered that its effect may vary significantly, even within a narrow range. A detailed nonlinear pattern of age is unlikely to be revealed by the traditional approaches in which the results are largely determined by the subjective design of the age categorizations or predefined knots in the spline functions. This study therefore contributes a flexible specification of age that better measures the accident risks. Generally, the real causes of an accident may be ascribed to multiple vehicles, even those not physically involved in the collision. Therefore, for the general case, it is difficult to identify an at-fault vehicle that is fully responsible for the accident, especially for accidents involving multiple vehicles in complicated situations. The subject of this study is thus designated to be the two-vehicle rear-end accidents, since this is the clearest type of accident for evaluating the relative risks of the drivers, even though the responsibilities can still be difficult to disentangle (between leading and rear vehicle) in certain cases. Fortunately, the pre-crash scenario of “Going Straight” provides a subset of samples where the full responsibility for the collisions can clearly be attributed to the rear vehicles. However, in other cases and even other types of accidents, this is a pervasive limitation on observed accident datasets. Therefore, in future accident data collection, it is suggested that additional details indicating the initial events that caused the crashes should be recorded, especially when the leading vehicle performed improper deceleration or stopped altogether before a collision. On the other hand, for future research, it is also necessary that we consider pre-crash situations and pay more carefulness to allocating driver responsibility. Evaluations of accident risks for drivers require the data to assign different roles to the drivers in order that the variables associated with high risks can be tested. Therefore, this study used a data structure with observations in inherently matched pairs so as to inspect the factors, including age, determining the risk of being at fault in a rear-end accident. The proposed data structure is similar to that used in case–control studies, and the observation selection mechanism is similar to that of twin studies. A pair of drivers in the same accident will share many variables and some will even be unmeasurable. Accordingly, the differences between two observations in a pair will not be due to any of the matching variables. This ensures that the standard error of the modeling parameters will not be influenced by these matching variables, leading to a higher power of estimation. Soothing functions provide excellent estimations within the range of variables in data. However, outside this range, the extrapolation from the estimated smoothing spline may experience amplified errors, especially for points far from the range. Fortunately, the range of ages is highly restricted in nature. In this study, ages range from 12 to 111, which covers most possible ages of drivers. Therefore, this study does not face such extrapolation issues. In this study, the relative risk of being an at-fault (striking) driver was connected to several factors, including age, gender, alcohol/drug usage and the physical condition of the driver. Based on the specification of the smoothing cubic spline function of age, the estimation explicitly provided the age impact on the log odds of being the “striking” driver in a rear-end accident. Also, the interaction effect between age and gender was investigated in a nonparametric manner. The interaction term in the regression model in fact produced two separate smooth functions, for male and female drivers. Controlling for other variables, the extracted age patterns indicate an approximate 4.5-year delay in male drivers’ driving performance and the risk of being at fault in a rear-end accident. This is an important phenomenon that is difficult to reveal using regular parametric specifications. The results not only confirm the U-shaped pattern of the age impact but further demonstrate that young and old drivers actually show higher age-sensitivity to risk. Therefore, it is argued that there are two important phases of driving performance and risk-taking behavior in the lives of drivers: the phase of growth in which the performance of the driver increases, and the phase of recession in which the performance declines. In order to mitigate the risks for young and old drivers, it is important to enhance traffic safety education for them, especially for young drivers. On the other hand, advanced vehicular technologies could be a valuable preventive measure for older drivers. For example, warning messages could be provided, or active driving-safety technologies could be used to compensate for driving mistakes due to reduced vehicle-maneuvering capacities and slower reaction times. The statistical evidence herein provides important findings and could promote further studies on improving driver's license policy. Because male and female drivers exhibit different risk behaviors at the same age, there could be separate policies for male and female drivers. According to the results, it is possible that the minimum age for applying a driver's license could be made higher for male than female drivers due to the larger risk for young male drivers. In order to justify such an action, future studies need to be conducted looking at several aspects. First, many other variables, for example drivers’ demographic and cultural attributes, could be introduced to control the influence from individual-level characteristics. Second, it would be useful to perform similar risk analyses on other types of accidents, and also on different datasets in order to produce more universal conclusions. Finally, driving-simulator-based studies offer a valuable approach that can provide highly controlled experimental environments. They can lead to more conclusive results than observed datasets because the roles of the drivers are clearer and can be specifically designed. In addition, for elderly drivers, the nonparametric analysis proposed here could be useful in determining the earliest age at which to check whether their driving capabilities are still sufficient. For example, the age of old drivers with equivalent risk levels to the youngest legal driving age could be chosen.