scholarly journals Driving Risk Assessment Using Near-Miss Events Based on Panel Poisson Regression and Panel Negative Binomial Regression

Entropy ◽  
2021 ◽  
Vol 23 (7) ◽  
pp. 829
Author(s):  
Shuai Sun ◽  
Jun Bi ◽  
Montserrat Guillen ◽  
Ana M. Pérez-Marín

This study proposes a method for identifying and evaluating driving risk as a first step towards calculating premiums in the newly emerging context of usage-based insurance. Telematics data gathered by the Internet of Vehicles (IoV) contain a large number of near-miss events which can be regarded as an alternative for modeling claims or accidents for estimating a driving risk score for a particular vehicle and its driver. Poisson regression and negative binomial regression are applied to a summary data set of 182 vehicles with one record per vehicle and to a panel data set of daily vehicle data containing four near-miss events, i.e., counts of excess speed, high speed brake, harsh acceleration or deceleration and additional driving behavior parameters that do not result in accidents. Negative binomial regression (AICoverspeed = 997.0, BICoverspeed = 1022.7) is seen to perform better than Poisson regression (AICoverspeed = 7051.8, BICoverspeed = 7074.3). Vehicles are separately classified to five driving risk levels with a driving risk score computed from individual effects of the corresponding panel model. This study provides a research basis for actuarial insurance premium calculations, even if no accident information is available, and enables a precise supervision of dangerous driving behaviors based on driving risk scores.

2021 ◽  
Vol 5 (1) ◽  
pp. 1-13
Author(s):  
Yopi Ariesia Ulfa ◽  
Agus M Soleh ◽  
Bagus Sartono

Based on data from the Directorate General of Disease Prevention and Control of the Ministry of Health of the Republic of Indonesia, in 2017, new leprosy cases that emerged on Java Island were the highest in Indonesia compared to the number of events on other islands. The purpose of this study is to compare Poisson regression to a negative binomial regression model to be applied to the data on the number of new cases of leprosy and to find out what explanatory variables have a significant effect on the number of new cases of leprosy in Java. This study's results indicate that a negative binomial regression model can overcome the Poisson regression model's overdispersion. Variables that significantly affect the number of new cases of leprosy based on the results of negative binomial regression modeling are total population, percentage of children under five years who had immunized with BCG, and percentage of the population with sustainable access to clean water.


2021 ◽  
Vol 10 (3) ◽  
pp. 226-236
Author(s):  
Khusnul Khotimah ◽  
Itasia Dina Sulvianti ◽  
Pika Silvianti

The number of leper in West Java is an example of the count data case. The analyzes commonly used in count data is Poisson regression. This research will determine the variables that influence the number of leper in West Java. The data used is the number of leper in West Java in 2019. This data has an overdispersion condition and spatial heterogenity. To handle overdispersion, the negative binomial regression model can be employed. While spatial heterogenity is overcome by adding adaptive bisquare kernel weight. This research resulted Geographically Weighted Negative Binomial Regression (GWNBR) with a weighting adaptive bisquare kernel classifies regency/city in West Java into ten groups based on the variables that sigfinicantly influence the number of leper. In general, the variable in the percentage of households with Clean and Healthy Behavior (PHBS) has a significant effect in all regency/city in West Java. Especially for Bogor Regency, Depok City, Bogor City, and Pangandaran Regency, the variable of the percentage of people poverty does not have a significant effect on the number leper.


2019 ◽  
pp. 232102221886979
Author(s):  
Radhika Pandey ◽  
Amey Sapre ◽  
Pramod Sinha

Identification of primary economic activity of firms is a prerequisite for compiling several macro aggregates. In this paper, we take a statistical approach to understand the extent of changes in primary economic activity of firms over time and across different industries. We use the history of economic activity of over 46,000 firms spread over 25 years from CMIE Prowess to identify the number of times firms change the nature of their business. Using the count of changes, we estimate Poisson and Negative Binomial regression models to gain predictability over changing economic activity across industry groups. We show that a Poisson model accurately characterizes the distribution of count of changes across industries and that firms with a long history are more likely to have changed their primary economic activity over the years. Findings show that classification can be a crucial problem in a large data set like the MCA21 and can even lead to distortions in value addition estimates at the industry level. JEL Classifications: D22, E00, E01


2013 ◽  
Vol 2 (2) ◽  
pp. 6
Author(s):  
PUTU SUSAN PRADAWATI ◽  
KOMANG GDE SUKARSA ◽  
I GUSTI AYU MADE SRINADI

Poisson regression was used to analyze the count data which Poisson distributed. Poisson regression analysis requires state equidispersion, in which the mean value of the response variable is equal to the value of the variance. However, there are deviations in which the value of the response variable variance is greater than the mean. This is called overdispersion. If overdispersion happens and Poisson Regression analysis is being used, then underestimated standard errors will be obtained. Negative Binomial Regression can handle overdispersion because it contains a dispersion parameter. From the simulation data which experienced overdispersion in the Poisson Regression model it was found that the Negative Binomial Regression was better than the Poisson Regression model.


Empirica ◽  
2019 ◽  
Vol 47 (4) ◽  
pp. 699-731
Author(s):  
Franz Hackl ◽  
Rudolf Winter-Ebmer

Abstract E-commerce has become an integral part of the world’s economy. In this study we investigate the impact of service quality in e-tailing on site visits and consumer demand. Such an analysis is important given the almost Bertrand-like competitive structure. Our analysis is based on a large representative data set obtained from a price comparison site covering essentially the complete Austrian e-tailing market. Customer evaluations for a broad range of 15 different service characteristics are condensed using factor analysis. Negative binomial regression analysis is used to measure the impact of service quality dimensions on referral requests to online shops for different product categories. Our results show that the most important service quality aspects are those related to the ordering process and the firm’s website performance.


Author(s):  
Samuel Olorunfemi Adams ◽  
Muhammad Ardo Bamanga ◽  
Samuel Olayemi Olanrewaju ◽  
Haruna Umar Yahaya ◽  
Rafiu Olayinka Akano

COVID-19 is currently threatening countries in the world. Presently in Nigeria, there are about 29,286 confirmed cases, 11,828 discharged and 654 deaths as of 6th July 2020. It is against this background that this study was targeted at modeling daily cases of COVID-19’s deaths in Nigeria using count regression models like; Poisson Regression (PR), Negative Binomial Regression (NBR) and Generalized Poisson Regression (GPR) model. The study aim at fitting an appropriate count Regression model to the confirmed, active and critical cases of COVID-19 in Nigeria after 118 days. The data for the study was extracted from the daily COVID-19 cases update released by the Nigeria Centre for Disease Control (NCDC) online database from February 28th, 2020 – 6th, July 2020. The extracted data were used in the simulation of Poisson, Negative Binomial, and Generalized Poisson Regression model with a program written in STATA version 14 and fitted to the data at a 5% significance level. The best model was selected based on the values of -2logL, AIC, and BIC selection test/criteria. The results obtained from the analysis revealed that the Poisson regression could not capture over-dispersion, so other forms of Poisson Regression models such as the Negative Binomial Regression and Generalized Poisson Regression were used in the estimation. Of the three count Regression models, Generalized Poisson Regression was the best model for fitting daily cumulative confirmed, active and critical COVID-19 cases in Nigeria when overdispersion is present in the predictors because it had the least -2log-Likelihood, AIC, and BIC. It was also discovered that active and critical cases have a positive and significant effect on the number of COVID-19 related deaths in Nigeria.


2019 ◽  
Vol 8 (1) ◽  
Author(s):  
Lulu Mahdiyah Sandjadirja ◽  
Muhammad Nur Aidi ◽  
Akbar Rizki

Poisson regression can be used to model rare events that consist of count data. Poisson regression application is carried out to find out external factors that affect the number of poor people in Indonesia by the province in 2016. The assumptions that must be met in this analysis are equdispersion. However, in real cases there is often a problem of overdispersion, ie the value of the variance is greater than the average value. High diversity can be caused by outliers. Expenditures on outliers have not been able to deal with the problem of overdispersion in Poisson Regression. One way to overcome this problem is to replace the Poisson distribution assumption with the Negative Binomial distribution. The results of the analysis show that the Negative Binomial Regression model without outliers is better than the Poisson Regression without outliers model indicated by a smaller AIC value. Based on the Negative Binomial Regression model without this outlier the external factors that affect the number of poor people in Indonesia by the province in 2016 are the percentage of households with floor conditions of houses with soil by province, population by province, percentage of unemployment to the total workforce by province and the percentage of the workforce against the working age population.


2021 ◽  
Vol 2123 (1) ◽  
pp. 012028
Author(s):  
Dian Handayani ◽  
A F Artari ◽  
W Safitri ◽  
W Rahayu ◽  
V M Santi

Abstract Crime rate is the number of reported crimes divided by total population. Several factors could contribute the variability of crime rates among areas. This study aims to model the relationship between crime rates among regencies and cities in the East Java Province (Indonesia) and some potentially explanatory variables based on Statistics Indonesia publication in 2020. The crime rate in the East Java Province was consistently at the top three after DKI Jakarta and North Sumatra during 2017 to 2019. Therefore, it is interesting for us to study further about the crime rate in the East Java. Our preliminary analysis indicates that there is an overdispersion in our sample data. To overcome the overdispersion, we fit Generalized Poisson and Negative Binomial regression. The ratio of deviance and degree of freedom based on Negative Binomial is slightly smaller (1.38) than Generalized Poisson (1.99). The results indicate that Negative Binomial and Generalized Poisson regression, compared to standard Poisson regression, are relatively fit to model our crime rate data. Some factors which contribute significantly (α=0.05) for the crime rate in the East Java Province under Negative Binomial as well as Generalized Poisson regression are percentage of poor people, number of households, unemployment rate, and percentage of expenditure.


2019 ◽  
Vol 188 (7) ◽  
pp. 1319-1327
Author(s):  
Alexis Robert ◽  
W John Edmunds ◽  
Conall H Watson ◽  
Ana Maria Henao-Restrepo ◽  
Pierre-Stéphane Gsell ◽  
...  

Abstract Understanding risk factors for Ebola transmission is key for effective prediction and design of interventions. We used data on 860 cases in 129 chains of transmission from the latter half of the 2013–2016 Ebola epidemic in Guinea. Using negative binomial regression, we determined characteristics associated with the number of secondary cases resulting from each infected individual. We found that attending an Ebola treatment unit was associated with a 38% decrease in secondary cases (incidence rate ratio (IRR) = 0.62, 95% confidence interval (CI): 0.38, 0.99) among individuals that did not survive. Unsafe burial was associated with a higher number of secondary cases (IRR = 1.82, 95% CI: 1.10, 3.02). The average number of secondary cases was higher for the first generation of a transmission chain (mean = 1.77) compared with subsequent generations (mean = 0.70). Children were least likely to transmit (IRR = 0.35, 95% CI: 0.21, 0.57) compared with adults, whereas older adults were associated with higher numbers of secondary cases. Men were less likely to transmit than women (IRR = 0.71, 95% CI: 0.55, 0.93). This detailed surveillance data set provided an invaluable insight into transmission routes and risks. Our analysis highlights the key role that age, receiving treatment, and safe burial played in the spread of EVD.


Sign in / Sign up

Export Citation Format

Share Document