• Title/Summary/Keyword: Negative Binomial Regression

Search Result 167, Processing Time 0.029 seconds

Understanding the Entertainment Values in the Online Educational Videos

  • Jeong, Seong Bin;Lee, Justin Jemin;Kwak, Kyu Tae
    • Journal of Internet Computing and Services
    • /
    • v.19 no.5
    • /
    • pp.77-87
    • /
    • 2018
  • Since the inception of the platform business in educational contents, the prominence of the online educational video has flipped the educational environment. Educational contents have been produced on the internet and allowed learners to access more flexible and student-centered. In fact, the number of people watching the educational content online, such as TED talks and YouTube, has increased during the past decade. The ways of delivering the lecture and the course information in online educational videos are totally different from the traditional lectures. In this paper, we aimed to examine and categorize the online educational videos based on the user's engagement and interest in the course contents. For the study, a negative binomial regression analysis was applied to estimate the effects of the attributes of the traditional lectures by comparatively analyzing the educational videos online. Several values are determined as engaging factors in the online educational videos; hybrid production of education and entertainment, shorter duration, and the number of presenters. From the study, we suggests how to produce engaging educational contents which will appeal the attentions from the users. Moreover, the result of the study may use as a guide to the providers making the productive educational videos.

Analysis of Traffic Accident by Circular Intersection Type in Korea Using Count Data Model (가산자료 모형을 이용한 국내 원형교차로 유형별 교통사고 분석)

  • Kim, Tae Yang;Lee, Min Yeong;Park, Byung Ho
    • Journal of the Korean Society of Safety
    • /
    • v.32 no.5
    • /
    • pp.129-134
    • /
    • 2017
  • This study aims to develop the traffic accident models by circular intersection type using count data model. The number of accident, the number of fatal and injured persons(FSI), and EPDO are calculated from the traffic accident data of TAAS. The circular intersection accident models are developed through Poisson and negative binomial regression analysis. The main results of this study are as follows. First, the null hypotheses that there are differences in the number of traffic accidents, FSI and EPDO by type of circular intersections are rejected. Second, the scale of intersection(median, large), number of approach road, mean width and length of exit road, area of the circulating roadway and central island are selected as factors influencing the number of traffic accidents, FSI and EPDO in rotary. Third, the scale of intersection(median), guide signs(limited speed, direction, roundabout), number of approach road, entry angle, area of the intersection and central island are adopted as factors influencing the number of traffic accidents, FSI and EPDO in roundabout. Finally, transferring from rotary to roundabout could be expected to make the accident decrease.

Impact of Level of Physical Activity on Healthcare Utilization among Korean Adults (성인의 신체활동 정도가 의료이용에 미치는 영향)

  • Kim, Ji-Yun;Park, Seung-Mi
    • Journal of Korean Academy of Nursing
    • /
    • v.42 no.2
    • /
    • pp.199-206
    • /
    • 2012
  • Purpose: This study was done to identify the impact of physical activity on healthcare utilization among Korean adults. Methods: Drawing from the 2008 Korean National Health and Nutrition Examination Survey (NHANES IV-2), data from 6,521 adults who completed the Health Interview and Health Behavior Surveys were analyzed. Association between physical activity and healthcare utilization was tested using the $X^2$-test. Multiple logistic regression analysis was used to calculate the odds ratios of using outpatient and inpatient healthcare for different levels of physical activity after adjusting for predisposing, enabling, and need factors. A generalized linear model applying a negative binomial distribution was used to determine how the level of physical activity was related to use of outpatient and inpatient healthcare. Results: Physically active participants were 16% less likely to use outpatient healthcare (OR, 0.84; 95% CI, 0.74-0.97) and 23% less likely to use inpatient healthcare (OR, 0.77; 95% CI, 0.63-0.93) than physically inactive participants. Levels of outpatient and inpatient healthcare use decreased as levels of physical activity increased, after adjusting for relevant factors. Conclusion: An independent association between being physically active and lower healthcare utilization was ascertained among Korean adults indicating a need to develop nursing intervention programs that encourage regular physical activity.

Impacts of Pre-signals on Traffic Crashes at 4-leg Signalized Intersections (전방신호기가 교통사고에 미치는 영향 연구)

  • Kim, Byeongeun;Lee, Youngihn
    • International Journal of Highway Engineering
    • /
    • v.15 no.4
    • /
    • pp.135-146
    • /
    • 2013
  • PURPOSES : This study aimed to analyze the impact the operation of pre-signals at 4-leg signalized intersections and present primary environmental factors of roads that need to be considered in the installation of pre-signals. METHODS : Shift of proportions safety effectiveness evaluation method which assesses shifts in proportions of target collision types to determine safety effectiveness was applied to analyze traffic crash by types. Also, Empirical Bayes before/after safety effectiveness evaluation method was adapted to analyze the impact pre-signal installation. Negative binomial regression was conducted to determine SPF(safety performance function). RESULTS : Pre-signals are effective in reducing the number of head on, right angle and sideswipe collisions and both the total number of personal injury crashes and severe crashes. Also, it is deemed that each factor used as an independent variable for the SPF model has strong correlation with the total number of personal injury crashes and severe crashes, and impacts general traffic crashes as a whole. CONCLUSIONS: This study suggests the following should be considered in pre-signal installation on intersections. 1) U-turns allowed in the front and rear 2) A high number of roads that connect to the intersection 3) Many right-turn traffic flows 4) Crosswalks installed in the front and rear 5) Insufficient left-turn lanes compared to left-turn traffic flows or no left-turn-only lane.

Prediction of the Number of Food Poisoning Occurrences by Microbes (원인균별 식중독 발생 건수 예측)

  • Yeo, In-Kwon
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.6
    • /
    • pp.923-932
    • /
    • 2013
  • This paper proposes a method to predict the number of foodborne disease outbreaks by microbes. The weekly data of food poisoning occurrences by microbes in Korea contain many zero-valued observations and have dependency between outbreaks. In order to model both phenomena, the number of food poisonings is predicted by an autoregressive model and the probabilities of food poisoning occurrences by microbes (given the total of food poisonings) are estimated by the baseline category logit model. The predicted number of foodborne disease outbreaks by a microbe is obtained by multiplying the predicted number of foodborne disease outbreaks and the estimated probability of the food poisoning by the corresponding microbe. The mean squared error and the mean absolute value error are evaluated to compare the performances of the proposed method and the zero-inflated model.

Fitting Distribution of Accident Frequency of Freeway Horizontal Curve Sections & Development of Negative Binomial Regression Models (고속도로 평면선형상 사고빈도분포 추정을 통한 음이항회귀모형 개발 (기하구조요인을 중심으로))

  • 강민욱;도철웅;손봉수
    • Journal of Korean Society of Transportation
    • /
    • v.20 no.7
    • /
    • pp.197-204
    • /
    • 2002
  • 교통사고예측 및 예방을 위해서는 실제적으로 도로설계과정에서 제어가 가능한 도로 기하구조요소에 대한 사고관계를 파악함이 타당하다. 즉, 도로의 설계자는 도로건설에 앞서 기하구조요소와 사고와의 관계를 현장자료를 통해 정확히 밝혀 도로설계에 반영해야 한다. 이를 위해, 교통사고의 빈도분포를 박히는 것은 가장 기본이 되는 일이며, 교통사고 예측모형개발에 선행되어야 한다. 일반적으로 교통사고건수의 경우 분산이 평균보다 큰 과분산(overdispersion)의 특징을 가지고 있어 음이항 분포를 따른다고 알려져 있다. 따라서 본 논문은 사고모형의 개발에 앞서, 사고발생지점에 대한 도로설계요소와 기타 잠재적인 사고발생 관련요인이 비교적 잘 파악되어있는 호남고속도로를 중심으로 평면 선형상 곡선부에 대하여 교통사고의 분포를 적합도 검정을 통해 알아보고자 하였다. 사고자료는 한국도로송사의 호남고속도로 5년(1996∼2000)간 자료를 분석에 맞게 정리하였으며, 강민욱과 송봉수(2002)에서 제시한 평면선형에 있어서의 구간분할법을 이용하여 배향곡선구간과 단일곡선구간에 대한 사고분석을 하였다. 적합도 분석결과, 예상대로 음이항분포가 사고건수를 설명하기에 가장 적합한 확률분포로 제시되었으며, 이를 통해 최우추정법을 이용한 음이항회귀모형을 개발하였다. 구간분할법을 적용한 음이항회귀모형의 경우, 기존의 확률회귀토형에 비하여 높은 결정계수를 갖았으며, 모형에서 적용된 기하구조요소로는 차량 노출계수, 곡선반경, 단위거리 당 편경사변화값 등이다.

Human Mastadenovirus Infections and Meteorological Factors in Cheonan, Korea

  • Oh, Eun Ju;Park, Joowon;Kim, Jae Kyung
    • Microbiology and Biotechnology Letters
    • /
    • v.49 no.2
    • /
    • pp.249-254
    • /
    • 2021
  • The study of the impact of weather on viral respiratory infections enables the assignment of causality to disease outbreaks caused by climatic factors. A better understanding of the seasonal distribution of viruses may facilitate the development of potential treatment approaches and effective preventive strategies for respiratory viral infections. We analyzed the incidence of human mastadenovirus infection using real-time reverse transcription polymerase chain reaction in 9,010 test samples obtained from Cheonan, South Korea, and simultaneously collected the weather data from January 1, 2012, to December 31, 2018. We used the data collected on the infection frequency to detect seasonal patterns of human mastadenovirus prevalence, which were directly compared with local weather data obtained over the same period. Descriptive statistical analysis, frequency analysis, t-test, and binomial logistic regression analysis were performed to examine the relationship between weather, particulate matter, and human mastadenovirus infections. Patients under 10 years of age showed the highest mastadenovirus infection rates (89.78%) at an average monthly temperature of 18.2℃. Moreover, we observed a negative correlation between human mastadenovirus infection and temperature, wind chill, and air pressure. The obtained results indicate that climatic factors affect the rate of human mastadenovirus infection. Therefore, it may be possible to predict the instance when preventive strategies would yield the most effective results.

The Impact of Government Assistance to State-owned Enterprises on Foreign Start-ups: Evidence from Yangtze River Delta

  • Risha, Omar Abu;Wang, Qingshi;Dou, Shanshan;Alhussam, Mohammed Ismail;Shi, Junguo
    • East Asian Economic Review
    • /
    • v.26 no.3
    • /
    • pp.205-225
    • /
    • 2022
  • Different types of corporate ownership may affect the environment among firms and could influence the decisions of new entities in the region. This study determines the role of state-owned enterprises (SOEs) in hindering new foreign manufacturing firms in the Yangtze River delta (YRD). The negative binomial regression is used for city-sector level data and the following points summarize the results: Firstly, the unique privileges that SOEs enjoy alongside governmental support create difficulties for foreign firms trying to establish themselves near existing SOEs. Secondly, although core cities are more attractive to foreign firms than peripheral cities, the role of core-periphery reveals that, in spite of all the regional advantages core cities could offer, whenever the share of SOEs is higher, the core-periphery system will have an adverse impact on new foreign firms. In other words, government preference for SOEs can suppress the attraction of foreign start-ups. However, after 2008, the governmental authorities finally succeeded in implementing their promising policy of fair treatment and competition in only the core cities.

The Effects of Sentiment and Readability on Useful Votes for Customer Reviews with Count Type Review Usefulness Index (온라인 리뷰의 감성과 독해 용이성이 리뷰 유용성에 미치는 영향: 가산형 리뷰 유용성 정보 활용)

  • Cruz, Ruth Angelie;Lee, Hong Joo
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.1
    • /
    • pp.43-61
    • /
    • 2016
  • Customer reviews help potential customers make purchasing decisions. However, the prevalence of reviews on websites push the customer to sift through them and change the focus from a mere search to identifying which of the available reviews are valuable and useful for the purchasing decision at hand. To identify useful reviews, websites have developed different mechanisms to give customers options when evaluating existing reviews. Websites allow users to rate the usefulness of a customer review as helpful or not. Amazon.com uses a ratio-type helpfulness, while Yelp.com uses a count-type usefulness index. This usefulness index provides helpful reviews to future potential purchasers. This study investigated the effects of sentiment and readability on useful votes for customer reviews. Similar studies on the relationship between sentiment and readability have focused on the ratio-type usefulness index utilized by websites such as Amazon.com. In this study, Yelp.com's count-type usefulness index for restaurant reviews was used to investigate the relationship between sentiment/readability and usefulness votes. Yelp.com's online customer reviews for stores in the beverage and food categories were used for the analysis. In total, 170,294 reviews containing information on a store's reputation and popularity were used. The control variables were the review length, store reputation, and popularity; the independent variables were the sentiment and readability, while the dependent variable was the number of helpful votes. The review rating is the moderating variable for the review sentiment and readability. The length is the number of characters in a review. The popularity is the number of reviews for a store, and the reputation is the general average rating of all reviews for a store. The readability of a review was calculated with the Coleman-Liau index. The sentiment is a positivity score for the review as calculated by SentiWordNet. The review rating is a preference score selected from 1 to 5 (stars) by the review author. The dependent variable (i.e., usefulness votes) used in this study is a count variable. Therefore, the Poisson regression model, which is commonly used to account for the discrete and nonnegative nature of count data, was applied in the analyses. The increase in helpful votes was assumed to follow a Poisson distribution. Because the Poisson model assumes an equal mean and variance and the data were over-dispersed, a negative binomial distribution model that allows for over-dispersion of the count variable was used for the estimation. Zero-inflated negative binomial regression was used to model count variables with excessive zeros and over-dispersed count outcome variables. With this model, the excess zeros were assumed to be generated through a separate process from the count values and therefore should be modeled as independently as possible. The results showed that positive sentiment had a negative effect on gaining useful votes for positive reviews but no significant effect on negative reviews. Poor readability had a negative effect on gaining useful votes and was not moderated by the review star ratings. These findings yield considerable managerial implications. The results are helpful for online websites when analyzing their review guidelines and identifying useful reviews for their business. Based on this study, positive reviews are not necessarily helpful; therefore, restaurants should consider which type of positive review is helpful for their business. Second, this study is beneficial for businesses and website designers in creating review mechanisms to know which type of reviews to highlight on their websites and which type of reviews can be beneficial to the business. Moreover, this study highlights the review systems employed by websites to allow their customers to post rating reviews.

Characteristics of Geometric Conditions Affecting Freeway Traffic Safety at Nighttime, Sunrise, and Sunset (야간 및 일출몰 시간대 교통안전에 영향을 미치는 고속도로 기하구조 특성분석)

  • Hong, Sung-Min;Kim, Joon-Ki;Oh, Cheol
    • Journal of Korean Society of Transportation
    • /
    • v.30 no.4
    • /
    • pp.95-106
    • /
    • 2012
  • Driver's capability of identifying the change in freeway alignments and environments is one of important factors associated with traffic safety on freeways. In particular, driver's visibility and recognition capability are highly dependent on the altitude of the sun by sunset, sunrise, and nighttime. The purpose of this study is to identify the characteristics of geometric conditions affecting crash occurrences at sunset, sunrise, and nighttime. Poisson and negative binomial regressions were adopted to predict freeway crash frequency in this study. Freeway crash data during 2007~2010 were used for developing the crash frequency models. A set of variables representing the characteristics of geometric conditions were identified as significant ones affecting crash occurrences. The results of this study would be useful in deriving effective countermeasures for preventing traffic crashes that mainly occur at sunset, sunrise, and nighttime on freeways.