• Title/Summary/Keyword: Count model

Search Result 514, Processing Time 0.023 seconds

Prediction Model for Popularity of Online Articles based on Analysis of Hit Count (온라인 게시글의 조회수 분석을 통한 인기도 예측)

  • Kim, Su-Do;Cho, Hwan-Gue
    • The Journal of the Korea Contents Association
    • /
    • v.12 no.4
    • /
    • pp.40-51
    • /
    • 2012
  • Online discussion bulletin in Korea is not only a specific place where user exchange opinions but also a public sphere through which users discuss and form public opinion. Sometimes, there is a heated debate on a topic and any article becomes a political or sociological issue. In this paper, we propose how to analyze the popularity of articles by collecting the information of articles obtained from two well-known discussion forums such as AGORA and SEOPRISE. And we propose a prediction model for the article popularity by applying the characteristics of subject articles. Our experiment shown that the popularity of 87.52% articles have been saturated within a day after the submission in AGORA, but the popularity of 39% articles is growing after 4 days passed in SEOPRISE. And we observed that there is a low correlation between the period of popularity and the hit count. The steady increase of the hit count of an article does not necessarily imply the final hit count of the article at the saturation point is so high. In this paper, we newly propose a new prediction model called 'baseline'. We evaluated the predictability for popular articles using three models (SVM, similar matching and baseline). Through the results of performance evaluation, we observed that SVM model is the best in F-measure and precision, but baseline is the best in running time.

Novel nomogram-based integrated gonadotropin therapy individualization in in vitro fertilization/intracytoplasmic sperm injection: A modeling approach

  • Ebid, Abdel Hameed IM;Motaleb, Sara M Abdel;Mostafa, Mahmoud I;Soliman, Mahmoud MA
    • Clinical and Experimental Reproductive Medicine
    • /
    • v.48 no.2
    • /
    • pp.163-173
    • /
    • 2021
  • Objective: This study aimed to characterize a validated model for predicting oocyte retrieval in controlled ovarian stimulation (COS) and to construct model-based nomograms for assistance in clinical decision-making regarding the gonadotropin protocol and dose. Methods: This observational, retrospective, cohort study included 636 women with primary unexplained infertility and a normal menstrual cycle who were attempting assisted reproductive therapy for the first time. The enrolled women were split into an index group (n=497) for model building and a validation group (n=139). The primary outcome was absolute oocyte count. The dose-response relationship was tested using modified Poisson, negative binomial, hybrid Poisson-Emax, and linear models. The validation group was similarly analyzed, and its results were compared to that of the index group. Results: The Poisson model with the log-link function demonstrated superior predictive performance and precision (Akaike information criterion, 2,704; λ=8.27; relative standard error (λ)=2.02%). The covariate analysis included women's age (p<0.001), antral follicle count (p<0.001), basal follicle-stimulating hormone level (p<0.001), gonadotropin dose (p=0.042), and protocol type (p=0.002 and p<0.001 for short and antagonist protocols, respectively). The estimates from 500 bootstrap samples were close to those of the original model. The validation group showed model assessment metrics comparable to the index model. Based on the fitted model, a static nomogram was built to improve visualization. In addition, a dynamic electronic tool was created for convenience of use. Conclusion: Based on our validated model, nomograms were constructed to help clinicians individualize the stimulation protocol and gonadotropin doses in COS cycles.

Pedestrian Accident Severity Analysis and Modeling by Arterial Road Function (간선도로 기능별 보행사고 심각도 분석과 모형 개발)

  • Beck, Tea Hun;Park, Min kyu;Park, Byung Ho
    • International Journal of Highway Engineering
    • /
    • v.16 no.4
    • /
    • pp.111-118
    • /
    • 2014
  • PURPOSES: The purposes are to analyze the pedestrian accident severity and to develop the accident models by arterial road function. METHODS: To analyze the accident, count data and ordered logit models are utilized in this study. In pursuing the above, this study uses pedestrian accident data from 2007 to 2011 in Cheongju. RESULTS : The main results are as follows. First, daytime, Tue.Wed.Thu., over-speeding, male pedestrian over 65 old are selected as the independent variables to increase pedestrian accident severity. Second, as the accident models of main and minor arterial roads, the negative binomial models are developed, which are analyzed to be statistically significant. Third, such the main variables related to pedestrian accidents as traffic and pedestrian volume, road width, number of exit/entry are adopted in the models. Finally, Such the policy guidelines as the installation of pedestrian fence, speed hump and crosswalks with pedestrian refuge area, designated pedestrian zone, and others are suggested for accident reduction. CONCLUSIONS: This study analyzed the pedestrian accident severity, and developed the negative binomial accident models. The results of this study expected to give some implications to the pedestrian safety improvement in Cheongju.

Bayesian Analysis for the Zero-inflated Regression Models (영과잉 회귀모형에 대한 베이지안 분석)

  • Jang, Hak-Jin;Kang, Yun-Hee;Lee, S.;Kim, Seong-W.
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.4
    • /
    • pp.603-613
    • /
    • 2008
  • We often encounter the situation that discrete count data have a large portion of zeros. In this case, it is not appropriate to analyze the data based on standard regression models such as the poisson or negative binomial regression models. In this article, we consider Bayesian analysis for two commonly used models. They are zero-inflated poisson and negative binomial regression models. We use the Bayes factor as a model selection tool and computation is proceeded via Markov chain Monte Carlo methods. Crash count data are analyzed to support theoretical results.

A Development of Lagrangian Particle Dispersion Model (Focusing on Calculation Methods of the Concentration Profile) (라그란지안 입자확산모델개발(농도 계산방법의 검토))

  • 구윤서
    • Journal of Korean Society for Atmospheric Environment
    • /
    • v.15 no.6
    • /
    • pp.757-765
    • /
    • 1999
  • Lagrangian particle dispersion model(LPDM) is an effective tool to calculate the dispersion from a point source since it dose not induce numerical diffusion errors in solving the pollutant dispersion equation. Fictitious particles are released to the atmosphere from the emission source and they are then transported by the mean velocity and diffused by the turbulent eddy motion in the LPDM. The concentration distribution from the dispersed particles in the calculation domain are finally estimated by applying a particle count method or a Gaussian kernel method. The two methods for calculating concentration profiles were compared each other and tested against the analytic solution and the tracer experiment to find the strength and weakness of each method and to choose computationally time saving method for the LPDM. The calculated concentrations from the particle count method was heavily dependent on the number of the particles released at the emission source. It requires lots fo particle emission to reach the converged concentration field. And resulting concentrations were also dependent on the size of numerical grid. The concentration field by the Gaussian kernel method, however, converged with a low particle emission rate at the source and was in good agreement with the analytic solution and the tracer experiment. The results showed that Gaussian kernel method was more effective method to calculate the concentrations in the LPDM.

  • PDF

A hybrid-vehicular communication systems using a gaussian model for sending a safe message (안전 메시지 전달을 위해 가우시안 모델을 적용한 하이브리드 차량 통신 시스템)

  • Oh, Sang-Yeob
    • Journal of Digital Convergence
    • /
    • v.10 no.7
    • /
    • pp.161-166
    • /
    • 2012
  • When a car accident happened on a highway, the accident vehicle should broadcast a safe message to its neighbors in order to prevent a chain-reaction collision. Also, there is a problem that the estimation accuracy is low because of the memory limit from increasing the sampling count. In this paper, we proposes a HVC systems using a back-off algorithm applied to a gaussian model. And we proposes a MAC protocol preventing the communication delay by separating the neighbor count collection channel, data channel, and RSU communication channel. As a result, we show the frame reception success rate of our protocol improved about 10% than the previous protocol.

Update Cycle Detection Method of Control Limits using Control Chart Performance Evaluation Model (관리도 성능평가모형을 통한 관리한계선 갱신주기 탐지기법)

  • Kim, Jongwoo;Park, Cheong-Sool;Kim, Jun Seok;Kim, Sung-Shick;Baek, Jun-Geol
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.40 no.1
    • /
    • pp.43-51
    • /
    • 2014
  • Statistical process control (SPC) is an important technique for monitoring and managing the manufacturing process. In spite of its easiness and effectiveness, some problematic sides of application exist such that the SPC techniques are hardly reflect the changes of the process conditions. Especially, update of control limits at the right time plays an important role in acquiring a reasonable performance of control charts. Therefore, we propose the control chart performance evaluation index (CPEI) based on count data model to monitor and manage the performance of control charts. The CPEI could indicate the degree of control chart performance and be helpful to detect the proper update cycle of control limits in real time. Experiments using real manufacturing data show that the proper update intervals are made by proposed method.

Radiation detector deadtime and pile up: A review of the status of science

  • Usman, Shoaib;Patil, Amol
    • Nuclear Engineering and Technology
    • /
    • v.50 no.7
    • /
    • pp.1006-1016
    • /
    • 2018
  • Since the early forties, researchers from around the world have been studying the phenomenon of deadtime in radiation detectors. Many have attempted to develop models to represent this phenomenon. Two highly idealized models; paralyzable and non-paralyzable are commonly used by most individuals involved in radiation measurements. Most put little thought about the operating conditions and applicability of these ideal models for their experimental conditions. So far, there is no general agreement on the applicability of any given model for a specific detector under specific operating conditions, let alone a universal model for all detectors and all operating conditions. Further the related problem of pile-up is often confused with the deadtime phenomenon. Much work, is needed to devise a generalized and practical solution to these related problems. Many methods have been developed to measure and compensate for the detector deadtime count loss, and many researchers have addressed deadtime and pulse pile-up. The goal of this article is to summarize the state of science of deadtime; measurement and compensation techniques as proposed by some of the most significant work on these topics and to review the deadtime correction models applicable to present day radiation detection systems.

The study on the determinants of the number of job changes (중소기업 청년인턴 이직횟수 결정요인 분석)

  • Park, Sungik;Ryu, Jangsoo;Kim, Jonghan;Cho, Jangsik
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.2
    • /
    • pp.387-397
    • /
    • 2015
  • In this paper, the determinants of the number of job changes in the SMEs (small and medium enterprises) youth-intern project is analysed, utilizing SMEs youth-intern DB and employment insurance DB. Since the number of job changes are count data which take integer values other than negative values, general linear regression analysis becomes inappropriate. Therefore, four models such as Poisson regression model, zero inflated Poisson regression model, negative binomial regression model and zero inflated negative binomial regression model are tried to fit count data. A zero inflated negative binomial regression model is selected to be the best model. Major results are the followings. First, the number of job changes is shown to be significantly smaller in the treatment group than in the control group. Second, the number of job changes turns out to be significantly smaller in the young-age group than in the old-age group. Third, it is also shown that the number of job changes of man is significantly greater than that of woman. Lastly, the number of job changes in the bigger firm is shown to be significantly less than that of the smaller firm.

A simple zero inflated bivariate negative binomial regression model with different dispersion parameters

  • Kim, Dongseok
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.4
    • /
    • pp.895-900
    • /
    • 2013
  • In this research, we propose a simple bivariate zero inflated negative binomial regression model with different dispersion for bivariate count data with excess zeros. An application to the demand for health services shows that the proposed model is better than existing models in terms of log-likelihood and AIC.