• 제목/요약/키워드: Logistic models

검색결과 829건 처리시간 0.025초

서울 경마 경기 우승마 예측 모형 연구 (Analysis of Horse Races: Prediction of Winning Horses in Horse Races Using Statistical Models)

  • 최혜민;황나영;황찬경;송종우
    • 응용통계연구
    • /
    • 제28권6호
    • /
    • pp.1133-1146
    • /
    • 2015
  • 경마 산업은 국내 합법 사행산업의 대부분을 차지하고 있다. 그러나 사행성 도박이라는 인식 하에 여타 스포츠 산업에 비해 활발한 통계적 분석이 이루어지지 않고 있다. 본 연구의 목적은 다양한 데이터마이닝 기법을 이용하여 우승마를 예측하는 모형 개발에 있다. 모형 적합에 사용한 데이터는 한국 마사회에서 제공하는 자료를 바탕으로 하였으며, 경마 성적표, 경주마 정보, 기수 정보, 조교사 정보 등을 사용하였다. 예측 모형은 크게 두 모형으로 나누어 순위를 기반으로 한 모형과 기록을 기반으로 한 모형으로 적합하였고, 분석 방법으로는 선형회귀분석, 랜덤 포레스트, 로지스틱 회귀 분석을 사용하였다. 그 결과 말 기본 정보와 과거 우승 경력, 기수의 과거 우승 경력 등이 순위 예측에 큰 영향을 미치는 것을 알 수 있었다. 모형 적합에 사용되지 않은 최근 1개월 간 데이터를 이용하여 단승식, 복승식, 삼복승식으로 배팅한 결과 모형 간 큰 차이가 없었고, 모두 양의 수익을 얻을 수 있었다.

IP기반 유선인터넷전화 가입요인 도출을 위한 분석적 연구: 통신상품결합서비스의 영향

  • 하성호;양정원
    • 한국데이타베이스학회:학술대회논문집
    • /
    • 한국데이타베이스학회 2010년도 춘계국제학술대회
    • /
    • pp.187-199
    • /
    • 2010
  • Recently, Internet Telephony has become increasingly popular in telecommunication industry. However, previous research on Internet Telephony has focused on analyzing specific Internet Telephony solutions, identifying the Internet Telephony movement itself. The research on prediction models about Internet Telephony adoption has been minimal. The main propose of this study is to develop models for predicting transition intention from using traditional telephones to using Internet Telephony. To do so, this study uses data mining methods to analyze demands in the IT communications market and to provide management strategies for Internet telephony providers. Especially this study uses discriminant analysis, logistic regression, classification tree, and neural nets to develop the prediction models for the Internet Telephony adoption. The models are compared with each other and a superior model is chosen.

  • PDF

외국환 거래의 자금세탁 혐의도 점수모형 개발에 관한 연구 (Scoring models to detect foreign exchange money laundering)

  • 홍성익;문태희;손소영
    • 산업공학
    • /
    • 제18권3호
    • /
    • pp.268-276
    • /
    • 2005
  • In recent years, the money Laundering crimes are increasing by means of foreign exchange transactions. Our study proposes four scoring models to provide early warning of the laundering in foreign exchange transactions for both inward and outward remittances: logistic regression model, decision tree, neural network, and ensemble model which combines the three models. In terms of accuracy of test data, decision tree model is selected for the inward remittance and an ensemble model for the outward remittance. From our study results, the accumulated number of transaction turns out to be the most important predictor variable. The proposed scoring models deal with the transaction level and is expected to help the bank teller to detect the laundering related transactions in the early stage.

인터넷전화(VoIP)의 신규고객 유치를 지원하는 데이터마이닝 모델 (A Data-Mining Model to Support new Customer Acquisition for Internet Telephony(VoIP))

  • 하성호;양정원;송영미
    • Journal of Information Technology Applications and Management
    • /
    • 제17권2호
    • /
    • pp.133-154
    • /
    • 2010
  • Recently, Internet Telephony has become increasingly popular in telecommunication industry. However, previous research on Internet Telephony has focused on analyzing specific Internet Telephonysolutions, identifyingthe Internet Telephony movement itself. The research on prediction models about Internet Telephony adoption has been minimal. The main propose of this study is to develop models for predicting transition intention from using traditional telephones to using Internet Telephony. To do so, this study uses data mining methods to analyze demands in the IT communications market and to provide management strategies for Internet telephony providers. Especially this study uses discriminant analysis, logistic regression, classification tree, and neural nets to develop those prediction models toward Internet Telephony adoption. The models are compared with each other and a superior model is chosen.

  • PDF

Application and evaluation of machine-learning model for fire accelerant classification from GC-MS data of fire residue

  • Park, Chihyun;Park, Wooyong;Jeon, Sookyung;Lee, Sumin;Lee, Joon-Bae
    • 분석과학
    • /
    • 제34권5호
    • /
    • pp.231-239
    • /
    • 2021
  • Detection of fire accelerants from fire residues is critical to determine whether the case was arson or accidental fire. However, to develop a standardized model for determining the presence or absence of fire accelerants was not easy because of high temperature which cause disappearance or combustion of components of fire accelerants. In this study, logistic regression, random forest, and support vector machine models were trained and evaluated from a total of 728 GC-MS analysis data obtained from actual fire residues. Mean classification accuracies of the three models were 63 %, 81 %, and 84 %, respectively, and in particular, mean AU-PR values of the three models were evaluated as 0.68, 0.86, and 0.86, respectively, showing fine performances of random forest and support vector machine models.

방사선치료 시 다양한 기계학습을 이용한 선량품질관리 결과의 예측 (Prediction of Delivery Quality Assurance Via Machine Learning in Helical Tomotherapy)

  • 장경환
    • 대한방사선기술학회지:방사선기술과학
    • /
    • 제47권4호
    • /
    • pp.263-270
    • /
    • 2024
  • The objective of this study was to evaluate the accuracy and impact of leaf open time (LOT) and pitch using various machine learning models on EBT film-based delivery quality assurance (DQA) performed on 211 patients of helical tomotherapy (HT). We randomly selected passed (n=191) and failed (n=20) DQA measurements to evaluate the accuracy of the k-nearest neighbor (KNN), support vector machine (SVM), naive Bayes (NB) and logistic regression (LR) models using scale-dependent metrics such as the coefficient of determination (R2), mean squared error (MSE), and root MSE (RMSE). We evaluated the performance of the four prediction models in terms of the accuracy, precision, sensitivity, and F1-score using a confusion matrix, finding the NB and LR models to achieve optimal results. The results of this study are expected to reduce the workload of medical physicists and dosimetrists by predicting DQA results according to LOT and pitch in advance.

The Effect of Economic Growth and Urbanization on Poverty Reduction in Vietnam

  • NGUYEN, Huyen Thi Thanh;NGUYEN, Chau Van;NGUYEN, Cong Van
    • The Journal of Asian Finance, Economics and Business
    • /
    • 제7권7호
    • /
    • pp.229-239
    • /
    • 2020
  • This article aims to measure the impact of economic growth and urbanization on poverty reduction in Vietnam, and verify whether economic growth and urbanization will help reduce poverty rates. Data for this study are tabular data related to growth, urbanization and poverty at the provincial level for the period of nine years, from 2006 to 2014 provided by the Vietnam General Statistics Office and the Vietnam General Department of Customs. The level of economic growth and urbanization mentioned in the study is reflected in such indicators as GDP value, exports value, imports value, urbanization rate and employment rate. The authors used logistic regression models with fixed-effects and logistic regression models with random effects. With 5% confidence level tested by the Chi-Square test of Hausman trial with the fixed-effect model, research results show that: (1) factors with significant negative impact on the poverty rate include imports value, urbanization rate and, employment rate; (2) factors that do not affect the poverty rate include exports value and GDP value. Based on the research results, this study proposes a number of policy recommendations to help promote economic growth, to sustain the urbanization process, and to contribute directly and positively to poverty reduction in Vietnam.

노인단독가구의 실제생계비와 주관적 생계비간의 차이분석 (A Study on the Gap between Actual Cost of Living and Subjective Cost of Living of the Elderly Households)

  • 성영애
    • 가정과삶의질연구
    • /
    • 제28권6호
    • /
    • pp.195-205
    • /
    • 2010
  • households using the 2007 Korean Retirement and Income Study. Elderly households were classified into three groups based on the comparisons among the costs of living and then the factors influencing the probability of belonging to each group were investigated using multiple logistic regression models. The major findings are as follows. Firstly, the subjective adequate cost of living was the highest and the subjective minimum cost of living was the lowest. The actual cost of living was in between. Secondly, 42.6% of elderly households belonged to Group1(whose actual cost of living was less than the subjective minimum cost of living), 30.0% was classified into Group2(whose actual cost of living was greater than the subjective minimum cost of living but less than the subjective adequate cost of living) and the actual cost of living of the remaining 27.4% was greater than the subjective adequate cost of living(Group3). Thirdly, income was the strongest factor influencing the probability of belonging to each group, but the influencing factors were different for the logistic models for Group1 and Group3 based on Group2.

로지스틱 회귀모형에서 최우추정량의 정확도 산정 (Assessing the accuracy of the maximum likelihood estimator in logistic regression models)

  • 이기원;손건태;정윤식
    • 응용통계연구
    • /
    • 제6권2호
    • /
    • pp.393-399
    • /
    • 1993
  • 반응이 두 가지로 나타나는 자료에서 설명변수와 반응변수와의 관계를 연구할 때 많이 사용되는 로지스틱 회귀모형에 대하여 그 모수들을 최우추정법으로 구할 때 추정량의 표준오차는 보통 로그우도함수의 2차도함수에 바탕을 두어 계산하게 된다. 한편 피셔정보량이 로그우도함수의 1차도함수를 제곱한 통계량의 기대값으로도 계산된다는 점에 착안하여 얻어지는 피셔정보량의 추정량도 이와 거의 비슷한 대표본 성질을 갖는 것으로 알려져 있다. 이러한 피셔정보량의 추정량들은 최우추정량을 구할 때의 반복 알고리즘과 깊은 관련을 갖고 있다. 어느 방법이 더 효과적으로 최우추정량을 계산하는 지 평균반복횟수를 비교하고 대표본분산의 추정량으로서 각 방법에서 계산되는 분산의 추정량들을 비교하였다.

  • PDF

Exploring the Health Production Model in Vietnam

  • NGUYEN, Tuyen Thi Mong;NGUYEN, Quyen Le Hoang Thuy To
    • The Journal of Asian Finance, Economics and Business
    • /
    • 제8권12호
    • /
    • pp.391-397
    • /
    • 2021
  • One of the sustainable development goals is to promote good health and well-being for all people. Child health is a top priority since their health issues can have a detrimental impact on human capital development, which is a critical input for the growth model. This paper applies the health production model to explore the determinants that influence the health of children under the age of five. The results of a survey of 203 households in Ho Chi Minh City, Vietnam, were examined. Child health is measured using anthropometric indicators such as weight-for-age, height-for-age, and weight-for-height (ZWFH). Three separate multinomial logistic models are regressed to examine the drivers of child health as proxied by z-score weight for age, z-score height for age, and z-score weight for height. The significance of input variables relating to a child's attributes, household, and environment was validated by the findings. The inclusion of overweight besides under-nourished indexes is novel because it reflects the current trend of child over-nutrition. The findings of the study highlight the importance of a wide range of initiatives to enhance child health. Moreover, the genetic effect is found to be crowded out by environmental and household factors. The finding verifies that despite their parents' moderate height, the future generation of Vietnamese can achieve the desired height.