• Title/Summary/Keyword: 로지스틱 모형

Search Result 534, Processing Time 0.026 seconds

Study on Detection Technique for Cochlodinium polykrikoides Red tide using Logistic Regression Model under Imbalanced Data (불균형 데이터 환경에서 로지스틱 회귀모형을 이용한 Cochlodinium polykrikoides 적조 탐지 기법 연구)

  • Bak, Su-Ho;Kim, Heung-Min;Kim, Bum-Kyu;Hwang, Do-Hyun;Enkhjargal, Unuzaya;Yoon, Hong-Joo
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.13 no.6
    • /
    • pp.1353-1364
    • /
    • 2018
  • This study proposed a method to detect Cochlodinium polykrikoides red tide pixels in satellite images using a logistic regression model of machine learning technique under Imbalanced data. The spectral profiles extracted from red tide, clear water, and turbid water were used as training dataset. 70% of the entire data set was extracted and used for as model training, and the classification accuracy of the model was evaluated using the remaining 30%. At this time, the white noise was added to the spectral profile of the red tide, which has a relatively small number of data compared to the clear water and the turbid water, and over-sampling was performed to solve the unbalanced data problem. As a result of the accuracy evaluation, the proposed algorithm showed about 94% classification accuracy.

Development of fertilizer-distributed algorithms based on crop growth models (작물생육모형 기반 비료시비량 분배 알고리즘 개발)

  • Doyun Kim;Yejin Lee;Tae-Young Heo
    • The Korean Journal of Applied Statistics
    • /
    • v.36 no.6
    • /
    • pp.619-629
    • /
    • 2023
  • Fertilizers are crucial for increasing crop yield, but using too much of them without taking into account the nutrients that the crops need can increase costs for farm management and have a negative impact on the environment. Through smart agriculture, fertilizers can be applied as needed at the right time to reflect the growth characteristics of crops, reducing the burden of fertilizer losses and providing economical nutrient management. In this study, we use the total dry weight of field-cultivated red pepper and green onion grown in various growing environments to fit a nonlinear model-based crop growth model using different growth curves (logistic, Gompertz, Richards, and double logistic curve), and we propose a fertilizer distributed algorithm based on crop growth rate.

Projection of the student number by logistic function and proportional moving average model (로지스틱함수모형과 비례이동평균모형에 의한 학생 수 추계와 분석)

  • Song, Pil-Jun;Kim, Jong-Tae
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.3
    • /
    • pp.503-511
    • /
    • 2010
  • The goal of this paper is to suggest an algorithm to get the number of student on the elementary, meddle and high-school for the forecasting of the numbers of student by the moving average method using a proportional expression. Comparing with the results of Korean education statistical system 2005, 2006, and 2007, the results of this paper are better than those of the Korean education statistical system.

Development of model for prediction of land sliding at steep slopes (급경사지 붕괴 예측을 위한 모형 개발)

  • Park, Ki-Byung;Joo, Yong-Sung;Park, Dug-Keun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.22 no.4
    • /
    • pp.691-699
    • /
    • 2011
  • Land sliding is one of well-known nature disaster. As a part of effort to reduce damage from land sliding, many researchers worked on increasing prediction ability. However, because previous studies are conducted mostly by non-statisticians, previously proposed models were hardly statistically justifiable. In this paper, we predicted the probability of land sliding using the logistic regression model. Since most explanatory variables under consideration were correlated, we proposed the final model after backward elimination process.

The Rating of Korean Basketball League Teams in 2006-2007 Season: Taking Account of Home-Court Advantage (홈팀의 이점을 고려한 KBL 2006-2007 시즌 경기력 평가)

  • Lee, Seung-Chun;Byun, Jong-Seok
    • Communications for Statistical Applications and Methods
    • /
    • v.15 no.5
    • /
    • pp.687-695
    • /
    • 2008
  • It is widely known that the home advantage plays an important factor for determining victory or defeat in sport leagues. Thus a ranking system of sport league should take account of the home advantage as a key factor. Various statistical models are studied to rate the Korean Basketball league teams in 2006-2007 season. Among them, the model equation provided by Harville and Smith (1994) is useful for constructing two ranking systems. Both systems give quite reasonable quantifications of the team's ability and the home advantage.

Inferential Problems in Bayesian Logistic Regression Models (베이지안 로지스틱 회귀모형에서의 추론에 대한 연구)

  • Hwang, Jin-Soo;Kang, Sung-Chan
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.6
    • /
    • pp.1149-1160
    • /
    • 2011
  • Model selection and hypothesis testing problems in Bayesian inference are still debated between scholars. Bayesian factors traditionally used as a criterion in Bayesian hypothesis testing and model selection, are easy to understand but sometimes hard to compute. In addition, there are other model selection criterions such as DIC(Deviance Information Criterion) by Spiegelhalter et al. (2002) and Bayesian P-values for testing. In this paper, we briefly introduce the Bayesian hypothesis testing and model selection procedure. In addition we have applied a Bayesian inference to Swiss banknote data by a fitting logistic regression model and computing several test statistics to see if they provide consistent results.

Various Graphical Methods for Assessing a Logistic Regression Model (로지스틱회귀모형의 평가를 위한 그래픽적 방법)

  • Kim, Kyung Jin;Kahng, Myung Wook
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.6
    • /
    • pp.1191-1208
    • /
    • 2015
  • Most statistical methods are dependent on the summary statistic. However, with graphical approaches, it is easier to identify the characteristics of the data and detect information that cannot be obtained by the summary statistic. We present various graphical methods to assess the adequacy of models in logistic regression that include checking log-density ratio, structural dimension, marginal model plot, chi-residual plot, and CERES plot. Through simulation data, we investigate and compare the results of graphical approaches under diverse conditions.

Analysis of Horse Races: Prediction of Winning Horses in Horse Races Using Statistical Models (서울 경마 경기 우승마 예측 모형 연구)

  • Choe, Hyemin;Hwang, Nayoung;Hwang, Chankyoung;Song, Jongwoo
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.6
    • /
    • pp.1133-1146
    • /
    • 2015
  • The Horse race industry has the largest proportion of the domestic legal gambling industry. However, there is limited statistical analysis on horse races versus other sports. We propose prediction models for winning horses in horse races using data mining techniques such as logistic regression, linear regression, and random forest. Horse races data are from the Korea Racing Authority and we use horse racing reports, information of racehorses, jockeys, and horse trainers. We consider two models based on ranks and time records. The analysis results show that prediction of ranks is affected by information on racehorses, number of wins of racehorses and jockeys. We place wagers for the last month of races based on our prediction models that produce serious profits.

Development for City Bus Dirver's Accident Occurrence Prediction Model Based on Digital Tachometer Records (디지털 운행기록에 근거한 시내버스 운전자의 사고발생 예측모형 개발)

  • Kim, Jung-yeul;Kum, Ki-jung
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.15 no.1
    • /
    • pp.1-15
    • /
    • 2016
  • This study aims to develop a model by which city bus drivers who are likely to cause an accident can be figured out based on the information about their actual driving records. For this purpose, from the information about the actual driving records of the drivers who have caused an accident and those who have not caused any, significance variables related to traffic accidents are drawn, and the accuracy between models is compared for the classification models developed, applying a discriminant analysis and logistic regression analysis. In addition, the developed models are applied to the data on other drivers' driving records to verify the accuracy of the models. As a result of developing a model for the classification of drivers who are likely to cause an accident, when deceleration ($X_{deceleration}$) and acceleration to the right ($Y_{right}$) are simultaneously in action, this variable was drawn as the optimal factor variable of the classification of drivers who had caused an accident, and the prediction model by discriminant analysis classified drivers who had caused an accident at a rate up to 62.8%, and the prediction model by logistic regression analysis could classify those who had caused an accident at a rate up to 76.7%. In addition, as a result of the verification of model predictive power of the models showed an accuracy rate of 84.1%.

Application of the Neural Network to Predict the Adolescents' Computer Entertainment Behavior (청소년의 컴퓨터 오락추구 행동을 예측하기 위한 신경망 활용)

  • Lee, Hyejoo;Jung, Euihyun
    • The Journal of Korean Association of Computer Education
    • /
    • v.16 no.2
    • /
    • pp.39-48
    • /
    • 2013
  • This study investigates the predictive model of the adolescents' computer entertainment behavior using neural network with the KYPS data (3449 in the junior high school; 1725 boys and 1724 girls). This study compares the results of neural network(model 1) to the logistic regression model and neural network(model 2) with the exact same variables used in logistic regression. The results reveal that the prediction of neural network model 1 is the highest among three models and with gender, computer use time, family income, the number of close friends, the number of misdeed friends, individual study time, self-control, private education time, leisure time, self-belief, stress, adaptation to school, and study related worries, the neural network model 1 predicts the computer entertainment behavior more efficiently. These results suggest that the neural network could be used for diagnosing and adjusting the adolescents' computer entertainment behavior.

  • PDF