• Title/Summary/Keyword: Probability Score

Search Result 295, Processing Time 0.021 seconds

Comparative Evaluation of Machine Learning Models for Predicting Soccer Injury Types

  • Davronbek Malikov;Jaeho Kim;Jung Kyu Park
    • Journal of the Korean Society of Industry Convergence
    • /
    • v.27 no.2_1
    • /
    • pp.257-268
    • /
    • 2024
  • Soccer is type of sport that carries a high risk of injury. Injury is not only cause in the unlucky soccer carrier and also team performance as well as financial effects can be worse since soccer is a team-based game. The duration of recovery from a soccer injury typically relies on its type and severity. Therefore, we conduct this research in order to predict the probability of players injury type using machine learning technologies in this paper. Furthermore, we compare different machine learning models to find the best fit model. This paper utilizes various supervised classification machine learning models, including Decision Tree, Random Forest, K-Nearest Neighbors (KNN), and Naive Bayes. Moreover, based on our finding the KNN and Decision models achieved the highest accuracy rates at 70%, surpassing other models. The Random Forest model followed closely with an accuracy score of 62%. Among the evaluated models, the Naive Bayes model demonstrated the lowest accuracy at 56%. We gathered information about 54 professional soccer players who are playing in the top five European leagues based on their career history. We gathered information about 54 professional soccer players who are playing in the top five European leagues based on their career history.

[Reivew]Prediction of Cervical Cancer Risk from Taking Hormone Contraceptivese

  • Su jeong RU;Kyung-A KIM;Myung-Ae CHUNG;Min Soo KANG
    • Korean Journal of Artificial Intelligence
    • /
    • v.12 no.1
    • /
    • pp.25-29
    • /
    • 2024
  • In this study, research was conducted to predict the probability of cervical cancer occurrence associated with the use of hormonal contraceptives. Cervical cancer is influenced by various environmental factors; however, the human papillomavirus (HPV) is detected in 99% of cases, making it the primary attributed cause. Additionally, although cervical cancer ranks 10th in overall female cancer incidence, it is nearly 100% preventable among known cancers. Early-stage cervical cancer typically presents no symptoms but can be detected early through regular screening. Therefore, routine tests, including cytology, should be conducted annually, as early detection significantly improves the chances of successful treatment. Thus, we employed artificial intelligence technology to forecast the likelihood of developing cervical cancer. We utilized the logistic regression algorithm, a predictive model, through Microsoft Azure. The classification model yielded an accuracy of 80.8%, a precision of 80.2%, a recall rate of 99.0%, and an F1 score of 88.6%. These results indicate that the use of hormonal contraceptives is associated with an increased risk of cervical cancer. Further development of the artificial intelligence program, as studied here, holds promise for reducing mortality rates attributable to cervical cancer.

A Study on Nursing Needs of Patients in the Recovery Room (회복실 환자의 간호요구도에 관한 연구 - 일 종합병원을 중심으로 -)

  • Kim Eun-Kyoung;Chae Soon-Ok;Kwon Kun-Sook;Kim Yun-Jeung;Hong Mun-He;Kim Me-Hee;Kim Nam-Sun;Lee Kyu-Eun
    • Journal of Korean Academy of Fundamentals of Nursing
    • /
    • v.9 no.1
    • /
    • pp.86-100
    • /
    • 2002
  • Purpose: The purpose of the study was done to identify the nursing care needs of patients in the recovery room. Method: The subjects in this study were 127 patients in a recovery room between 6/9/2001 and 24/9/2001. The instrument used for this study was the descriptive questionnaire developed by Shin Hyun-Jin (1999). The data was analysed by frequency, percentage, mean, standard deviation, t-test, ANOVA, and factor analysis using the SPSS program. Result: 1) Kaiser - Meyer -O1kin sample appropriateness was 799 and Bartlett's test of sphericity significant probability was .000. 2) The mean score for nursing care need of patients in the recovery room was $4.17{\pm}.51$ of a total possible score of 5. The score of nursing need for different parameters was as follows : Educational need ($4.31{\pm}.49$), physical need ($4.27{\pm}.47$), emotional need ($4.11{\pm}.52$), environmental need ($3.99{\pm}.56$). 3) Differences in the needs for nursing care according to the demographics were significant for gender, marital status, operation experience, and departments consulted. General characteristic variables significantly related to nursing need were as follows: Physical need significantly related to the departments consulted (F=2.23, p=.036). Educational need significantly related to the marital status (F=2.55. p=.012), departments consulted (F=2.30, p= 031). Emotional need significantly related to the marital status (F=2.22, p=.028). Environmental need significantly related to the gender (t=-2.44, p= .016), marital status (F=2.01, p= .046). operation experience (t=-1.99. p= .048). Conclusion: Nursing care needs of patients in the recovery room are significantly related to educational need, physical need, emotional need and environmental need. Intervention plans and program need to be developed to improve strategies to meet nursing needs of patients in the recovery room.

  • PDF

The Significance Test on the AHP-based Alternative Evaluation: An Application of Non-Parametric Statistical Method (AHP를 이용한 대안 평가의 유의성 분석: 비모수적 통계 검정 적용)

  • Park, Joonsoo;Kim, Sung-Chul
    • The Journal of Society for e-Business Studies
    • /
    • v.22 no.1
    • /
    • pp.15-35
    • /
    • 2017
  • The method of weighted sum of evaluation using AHP is widely used in feasibility analysis and alternative selection. Final scores are given in forms of weighted sums and the alternative with largest score is selected. With two alternatives, as in feasibility analysis, the final score greater than 0.5 gives the selection but there remains a question that how large is large enough. KDI suggested a concept of 'grey area' where scores are between 0.45 and 0.55 in which decisions are to be made with caution, but it lacks theoretical background. Statistical testing was introduced to answer the question in some studies. It was assumed some kinds of probability distribution, but did not give the validity on them. We examine the various cases of weighted sum of evaluation score and show why the statistical testing has to be introduced. We suggest a non-parametric testing procedure which does not assume a specific distribution. A case study is conducted to analyze the validity of our suggested testing procedure. We conclude our study with remarks on the implication of analysis and the future way of research development.

EEG Signal Classification Algorithm based on DWT and SVM for Driving Robot Control (주행로봇제어를 위한 DWT와 SVM기반의 EEG신호 분류 알고리즘)

  • Lee, Kibae;Lee, Chong Hyun;Bae, Jinho;Lee, Jaeil
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.52 no.8
    • /
    • pp.117-125
    • /
    • 2015
  • In this paper, we propose a classification algorithm based on the obtained EEG(Electroencephalogram) signal for the control of 'left' and 'right' turnings of which a driving system composed of EEG sensor, Labview, DAQ, Matlab and driving robot. The proposed algorithm uses features extracted from frequency band information obtained by DWT (Discrete Wavelet Transform) and selects features of high discrimination by using Fisher score. We, also propose the number of feature vectors for the best classification performance by using SVM(Support Vector Machine) classifier and propose a decision pending algorithm based on MLD (Maximum Likelihood Decision) to prevent malfunction due to misclassification. The selected four feature vectors for the proposed algorithm are the mean of absolute value of voltage and the standard deviation of d5(2-4Hz) and d2(16-32Hz) frequency bands of P8 channel according to the international standard electrode placement method. By using the SVM classifier, we obtained 98.75% accuracy and 1.25% error rate. Also, when we specify error probability of 70% for decision pending, we obtained 95.63% accuracy and 0% error rate by using the proposed decision pending algorithm.

STANDARDIZATION STUDY FOR THE KOREAN VERSION OF THE LURIA-NEBRASKA NEUROPSYCHOLOGICAL BATTERY FOR CHILDREN I : SCALE CONSTRUCTION, RELIABILITY & NORMS FOR THE KOREAN VERSION OF LNNB-C (한국판 아동용 Luria-Nebraska 신경심리 검사의 표준화 연구 I: 척도 제작, 신뢰도 및 뇌손상 진단을 위한 규준 산출)

  • Shin, Min-Sup
    • Journal of the Korean Academy of Child and Adolescent Psychiatry
    • /
    • v.5 no.1
    • /
    • pp.54-69
    • /
    • 1994
  • The purpose of present study was to develop the Korean Version of Luria-Nebraska Neuropsychological Battery for Children(LNNB-C), to examine the reliability of it, and to establish the norms for determining the probability of brain damage. The normative group used to standardize the Korean version of LNNB-C was composed of 147 children between the age of 8 and 12(body 74, girl 73). The clinical group consisted of 19 brain damaged, 16 ADHD, and 16 psychiatric controls. The inter-scorer reliability was 96.3%, indicating that the stability of the scoring system for the Korean version of LNNB-C is good. The reliability coefficients(Cronbach's ${\alpha}$) of LNNB-C scales were ranged .51 to .91, which are similar to those of original LNNB-C. To establish the norms for detecting brain damage, the means and standard deviations for normative group were used to calculate T-scores for each scale. To determine a critical level that could successfully predict a normal child's average score at a given age, first the average score of normative group was calculated, and this score was then entered a regression equation with age to predict the average(baseline) acore. Finally, some issues on constructing the Korean version of LNNB-C and the cultural differences between Korean and American children in performing LNNB-C were discussed.

  • PDF

Development of the Three-tier Test Items for the Thinking Skills of the Scientific Inquiry (과학적 탐구 사고력의 3단계 선다형 평가 연구)

  • Lee, Moo
    • Journal of The Korean Association For Science Education
    • /
    • v.18 no.4
    • /
    • pp.643-650
    • /
    • 1998
  • In order to assess students' higher mental abilities, such as scientific inquiry thinking skills, the essay type items would be more adequate than the multiple choice itmes. However, due to the present condition in which a huge number of students take the examination at the same time, it is inevitable to use the multiple choice type. For this reason, it is necessary to develop a new type of multiple choice items which can reduce the disadvantages of the traditional multiple choice type and can achieve a similar level of validity as subjective type assessment. The three-tier multiple choice test items which can be used for a large sample of students and especially for scientific inquiry thinking abilities, are proposed and examined. The three-tier multiple choice test items asked firstly conclusion or the results of calculation or experimental apparatus, secondly the processes of calculation or of developing conclusion, thirdly asking relevant scientific concepts. For the item analysis, 1 point was given to the correct answer, while 0 point was given to the wrong one. The data were processed through the computer program developed in Turbo C 2.0 language with an IBM compatable personal computer. The average score in the sub-items asking for scientific concepts was lower than that in the sub-items asking for results or processes. The score of guessing by chance in the three-tier multiple choice items was only 0.13%, so that the probability of making correct answers by just guessing would be extremely low. The three-tier multiple choice items, even if they are objective items, are thought to assess thinking skills of the scientific inquiry meaningfully excluding the possibility of guessing by chance.

  • PDF

The Odd Pair Family's Dietary management in rural, Korea - Comparison with the Pair Family - (농촌거주 외짝가족의 식생활관리 -부부가족과의 비교-)

  • Rhie Seung Gyo;Chung Kum Ju;Won Hyang Rye
    • The Korean Journal of Community Living Science
    • /
    • v.16 no.1
    • /
    • pp.89-103
    • /
    • 2005
  • Recently the rural Korea has been remarkedly changed of family and social value in accordance with the development of industry. The lower economic class made by social economic growth is widespread with increasing aged, specially odd pair family in rural. The purpose of this study was to investigate to help and keep improve health of rural lower economic class, family system by comparing and analyzing the dietary management, between pair and odd pair family, and to get the data helpful the right guidance for rural. The subjects 1870 collected in 9 provinces by sampling with probability proportional to size (PPS). Questionnaire about dietary habit, food cultivation, production and preservation survey was conducted by trained interviewers. The main results were as follows : 1) The characteristics of odd pair families, head of household was female(77%), over 65 years(84.9%), small family(1.76 persons) and lower education(male 7.5 years, female 3.1 years) status. 2) As the states of diets of odd pair family, having breakfast(87.1 %) but one or two kinds of side dishes(31.3 %) only possible to guess lower status of food intake balance. Nutritional supplements(21. 7 %) was lower than that of paired family. 3) The aspects of dietary habit of odd pair family, no instant foods(70.7%), no snack(38.4%) no dine out(69.2%) were common. 4) Dietary habit scores were 7.78 points of odd pair family compared 8.34 points of paired family. 5) Food purchase place of odd pair family was market(44.2%) but super-market(42.7%) of paired family. 6)In odd pair family, seldom traditional dish preparation(62.0%) but prepared winter kimchi(81.9%), comparing seldom traditional dish(38.6%) and winter kimchi(96.4%) in paired family. 7)The food cultivation state was surveyed, pepper( 42.2 %) and chinese cabbage( 43.9 %) were consumed after cultivation, but sesame(59.4%), bean sprout(90.2%), tofu(92.8%) and egg(93.3%) were consumed by purchase in odd pair family.8) Food cultivation score of odd pair family was 2.98/12points significantly lower than 4.50/12 points of paired family(p<0.01). 9) At the status of fermentation food production in odd pair family, Duenjang(72.1 %) and Gochujang(69.7%) Kanjang(68.3%) Kimchi(82.1 %) and Meju(68.3%) were high rate of production, but more frequently producted in pair family. 10) The score of fermentation food production of odd pair family was 8.57/12points but significantly lower than 10.24/12 points of pair family(p<0.0001). 11) Food preservation score 0.48/6 points in odd pair family was not significantly different than that of pair family(1.07/6points).

  • PDF

Analyzing the discriminative characteristic of cover letters using text mining focused on Air Force applicants (텍스트 마이닝을 이용한 공군 부사관 지원자 자기소개서의 차별적 특성 분석)

  • Kwon, Hyeok;Kim, Wooju
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.3
    • /
    • pp.75-94
    • /
    • 2021
  • The low birth rate and shortened military service period are causing concerns about selecting excellent military officers. The Republic of Korea entered a low birth rate society in 1984 and an aged society in 2018 respectively, and is expected to be in a super-aged society in 2025. In addition, the troop-oriented military is changed as a state-of-the-art weapons-oriented military, and the reduction of the military service period was implemented in 2018 to ease the burden of military service for young people and play a role in the society early. Some observe that the application rate for military officers is falling due to a decrease of manpower resources and a preference for shortened mandatory military service over military officers. This requires further consideration of the policy of securing excellent military officers. Most of the related studies have used social scientists' methodologies, but this study applies the methodology of text mining suitable for large-scale documents analysis. This study extracts words of discriminative characteristics from the Republic of Korea Air Force Non-Commissioned Officer Applicant cover letters and analyzes the polarity of pass and fail. It consists of three steps in total. First, the application is divided into general and technical fields, and the words characterized in the cover letter are ordered according to the difference in the frequency ratio of each field. The greater the difference in the proportion of each application field, the field character is defined as 'more discriminative'. Based on this, we extract the top 50 words representing discriminative characteristics in general fields and the top 50 words representing discriminative characteristics in technology fields. Second, the number of appropriate topics in the overall cover letter is calculated through the LDA. It uses perplexity score and coherence score. Based on the appropriate number of topics, we then use LDA to generate topic and probability, and estimate which topic words of discriminative characteristic belong to. Subsequently, the keyword indicators of questions used to set the labeling candidate index, and the most appropriate index indicator is set as the label for the topic when considering the topic-specific word distribution. Third, using L-LDA, which sets the cover letter and label as pass and fail, we generate topics and probabilities for each field of pass and fail labels. Furthermore, we extract only words of discriminative characteristics that give labeled topics among generated topics and probabilities by pass and fail labels. Next, we extract the difference between the probability on the pass label and the probability on the fail label by word of the labeled discriminative characteristic. A positive figure can be seen as having the polarity of pass, and a negative figure can be seen as having the polarity of fail. This study is the first research to reflect the characteristics of cover letters of Republic of Korea Air Force non-commissioned officer applicants, not in the private sector. Moreover, these methodologies can apply text mining techniques for multiple documents, rather survey or interview methods, to reduce analysis time and increase reliability for the entire population. For this reason, the methodology proposed in the study is also applicable to other forms of multiple documents in the field of military personnel. This study shows that L-LDA is more suitable than LDA to extract discriminative characteristics of Republic of Korea Air Force Noncommissioned cover letters. Furthermore, this study proposes a methodology that uses a combination of LDA and L-LDA. Therefore, through the analysis of the results of the acquisition of non-commissioned Republic of Korea Air Force officers, we would like to provide information available for acquisition and promotional policies and propose a methodology available for research in the field of military manpower acquisition.

Risk Factors for the Probability of Pregnancy Following Synchronization Protocols in Dairy Cows (젖소에서 배란동기화 프로그램 적용 후 임신율에 영향을 미치는 요인 분석 연구)

  • Jeong, Jae-Kwan;Kang, Hyun-Gu;Jung, Young-Hun;Hur, Tai-Young;Kim, III-Hwa
    • Journal of Veterinary Clinics
    • /
    • v.31 no.5
    • /
    • pp.382-388
    • /
    • 2014
  • The objective of this study was to determine the risk factors associated with pregnancy following 3 synchronization protocols in dairy cows. Data were collected on 1,952 cows from 22 dairy farms, including synchronization protocols ($PGF_{2{\alpha}}$ + estradiol benzoate [PG+EB], Ovsynch, and CIDR-ovsynch), cow parity, body condition score (BCS), and dates of previous calving, insemination and conception. The odds ratio (OR) for pregnancy were analyzed by logistic regression using the LOGISTIC procedure in SAS. The analysis revealed that farm (p = 0.005), cow parity (p = 0.0001), BCS (p < 0.005), and AI season (p < 0.05) significantly affected and calving to AI interval tended to affect (p < 0.1) the probability for pregnancy. Although synchronization protocols did not affect the probability for pregnancy (p > 0.05), cow parity and synchronization protocols showed a significant interaction (p < 0.005); the OR (0.60) was significantly lower (p < 0.0001) for multiparous cows compared to primiparous cows using PG+EB, whereas the OR (1.44) tended to be higher (p < 0.1) for multiparous cows compared to primiparous cows using the Ovsynch, and the probability for pregnancy did not differ between multiparous and primiparous cows using the CIDR-ovsynch (p > 0.05). Cows with BCS ${\geq}$ 3.00 were more likely pregnant (OR: 1.41) compared with cows having BCS ${\leq}$ 2.75, whereas cows inseminated during summer had a lower OR (0.73) compared with those inseminated during spring. Cows with a calving to AI interval > 150 days were more likely to be pregnant (OR: 1.20) compared with cows with a calving to AI interval ${\leq}$ 150 days. In conclusion, the OR for pregnancy following synchronization protocols in dairy cows was affected by farm, parity, BCS, calving to AI interval of the cow, and AI season, and there was a significant interaction between cow parity and synchronization protocols; the OR for pregnancy was lower for multiparous cows compared with primiparous cows using the PG+EB protocol.