• Title/Summary/Keyword: Scoring model

Search Result 269, Processing Time 0.027 seconds

Design and Implementation of an Automatic Scoring Model Using a Voting Method for Descriptive Answers (투표 기반 서술형 주관식 답안 자동 채점 모델의 설계 및 구현)

  • Heo, Jeongman;Park, So-Young
    • Journal of the Korea Society of Computer and Information
    • /
    • v.18 no.8
    • /
    • pp.17-25
    • /
    • 2013
  • TIn this paper, we propose a model automatically scoring a student's answer for a descriptive problem by using a voting method. Considering the model construction cost, the proposed model does not separately construct the automatic scoring model per problem type. In order to utilize features useful for automatically scoring the descriptive answers, the proposed model extracts feature values from the results, generated by comparing the student's answer with the answer sheet. For the purpose of improving the precision of the scoring result, the proposed model collects the scoring results classified by a few machine learning based classifiers, and unanimously selects the scoring result as the final result. Experimental results show that the single machine learning based classifier C4.5 takes 83.00% on precision while the proposed model improve the precision up to 90.57% by using three machine learning based classifiers C4.5, ME, and SVM.

The Automated Scoring of Kinematics Graph Answers through the Design and Application of a Convolutional Neural Network-Based Scoring Model (합성곱 신경망 기반 채점 모델 설계 및 적용을 통한 운동학 그래프 답안 자동 채점)

  • Jae-Sang Han;Hyun-Joo Kim
    • Journal of The Korean Association For Science Education
    • /
    • v.43 no.3
    • /
    • pp.237-251
    • /
    • 2023
  • This study explores the possibility of automated scoring for scientific graph answers by designing an automated scoring model using convolutional neural networks and applying it to students' kinematics graph answers. The researchers prepared 2,200 answers, which were divided into 2,000 training data and 200 validation data. Additionally, 202 student answers were divided into 100 training data and 102 test data. First, in the process of designing an automated scoring model and validating its performance, the automated scoring model was optimized for graph image classification using the answer dataset prepared by the researchers. Next, the automated scoring model was trained using various types of training datasets, and it was used to score the student test dataset. The performance of the automated scoring model has been improved as the amount of training data increased in amount and diversity. Finally, compared to human scoring, the accuracy was 97.06%, the kappa coefficient was 0.957, and the weighted kappa coefficient was 0.968. On the other hand, in the case of answer types that were not included in the training data, the s coring was almos t identical among human s corers however, the automated scoring model performed inaccurately.

Research on the E-Commerce Credit Scoring Model Using the Gaussian Density Function

  • Xiao, Qiang;He, Rui-chun;Zhang, Wei
    • Journal of Information Processing Systems
    • /
    • v.11 no.2
    • /
    • pp.173-183
    • /
    • 2015
  • At present, it is simple to the electronic commerce credit scoring model, as a brush credit phenomenon in E-commerce has emerged. This phenomenon affects the judgment of consumers and hinders the rapid development of E-commerce. In this paper, that E-commerce credit evaluation model that uses a Gaussian density function is put forward by density test and the analysis for the anomalies of E-commerce credit rating, it can be fond out the abnormal point in credit scoring, these points were calculated by nonlinear credit scoring algorithm, thus it can effectively improve the current E-commerce credit score, and enhance the accuracy of E-commerce credit score.

Scoring models to detect foreign exchange money laundering (외국환 거래의 자금세탁 혐의도 점수모형 개발에 관한 연구)

  • Hong, Seong-Ik;Moon, Tae-Hee;Sohn, So-Young
    • IE interfaces
    • /
    • v.18 no.3
    • /
    • pp.268-276
    • /
    • 2005
  • In recent years, the money Laundering crimes are increasing by means of foreign exchange transactions. Our study proposes four scoring models to provide early warning of the laundering in foreign exchange transactions for both inward and outward remittances: logistic regression model, decision tree, neural network, and ensemble model which combines the three models. In terms of accuracy of test data, decision tree model is selected for the inward remittance and an ensemble model for the outward remittance. From our study results, the accumulated number of transaction turns out to be the most important predictor variable. The proposed scoring models deal with the transaction level and is expected to help the bank teller to detect the laundering related transactions in the early stage.

Development of Scoring Model on Customer Attrition Probability by Using Data Mining Techniques

  • Han, Sang-Tae;Lee, Seong-Keon;Kang, Hyun-Cheol;Ryu, Dong-Kyun
    • Communications for Statistical Applications and Methods
    • /
    • v.9 no.1
    • /
    • pp.271-280
    • /
    • 2002
  • Recently, many companies have applied data mining techniques to promote competitive power in the field of their business market. In this study, we address how data mining, that is a technique to enable to discover knowledge from a deluge of data, Is used in an executed project in order to support decision making of an enterprise. Also, we develope scoring model on customer attrition probability for automobile-insurance company using data mining techniques. The development of scoring model in domestic insurance is given as an example concretely.

Polyclass in Data Mining (데이터 마이닝에서의 폴리클라스)

  • 구자용;박헌진;최대우
    • The Korean Journal of Applied Statistics
    • /
    • v.13 no.2
    • /
    • pp.489-503
    • /
    • 2000
  • Data mining means data analysis and model selection using various types of data in order to explore useful information and knowledge for making decisions. Examples of data mining include scoring for credit analysis of a new customer and scoring for churn management, where the customers with high scores are given special attention. In this paper, scoring is interpreted as a modeling process of the conditional probability and polyclass scoring method is described. German credit data, a PC communication company data and a mobile communication company data are used to compare the performance of polyclass scoring method with that of the scoring method based on a tree model.

  • PDF

Development of a Risk Scoring Model to Predict Unexpected Conversion to Thoracotomy during Video-Assisted Thoracoscopic Surgery for Lung Cancer

  • Ga Young Yoo;Seung Keun Yoon;Mi Hyoung Moon;Seok Whan Moon;Wonjung Hwang;Kyung Soo Kim
    • Journal of Chest Surgery
    • /
    • v.57 no.3
    • /
    • pp.302-311
    • /
    • 2024
  • Background: Unexpected conversion to thoracotomy during planned video-assisted thoracoscopic surgery (VATS) can lead to poor outcomes and comparatively high morbidity. This study was conducted to assess preoperative risk factors associated with unexpected thoracotomy conversion and to develop a risk scoring model for preoperative use, aimed at identifying patients with an elevated risk of conversion. Methods: A retrospective analysis was conducted of 1,506 patients who underwent surgical resection for non-small cell lung cancer. To evaluate the risk factors, univariate analysis and logistic regression were performed. A risk scoring model was established to predict unexpected thoracotomy conversion during VATS of the lung, based on preoperative factors. To validate the model, an additional cohort of 878 patients was analyzed. Results: Among the potentially significant clinical variables, male sex, previous ipsilateral lung surgery, preoperative detection of calcified lymph nodes, and clinical T stage were identified as independent risk factors for unplanned conversion to thoracotomy. A 6-point risk scoring model was developed to predict conversion based on the assessed risk, with patients categorized into 4 groups. The results indicated an area under the receiver operating characteristic curve of 0.747, with a sensitivity of 80.5%, specificity of 56.4%, positive predictive value of 1.8%, and negative predictive value of 91.0%. When applied to the validation cohort, the model exhibited good predictive accuracy. Conclusion: We successfully developed and validated a risk scoring model for preoperative use that can predict the likelihood of unplanned conversion to thoracotomy during VATS of the lung.

Development of a Scoring Model for Evaluating the Rural Healthy and Longevity Village Project using DEA and AHP (DEA와 AHP기법을 이용한 농촌건강장수마을사업 평가모형 개발)

  • Suh, Kyo;Han, Yi-Cheol;Lee, Ji-Min;Lee, Jeong-Jae
    • Journal of Korean Society of Rural Planning
    • /
    • v.12 no.4 s.33
    • /
    • pp.1-11
    • /
    • 2006
  • Recently many administrative institutes try to improve the viability of rural villages. For increasing the viability, not only infrastructures but internal vitality is necessary in rural villages. Nonetheless, most of governmental projects have been focused on infrastructures. For this reason, RDA(Rural Development Administration) designed and performed the RHL(Rural Healthy and Longevity village) project. This RHL project is not easy to evaluate the outcome because it consists of very intangible project items. In this paper, we developed a scoring model to evaluate the result of the RHL project. The scoring model based on DEA(Data Envelopment Analysis) was suggested to evaluate the quantity of personal activities in each village. Personal activities are classified into five categories: regional life, social life, productive life, outdoor life and indoor life. Evaluating indices of each category are developed and weighting values are evaluated by AHP(Analytic Hierarchy Process). The developed model was applied to Kumsan village and examined its applicability.

A Novel Molecular Grading Model: Combination of Ki67 and VEGF in Predicting Tumor Recurrence and Progression in Non-invasive Urothelial Bladder Cancer

  • Chen, Jun-Xing;Deng, Nan;Chen, Xu;Chen, Ling-Wu;Qiu, Shao-Peng;Li, Xiao-Fei;Li, Jia-Ping
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.13 no.5
    • /
    • pp.2229-2234
    • /
    • 2012
  • Purpose: To assess efficacy of Ki67 combined with VEGF as a molecular grading model to predict outcomes with non-muscle invasive bladder cancer (NMIBC). Materials: 72 NMIBC patients who underwent transurethral resection (TUR) followed by routine intravesical instillations were retrospectively analyzed in this study. Univariate and multivariate analyses were performed to confirm the prognostic values of the Ki67 labeling index (LI) and VEGF scoring for tumor recurrence and progression. Results: The novel molecular grading model for NMIBC contained three molecular grades including mG1 (Ki67 $LI{\leq}25%$, VEGF $scoring{\leq}8$), mG2 (Ki67 LI>25%, VEGF $scoring{\leq}8$; or Ki67 $LI{\leq}25%$, VEGF scoring > 8), and mG3 (Ki67 LI > 25%, VEGF scoring > 8), which can indicate favorable, intermediate and poor prognosis, respectively. Conclusions: The described novel molecular grading model utilizing Ki67 LI and VEGF scoring is helpful to effectively and accurately predict outcomes and optimize personal therapy.

Automatic scoring of mathematics descriptive assessment using random forest algorithm (랜덤 포레스트 알고리즘을 활용한 수학 서술형 자동 채점)

  • Inyong Choi;Hwa Kyung Kim;In Woo Chung;Min Ho Song
    • The Mathematical Education
    • /
    • v.63 no.2
    • /
    • pp.165-186
    • /
    • 2024
  • Despite the growing attention on artificial intelligence-based automated scoring technology as a support method for the introduction of descriptive items in school environments and large-scale assessments, there is a noticeable lack of foundational research in mathematics compared to other subjects. This study developed an automated scoring model for two descriptive items in first-year middle school mathematics using the Random Forest algorithm, evaluated its performance, and explored ways to enhance this performance. The accuracy of the final models for the two items was found to be between 0.95 to 1.00 and 0.73 to 0.89, respectively, which is relatively high compared to automated scoring models in other subjects. We discovered that the strategic selection of the number of evaluation categories, taking into account the amount of data, is crucial for the effective development and performance of automated scoring models. Additionally, text preprocessing by mathematics education experts proved effective in improving both the performance and interpretability of the automated scoring model. Selecting a vectorization method that matches the characteristics of the items and data was identified as one way to enhance model performance. Furthermore, we confirmed that oversampling is a useful method to supplement performance in situations where practical limitations hinder balanced data collection. To enhance educational utility, further research is needed on how to utilize feature importance derived from the Random Forest-based automated scoring model to generate useful information for teaching and learning, such as feedback. This study is significant as foundational research in the field of mathematics descriptive automatic scoring, and there is a need for various subsequent studies through close collaboration between AI experts and math education experts.