• Title/Summary/Keyword: Statistical Learning

Search Result 1,300, Processing Time 0.028 seconds

Simple Graphs for Complex Prediction Functions

  • Huh, Myung-Hoe;Lee, Yong-Goo
    • Communications for Statistical Applications and Methods
    • /
    • v.15 no.3
    • /
    • pp.343-351
    • /
    • 2008
  • By supervised learning with p predictors, we frequently obtain a prediction function of the form $y\;=\;f(x_1,...,x_p)$. When $p\;{\geq}\;3$, it is not easy to understand the inner structure of f, except for the case the function is formulated as additive. In this study, we propose to use p simple graphs for visual understanding of complex prediction functions produced by several supervised learning engines such as LOESS, neural networks, support vector machines and random forests.

A Program for Statistical Education through Simulation

  • SookHee Choi
    • Communications for Statistical Applications and Methods
    • /
    • v.6 no.1
    • /
    • pp.251-259
    • /
    • 1999
  • The purpose of this study is to develope a program for statistics education. This program deals with simulation which is helpful in understanding some elementary statistical concepts. This program under multimedia environment which includes sound video animation etc. doesn't show only the result but make it possible for students to execute the program by stages. This type of dynamic learning is efficient to overcome the limits of teaching materials or classroom work. Also it can interest students greatly. By executing it the students can understand the method and meaning of simulation and acquire concepts of probability and statistical inference naturally.

  • PDF

Korean College Students' English Learning Motivation and Listening Proficiency

  • Yang, Eun-Mi
    • English Language & Literature Teaching
    • /
    • v.17 no.2
    • /
    • pp.93-114
    • /
    • 2011
  • The aim of this study is twofold. First, this study aimed to explore how Korean university students' English learning motivation is related to their English listening proficiency and study time. Second, it attempted to interpret the English learning motivation linking the two different motivation theories: self-determination theory and L2 motivational self system. The constructs of the students' L2 learning motivation were investigated with the data obtained through the questionnaire from 122 sophomore students. A factor analysis was conducted to extract the major factors of motivation. As a result, 6 factors were extracted: Intrinsic Pleasure, Identified Value Regulation, Intrinsic Accomplishment, Introjected Regulation, External Regulation, and Identified Regulation. The Interrelatedness among the assessment results on the L2 listening proficiency (pre and post test), listening study time, and motivation factors was measured by correlation coefficients. The statistical results indicated that pre-test scores were significantly related to Identified Regulation and Identified Value Regulation toward English learning, and post-test results had significant correlation with Intrinsic Accomplishment and Identified Regulation. However, no motivation subtypes showed statistical association with the students' listening study time. The results were attempted to be interpreted both under L2 motivational self system and self-determination framework to better illuminate the motivation theory with more explanatory power.

  • PDF

A Study on an Instructional Model and Statistical Thinking Levels to Help Minority Students with Low-SES and Learning Difficulty (교육소외 학생들을 위한 수업모형과 통계이해수준에 관한 연구)

  • Baek, Jung-Hwan;ChoiKoh, Sang-Sook
    • The Mathematical Education
    • /
    • v.50 no.3
    • /
    • pp.263-284
    • /
    • 2011
  • We took note of the fact that there were not many studies on improvement of mathematics learning in the field of statistics for the minority students from the families who belonged to the Low-SES. This study was to help them understand the concepts and principles of mathematics, motivate them for mathematics learning, and have them feel familiar with it. The subjects were 12 students from the low-SES families among the sophomores of 00 High School in Gyeonggi-do. Although it could not be achieved effectively in the short-term of learning for the slow learners, their understanding of basic concepts and confidence, interests and concerns in statistical learning were remarkably changed, compared to their work in the beginning period. Our discourse classes using various topics and examples were well perceived by the students whose performance was improved up to the 3rd thinking level of Mooney's framework. Also, a meaningful instructional model for slow learners(IMSL) was found through the discourse.

Price Determinant Factors of Artworks and Prediction Model Based on Machine Learning (작품 가격 추정을 위한 기계 학습 기법의 응용 및 가격 결정 요인 분석)

  • Jang, Dongryul;Park, Minjae
    • Journal of Korean Society for Quality Management
    • /
    • v.47 no.4
    • /
    • pp.687-700
    • /
    • 2019
  • Purpose: The purpose of this study is to investigate the interaction effects between price determinants of artworks. We expand the methodology in art market by applying machine learning techniques to estimate the price of artworks and compare linear regression and machine learning in terms of prediction accuracy. Methods: Moderated regression analysis was performed to verify the interaction effects of artistic characteristics on price. The moderating effects were studied by confirming the significance level of the interaction terms of the derived regression equation. In order to derive price estimation model, we use multiple linear regression analysis, which is a parametric statistical technique, and k-nearest neighbor (kNN) regression, which is a nonparametric statistical technique in machine learning methods. Results: Mostly, the influences of the price determinants of art are different according to the auction types and the artist 's reputation. However, the auction type did not control the influence of the genre of the work on the price. As a result of the analysis, the kNN regression was superior to the linear regression analysis based on the prediction accuracy. Conclusion: It provides a theoretical basis for the complexity that exists between pricing determinant factors of artworks. In addition, the nonparametric models and machine learning techniques as well as existing parameter models are implemented to estimate the artworks' price.

Comparison of Scala and R for Machine Learning in Spark (스파크에서 스칼라와 R을 이용한 머신러닝의 비교)

  • Woo-Seok Ryu
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.18 no.1
    • /
    • pp.85-90
    • /
    • 2023
  • Data analysis methodology in the healthcare field is shifting from traditional statistics-oriented research methods to predictive research using machine learning. In this study, we survey various machine learning tools, and compare several programming models, which utilize R and Spark, for applying R, a statistical tool widely used in the health care field, to machine learning. In addition, we compare the performance of linear regression model using scala, which is the basic languages of Spark and R. As a result of the experiment, the learning execution time when using SparkR increased by 10 to 20% compared to Scala. Considering the presented performance degradation, SparkR's distributed processing was confirmed as useful in R as the traditional statistical analysis tool that could be used as it is.

Analysis of massive data in astronomy (천문학에서의 대용량 자료 분석)

  • Shin, Min-Su
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.6
    • /
    • pp.1107-1116
    • /
    • 2016
  • Recent astronomical survey observations have produced substantial amounts of data as well as completely changed conventional methods of analyzing astronomical data. Both classical statistical inference and modern machine learning methods have been used in every step of data analysis that range from data calibration to inferences of physical models. We are seeing the growing popularity of using machine learning methods in classical problems of astronomical data analysis due to low-cost data acquisition using cheap large-scale detectors and fast computer networks that enable us to share large volumes of data. It is common to consider the effects of inhomogeneous spatial and temporal coverage in the analysis of big astronomical data. The growing size of the data requires us to use parallel distributed computing environments as well as machine learning algorithms. Distributed data analysis systems have not been adopted widely for the general analysis of massive astronomical data. Gathering adequate training data is expensive in observation and learning data are generally collected from multiple data sources in astronomy; therefore, semi-supervised and ensemble machine learning methods will become important for the analysis of big astronomical data.

Prediction of English Premier League Game Using an Ensemble Technique (앙상블 기법을 통한 잉글리시 프리미어리그 경기결과 예측)

  • Yi, Jae Hyun;Lee, Soo Won
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.9 no.5
    • /
    • pp.161-168
    • /
    • 2020
  • Predicting outcome of the sports enables teams to establish their strategy by analyzing variables that affect overall game flow and wins and losses. Many studies have been conducted on the prediction of the outcome of sports events through statistical techniques and machine learning techniques. Predictive performance is the most important in a game prediction model. However, statistical and machine learning models show different optimal performance depending on the characteristics of the data used for learning. In this paper, we propose a new ensemble model to predict English Premier League soccer games using statistical models and the machine learning models which showed good performance in predicting the results of the soccer games and this model is possible to select a model that performs best when predicting the data even if the data are different. The proposed ensemble model predicts game results by learning the final prediction model with the game prediction results of each single model and the actual game results. Experimental results for the proposed model show higher performance than the single models.

Effects of Blended-TBL on Students' Self-Regulated Learning

  • PARK, Eunsook
    • Educational Technology International
    • /
    • v.10 no.1
    • /
    • pp.137-155
    • /
    • 2009
  • The purpose of this research is to develop Blended-TBL(Team Based Learning) model that emphasizes the active participation and teamwork of students in on-off blended learning environment, and apply it into the college course and explore whether self-regulated learning between one group pretest and posttest is different. For this, this research investigated the concept and the characteristics of Team Based Learning, and developed the Blended-TBL Model to apply it into the college course, and finally prove effects of Blended-TBL model on self-regulated learning using Motivated Strategies for Learning Questionnaire (MSLQ). The participants in this study were 57 college students. They participated in on-off blended-TBL course for 15weeks. Participants followed the content grounded and the problem solving steps in collaborative team-based learning. This research practiced a quantitative research to find out the statistical difference of the self-regulated learning between pretest and posttest using SPSS. The result revealed that Blended-TBL students improved self-regulated learning including motivation, cognitive, metacognitive, and resource management. Based on this result, this research discussed the effects of Blended-TBL on Self-Regulated Learning and suggested the further study.

Development of an e-Learning Environment for Blended Learning

  • Ahn, Jeong-Yong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.2
    • /
    • pp.345-353
    • /
    • 2006
  • Over the past few years, training professionals have become more pragmatic in their approach to technology-based media by using it to augment traditional forms of training delivery, such as classroom instruction and text-based materials. This trend has led to the rise of the term blended learning. Blended learning, an environment of e-learning, is a powerful learning solution created through a mixture of face-to-face and online learning delivered through a mix of media and superior learning experiences. In this article we design and implement an e-learning environment for blended learning. The environment focused on following factors: learning activity and participation of learners, and real time feedback of instructor.

  • PDF