• Title/Summary/Keyword: Statistical learning

Search Result 1,329, Processing Time 0.023 seconds

A Co-Evolutionary Computing for Statistical Learning Theory

  • Jun Sung-Hae
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.5 no.4
    • /
    • pp.281-285
    • /
    • 2005
  • Learning and evolving are two basics for data mining. As compared with classical learning theory based on objective function with minimizing training errors, the recently evolutionary computing has had an efficient approach for constructing optimal model without the minimizing training errors. The global search of evolutionary computing in solution space can settle the local optima problems of learning models. In this research, combining co-evolving algorithm into statistical learning theory, we propose an co-evolutionary computing for statistical learning theory for overcoming local optima problems of statistical learning theory. We apply proposed model to classification and prediction problems of the learning. In the experimental results, we verify the improved performance of our model using the data sets from UCI machine learning repository and KDD Cup 2000.

The Role of Distributional Cues in the Acquisition of Verb Argument Structures

  • Kim, Mee-Sook
    • Language and Information
    • /
    • v.7 no.1
    • /
    • pp.87-99
    • /
    • 2003
  • This paper investigates the role of input frequency in the acquisition of verb argument structures based on distributional information of a corpus of utterances derived from the English CHILDES database (MacWhinney 1993). It has been widely accepted that children successfully learn verb argument structures by innate language mechanisms, such as linking rules which connect verb meanings and its syntactic structures. In contrast, an approach to language acquisition called “statistical language learning” has currently claimed that children could succeed in acquiring syntactic structures in the absence of innate language mechanisms, making use of distributional properties of the input. In this paper, I evaluate the feasibility of the statistical learning in acquiring verb argument structures, based on distributional information about locative verbs in parental input. The naturalistic data allow us to investigate to what extent the statistical learning approach can and cannot help children succeed in learning the syntax of locative verbs. Based on the results of English database analysis, I show that there is rich statistical information for learning the syntactic possibilities of locative verbs in parental input, despite some limitations in the statistical learning approach.

  • PDF

Development of a Dynamic Geometry Environment to Collect Learning History Data

  • Mun, Kill-Sung;Han, Beom-Soo;Han, Kyung-Soo;Ahn, Jeong-Yong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.18 no.2
    • /
    • pp.375-384
    • /
    • 2007
  • As teachings that use the ICT are more popular, many studies on the dynamic geometry environment(DGE) are under way. An important factor emphasized in the studies is to practical use learning activities of learners. In this study, we first define the learning history data in DGE. Second we develop a prototype of the DGE that is able to collect and analyze the learning history data automatically. The environment enables not only to grasp leaning history but also to create and manage new learning objects.

  • PDF

Machine Learning vs. Statistical Model for Prediction Modelling: Application in Medical Imaging Research (예측모형의 머신러닝 방법론과 통계학적 방법론의 비교: 영상의학 연구에서의 적용)

  • Leeha Ryu;Kyunghwa Han
    • Journal of the Korean Society of Radiology
    • /
    • v.83 no.6
    • /
    • pp.1219-1228
    • /
    • 2022
  • Clinical prediction models has been increasingly published in radiology research. In particular, as a radiomics research is being actively conducted, the prediction model is developed based on the traditional statistical model, as well as machine learning, to account for the high-dimensional data. In this review, we investigated the statistical and machine learning methods used in clinical prediction model research, and briefly summarized each analytical method for statistical model, machine learning, and statistical learning. Finally, we discussed several considerations for choosing the prediction modeling method.

Virtual Learning Environments for Statistics Education and Applications for Official Statistics

  • Mittag Hans-Joachim
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2004.11a
    • /
    • pp.307-312
    • /
    • 2004
  • In our fast-moving information and knowledge society, skills and know-how rapidly become outdated. Virtual learning environments play a key role in meeting today's growing demand for customized educational and vocational training and lift-long teaming. The scope of multimedia-based and web-supported education is illustrated by means of an interdisciplinary multimedia project 'New Statistics' funded by the German government. The project output contains more than 70 learning modules covering the complete curriculum of an introductory statistics course. All modules are based on a statistical laboratory and on a multitude of Java applets, animations and case studies. The paper focuses on presenting the statistical laboratory and the applets. These components present the main project pillars and are particularly suitable for international use, independently from the original project framework. This article also demonstrates the application of Java applets and other multimedia developments from the educational world to official statistics for interactive presentation of statistical information.

  • PDF

Application of data mining and statistical measurement of agricultural high-quality development

  • Yan Zhou
    • Advances in nano research
    • /
    • v.14 no.3
    • /
    • pp.225-234
    • /
    • 2023
  • In this study, we aim to use big data resources and statistical analysis to obtain a reliable instruction to reach high-quality and high yield agricultural yields. In this regard, soil type data, raining and temperature data as well as wheat production in each year are collected for a specific region. Using statistical methodology, the acquired data was cleaned to remove incomplete and defective data. Afterwards, using several classification methods in machine learning we tried to distinguish between different factors and their influence on the final crop yields. Comparing the proposed models' prediction using statistical quantities correlation factor and mean squared error between predicted values of the crop yield and actual values the efficacy of machine learning methods is discussed. The results of the analysis show high accuracy of machine learning methods in the prediction of the crop yields. Moreover, it is indicated that the random forest (RF) classification approach provides best results among other classification methods utilized in this study.

Fostering Students' Statistical Thinking through Data Modelling

  • Ken W. Li
    • Research in Mathematical Education
    • /
    • v.26 no.3
    • /
    • pp.127-146
    • /
    • 2023
  • Statistical thinking has a broad definition but focuses on the context of regression modelling in the present study. To foster students' statistical thinking within the context, teaching should no longer be seen as transfer of knowledge from teacher to students but as a process of engaging with learning activities in which they develop ownership of knowledge. This study aims at collaborative learning contexts; students were divided into small groups in order to increase opportunities for peer collaboration. Each group of students was asked to do a regression project after class. Through doing the project, they learnt to organize and connect previously accrued piecemeal statistical knowledge in an integrated manner. They could also clarify misunderstandings and solve problems through verbal exchanges among themselves. They gave a clear and lucid account of the model they had built and showed collaborative interactions when presenting their projects in front of class. A survey was conducted to solicit their feedback on how peer collaboration would facilitate learning of statistics. Almost all students found their interaction with their peers productive; they focused on the development of statistical thinking with concerted effort.

A Study on the Research Trends of Smart Learning (스마트교육 연구동향에 대한 분석 연구)

  • Kim, Hyang-Hwa;Oh, Dong-In;Heo, Gyun
    • Journal of Fisheries and Marine Sciences Education
    • /
    • v.26 no.1
    • /
    • pp.156-165
    • /
    • 2014
  • The purpose of this study was to find research trends of smart learning. For this, we identified the research's characteristics such as the subject or keyword of research, method, data collection, and statistical analysis method. The 2,865 articles published from 1995 to 2013 were gathered from five Korean academic journals related to smart learning. Among them, research keyword, areas, research method, data collection method, and statistical analysis method were analyzed on 596 papers. The findings of this study were as follows: (a) Smart learning papers such keyword likes u-learning, m-learning, and smart-learning were emerging after 2006. Smart learning papers with ICT related topics were highly increased after 2000, but they were decreased after 2006. Smart learning papers with e-learning related keywords were steadily increased after 2000 through 2013. (b) The research field of deign had the highest portion in smart learning research, but managing had the lowest portion. (c) Development was mainly used as a research method. Both questionnaire and experiment were mainly used for collecting data methods. T-test and frequency analysis were mainly used as statistical analysis methods.

Statistical Modeling of Learning Curves with Binary Response Data (이항 반응 자료에 대한 학습곡선의 모형화)

  • Lee, Seul-Ji;Park, Man-Sik
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.3
    • /
    • pp.433-450
    • /
    • 2012
  • As a worker performs a certain operation repeatedly, he tends to become familiar with the job and complete it in a very short time. That means that the efficiency is improved due to his accumulated knowledge, experience and skill in regards to the operation. Investing time in an output is reduced by repeating any operation. This phenomenon is referred to as the learning curve effect. A learning curve is a graphical representation of the changing rate of learning. According to previous literature, learning curve effects are determined by subjective pre-assigned factors. In this study, we propose a new statistical model to clarify the learning curve effect by means of a basic cumulative distribution function. This work mainly focuses on the statistical modeling of binary data. We employ the Newton-Raphson method for the estimation and Delta method for the construction of confidence intervals. We also perform a real data analysis.