• 제목/요약/키워드: Statistical learning

검색결과 1,329건 처리시간 0.019초

A Co-Evolutionary Computing for Statistical Learning Theory

  • Jun Sung-Hae
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • 제5권4호
    • /
    • pp.281-285
    • /
    • 2005
  • Learning and evolving are two basics for data mining. As compared with classical learning theory based on objective function with minimizing training errors, the recently evolutionary computing has had an efficient approach for constructing optimal model without the minimizing training errors. The global search of evolutionary computing in solution space can settle the local optima problems of learning models. In this research, combining co-evolving algorithm into statistical learning theory, we propose an co-evolutionary computing for statistical learning theory for overcoming local optima problems of statistical learning theory. We apply proposed model to classification and prediction problems of the learning. In the experimental results, we verify the improved performance of our model using the data sets from UCI machine learning repository and KDD Cup 2000.

The Role of Distributional Cues in the Acquisition of Verb Argument Structures

  • Kim, Mee-Sook
    • 한국언어정보학회지:언어와정보
    • /
    • 제7권1호
    • /
    • pp.87-99
    • /
    • 2003
  • This paper investigates the role of input frequency in the acquisition of verb argument structures based on distributional information of a corpus of utterances derived from the English CHILDES database (MacWhinney 1993). It has been widely accepted that children successfully learn verb argument structures by innate language mechanisms, such as linking rules which connect verb meanings and its syntactic structures. In contrast, an approach to language acquisition called “statistical language learning” has currently claimed that children could succeed in acquiring syntactic structures in the absence of innate language mechanisms, making use of distributional properties of the input. In this paper, I evaluate the feasibility of the statistical learning in acquiring verb argument structures, based on distributional information about locative verbs in parental input. The naturalistic data allow us to investigate to what extent the statistical learning approach can and cannot help children succeed in learning the syntax of locative verbs. Based on the results of English database analysis, I show that there is rich statistical information for learning the syntactic possibilities of locative verbs in parental input, despite some limitations in the statistical learning approach.

  • PDF

Development of a Dynamic Geometry Environment to Collect Learning History Data

  • Mun, Kill-Sung;Han, Beom-Soo;Han, Kyung-Soo;Ahn, Jeong-Yong
    • Journal of the Korean Data and Information Science Society
    • /
    • 제18권2호
    • /
    • pp.375-384
    • /
    • 2007
  • As teachings that use the ICT are more popular, many studies on the dynamic geometry environment(DGE) are under way. An important factor emphasized in the studies is to practical use learning activities of learners. In this study, we first define the learning history data in DGE. Second we develop a prototype of the DGE that is able to collect and analyze the learning history data automatically. The environment enables not only to grasp leaning history but also to create and manage new learning objects.

  • PDF

예측모형의 머신러닝 방법론과 통계학적 방법론의 비교: 영상의학 연구에서의 적용 (Machine Learning vs. Statistical Model for Prediction Modelling: Application in Medical Imaging Research)

  • 유리하;한경화
    • 대한영상의학회지
    • /
    • 제83권6호
    • /
    • pp.1219-1228
    • /
    • 2022
  • 최근 영상의학 연구 분야에서 영상 인자를 포함한 임상 예측 모형의 수요가 증가하고 있고, 특히 라디오믹스 연구가 활발하게 이루어지면서 기존의 전통적인 회귀 모형뿐만 아니라 머신러닝을 사용하는 연구들이 많아지고 있다. 본 종설에서는 영상의학 분야에서 예측 모형 연구에 사용된 통계학적 방법과 머신 러닝 방법들을 조사하여 정리하고, 각 방법론에 대한 설명과 장단점을 살펴보고자 한다. 마지막으로 예측 모형 연구에서 분석 방법 선택에서의 고려사항을 정리해 보고자 한다.

Virtual Learning Environments for Statistics Education and Applications for Official Statistics

  • Mittag Hans-Joachim
    • 한국통계학회:학술대회논문집
    • /
    • 한국통계학회 2004년도 학술발표논문집
    • /
    • pp.307-312
    • /
    • 2004
  • In our fast-moving information and knowledge society, skills and know-how rapidly become outdated. Virtual learning environments play a key role in meeting today's growing demand for customized educational and vocational training and lift-long teaming. The scope of multimedia-based and web-supported education is illustrated by means of an interdisciplinary multimedia project 'New Statistics' funded by the German government. The project output contains more than 70 learning modules covering the complete curriculum of an introductory statistics course. All modules are based on a statistical laboratory and on a multitude of Java applets, animations and case studies. The paper focuses on presenting the statistical laboratory and the applets. These components present the main project pillars and are particularly suitable for international use, independently from the original project framework. This article also demonstrates the application of Java applets and other multimedia developments from the educational world to official statistics for interactive presentation of statistical information.

  • PDF

Application of data mining and statistical measurement of agricultural high-quality development

  • Yan Zhou
    • Advances in nano research
    • /
    • 제14권3호
    • /
    • pp.225-234
    • /
    • 2023
  • In this study, we aim to use big data resources and statistical analysis to obtain a reliable instruction to reach high-quality and high yield agricultural yields. In this regard, soil type data, raining and temperature data as well as wheat production in each year are collected for a specific region. Using statistical methodology, the acquired data was cleaned to remove incomplete and defective data. Afterwards, using several classification methods in machine learning we tried to distinguish between different factors and their influence on the final crop yields. Comparing the proposed models' prediction using statistical quantities correlation factor and mean squared error between predicted values of the crop yield and actual values the efficacy of machine learning methods is discussed. The results of the analysis show high accuracy of machine learning methods in the prediction of the crop yields. Moreover, it is indicated that the random forest (RF) classification approach provides best results among other classification methods utilized in this study.

Fostering Students' Statistical Thinking through Data Modelling

  • Ken W. Li
    • 한국수학교육학회지시리즈D:수학교육연구
    • /
    • 제26권3호
    • /
    • pp.127-146
    • /
    • 2023
  • Statistical thinking has a broad definition but focuses on the context of regression modelling in the present study. To foster students' statistical thinking within the context, teaching should no longer be seen as transfer of knowledge from teacher to students but as a process of engaging with learning activities in which they develop ownership of knowledge. This study aims at collaborative learning contexts; students were divided into small groups in order to increase opportunities for peer collaboration. Each group of students was asked to do a regression project after class. Through doing the project, they learnt to organize and connect previously accrued piecemeal statistical knowledge in an integrated manner. They could also clarify misunderstandings and solve problems through verbal exchanges among themselves. They gave a clear and lucid account of the model they had built and showed collaborative interactions when presenting their projects in front of class. A survey was conducted to solicit their feedback on how peer collaboration would facilitate learning of statistics. Almost all students found their interaction with their peers productive; they focused on the development of statistical thinking with concerted effort.

통계학 용어의 증보 (Statistical terms in Korean)

  • 허명회
    • 응용통계연구
    • /
    • 제34권4호
    • /
    • pp.575-578
    • /
    • 2021
  • 통계학 용어의 국문화에 관련하여 1980년대 이래 한국통계학회의 활동을 돌아보고 2000년 이래 대두된 새 용어들을 제안한다. 기계학습과 관련된 통계학 용어가 속히 정립되어야 하고 전통적 용어들에 대하여도 지속적인 업데이트가 필요하다.

스마트교육 연구동향에 대한 분석 연구 (A Study on the Research Trends of Smart Learning)

  • 김향화;오동인;허균
    • 수산해양교육연구
    • /
    • 제26권1호
    • /
    • pp.156-165
    • /
    • 2014
  • The purpose of this study was to find research trends of smart learning. For this, we identified the research's characteristics such as the subject or keyword of research, method, data collection, and statistical analysis method. The 2,865 articles published from 1995 to 2013 were gathered from five Korean academic journals related to smart learning. Among them, research keyword, areas, research method, data collection method, and statistical analysis method were analyzed on 596 papers. The findings of this study were as follows: (a) Smart learning papers such keyword likes u-learning, m-learning, and smart-learning were emerging after 2006. Smart learning papers with ICT related topics were highly increased after 2000, but they were decreased after 2006. Smart learning papers with e-learning related keywords were steadily increased after 2000 through 2013. (b) The research field of deign had the highest portion in smart learning research, but managing had the lowest portion. (c) Development was mainly used as a research method. Both questionnaire and experiment were mainly used for collecting data methods. T-test and frequency analysis were mainly used as statistical analysis methods.

이항 반응 자료에 대한 학습곡선의 모형화 (Statistical Modeling of Learning Curves with Binary Response Data)

  • 이슬지;박만식
    • Communications for Statistical Applications and Methods
    • /
    • 제19권3호
    • /
    • pp.433-450
    • /
    • 2012
  • 연구자가 같은 작업을 반복적으로 수행할 때, 작업 효율성은 연구에 관련된 지식, 경험, 기술이 축적되면서 향상된다. 결과를 얻기 위해 연구에 투자하는 시간은 같은 작업을 반복함으로써 줄일 수 있다. 이러한 현상을 학습곡선 효과(learning curve effect)라고 일컫는다. 학습곡선(learning curves)은 학습의 변화를 시각적으로 나타낸 것으로 이전의 학습곡선 연구에서는 시간을 일정한 구간으로 나누어 구간별 작업에 대한 숙련도의 평균 차이 여부를 확인하였다. 이러한 방법은 구간을 어떻게 나눌 것인가 하는 기준이 존재하지 않으며, 더욱이 이항 반응 자료로 모형을 적합하기 어려운 문제점을 가지고 있다. 본 연구에서는 이산형 확률변수 중 이항 반응 자료(베르누이자료)에 대한 학습곡선의 통계적 모형에 초점을 맞추고자 한다. 누적확률분포의 특성을 이용하여 모수를 추정하기 위해서 뉴튼-랩슨 방법(Newton-Raphson method)을 사용하였고, 이 연구에서 제안한 모형의 점근적 분포를 구하였다.