• Title/Summary/Keyword: educational data mining

Search Result 68, Processing Time 0.024 seconds

Characterizing Patterns of Experience of Harmful Shops among Adolescents Using Decision Tree Models (데이터마이닝을 이용한 청소년 유해업소 출입경험에 영향을 주는 요인)

  • Sohn, Aeree
    • Korean Journal of Health Education and Promotion
    • /
    • v.31 no.3
    • /
    • pp.15-26
    • /
    • 2014
  • Objective: This study was conducted in order to explore the predictive model of the experience of harmful shops in middle and high school students. Methods: The survey was conducted using a self-administered questionnaire method online via the homepage of the education ministry's student health information center. Participants were 1,888 middle school students and 1,563 high school students from 107 schools in Korea. The collected data were processed using the SPSS classification trees 18.0 program and examined using data mining decision tree model. Results: In this study, 6.9% of all subjects were found to have been to sex industry harmful place and 81.8% game place. The results revealed that smoking, living with parents, and school grade were significant predictors for experience of sex industry harmful place. The perception of study disrupts, drinking, living with parents, stress, and satisfaction of school life were significant predictors for experience of game harmful place. Conclusions: These results suggest that an educational approach should be developed by tailored conditions to prevent the access to harmful shops.

Analysis of the Current Status of Edutech in Korean Language Education

  • JinHee KIM;HoSung WOO
    • Fourth Industrial Review
    • /
    • v.3 no.2
    • /
    • pp.11-17
    • /
    • 2023
  • Purpose - Recently, in the field of language education, interest in edutech has increased due to difficulties in classroom teaching due to COVID-19. Accordingly, we would like to analyze research topics related to e-learning before and after COVID-19 and examine the implications for the future Korean language education field. Research design, data, and methodology - This study organized a list of papers to be analyzed by searching for e-learning terms applicable to Korean language education in RISS. The collected data was electronically documented, keywords were extracted using text mining techniques, and word frequencies were checked, and then viewed through cloud visualization. Result - It was confirmed that research on e-learning in the field of Korean language education has increased rapidly in 2021 and 2022. In particular, extensive research on online learning methods has been actively conducted due to the difficulties of face-to-face learning in the COVID-19 era. There have been many studies on teaching and learning methods, such as flipped learning, hybrid learning, blended learning, mobile learning, and smart learning. Conclusion - Since the research so far has mainly focused on online class management methods. Therefore, future research suggests that efforts should be made to develop educational contents and teaching methods using specific ICT technologies. These efforts will contribute to advancing smart education that future education aims for.

Stock News Dataset Quality Assessment by Evaluating the Data Distribution and the Sentiment Prediction

  • Alasmari, Eman;Hamdy, Mohamed;Alyoubi, Khaled H.;Alotaibi, Fahd Saleh
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.2
    • /
    • pp.1-8
    • /
    • 2022
  • This work provides a reliable and classified stocks dataset merged with Saudi stock news. This dataset allows researchers to analyze and better understand the realities, impacts, and relationships between stock news and stock fluctuations. The data were collected from the Saudi stock market via the Corporate News (CN) and Historical Data Stocks (HDS) datasets. As their names suggest, CN contains news, and HDS provides information concerning how stock values change over time. Both datasets cover the period from 2011 to 2019, have 30,098 rows, and have 16 variables-four of which they share and 12 of which differ. Therefore, the combined dataset presented here includes 30,098 published news pieces and information about stock fluctuations across nine years. Stock news polarity has been interpreted in various ways by native Arabic speakers associated with the stock domain. Therefore, this polarity was categorized manually based on Arabic semantics. As the Saudi stock market massively contributes to the international economy, this dataset is essential for stock investors and analyzers. The dataset has been prepared for educational and scientific purposes, motivated by the scarcity of data describing the impact of Saudi stock news on stock activities. It will, therefore, be useful across many sectors, including stock market analytics, data mining, statistics, machine learning, and deep learning. The data evaluation is applied by testing the data distribution of the categories and the sentiment prediction-the data distribution over classes and sentiment prediction accuracy. The results show that the data distribution of the polarity over sectors is considered a balanced distribution. The NB model is developed to evaluate the data quality based on sentiment classification, proving the data reliability by achieving 68% accuracy. So, the data evaluation results ensure dataset reliability, readiness, and high quality for any usage.

Analysis of the Core Concepts of Middle School Informatics Textbook Using Big Data Analysis Techniques (빅데이터 분석 방법을 이용한 중학교 정보 교과서 핵심 개념 분석)

  • Woon, Daewoong;Choe, Hyunjong
    • Journal of Creative Information Culture
    • /
    • v.5 no.2
    • /
    • pp.157-164
    • /
    • 2019
  • Big data is a field that has been utilized and developed in various fields in our society recently. Big data analysis techniques are frequently used to analyze various big data in various fields of politics, economy, and society to grasp various meanings hidden in the data. However, big data analysis is used some case studies of in fields of analysis of educational data, but analysis of the curriculum and direction is still inadequate. Therefore, this study aims to identify and analyze the core concepts of middle school informatics textbooks using big data analysis techniques. Text mining was used for big data analysis for informatics textbook analysis. Through the core concepts of middle school informatics textbooks identified using this techniques, we could confirm the concepts to be emphasized in the textbooks and the possibility of using big data in the field of education.

Analysis of Characteristics of Clusters of Middle School Students Using K-Means Cluster Analysis (K-평균 군집분석을 활용한 중학생의 군집화 및 특성 분석)

  • Jaebong, Lee
    • Journal of The Korean Association For Science Education
    • /
    • v.42 no.6
    • /
    • pp.611-619
    • /
    • 2022
  • The purpose of this study is to explore the possibility of applying big data analysis to provide appropriate feedback to students using evaluation data in science education at a time when interest in educational data mining has recently increased in education. In this study, we use the evaluation data of 2,576 students who took 24 questions of the national assessment of educational achievement. And we use K-means cluster analysis as a method of unsupervised machine learning for clustering. As a result of clustering, students were divided into six clusters. The middle-ranking students are divided into various clusters when compared to upper or lower ranks. According to the results of the cluster analysis, the most important factor influencing clusterization is academic achievement, and each cluster shows different characteristics in terms of content domains, subject competencies, and affective characteristics. Learning motivation is important among the affective domains in the lower-ranking achievement cluster, and scientific inquiry and problem-solving competency, as well as scientific communication competency have a major influence in terms of subject competencies. In the content domain, achievement of motion and energy and matter are important factors to distinguish the characteristics of the cluster. As a result, we can provide students with customized feedback for learning based on the characteristics of each cluster. We discuss implications of these results for science education, such as the possibility of using this study results, balanced learning by content domains, enhancement of subject competency, and improvement of scientific attitude.

An Exploratory Study of e-Learning Satisfaction: A Mixed Methods of Text Mining and Interview Approaches (이러닝 만족도 증진을 위한 탐색적 연구: 텍스트 마이닝과 인터뷰 혼합방법론)

  • Sun-Gyu Lee;Soobin Choi;Hee-Woong Kim
    • Information Systems Review
    • /
    • v.21 no.1
    • /
    • pp.39-59
    • /
    • 2019
  • E-learning has improved the educational effect by making it possible to learn anytime and anywhere by escaping the traditional infusion education. As the use of e-learning system increases with the increasing popularity of e-learning, it has become important to measure e-learning satisfaction. In this study, we used the mixed research method to identify satisfaction factors of e-learning. The mixed research method is to perform both qualitative research and quantitative research at the same time. As a quantitative research, we collected reviews in Udemy.com by text mining. Then we classified high and low rated lectures and applied topic modeling technique to derive factors from reviews. Also, this study conducted an in-depth 1:1 interview on e-learning learners as a qualitative research. By combining these results, we were able to derive factors of e-learning satisfaction and dissatisfaction. Based on these factors, we suggested ways to improve e-learning satisfaction. In contrast to the fact that survey-based research was mainly conducted in the past, this study collects actual data by text mining. The academic significance of this study is that the results of the topic modeling are combined with the factor based on the information system success model.

Development of newly recruited privates on-the-job Training Achievements Group Classification Model (신병 주특기교육 성취집단 예측모형 개발)

  • Kwak, Ki-Hyo;Suh, Yong-Moo
    • Journal of the military operations research society of Korea
    • /
    • v.33 no.2
    • /
    • pp.101-113
    • /
    • 2007
  • The period of military personnel service will be phased down by 2014 according to 'The law of National Defense Reformation' issued by the Ministry of National Defense. For this reason, the ROK army provides discrimination education to 'newly recruited privates' for more effective individual performance in the on-the-job training. For the training to be more effective, it would be essential to predict the degree of achievements by new privates in the training. Thus, we used data mining techniques to develop a classification model which classifies the new privates into one of two achievements groups, so that different skills of education are applied to each group. The target variable for this model is a binary variable, whose value can be either 'a group of general control' or 'a group of special control'. We developed four pure classification models using Neural Network, Decision Tree, Support Vector Machine and Naive Bayesian. We also built four hybrid models, each of which combines k-means clustering algorithm with one of these four mining technique. Experimental results demonstrated that the highest performance model was the hybrid model of k-means and Neural Network. We expect that various military education programs could be supported by these classification models for better educational performance.

Text Mining-Based Emerging Trend Analysis for e-Learning Contents Targeting for CEO (텍스트마이닝을 통한 최고경영자 대상 이러닝 콘텐츠 트렌드 분석)

  • Kyung-Hoon Kim;Myungsin Chae;Byungtae Lee
    • Information Systems Review
    • /
    • v.19 no.2
    • /
    • pp.1-19
    • /
    • 2017
  • Original scripts of e-learning lectures for the CEOs of corporation S were analyzed using topic analysis, which is a text mining method. Twenty-two topics were extracted based on the keywords chosen from five-year records that ranged from 2011 to 2015. Research analysis was then conducted on various issues. Promising topics were selected through evaluation and element analysis of the members of each topic. In management and economics, members demonstrated high satisfaction and interest toward topics in marketing strategy, human resource management, and communication. Philosophy, history of war, and history demonstrated high interest and satisfaction in the field of humanities, whereas mind health showed high interest and satisfaction in the field of in lifestyle. Studies were also conducted to identify topics on the proportion of content, but these studies failed to increase member satisfaction. In the field of IT, educational content responds sensitively to change of the times, but it may not increase the interest and satisfaction of members. The present study found that content production for CEOs should draw out deep implications for value innovation through technology application instead of simply ending the technical aspect of information delivery. Previous studies classified contents superficially based on the name of content program when analyzing the status of content operation. However, text mining can derive deep content and subject classification based on the contents of unstructured data script. This approach can examine current shortages and necessary fields if the service contents of the themes are displayed by year. This study was based on data obtained from influential e-learning companies in Korea. Obtaining practical results was difficult because data were not acquired from portal sites or social networking service. The content of e-learning trends of CEOs were analyzed. Data analysis was also conducted on the intellectual interests of CEOs in each field.

Working in a Risky Environment: Coping and Risk Handling Strategies Among Small-scale Miners in Ghana

  • Wireko-Gyebi, Rejoice Selorm;Arhin, Albert Abraham;Braimah, Imoro;King, Rudith Sylvana;Lykke, Anne Mette
    • Safety and Health at Work
    • /
    • v.13 no.2
    • /
    • pp.163-169
    • /
    • 2022
  • Background: It is estimated that about 13 million artisanal and small-scale miners carry out their activities under harsh, precarious, unfriendly, and risky conditions. Yet, our understanding of the extent to which these workers use personal protective equipment (PPE) and navigate through the various risks and hazards they face is still limited. This article has two main objectives. First, it explores the extent of usage of PPE among artisanal and small-scale miners for the prevention of hazards and risks. Second, it examines the coping strategies used by these miners as a response to experiences of occupational injuries and risks Methods: A cross-sectional survey of small-scale miners was conducted in six communities across three districts in Ghana, West Africa. The mixed methods approach was adopted. A total of 148 small-scale miners participated in the study. Six focus group discussions (FGDs) were held across the six communities. The data were analysed using descriptive statistics. Chi-square tests were used to analyse the relationship between some socio-demographic characteristics (sex, age, and educational background) and the usage of PPE. Open-ended questions and responses from FGDs were analysed based on the content and verbatim quotations from miners. Results: Findings suggest that 78% of the miners interviewed do not use the appropriate PPE citing reasons such as cost, and their personal discomfort associated with use of PPE. There was no significant relationship between socio-demographic characteristics (i.e., sex, age, education and major mining activity) and the usage of PPE. The study further revealed four main coping strategies used by miners to handle the risks. These are rest, taking unprescribed medication and hard drugs, registration with health insurance scheme and savings and investments. Conclusion: This study shows that very few artisanal miners use PPE despite the significant hazards and risks to which they are exposed. The study recommends to the government to put in place measures to ensure that miners adhere to health and safety regulations before undertaking mining activities. This means that health and safety plans and use of PPE should be linked to the license acquisition process for miners.

A Delphi Study on Competencies of Future Green Architectural Engineer (근미래 친환경 건축분야 엔지니어에게 필요한 역량에 대한 델파이 연구)

  • Kang, So Yeon;Kim, Taeyeon;Lee, Jungwoo
    • Journal of Engineering Education Research
    • /
    • v.21 no.3
    • /
    • pp.56-65
    • /
    • 2018
  • With rapid advance of technologies including information and communication technologies, jobs are evolving faster than ever. Architectural engineering is no exception in this regard, and the green architectural engineering is emerging fast as a promising new field. In this study, a Delphi study of expert architectural engineers are conducted to find out (1) near future prospects of the field, (2) near future emerging jobs, (3) competencies needed for these jobs, and (4) educational content necessary to build these competencies with regards to the green architectural engineering. Initial Delphi survey consisting of open-ended questions in the above four areas were conducted and came out with 65 items after duplicate removal and semantic refinements. Further refinements via second and third wave of Delphi results into 40 items that the 13 architectural engineering experts may largely agree upon as future prospects with regards to the green architectural engineering. Findings indicate that it is expected that the demand for green architectural engineering and needs for automatic energy control system increase. Also, collaborations with other fields is becoming more and more important in green architectural engineering. The professional work management skills such as knowledge convergence, problem solving, collaboration skills, and creativity linking components from various related areas seem to also be on the increasing need. Near future ready critical skills are found to be the building environment control techniques (thermal, light, sound, and air), the data processing techniques like data mining, energy monitoring, and the control and utilization of environmental analysis software. Experts also agree on new curriculum for green building architecture to be developed with more of converging subjects across disciplines for future ready professional skills and experiences. Major topics to be covered in the near future includes building environment studies, building energy management, energy reduction systems, indoor air quality, global environment and natural phenomena, and machinery and electrical facility. Architectural engineering community should be concerned with building up the competencies identified in this Delphi preparing for fast advancing future.