• Title/Summary/Keyword: educational data mining

Search Result 65, Processing Time 0.022 seconds

Characterizing Patterns of Experience of Harmful Shops among Adolescents Using Decision Tree Models (데이터마이닝을 이용한 청소년 유해업소 출입경험에 영향을 주는 요인)

  • Sohn, Aeree
    • Korean Journal of Health Education and Promotion
    • /
    • v.31 no.3
    • /
    • pp.15-26
    • /
    • 2014
  • Objective: This study was conducted in order to explore the predictive model of the experience of harmful shops in middle and high school students. Methods: The survey was conducted using a self-administered questionnaire method online via the homepage of the education ministry's student health information center. Participants were 1,888 middle school students and 1,563 high school students from 107 schools in Korea. The collected data were processed using the SPSS classification trees 18.0 program and examined using data mining decision tree model. Results: In this study, 6.9% of all subjects were found to have been to sex industry harmful place and 81.8% game place. The results revealed that smoking, living with parents, and school grade were significant predictors for experience of sex industry harmful place. The perception of study disrupts, drinking, living with parents, stress, and satisfaction of school life were significant predictors for experience of game harmful place. Conclusions: These results suggest that an educational approach should be developed by tailored conditions to prevent the access to harmful shops.

Analysis of the Current Status of Edutech in Korean Language Education

  • JinHee KIM;HoSung WOO
    • Fourth Industrial Review
    • /
    • v.3 no.2
    • /
    • pp.11-17
    • /
    • 2023
  • Purpose - Recently, in the field of language education, interest in edutech has increased due to difficulties in classroom teaching due to COVID-19. Accordingly, we would like to analyze research topics related to e-learning before and after COVID-19 and examine the implications for the future Korean language education field. Research design, data, and methodology - This study organized a list of papers to be analyzed by searching for e-learning terms applicable to Korean language education in RISS. The collected data was electronically documented, keywords were extracted using text mining techniques, and word frequencies were checked, and then viewed through cloud visualization. Result - It was confirmed that research on e-learning in the field of Korean language education has increased rapidly in 2021 and 2022. In particular, extensive research on online learning methods has been actively conducted due to the difficulties of face-to-face learning in the COVID-19 era. There have been many studies on teaching and learning methods, such as flipped learning, hybrid learning, blended learning, mobile learning, and smart learning. Conclusion - Since the research so far has mainly focused on online class management methods. Therefore, future research suggests that efforts should be made to develop educational contents and teaching methods using specific ICT technologies. These efforts will contribute to advancing smart education that future education aims for.

Stock News Dataset Quality Assessment by Evaluating the Data Distribution and the Sentiment Prediction

  • Alasmari, Eman;Hamdy, Mohamed;Alyoubi, Khaled H.;Alotaibi, Fahd Saleh
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.2
    • /
    • pp.1-8
    • /
    • 2022
  • This work provides a reliable and classified stocks dataset merged with Saudi stock news. This dataset allows researchers to analyze and better understand the realities, impacts, and relationships between stock news and stock fluctuations. The data were collected from the Saudi stock market via the Corporate News (CN) and Historical Data Stocks (HDS) datasets. As their names suggest, CN contains news, and HDS provides information concerning how stock values change over time. Both datasets cover the period from 2011 to 2019, have 30,098 rows, and have 16 variables-four of which they share and 12 of which differ. Therefore, the combined dataset presented here includes 30,098 published news pieces and information about stock fluctuations across nine years. Stock news polarity has been interpreted in various ways by native Arabic speakers associated with the stock domain. Therefore, this polarity was categorized manually based on Arabic semantics. As the Saudi stock market massively contributes to the international economy, this dataset is essential for stock investors and analyzers. The dataset has been prepared for educational and scientific purposes, motivated by the scarcity of data describing the impact of Saudi stock news on stock activities. It will, therefore, be useful across many sectors, including stock market analytics, data mining, statistics, machine learning, and deep learning. The data evaluation is applied by testing the data distribution of the categories and the sentiment prediction-the data distribution over classes and sentiment prediction accuracy. The results show that the data distribution of the polarity over sectors is considered a balanced distribution. The NB model is developed to evaluate the data quality based on sentiment classification, proving the data reliability by achieving 68% accuracy. So, the data evaluation results ensure dataset reliability, readiness, and high quality for any usage.

Analysis of the Core Concepts of Middle School Informatics Textbook Using Big Data Analysis Techniques (빅데이터 분석 방법을 이용한 중학교 정보 교과서 핵심 개념 분석)

  • Woon, Daewoong;Choe, Hyunjong
    • Journal of Creative Information Culture
    • /
    • v.5 no.2
    • /
    • pp.157-164
    • /
    • 2019
  • Big data is a field that has been utilized and developed in various fields in our society recently. Big data analysis techniques are frequently used to analyze various big data in various fields of politics, economy, and society to grasp various meanings hidden in the data. However, big data analysis is used some case studies of in fields of analysis of educational data, but analysis of the curriculum and direction is still inadequate. Therefore, this study aims to identify and analyze the core concepts of middle school informatics textbooks using big data analysis techniques. Text mining was used for big data analysis for informatics textbook analysis. Through the core concepts of middle school informatics textbooks identified using this techniques, we could confirm the concepts to be emphasized in the textbooks and the possibility of using big data in the field of education.

Analysis of Characteristics of Clusters of Middle School Students Using K-Means Cluster Analysis (K-평균 군집분석을 활용한 중학생의 군집화 및 특성 분석)

  • Jaebong, Lee
    • Journal of The Korean Association For Science Education
    • /
    • v.42 no.6
    • /
    • pp.611-619
    • /
    • 2022
  • The purpose of this study is to explore the possibility of applying big data analysis to provide appropriate feedback to students using evaluation data in science education at a time when interest in educational data mining has recently increased in education. In this study, we use the evaluation data of 2,576 students who took 24 questions of the national assessment of educational achievement. And we use K-means cluster analysis as a method of unsupervised machine learning for clustering. As a result of clustering, students were divided into six clusters. The middle-ranking students are divided into various clusters when compared to upper or lower ranks. According to the results of the cluster analysis, the most important factor influencing clusterization is academic achievement, and each cluster shows different characteristics in terms of content domains, subject competencies, and affective characteristics. Learning motivation is important among the affective domains in the lower-ranking achievement cluster, and scientific inquiry and problem-solving competency, as well as scientific communication competency have a major influence in terms of subject competencies. In the content domain, achievement of motion and energy and matter are important factors to distinguish the characteristics of the cluster. As a result, we can provide students with customized feedback for learning based on the characteristics of each cluster. We discuss implications of these results for science education, such as the possibility of using this study results, balanced learning by content domains, enhancement of subject competency, and improvement of scientific attitude.

Development of newly recruited privates on-the-job Training Achievements Group Classification Model (신병 주특기교육 성취집단 예측모형 개발)

  • Kwak, Ki-Hyo;Suh, Yong-Moo
    • Journal of the military operations research society of Korea
    • /
    • v.33 no.2
    • /
    • pp.101-113
    • /
    • 2007
  • The period of military personnel service will be phased down by 2014 according to 'The law of National Defense Reformation' issued by the Ministry of National Defense. For this reason, the ROK army provides discrimination education to 'newly recruited privates' for more effective individual performance in the on-the-job training. For the training to be more effective, it would be essential to predict the degree of achievements by new privates in the training. Thus, we used data mining techniques to develop a classification model which classifies the new privates into one of two achievements groups, so that different skills of education are applied to each group. The target variable for this model is a binary variable, whose value can be either 'a group of general control' or 'a group of special control'. We developed four pure classification models using Neural Network, Decision Tree, Support Vector Machine and Naive Bayesian. We also built four hybrid models, each of which combines k-means clustering algorithm with one of these four mining technique. Experimental results demonstrated that the highest performance model was the hybrid model of k-means and Neural Network. We expect that various military education programs could be supported by these classification models for better educational performance.

Working in a Risky Environment: Coping and Risk Handling Strategies Among Small-scale Miners in Ghana

  • Wireko-Gyebi, Rejoice Selorm;Arhin, Albert Abraham;Braimah, Imoro;King, Rudith Sylvana;Lykke, Anne Mette
    • Safety and Health at Work
    • /
    • v.13 no.2
    • /
    • pp.163-169
    • /
    • 2022
  • Background: It is estimated that about 13 million artisanal and small-scale miners carry out their activities under harsh, precarious, unfriendly, and risky conditions. Yet, our understanding of the extent to which these workers use personal protective equipment (PPE) and navigate through the various risks and hazards they face is still limited. This article has two main objectives. First, it explores the extent of usage of PPE among artisanal and small-scale miners for the prevention of hazards and risks. Second, it examines the coping strategies used by these miners as a response to experiences of occupational injuries and risks Methods: A cross-sectional survey of small-scale miners was conducted in six communities across three districts in Ghana, West Africa. The mixed methods approach was adopted. A total of 148 small-scale miners participated in the study. Six focus group discussions (FGDs) were held across the six communities. The data were analysed using descriptive statistics. Chi-square tests were used to analyse the relationship between some socio-demographic characteristics (sex, age, and educational background) and the usage of PPE. Open-ended questions and responses from FGDs were analysed based on the content and verbatim quotations from miners. Results: Findings suggest that 78% of the miners interviewed do not use the appropriate PPE citing reasons such as cost, and their personal discomfort associated with use of PPE. There was no significant relationship between socio-demographic characteristics (i.e., sex, age, education and major mining activity) and the usage of PPE. The study further revealed four main coping strategies used by miners to handle the risks. These are rest, taking unprescribed medication and hard drugs, registration with health insurance scheme and savings and investments. Conclusion: This study shows that very few artisanal miners use PPE despite the significant hazards and risks to which they are exposed. The study recommends to the government to put in place measures to ensure that miners adhere to health and safety regulations before undertaking mining activities. This means that health and safety plans and use of PPE should be linked to the license acquisition process for miners.

A Delphi Study on Competencies of Future Green Architectural Engineer (근미래 친환경 건축분야 엔지니어에게 필요한 역량에 대한 델파이 연구)

  • Kang, So Yeon;Kim, Taeyeon;Lee, Jungwoo
    • Journal of Engineering Education Research
    • /
    • v.21 no.3
    • /
    • pp.56-65
    • /
    • 2018
  • With rapid advance of technologies including information and communication technologies, jobs are evolving faster than ever. Architectural engineering is no exception in this regard, and the green architectural engineering is emerging fast as a promising new field. In this study, a Delphi study of expert architectural engineers are conducted to find out (1) near future prospects of the field, (2) near future emerging jobs, (3) competencies needed for these jobs, and (4) educational content necessary to build these competencies with regards to the green architectural engineering. Initial Delphi survey consisting of open-ended questions in the above four areas were conducted and came out with 65 items after duplicate removal and semantic refinements. Further refinements via second and third wave of Delphi results into 40 items that the 13 architectural engineering experts may largely agree upon as future prospects with regards to the green architectural engineering. Findings indicate that it is expected that the demand for green architectural engineering and needs for automatic energy control system increase. Also, collaborations with other fields is becoming more and more important in green architectural engineering. The professional work management skills such as knowledge convergence, problem solving, collaboration skills, and creativity linking components from various related areas seem to also be on the increasing need. Near future ready critical skills are found to be the building environment control techniques (thermal, light, sound, and air), the data processing techniques like data mining, energy monitoring, and the control and utilization of environmental analysis software. Experts also agree on new curriculum for green building architecture to be developed with more of converging subjects across disciplines for future ready professional skills and experiences. Major topics to be covered in the near future includes building environment studies, building energy management, energy reduction systems, indoor air quality, global environment and natural phenomena, and machinery and electrical facility. Architectural engineering community should be concerned with building up the competencies identified in this Delphi preparing for fast advancing future.

Social graph visualization techniques for public data (공공데이터에 적합한 다양한 소셜 그래프 비주얼라이제이션 알고리즘 제안)

  • Lee, Manjai;On, Byung-Won
    • Journal of the HCI Society of Korea
    • /
    • v.10 no.1
    • /
    • pp.5-17
    • /
    • 2015
  • Nowadays various public data have been serviced to the public. Through the opening of public data, the transparency and effectiveness of public policy developed by governments are increased and users can lead to the growth of industry related to public data. Since end-users of using public data are citizens, it is very important for everyone to figure out the meaning of public data using proper visualization techniques. In this work, to indicate the significance of widespread public data, we consider UN voting record as public data in which many people may be interested. In general, it has high utilization value by diplomatic and educational purposes, and is available in public. If we use proper data mining and visualization algorithms, we can get an insight regarding the voting patterns of UN members. To visualize, it is necessary to measure the voting similarity values among UN members and then a social graph is created by the similarity values. Next, using a graph layout algorithm, the social graph is rendered on the screen. If we use the existing method for visualizing the social graph, it is hard to understand the meaning of the social graph because the graph is usually dense. To improve the weak point of the existing social graph visualization, we propose Friend-Matching, Friend-Rival Matching, and Bubble Heap algorithms in this paper. We also validate that our proposed algorithms can improve the quality of visualizing social graphs displayed by the existing method. Finally, our prototype system has been released in http://datalab.kunsan.ac.kr/politiz/un/. Please, see if it is useful in the aspect of public data utilization.

Trend Analysis using Topic Modeling for Simulation Studies (토픽 모델링을 이용한 시뮬레이션 연구 동향 분석)

  • Na, Sang-Tae;Kim, Ja-Hee;Jung, Min-Ho;Ahn, Joo-Eon
    • Journal of the Korea Society for Simulation
    • /
    • v.25 no.3
    • /
    • pp.107-116
    • /
    • 2016
  • The recent diversification in terms of the scope and techniques used for simulations has highlighted the importance of analyzing state of the art trends and applying these for educational and study purposes. While qualitative methods such as literature research or experts' assessments have previously been used, such methods are in fact likely to reflect the subjective viewpoint of experts, and to involve too much time and money for the results obtained. For the purpose of an objective analysis, a quantitative analysis that included the examination of topics found in domestic academic journal articles was conducted in the present study. In this regard, simulation was found to be most actively used domestically in the electrical and electronic fields. In addition, simulation was also found to be employed for the purpose of education and entertainment in the social sciences. The results of this study are expected to help to facilitate the prediction of the direction of the development of not only the Korea Society for Simulation, but also domestic simulation studies. This study also raises the possibility of applying text mining to trend analysis, and proves that it can be a useful method for deriving future key topics and helping experts' decisions regarding quantitative data.