• Title/Summary/Keyword: Multi-learning System

Search Result 620, Processing Time 0.031 seconds

Multi-Dimensional Analysis Method of Product Reviews for Market Insight (마켓 인사이트를 위한 상품 리뷰의 다차원 분석 방안)

  • Park, Jeong Hyun;Lee, Seo Ho;Lim, Gyu Jin;Yeo, Un Yeong;Kim, Jong Woo
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.2
    • /
    • pp.57-78
    • /
    • 2020
  • With the development of the Internet, consumers have had an opportunity to check product information easily through E-Commerce. Product reviews used in the process of purchasing goods are based on user experience, allowing consumers to engage as producers of information as well as refer to information. This can be a way to increase the efficiency of purchasing decisions from the perspective of consumers, and from the seller's point of view, it can help develop products and strengthen their competitiveness. However, it takes a lot of time and effort to understand the overall assessment and assessment dimensions of the products that I think are important in reading the vast amount of product reviews offered by E-Commerce for the products consumers want to compare. This is because product reviews are unstructured information and it is difficult to read sentiment of reviews and assessment dimension immediately. For example, consumers who want to purchase a laptop would like to check the assessment of comparative products at each dimension, such as performance, weight, delivery, speed, and design. Therefore, in this paper, we would like to propose a method to automatically generate multi-dimensional product assessment scores in product reviews that we would like to compare. The methods presented in this study consist largely of two phases. One is the pre-preparation phase and the second is the individual product scoring phase. In the pre-preparation phase, a dimensioned classification model and a sentiment analysis model are created based on a review of the large category product group review. By combining word embedding and association analysis, the dimensioned classification model complements the limitation that word embedding methods for finding relevance between dimensions and words in existing studies see only the distance of words in sentences. Sentiment analysis models generate CNN models by organizing learning data tagged with positives and negatives on a phrase unit for accurate polarity detection. Through this, the individual product scoring phase applies the models pre-prepared for the phrase unit review. Multi-dimensional assessment scores can be obtained by aggregating them by assessment dimension according to the proportion of reviews organized like this, which are grouped among those that are judged to describe a specific dimension for each phrase. In the experiment of this paper, approximately 260,000 reviews of the large category product group are collected to form a dimensioned classification model and a sentiment analysis model. In addition, reviews of the laptops of S and L companies selling at E-Commerce are collected and used as experimental data, respectively. The dimensioned classification model classified individual product reviews broken down into phrases into six assessment dimensions and combined the existing word embedding method with an association analysis indicating frequency between words and dimensions. As a result of combining word embedding and association analysis, the accuracy of the model increased by 13.7%. The sentiment analysis models could be seen to closely analyze the assessment when they were taught in a phrase unit rather than in sentences. As a result, it was confirmed that the accuracy was 29.4% higher than the sentence-based model. Through this study, both sellers and consumers can expect efficient decision making in purchasing and product development, given that they can make multi-dimensional comparisons of products. In addition, text reviews, which are unstructured data, were transformed into objective values such as frequency and morpheme, and they were analysed together using word embedding and association analysis to improve the objectivity aspects of more precise multi-dimensional analysis and research. This will be an attractive analysis model in terms of not only enabling more effective service deployment during the evolving E-Commerce market and fierce competition, but also satisfying both customers.

Automatic TV Program Recommendation using LDA based Latent Topic Inference (LDA 기반 은닉 토픽 추론을 이용한 TV 프로그램 자동 추천)

  • Kim, Eun-Hui;Pyo, Shin-Jee;Kim, Mun-Churl
    • Journal of Broadcast Engineering
    • /
    • v.17 no.2
    • /
    • pp.270-283
    • /
    • 2012
  • With the advent of multi-channel TV, IPTV and smart TV services, excessive amounts of TV program contents become available at users' sides, which makes it very difficult for TV viewers to easily find and consume their preferred TV programs. Therefore, the service of automatic TV recommendation is an important issue for TV users for future intelligent TV services, which allows to improve access to their preferred TV contents. In this paper, we present a recommendation model based on statistical machine learning using a collaborative filtering concept by taking in account both public and personal preferences on TV program contents. For this, users' preference on TV programs is modeled as a latent topic variable using LDA (Latent Dirichlet Allocation) which is recently applied in various application domains. To apply LDA for TV recommendation appropriately, TV viewers's interested topics is regarded as latent topics in LDA, and asymmetric Dirichlet distribution is applied on the LDA which can reveal the diversity of the TV viewers' interests on topics based on the analysis of the real TV usage history data. The experimental results show that the proposed LDA based TV recommendation method yields average 66.5% with top 5 ranked TV programs in weekly recommendation, average 77.9% precision in bimonthly recommendation with top 5 ranked TV programs for the TV usage history data of similar taste user groups.

Engineering basic competencies level and educatkonal needs analysis of engineering college graduates (공대 졸업생들의 공학기초능력 수준과 교육 요구 분석)

  • Hahm, Seung-Yeon
    • 대한공업교육학회지
    • /
    • v.34 no.1
    • /
    • pp.196-209
    • /
    • 2009
  • The purpose of this study was to analyze engineering basic competencies about the time completed a course and educational needs of engineering college graduates. Survey method using questionnaire was the major research method of this study. A survey of 807 engineering college graduates was carried out. Questionnaire were made of the level of engineering basic competencies of engineering college graduates, its priorities of actual vocation and level of engineering basic competencies of major. Major results of the study, some recommendations for future researches were made as follows: The level of engineering basic competencies of engineering college graduates was 3.3 average(5 full marks). Engineering basic competencies that educational needs were high respectively, were an ability to communicate effectively, an ability to design a system, component, or process to meet desired needs, an ability to function on multi-disciplinary teams, an ability to understand global culture and cooperate internationally, an ability to design and conduct experiment as well as to analyze and interpret data, and an ability to engage in life-long learning.

Media Literacy Education in the Australian Curriculum: Media Art (호주 국가교육과정 예술과목 'Media Art' 에 나타난 미디어 리터러시 교육)

  • Park, Yoo-Shin
    • Cartoon and Animation Studies
    • /
    • s.48
    • /
    • pp.271-310
    • /
    • 2017
  • This paper examines the composition and the content of media art which is an art education subject in a national curriculum of Australia; and discusses implications for Korean education curriculums. Media covered by Media Art subject in Australia are the multi types of general media including TV, movie, video, newspaper, radio, video game, the internet, and mobile media; and their contents. The purpose of ACARA's media art education curriculum is to improve creative use, knowledge, understanding, and technology of communication techniques for multiple purposes and the audiences. Through the Media Art subject, both the students and the community are able to participate in the actual communications with the rich culture surrounding them and to develop the knowledge and understanding of the 5 core concepts of language, technology, system, audience and re-creation while testing the culture. The implication of this study is as the following. ACARA's media art education curriculum has been developed as an independent educational program and has a special significance within Australian education curriculums. Although ACARA's media art education curriculum is formed as an independent subject, it is suggested within the curriculum to instruct in close connection with other subjects upon execution. Its organization and elaborateness in curriculum composition are very effective in terms of the teacher's teaching-learning design and as well as the evaluation. This seems to show a good model of leading media literacy curriculum. ACARA's media art education curriculum can be a great reference in introducing media literacy to Korean national education curriculums.

Development of Sludge Concentration Estimation Method using Neuro-Fuzzy Algorithm (뉴로-퍼지 알고리즘을 이용한 슬러지 농도 추정 기법 개발)

  • Jang, Sang-Bok;Lee, Ho-Hyun;Lee, Dae-Jong;Kweon, Jin-Hee;Chun, Myung-Geun
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.25 no.2
    • /
    • pp.119-125
    • /
    • 2015
  • A concentration meter is widely used at purification plants, sewage treatment plants and waste water treatment plants to sort and transfer high concentration sludge and to control the amount of chemical dosage. When the strange substance is contained in the sludge, however, the attenuation of ultrasonic wave could be increased or not be transmitted to the receiver. At that case, the value of concentration meter is higher than the actual density value or vibrated up and down. It has also been difficult to automate the residuals treatment process according to the problems as sludge attachment or damage of a sensor. Multi-beam ultrasonic concentration meter has been developed to solve these problems, but the failure of the ultrasonic beam of a specific concentration measurement value degrade the performance of the entire system. This paper proposes the method to improve the accuracy of sludge concentration rate by choosing reliable sensor values and learning them by proposed algorithm. The prediction algorithm is chosen as neuro-fuzzy model, which is tested by the various experiments.

A Comparative Study of Scientific Literacy and Core Competence Discourses as Rationales for the 21st Century Science Curriculum Reform (21세기 과학 교육과정 개혁 논리로서의 과학적 소양 및 핵심 역량 담론 비교 연구)

  • Lee, Gyeong-Geon;Hong, Hun-Gi
    • Journal of The Korean Association For Science Education
    • /
    • v.42 no.1
    • /
    • pp.1-18
    • /
    • 2022
  • The two most influential rationales for the 21st century science curriculum reform can be said to be core competence and scientific literacy. However, the relationship between the two has not been scrutinized but remained speculative - and this has made the harmonization of the general guideline and subject-matter curriculum difficult in Korean national curriculum system. This study compares the two discourses to derive implications for future science curriculum development. This study took a literature research approach. In chapter II, national curriculum or standards, position papers, and research articles were reviewed to delineate the historical development of the discourses. In chapter III and IV, the intersections of those two discourses are delineated. In chapter III, the commonalities of the two discourses are explicated with regard to crisis rhetoric, multi-faceted meanings (individual, community, and global aspects), organization of subject-matter content and teaching and learning method, and the role of high-stake exams. In chapter IV, their respective strengths and weaknesses are juxtaposed. In chapter V, it is suggested that understanding scientific literacy and core competence discourses to have a family resemblance as 21st century science curriculum reform rationale, after Wittgenstein and Kuhn. Finally, the ways to resolve the conflict between the two ideas from the general guideline and subject-matter curriculum over crisis rhetoric were explored.

A study on improving the accuracy of machine learning models through the use of non-financial information in predicting the Closure of operator using electronic payment service (전자결제서비스 이용 사업자 폐업 예측에서 비재무정보 활용을 통한 머신러닝 모델의 정확도 향상에 관한 연구)

  • Hyunjeong Gong;Eugene Hwang;Sunghyuk Park
    • Journal of Intelligence and Information Systems
    • /
    • v.29 no.3
    • /
    • pp.361-381
    • /
    • 2023
  • Research on corporate bankruptcy prediction has been focused on financial information. Since the company's financial information is updated quarterly, there is a problem that timeliness is insufficient in predicting the possibility of a company's business closure in real time. Evaluated companies that want to improve this need a method of judging the soundness of a company that uses information other than financial information to judge the soundness of a target company. To this end, as information technology has made it easier to collect non-financial information about companies, research has been conducted to apply additional variables and various methodologies other than financial information to predict corporate bankruptcy. It has become an important research task to determine whether it has an effect. In this study, we examined the impact of electronic payment-related information, which constitutes non-financial information, when predicting the closure of business operators using electronic payment service and examined the difference in closure prediction accuracy according to the combination of financial and non-financial information. Specifically, three research models consisting of a financial information model, a non-financial information model, and a combined model were designed, and the closure prediction accuracy was confirmed with six algorithms including the Multi Layer Perceptron (MLP) algorithm. The model combining financial and non-financial information showed the highest prediction accuracy, followed by the non-financial information model and the financial information model in order. As for the prediction accuracy of business closure by algorithm, XGBoost showed the highest prediction accuracy among the six algorithms. As a result of examining the relative importance of a total of 87 variables used to predict business closure, it was confirmed that more than 70% of the top 20 variables that had a significant impact on the prediction of business closure were non-financial information. Through this, it was confirmed that electronic payment-related information of non-financial information is an important variable in predicting business closure, and the possibility of using non-financial information as an alternative to financial information was also examined. Based on this study, the importance of collecting and utilizing non-financial information as information that can predict business closure is recognized, and a plan to utilize it for corporate decision-making is also proposed.

Ensemble of Nested Dichotomies for Activity Recognition Using Accelerometer Data on Smartphone (Ensemble of Nested Dichotomies 기법을 이용한 스마트폰 가속도 센서 데이터 기반의 동작 인지)

  • Ha, Eu Tteum;Kim, Jeongmin;Ryu, Kwang Ryel
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.4
    • /
    • pp.123-132
    • /
    • 2013
  • As the smartphones are equipped with various sensors such as the accelerometer, GPS, gravity sensor, gyros, ambient light sensor, proximity sensor, and so on, there have been many research works on making use of these sensors to create valuable applications. Human activity recognition is one such application that is motivated by various welfare applications such as the support for the elderly, measurement of calorie consumption, analysis of lifestyles, analysis of exercise patterns, and so on. One of the challenges faced when using the smartphone sensors for activity recognition is that the number of sensors used should be minimized to save the battery power. When the number of sensors used are restricted, it is difficult to realize a highly accurate activity recognizer or a classifier because it is hard to distinguish between subtly different activities relying on only limited information. The difficulty gets especially severe when the number of different activity classes to be distinguished is very large. In this paper, we show that a fairly accurate classifier can be built that can distinguish ten different activities by using only a single sensor data, i.e., the smartphone accelerometer data. The approach that we take to dealing with this ten-class problem is to use the ensemble of nested dichotomy (END) method that transforms a multi-class problem into multiple two-class problems. END builds a committee of binary classifiers in a nested fashion using a binary tree. At the root of the binary tree, the set of all the classes are split into two subsets of classes by using a binary classifier. At a child node of the tree, a subset of classes is again split into two smaller subsets by using another binary classifier. Continuing in this way, we can obtain a binary tree where each leaf node contains a single class. This binary tree can be viewed as a nested dichotomy that can make multi-class predictions. Depending on how a set of classes are split into two subsets at each node, the final tree that we obtain can be different. Since there can be some classes that are correlated, a particular tree may perform better than the others. However, we can hardly identify the best tree without deep domain knowledge. The END method copes with this problem by building multiple dichotomy trees randomly during learning, and then combining the predictions made by each tree during classification. The END method is generally known to perform well even when the base learner is unable to model complex decision boundaries As the base classifier at each node of the dichotomy, we have used another ensemble classifier called the random forest. A random forest is built by repeatedly generating a decision tree each time with a different random subset of features using a bootstrap sample. By combining bagging with random feature subset selection, a random forest enjoys the advantage of having more diverse ensemble members than a simple bagging. As an overall result, our ensemble of nested dichotomy can actually be seen as a committee of committees of decision trees that can deal with a multi-class problem with high accuracy. The ten classes of activities that we distinguish in this paper are 'Sitting', 'Standing', 'Walking', 'Running', 'Walking Uphill', 'Walking Downhill', 'Running Uphill', 'Running Downhill', 'Falling', and 'Hobbling'. The features used for classifying these activities include not only the magnitude of acceleration vector at each time point but also the maximum, the minimum, and the standard deviation of vector magnitude within a time window of the last 2 seconds, etc. For experiments to compare the performance of END with those of other methods, the accelerometer data has been collected at every 0.1 second for 2 minutes for each activity from 5 volunteers. Among these 5,900 ($=5{\times}(60{\times}2-2)/0.1$) data collected for each activity (the data for the first 2 seconds are trashed because they do not have time window data), 4,700 have been used for training and the rest for testing. Although 'Walking Uphill' is often confused with some other similar activities, END has been found to classify all of the ten activities with a fairly high accuracy of 98.4%. On the other hand, the accuracies achieved by a decision tree, a k-nearest neighbor, and a one-versus-rest support vector machine have been observed as 97.6%, 96.5%, and 97.6%, respectively.

A Study on Analyzing Sentiments on Movie Reviews by Multi-Level Sentiment Classifier (영화 리뷰 감성분석을 위한 텍스트 마이닝 기반 감성 분류기 구축)

  • Kim, Yuyoung;Song, Min
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.3
    • /
    • pp.71-89
    • /
    • 2016
  • Sentiment analysis is used for identifying emotions or sentiments embedded in the user generated data such as customer reviews from blogs, social network services, and so on. Various research fields such as computer science and business management can take advantage of this feature to analyze customer-generated opinions. In previous studies, the star rating of a review is regarded as the same as sentiment embedded in the text. However, it does not always correspond to the sentiment polarity. Due to this supposition, previous studies have some limitations in their accuracy. To solve this issue, the present study uses a supervised sentiment classification model to measure a more accurate sentiment polarity. This study aims to propose an advanced sentiment classifier and to discover the correlation between movie reviews and box-office success. The advanced sentiment classifier is based on two supervised machine learning techniques, the Support Vector Machines (SVM) and Feedforward Neural Network (FNN). The sentiment scores of the movie reviews are measured by the sentiment classifier and are analyzed by statistical correlations between movie reviews and box-office success. Movie reviews are collected along with a star-rate. The dataset used in this study consists of 1,258,538 reviews from 175 films gathered from Naver Movie website (movie.naver.com). The results show that the proposed sentiment classifier outperforms Naive Bayes (NB) classifier as its accuracy is about 6% higher than NB. Furthermore, the results indicate that there are positive correlations between the star-rate and the number of audiences, which can be regarded as the box-office success of a movie. The study also shows that there is the mild, positive correlation between the sentiment scores estimated by the classifier and the number of audiences. To verify the applicability of the sentiment scores, an independent sample t-test was conducted. For this, the movies were divided into two groups using the average of sentiment scores. The two groups are significantly different in terms of the star-rated scores.

A Diagnostic Study on High School Students' Health and Quality of Life - Based on the PRECEDE model - (고등학생의 건강 및 삶의 질에 대한 진단적 연구 - PRECEDE 모형을 근간으로 -)

  • Yoo Jae-Soon;Hong Yeo-Shin
    • The Journal of Korean Academic Society of Nursing Education
    • /
    • v.3
    • /
    • pp.78-98
    • /
    • 1997
  • Health education, as the most fundamental concept for national health promotion, alms for developing the self-care ability of the general public. High school days are regarded as the period when most important physical, mental and social developments occur, and most health-related behaviors are formed. School health education is one of the major learning resources influencing health potential in the home and community as well as for the individual student. High school health education in Korea has a fundamental systemic flaw in that health-related subjects are divided and taught under various subjects areas at school. In order to achieve the goal of school health education, it is essential to make a systematic assessment of the learner's concerns connected with his health and life, and the factors affecting them. So far, most of the research projects that had been carried out for improving high school health education were limited in their concerns to a particular aspect of health. Even though some had been done in view of comprehensive school health education, they failed to Include a health assessment of the learner. Therefore, in this study the high school students' concerns related to health and life were investigated in the first place on the basis of the PRECEDE model, developed by Green and others for the purpose of a comprehensive diagnostic research on high school health education. This study was done in two steps : one was the basic study for developing research instrument and the other was the main one. The former was conducted at five high schools in Seoul and Cheongju for 2 months-beginning in March, 1996. The students were asked to respond to questions related to their health and lives in unstructured open-ended question forms. On the basis of analysis of the basic study, the diagnostic instruments for the quality of life, health problems, health behavior and educational factors were constructed to be used for the collection of data for main study. An expert panel and the pilot study were used to improve content validity and reliability of the instruments. The reliability of the instruments was measured at between .7697 and .9611 by the Cronbach $\alpha$. The data for this study were collected from the sample consisted of the junior and senior classes of twenty general and vocational high schools in Seoul and Cheongju for two months period beginning in July, 1996. In analyzing the data, both t-test and $X^2$-test were done by using SAS-$PC^+$ Program to compare data between the sexes of the high school students and the types of high school. A canonical correlation analysis was carried out to determine the relationships among the diagnostic variables, and a multivariate multiple regression analysis was conducted by using LISREL 8.03 to ascertain the influences of variables on the high school students' health and quality of life. The results were as follows : 1) The findings of the hypothesis tests (1) The canonical correlation between the educational diagnosis variables and behavioral, epidemiological, social diagnosis variables was .7221, which was significant at the level of p<.001. (2) The canonical correlation between the educational diagnosis variables and the behavior variables was .6851, which also was significant (p<.001). (3) The canonical correlation between the behavioral diagnosis variables and the epidemiological variables was 4295, which was significant (p<.001). (4) The canonical correlation between the epidemiological diagnosis variables and the social variables was .6005, which was also significant (p<.001). Therefore, the relationship between each diagnosis variable suggested by the PRECEDE model had been experimentally proven to be valid, supporting the conceptual framework of the study as appropriate for assessing the multi-dimensional factors affecting high school students' health and quality of life. Health behavior self-efficacy, the level of parents' interest and knowledge of health, and the level of the perception of school health education, all of which are the educational diagnostic variables, are the most influential variables in students' health and quality of life. In particular, health behavior self-efficacy, a causative factor, was one of the main influential variables in their health and quality of life. Other diagnostic variables suggested in the steps of the PRECEDE model were found to have reciprocal relations rather than a unidirectional causative relationship. The significance of this research is that it has diagnosed the needs of high school health education by the learner-centered assessment of variety of factors related to the health and the life of the students. This research findings suggest an integrated system of school health education to be contrived to enhance the effectiveness of the education by strengthening the influential factors such as self-efficacy to improve the health and quality of the lives of high school students.

  • PDF