• Title/Summary/Keyword: Web data mining

Search Result 412, Processing Time 0.027 seconds

Analysis on the Usage of Internet Games for Children with Decision Tree Rules (의사결정규칙을 이용한 아동의 교육용 인터넷 게임 활용실태 분석)

  • Kim, Yong-Dae;Jung, Hui-Suk;Choi, Eun-Jeong;Park, Byung-Sun;Han, Jeong-Hye
    • Journal of The Korean Association of Information Education
    • /
    • v.5 no.3
    • /
    • pp.389-400
    • /
    • 2001
  • The Internet Game is widespreaded quickly on web, and there are many kinds of funny games for users to use easily, so that can be applied to ICT(Information Communication Technology)education. In this paper, we provide the analysis on the usage of Internet games for children and teachers that is conducted by the decision tree algorithm, which is one of the popular data mining techniques. The results show the pattern of children's and teachers' usages of Internet games.

  • PDF

A Method for User Sentiment Classification using Instagram Hashtags (인스타그램 해시태그를 이용한 사용자 감정 분류 방법)

  • Nam, Minji;Lee, EunJi;Shin, Juhyun
    • Journal of Korea Multimedia Society
    • /
    • v.18 no.11
    • /
    • pp.1391-1399
    • /
    • 2015
  • In recent times, studies sentiment analysis are being actively conducted by implementing natural language processing technologies for analyzing subjective data such as opinions and attitudes of users expressed on the Web, blogs, and social networking services (SNSs). Conventionally, to classify the sentiments in texts, most studies determine positive/negative/neutral sentiments by assigning polarity values for sentiment vocabulary using sentiment lexicons. However, in this study, sentiments are classified based on Thayer's model, which is psychologically defined, unlike the polarity classification used in opinion mining. In this paper, as a method for classifying the sentiments, sentiment categories are proposed by extracting sentiment keywords for major sentiments by using hashtags, which are essential elements of Instagram. By applying sentiment categories to user posts, sentiments can be determined through the similarity measurement between the sentiment adjective candidates and the sentiment keywords. The test results of the proposed method show that the average accuracy rate for all the sentiment categories was 90.7%, which indicates good performance. If a sentiment classification system with a large capacity is prepared using the proposed method, then it is expected that sentiment analysis in various fields will be possible, such as for determining social phenomena through SNS.

Proposal of Research Methodology Using The Measurement of Perception Difference

  • YANG, Hoechang
    • Journal of Wellbeing Management and Applied Psychology
    • /
    • v.2 no.2
    • /
    • pp.39-45
    • /
    • 2019
  • The purpose of this study is to solve the problem of revision or abbreviation of questionnaires based on the previous studies suggested by many existing empirical studies. In addition, this study aims to provide the theoretical basis of the research method which has been variously approached since it presents the methodology that can directly measure the research object. For this purpose, this study proposed a more elaborate analysis method using the differences in perception of individuals who are interested in cognitive research. Specifically, the perception gap(D) can be used as an independent variable, a dependent variable, and a moderating variable. And this study suggested an effective research approach using the measurement of perception difference. The difference of perception suggested that it can be used as a measure to overcome the limitations of existing researches used it as independent variables or mediating variables that measure only one factor of expectation and performance or importance and satisfaction. In addition, it is highly likely that various analyzes on the perception differences, which are the result of measuring target factors for the same person, will be quite effective in the situation where follow-up of respondents is difficult. This study is expected to overcome various limitations reported by empirical studies such as scale utilization problem and follow-up survey difficulty. In future research, it was expected that the limitation of the factor derivation process in the research approach could be complemented by web crawling and text mining of big data analysis.

How to Promote the Korean Journal of Child Studies to an International Journal (아동학회지를 어떻게 국제화시킬 것인가?)

  • Huh, Sun
    • Korean Journal of Child Studies
    • /
    • v.37 no.1
    • /
    • pp.7-16
    • /
    • 2016
  • Objective: It aimed at proposing the Korean Journal of Child Studies' strategy to be promoted to international journal based on the style and format of scholarly journals and journal metrics. Methods: The review of the journal in not only print version, but also an online version was done from the perspective of style and format. The total citation and impact factor were manually calculated from Web of Science Core Collection. Results: More professional level manuscript editing is required for maintaining the consistency of the style and format. The verso page and back matters should be improved to international level. Journal homepage should be reconstructed by adopting digital standards for the journal, including journal article tag suite, CrossMark, FundRef, ORCID, and text and data mining. To become an international journal, transformation into English journal and deposition to PubMed Central is mandatory. Conclusion: Since the editor's and society members' performance is top-notch, it will be possible to promote the journal up to international level soon. Society should guarantee the term of editor for enough time and support her with full cost and complete consent.

Quality Analysis of Smart Application Contents for the Convenience of Care and Hospital Access (진료의 편의성과 병원 접근성 증진을 위한 스마트 어플리케이션 콘텐츠의 질적 분석)

  • Lee, Jae Bin;Kim, Ji Hye;Bok, Jeong Hee;Woo, Hyekyung
    • Korea Journal of Hospital Management
    • /
    • v.25 no.1
    • /
    • pp.1-12
    • /
    • 2020
  • Purposes: The aim of this study is to evaluate whether the contents of hospital reservation and reception applications(apps) are qualitatively useful in meeting the needs of medical consumers and improving hospital accessibility and convenience. Methodology: (1) identify consumer needs through social data web mining, (2) describe the status of key contents of mobile apps to improve accessibility and convenience of care, and (3) verify the quality of apps through validated tools Finding: The contents of 'mobile reservation function' and 'waiting time information provision' that can contribute to reduction of delay time of care and efficiency of desk work were supported, but the level of utilization was insufficient. The quality level of the app, including the level of consumers' needs, has shown a wide gap between the apps. Implications: The recent development of mobile apps for hospital accessibility and consumer needs has shown a wide gap in the quality of apps, including information and aesthetic. Therefore, it is necessary to develop apps based on user interface(UI), user experience(UX) based designs that can promote the usefulness and convenience of apps while monitoring needs of consumers continuously.

A Clustering Technique Using Association Rules for The Library and Information Science Terminology (연관규칙을 이용한 문헌정보학 전문용어 클러스터링 기법에 관한 연구)

  • Seung, Hyon-Woo;Park, Mi-Young
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.37 no.2
    • /
    • pp.89-105
    • /
    • 2003
  • In this paper, an effective method for clustering terminologies extracted from text is proposed, in order to develope a search engine to extract relevant information from large web documents. To prevent frequency of the meaningless association rules among general terminologies, only useful association rules among terminologies are produced using database tables which consist of domain-specific terminologies. Such association rules are produced by applying the Apriori algorithm after forming transaction units from groups of association rules in a document. A group of association rules produced from a terminology forms in a cluster.

A Study on the Preemptive Measure for Fake News Eradication Using Data Mining Algorithms : Focused on the M Online Community Postings (데이터 마이닝을 활용한 가짜뉴스의 선제적 대응을 위한 연구 : M 온라인 커뮤니티 게시물을 중심으로)

  • Lim, Munyeong;Park, Sungbum
    • Journal of Information Technology Services
    • /
    • v.18 no.1
    • /
    • pp.219-234
    • /
    • 2019
  • Fake news threaten democratic elections and causes social conflicts, resulting in major damage. However, the concept of fake news is hard to define, as there is a saying, "News is not fake, fake is not news." Fake news, however, has irreversible characteristics that can not be recovered or reversed completely through post-punishment of economic and political benefits. It is also rapidly spreading in the early days. Therefore, it is very important to preemptively detect these types of articles and prevent their blind proliferation. The existing countermeasures are focused on reporting fake news, raising the level of punishment, and the media & academia to determine the authenticity of the news. Researchers are also trying to determine the authenticity by analyzing its contents. Apart from the contents of fake news, determining the behavioral characteristics of the promoters and its qualities can help identify the possibility of having fake news in advance. The online community has a fake news interception and response tradition through its long-standing community-based activities. As a result, I attempted to model the fake news by analyzing the affirmation-denial analysis and posting behavior by securing the web board crawl of the 'M community' bulletin board during the 2017 Korean presidential election period. Random forest algorithm deemed significant. The results of this research will help counteract fake news and focus on preemptive blocking through behavioral analysis rather than post-judgment after semantic analysis.

A Study on Personalized Advertisement System Using Web Mining (웹 마이닝을 이용한 개인 광고기법에 관한 연구)

  • 김은수;송강수;이원돈;송정길
    • Journal of the Korea Society of Computer and Information
    • /
    • v.8 no.4
    • /
    • pp.92-103
    • /
    • 2003
  • Great many advertisements are serviced in on-line by development of electronic commerce and internet user's rapid increase recently. However, this advertisement service is stopping in one-side service of relevant advertisement rather than doing users' inclination analysis to basis. Therefore, want advertisement service that many websites are personalized for efficient service of relevant advertisement and service through relevant server's log analysis research and enforce. Take advantage of log data of local system that this treatise is not analysis of server log data and analyze user's Preference degree and inclination. Also, try to propose advertisement system personalized by making relevant site tributary category and give weight of relevant tributary. User's preference user preference which analysis is one part of cooperation fielder ring of web personalized techniques use information in visit site tributary and suppose internet user's action in visit number of times of relevant site and try inclination analysis of mixing form. Express user's preference degree by vector, and inclination analysis result uninterrupted data that simplicity application form is not regarded and techniques that propose inclination analysis change of data since with move data use and analyze newly and proposed so that can do continuous renewal and application as feedback Sikkim. Presented method that can choose advertisements of relevant tributary through this result and provide personalized advertisement service by applying process such as user inclination analysis in advertisement chosen.

  • PDF

Enhancing Classification Performance of Temporal Keyword Data by Using Moving Average-based Dynamic Time Warping Method (이동 평균 기반 동적 시간 와핑 기법을 이용한 시계열 키워드 데이터의 분류 성능 개선 방안)

  • Jeong, Do-Heon
    • Journal of the Korean Society for information Management
    • /
    • v.36 no.4
    • /
    • pp.83-105
    • /
    • 2019
  • This study aims to suggest an effective method for the automatic classification of keywords with similar patterns by calculating pattern similarity of temporal data. For this, large scale news on the Web were collected and time series data composed of 120 time segments were built. To make training data set for the performance test of the proposed model, 440 representative keywords were manually classified according to 8 types of trend. This study introduces a Dynamic Time Warping(DTW) method which have been commonly used in the field of time series analytics, and proposes an application model, MA-DTW based on a Moving Average(MA) method which gives a good explanation on a tendency of trend curve. As a result of the automatic classification by a k-Nearest Neighbor(kNN) algorithm, Euclidean Distance(ED) and DTW showed 48.2% and 66.6% of maximum micro-averaged F1 score respectively, whereas the proposed model represented 74.3% of the best micro-averaged F1 score. In all respect of the comprehensive experiments, the suggested model outperformed the methods of ED and DTW.

Does Rain Really Cause Toothache? Statistical Analysis Based on Google Trends

  • Jeon, Se-Jeong
    • Journal of dental hygiene science
    • /
    • v.21 no.2
    • /
    • pp.104-110
    • /
    • 2021
  • Background: Regardless of countries, the myth that rain makes the body ache has been worded in various forms, and a number of studies have been reported to investigate this. However, these studies, which depended on the patient's experience or memory, had obvious limitations. Google Trends is a big data analysis service based on search terms and viewing videos provided by Google LLC, and attempts to use it in various fields are continuing. In this study, we endeavored to introduce the 'value as a research tool' of the Google Trends, that has emerged along with technological advancements, through research on 'whether toothaches really occur frequently on rainy days'. Methods: Keywords were selected as objectively as possible by applying web crawling and text mining techniques, and the keyword "bi" meaning rain in Korean was added to verify the reliability of Google Trends data. The correlation was statistically analyzed using precipitation and temperature data provided by the Korea Meteorological Agency and daily search volume data provided by Google Trends. Results: Keywords "chi-gwa", "chi-tong", and "chung-chi" were selected, which in Korean mean 'dental clinic', 'toothache', and 'tooth decay' respectively. A significant correlation was found between the amount of precipitation and the search volume of tooth decay. No correlation was found between precipitation and other keywords or other combinations. It was natural that a very significant correlation was found between the amount of precipitation, temperature, and the search volume of "bi". Conclusion: Rain seems to actually be a cause of toothache, and if objective keyword selection is premised, Google Trends is considered to be very useful as a research tool in the future.