• Title/Summary/Keyword: 토픽 추적

Search Result 16, Processing Time 0.03 seconds

A Study on Issue Tracking on Multi-cultural Studies Using Topic Modeling (토픽 모델링을 활용한 다문화 연구의 이슈 추적 연구)

  • Park, Jong Do
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.53 no.3
    • /
    • pp.273-289
    • /
    • 2019
  • The goal of this study is to analyze topics discussed in academic papers on multiculture in Korea to figure out research trends in the field. In order to do topic analysis, LDA (Latent Dirichlet Allocation)-based topic modeling methods are employed. Through the analysis, it is possible to track topic changes in the field and it is found that topics related to 'social integration' and 'multicultural education in schools' are hot topics, and topics related to 'cultural identity and nationalism' are cold topics among top five topics in the field.

Topic-Network based Topic Shift Detection on Twitter (트위터 데이터를 이용한 네트워크 기반 토픽 변화 추적 연구)

  • Jin, Seol A;Heo, Go Eun;Jeong, Yoo Kyung;Song, Min
    • Journal of the Korean Society for information Management
    • /
    • v.30 no.1
    • /
    • pp.285-302
    • /
    • 2013
  • This study identified topic shifts and patterns over time by analyzing an enormous amount of Twitter data whose characteristics are high accessibility and briefness. First, we extracted keywords for a certain product and used them for representing the topic network allows for intuitive understanding of keywords associated with topics by nodes and edges by co-word analysis. We conducted temporal analysis of term co-occurrence as well as topic modeling to examine the results of network analysis. In addition, the results of comparing topic shifts on Twitter with the corresponding retrieval results from newspapers confirm that Twitter makes immediate responses to news media and spreads the negative issues out quickly. Our findings may suggest that companies utilize the proposed technique to identify public's negative opinions as quickly as possible and to apply for the timely decision making and effective responses to their customers.

Text Mining Analysis of News Articles Related to 'Space Hazard' ('우주 위험' 관련 뉴스 기사의 텍스트 마이닝 분석 연구)

  • Jo, Hoon;Sohn, Jungjoo
    • Journal of the Korean earth science society
    • /
    • v.43 no.1
    • /
    • pp.224-235
    • /
    • 2022
  • This study aimed to confirm the status of media reports on space hazards using topic modeling analysis of media articles that are related to space hazards for the past 12 years. Therefore, Latent Dirichlet Allocation (LDA) analysis was performed by collecting over 1200 space hazards articles between 2010 and 2021 on solar storm, artificial space objects, and natural space objects from BIGKins news platform. The articles related to solar storm focused on three topics: the effect of solar explosion on satellites; effect of solar explosion on radio communication in Korea, centered on the Korean Space Weather Center; and relationship between aircrew and space radiation. The articles related to artificial space objects focused on three topics: the threat of space garbage to satellite and space stations and the transition of useful objects into space junk; the relationship between space garbage and humanity as shown in movies; and the effort of developed countries for tracking, monitoring, and disposing of space garbage. The articles related to natural space objects focused on two topics: International Space Agency's tracking and monitoring of near-Earth asteroids and the countermeasures of collisions, and the evolution and extinction of dinosaurs and mammals, with a focus on the collisions of asteroids or comets. Therefore, this study confirmed that domestic media play a role in conveying dangers of space hazards and arousing the attention of public using a total of eight themes in various fields such as society and culture, and derived education method and policy on space hazards.

Adaptive User and Topic Modeling based Automatic TV Recommender System for Big Data Processing (빅 데이터 처리를 위한 적응적 사용자 및 토픽 모델링 기반 자동 TV 프로그램 추천시스템)

  • Kim, EunHui;Kim, Munchurl
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2015.07a
    • /
    • pp.195-198
    • /
    • 2015
  • 최근 TV 서비스의 가입자 및 TV 프로그램 콘텐츠의 급격한 증가에 따라 빅데이터 처리에 적합한 추천 시스템의 필요성이 증가하고 있다. 본 논문은 사용자들의 간접 평가 데이터 기반의 추천 시스템 디자인 시, 누적된 사용자의 과거 이용내역 데이터를 저장하지 않고 새로 생성된 사용자 이용내역 데이터를 학습하는 효율적인 알고리즘이면서, 시간 흐름에 따라 사용자들의 선호도 변화 및 TV 프로그램 스케줄 변화의 추적이 가능한 토픽 모델링 기반의 알고리즘을 제안한다. 빅데이터 처리를 위해서는 분산처리 형태의 알고리즘을 피할 수 없는데, 기존의 연구들 중 토픽 모델링 기반의 추론 알고리즘의 병렬분산처리 과정 중에 핵심이 되는 부분은 많은 데이터를 여러 대의 기계에 나누어 병렬분산 학습하면서 전역변수 데이터를 동기화하는 부분이다. 그런데, 이러한 전역데이터 동기화 기술에 있어, 여러 대의 컴퓨터를 병렬분산처리하기위한 하둡 기반의 시스템 및 서버-클라이언트간의 중재, 고장 감내 시스템 등을 모두 고려한 알고리즘들이 제안되어 왔으나, 네트워크 대역폭 한계로 인해 데이터 증가에 따른 동기화 시간 지연은 피할 수 없는 부분이다. 이에, 본 논문에서는 빅데이터 처리를 위해 사용자들을 클러스터링하고, 클러스터별 제안 알고리즘으로 전역데이터 동기화를 수행한 것과 지역 데이터를 활용하여 추론 연산한 결과, 클러스터별 지역별 TV프로그램 시청 토큰 별 은닉토픽 할당 테이블을 유지할 때 추천 성능이 더욱 향상되어 나오는 결과를 확인하여, 제안된 구조의 추천 시스템 디자인의 효율성과 합리성을 확인할 수 있었다.

  • PDF

The Analysis of North Korea's Economic Policy Trends through Topic Modeling (토픽모델링을 통한 북한의 경제정책 동향 분석)

  • Kang, Kyung Hwa
    • Smart Media Journal
    • /
    • v.9 no.4
    • /
    • pp.44-51
    • /
    • 2020
  • Since the mid-to-late 1990s, there have obviously been many changes in the North Korean economy. Since the change has been more pronounced since Kim Jong Un took power in 2012, the purpose of the paper is to track the trend of economic policy by timing. In this paper, I use LDA Topic Modeling, a text-mining analyzer method, to analyze the economics journal "Economic Research," which is a representative literature in the economic field published in North Korea. An in-depth analysis of the "economic research," which has an unrivaled position as an economic journal produced in North Korea, can be said to be an essential task in tracking the reality, limitations facing the economy and alternatives that North Korean authorities are aware of. Through the "Economic Research," where various topics of debate on the North Korean economy are hidden, the North Korean leader's economic policy flow is examined and the contents of the "change" intended by the current Kim Jong-un regime are analyzed.

A Reply Graph-based Social Mining Method with Topic Modeling (토픽 모델링을 이용한 댓글 그래프 기반 소셜 마이닝 기법)

  • Lee, Sang Yeon;Lee, Keon Myung
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.24 no.6
    • /
    • pp.640-645
    • /
    • 2014
  • Many people use social network services as to communicate, to share an information and to build social relationships between others on the Internet. Twitter is such a representative service, where millions of tweets are posted a day and a huge amount of data collection has been being accumulated. Social mining that extracts the meaningful information from the massive data has been intensively studied. Typically, Twitter easily can deliver and retweet the contents using the following-follower relationships. Topic modeling in tweet data is a good tool for issue tracking in social media. To overcome the restrictions of short contents in tweets, we introduce a notion of reply graph which is constructed as a graph structure of which nodes correspond to users and of which edges correspond to existence of reply and retweet messages between the users. The LDA topic model, which is a typical method of topic modeling, is ineffective for short textual data. This paper introduces a topic modeling method that uses reply graph to reduce the number of short documents and to improve the quality of mining results. The proposed model uses the LDA model as the topic modeling framework for tweet issue tracking. Some experimental results of the proposed method are presented for a collection of Twitter data of 7 days.

Topic and Source Diversity of the Front Page in the New York Times, Chicago Tribune and the Los Angeles Times from 1950 to 2000 (20세기 하반기의 미 신문 1면 보도에 대한 다양성 분석: 뉴스 토픽과 정보원의 분포를 중심으로)

  • Shim, Hoon
    • Korean journal of communication and information
    • /
    • v.30
    • /
    • pp.175-201
    • /
    • 2005
  • This study investigates the diversity of news topic and source of the New York Times, Chicago Tribune, and the Los Angeles Times in the second half of the twentieth century. In probing the conventional traits of the contemporary press, the researcher traced the changing patterns and trends of news values in terms of news-gathering routine in order to evaluate the journalistic role conception in terms of social responsibility theory. Findings indicated that the American press as a neutral transmitter has been consistently violated by source and topic bias without any significant changes during the last five decades. The data, however, revealed the evident shift of the contemporary press from the heavy reliance of official source to the business/economic source. In addition, news topics such as business, health, and education have replaced the conventional popular topics such as crime and accidents. By contrast, it was revealed that the unconventional topics such as poverty, labor and minority still fail to receive the large attention from the target papers.

  • PDF

Implementation of IoT Home System based on MQTT (MQTT 기반 IoT 홈 시스템 구현)

  • Kim, U-zo;Choi, Jin-ku
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.20 no.1
    • /
    • pp.231-237
    • /
    • 2020
  • In this paper, we implemented a home IoT system based on MQTT protocol. In this system, data are collected from sensors in real time and transmitted to the server system. Based on collected data, home devices could be controlled automatically or manually. By using the MQTT protocol, we were able to see the data values of sensors collected in real time according to the topic setting. We implemented a system that automatically sets up home devices based on topic data, and it worked. The system is expected to be useful in applications that require monitoring and tracking of data in real time.

An Analysis of IT Trends Using Tweet Data (트윗 데이터를 활용한 IT 트렌드 분석)

  • Yi, Jin Baek;Lee, Choong Kwon;Cha, Kyung Jin
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.1
    • /
    • pp.143-159
    • /
    • 2015
  • Predicting IT trends has been a long and important subject for information systems research. IT trend prediction makes it possible to acknowledge emerging eras of innovation and allocate budgets to prepare against rapidly changing technological trends. Towards the end of each year, various domestic and global organizations predict and announce IT trends for the following year. For example, Gartner Predicts 10 top IT trend during the next year, and these predictions affect IT and industry leaders and organization's basic assumptions about technology and the future of IT, but the accuracy of these reports are difficult to verify. Social media data can be useful tool to verify the accuracy. As social media services have gained in popularity, it is used in a variety of ways, from posting about personal daily life to keeping up to date with news and trends. In the recent years, rates of social media activity in Korea have reached unprecedented levels. Hundreds of millions of users now participate in online social networks and communicate with colleague and friends their opinions and thoughts. In particular, Twitter is currently the major micro blog service, it has an important function named 'tweets' which is to report their current thoughts and actions, comments on news and engage in discussions. For an analysis on IT trends, we chose Tweet data because not only it produces massive unstructured textual data in real time but also it serves as an influential channel for opinion leading on technology. Previous studies found that the tweet data provides useful information and detects the trend of society effectively, these studies also identifies that Twitter can track the issue faster than the other media, newspapers. Therefore, this study investigates how frequently the predicted IT trends for the following year announced by public organizations are mentioned on social network services like Twitter. IT trend predictions for 2013, announced near the end of 2012 from two domestic organizations, the National IT Industry Promotion Agency (NIPA) and the National Information Society Agency (NIA), were used as a basis for this research. The present study analyzes the Twitter data generated from Seoul (Korea) compared with the predictions of the two organizations to analyze the differences. Thus, Twitter data analysis requires various natural language processing techniques, including the removal of stop words, and noun extraction for processing various unrefined forms of unstructured data. To overcome these challenges, we used SAS IRS (Information Retrieval Studio) developed by SAS to capture the trend in real-time processing big stream datasets of Twitter. The system offers a framework for crawling, normalizing, analyzing, indexing and searching tweet data. As a result, we have crawled the entire Twitter sphere in Seoul area and obtained 21,589 tweets in 2013 to review how frequently the IT trend topics announced by the two organizations were mentioned by the people in Seoul. The results shows that most IT trend predicted by NIPA and NIA were all frequently mentioned in Twitter except some topics such as 'new types of security threat', 'green IT', 'next generation semiconductor' since these topics non generalized compound words so they can be mentioned in Twitter with other words. To answer whether the IT trend tweets from Korea is related to the following year's IT trends in real world, we compared Twitter's trending topics with those in Nara Market, Korea's online e-Procurement system which is a nationwide web-based procurement system, dealing with whole procurement process of all public organizations in Korea. The correlation analysis show that Tweet frequencies on IT trending topics predicted by NIPA and NIA are significantly correlated with frequencies on IT topics mentioned in project announcements by Nara market in 2012 and 2013. The main contribution of our research can be found in the following aspects: i) the IT topic predictions announced by NIPA and NIA can provide an effective guideline to IT professionals and researchers in Korea who are looking for verified IT topic trends in the following topic, ii) researchers can use Twitter to get some useful ideas to detect and predict dynamic trends of technological and social issues.

A Study on Graph-based Topic Extraction from Microblogs (마이크로블로그를 통한 그래프 기반의 토픽 추출에 관한 연구)

  • Choi, Don-Jung;Lee, Sung-Woo;Kim, Jae-Kwang;Lee, Jee-Hyong
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.21 no.5
    • /
    • pp.564-568
    • /
    • 2011
  • Microblogs became popular information delivery ways due to the spread of smart phones. They have the characteristic of reflecting the interests of users more quickly than other medium. Particularly, in case of the subject which attracts many users, microblogs can supply rich information originated from various information sources. Nevertheless, it has been considered as a hard problem to obtain useful information from microblogs because too much noises are in them. So far, various methods are proposed to extract and track some subjects from particular documents, yet these methods do not work effectively in case of microblogs which consist of short phrases. In this paper, we propose a graph-based topic extraction and partitioning method to understand interests of users about a certain keyword. The proposed method contains the process of generating a keyword graph using the co-occurrences of terms in the microblogs, and the process of splitting the graph by using a network partitioning method. When we applied the proposed method on some keywords. our method shows good performance for finding a topic about the keyword and partitioning the topic into sub-topics.