• Title/Summary/Keyword: News Article

Search Result 234, Processing Time 0.023 seconds

User-Perspective Issue Clustering Using Multi-Layered Two-Mode Network Analysis (다계층 이원 네트워크를 활용한 사용자 관점의 이슈 클러스터링)

  • Kim, Jieun;Kim, Namgyu;Cho, Yoonho
    • Journal of Intelligence and Information Systems
    • /
    • v.20 no.2
    • /
    • pp.93-107
    • /
    • 2014
  • In this paper, we report what we have observed with regard to user-perspective issue clustering based on multi-layered two-mode network analysis. This work is significant in the context of data collection by companies about customer needs. Most companies have failed to uncover such needs for products or services properly in terms of demographic data such as age, income levels, and purchase history. Because of excessive reliance on limited internal data, most recommendation systems do not provide decision makers with appropriate business information for current business circumstances. However, part of the problem is the increasing regulation of personal data gathering and privacy. This makes demographic or transaction data collection more difficult, and is a significant hurdle for traditional recommendation approaches because these systems demand a great deal of personal data or transaction logs. Our motivation for presenting this paper to academia is our strong belief, and evidence, that most customers' requirements for products can be effectively and efficiently analyzed from unstructured textual data such as Internet news text. In order to derive users' requirements from textual data obtained online, the proposed approach in this paper attempts to construct double two-mode networks, such as a user-news network and news-issue network, and to integrate these into one quasi-network as the input for issue clustering. One of the contributions of this research is the development of a methodology utilizing enormous amounts of unstructured textual data for user-oriented issue clustering by leveraging existing text mining and social network analysis. In order to build multi-layered two-mode networks of news logs, we need some tools such as text mining and topic analysis. We used not only SAS Enterprise Miner 12.1, which provides a text miner module and cluster module for textual data analysis, but also NetMiner 4 for network visualization and analysis. Our approach for user-perspective issue clustering is composed of six main phases: crawling, topic analysis, access pattern analysis, network merging, network conversion, and clustering. In the first phase, we collect visit logs for news sites by crawler. After gathering unstructured news article data, the topic analysis phase extracts issues from each news article in order to build an article-news network. For simplicity, 100 topics are extracted from 13,652 articles. In the third phase, a user-article network is constructed with access patterns derived from web transaction logs. The double two-mode networks are then merged into a quasi-network of user-issue. Finally, in the user-oriented issue-clustering phase, we classify issues through structural equivalence, and compare these with the clustering results from statistical tools and network analysis. An experiment with a large dataset was performed to build a multi-layer two-mode network. After that, we compared the results of issue clustering from SAS with that of network analysis. The experimental dataset was from a web site ranking site, and the biggest portal site in Korea. The sample dataset contains 150 million transaction logs and 13,652 news articles of 5,000 panels over one year. User-article and article-issue networks are constructed and merged into a user-issue quasi-network using Netminer. Our issue-clustering results applied the Partitioning Around Medoids (PAM) algorithm and Multidimensional Scaling (MDS), and are consistent with the results from SAS clustering. In spite of extensive efforts to provide user information with recommendation systems, most projects are successful only when companies have sufficient data about users and transactions. Our proposed methodology, user-perspective issue clustering, can provide practical support to decision-making in companies because it enhances user-related data from unstructured textual data. To overcome the problem of insufficient data from traditional approaches, our methodology infers customers' real interests by utilizing web transaction logs. In addition, we suggest topic analysis and issue clustering as a practical means of issue identification.

Techno Populism and Algorithmic Manipulation of News in South Korea

  • Yoon, Sunny
    • Journal of Contemporary Eastern Asia
    • /
    • v.18 no.2
    • /
    • pp.33-48
    • /
    • 2019
  • The current Moon Jai-in administration in South Korea is facing serious challenges as a result of a scandal involving the manipulation of news online. Staff in Moon's camp are suspected of manipulating public opinion by creating millions of fake news comments online, contributing to Moon being elected president. This South Korean political scandal raises a number of theoretical issues with regard to new platform technologies and media manipulation. First, the incident exposes the technological limits of blocking manipulation of the news, partly because of the nature of social media and partly because of the nature of contemporary technology. Contemporary social media is often monopolistic in nature; with the majority of people are using the same platforms, and hence it is likely that they will be subject to forms of media manipulation. Second, the Korean case of news manipulation demonstrates a unique cultural aspect of Korean society. News comments and readers' replies have become a major channel of alternative news in Korea. This phenomenon is often designated as "reply journalism," since people are interested in reading the news replies of ordinary readers equally to reading news reports themselves. News replies are considered indicators of public opinion and are seen as affecting trias politica in Korean society. Third, the Korean incident of news manipulation implicates a new form of populism in the 21st century and the nature of democratic participation. This article aims to explicate key issues in media manipulation by including wider technological, cultural, and political aspects in the South Korean news media context.

Factors Influencing Subscribers' Voluntary Payment Behavior on an Online News Site: Focusing on the Role of Appreciation (온라인 뉴스 사이트에서 독자의 자발적 구독료 지불행위에 영향을 미치는 요인에 대한 연구: 공감의 역할을 중심으로)

  • Lee, Hyoung-Joo;Rhee, Hosung Timothy;Yang, Sung-Byung
    • Knowledge Management Research
    • /
    • v.14 no.4
    • /
    • pp.1-17
    • /
    • 2013
  • As online communities proliferate, online news sites have received great attention in news media research. Although most of the online news sites provide contents for free, some have adopted the Pay-What-You-Want (PWYW) model by offering a voluntary payment option to the readers. In this study, we investigate the factors which influence subscribers' voluntary payment behavior on an online news site. Drawing upon both the Stimulus-Organism-Response (SOR) framework and the Elaboration Likelihood Model (ELM), we hypothesize that appreciation has a direct effect on the subscribers' voluntary payment behavior, whereas central factors (positive emotional content, cognitive content) and peripheral factors (news sharing, news article length) of the news articles have indirect impacts on voluntary payment behavior through the enhanced appreciation. Based on an empirical analysis of 172 news articles from the Korean online news site that adopted the PWYW pricing model (i.e., Ohmynews.com), we find that appreciation plays a critical role in voluntary payment behavior and that peripheral factors have significant impacts on appreciation. However, the impacts of central factors on appreciation are not found. By identifying influencing factors of subscribers' voluntary payment behavior on online news sites for the first time, this paper suggests a prospective alternative profit model for online news providers faced with fierce competition.

  • PDF

A Study on Fake News Subject Matter, Presentation Elements, Tools of Detection, and Social Media Platforms in India

  • Kanozia, Rubal;Arya, Ritu;Singh, Satwinder;Narula, Sumit;Ganghariya, Garima
    • Asian Journal for Public Opinion Research
    • /
    • v.9 no.1
    • /
    • pp.48-82
    • /
    • 2021
  • This research article attempts to understand the current situation of fake news on social media in India. The study focused on four characteristics of fake news based on four research questions: subject matter, presentation elements of fake news, debunking tool(s) or technique(s) used, and the social media site on which the fake news story was shared. A systematic sampling method was used to select a sample of 90 debunked fake news stories from two Indian fact-checking websites, Alt News and Factly, from December 2019 to February 2020. A content analysis of the four characteristics of fake news stories was carefully analyzed, classified, coded, and presented. The results show that most of the fake news stories were related to politics in India. The majority of the fake news was shared via a video with text in which narrative was changed to mislead users. For the largest number of debunked fake news stories, information from official or primary sources, such as reports, data, statements, announcements, or updates were used to debunk false claims.

A Study on the Trends of Construction Safety Accident in Unstructured Text Using Topic Modeling (비정형 텍스트 기반의 토픽 모델링을 이용한 건설 안전사고 동향 분석)

  • Lee, Sang-Gyu
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.19 no.10
    • /
    • pp.176-182
    • /
    • 2018
  • In order to understand and track the trends of construction safety accident, this study shows the topic trends in the construction safety accident with LDA(Latent Dirichlet Allocation)-based topic modeling method for data analytics. Especially, it performs to figure out the main issue of construction safety accident with unstructured data analysis based on the topic modeling rather than a variety of structured data analysis for preventing to safety accident in construction industry. To apply this methodology, I randomly collected to 540 news article data about construction accident from January 2017 to February 2018. Based on the unstructured data with the LDA-based topic modeling, I found the 10 topics and identified key issues through 10 keyword in each 10 topics. I forecasted the topic issue related to construction safety accident based on analysis of time-series trends about the news data from January 2017 to February 2018. With this method, this research gives a hint about ways of using unstructured news article data to anticipate safety policy and research field and to respond to construction accident safety issues in the future.

A Study of Housing Environment Problems through the Daily newspapers ( I ) - The Change of a type of the Dong-A daily papers (1920~1990) - (일간지를 통해 본 주거환경문제의 연구 ( I ) - 동아일보 (1920년~1990년) 기사 유형의 변천 -)

  • 신경주
    • Journal of the Korean housing association
    • /
    • v.2 no.2
    • /
    • pp.41-53
    • /
    • 1991
  • This study discussed the change of housing environmental problems from the early 1900s to the present.The reason is to find the solution of serious housing environment problems. The documentary research method was used for this study.Articles of content analysis(N= 1129)were published in 1920(the first edition)to December. 31, 1990 which were The Dong - A daily news article about housing environment. The main content of this study was examined the change, such as the number of whole article by time series and importance of article(column number of article), classification of article subject, and the number of article by subject. On the basis of this data, was made by chronological classification of the change of housing environment problems for 70 years. Since overall results will become supply of right information about housing environment to fur peoples, will provide the oppronment that oneself ran participate the protection of housing environment, and further will take a part solution of housing environment problems.At the future, I am going to design deep analysis of article content by subject.

  • PDF

Fake News Checking Tool Based on Siamese Neural Networks and NLP (NLP와 Siamese Neural Networks를 이용한 뉴스 사실 확인 인공지능 연구)

  • Vadim, Saprunov;Kang, Sung-Won;Rhee, Kyung-hyune
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2022.05a
    • /
    • pp.627-630
    • /
    • 2022
  • Over the past few years, fake news has become one of the most significant problems. Since it is impossible to prevent people from spreading misinformation, people should analyze the news themselves. However, this process takes some time and effort, so the routine part of this analysis should be automated. There are many different approaches to this problem, but they only analyze the text and messages, ignoring the images. The fake news problem should be solved using a complex analysis tool to reach better performance. In this paper, we propose the approach of training an Artificial Intelligence using an unsupervised learning algorithm, combined with online data parsing tools, providing independence from subjective data set. Therefore it will be more difficult to spread fake news since people could quickly check if the news or article is trustworthy.

Topic Modeling of News Article about International Construction Market Using Latent Dirichlet Allocation (Latent Dirichlet Allocation 기법을 활용한 해외건설시장 뉴스기사의 토픽 모델링(Topic Modeling))

  • Moon, Seonghyeon;Chung, Sehwan;Chi, Seokho
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.38 no.4
    • /
    • pp.595-599
    • /
    • 2018
  • Sufficient understanding of oversea construction market status is crucial to get profitability in the international construction project. Plenty of researchers have been considering the news article as a fine data source for figuring out the market condition, since the data includes market information such as political, economic, and social issue. Since the text data exists in unstructured format with huge size, various text-mining techniques were studied to reduce the unnecessary manpower, time, and cost to summarize the data. However, there are some limitations to extract the needed information from the news article because of the existence of various topics in the data. This research is aimed to overcome the problems and contribute to summarization of market status by performing topic modeling with Latent Dirichlet Allocation. With assuming that 10 topics existed in the corpus, the topics included projects for user convenience (topic-2), private supports to solve poverty problems in Africa (topic-4), and so on. By grouping the topics in the news articles, the results could improve extracting useful information and summarizing the market status.

A Study on the Application of the FRBR Model to Newspaper (신문의 FRBR 모형 적용에 관한 연구)

  • Chang, Inho
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.49 no.3
    • /
    • pp.333-349
    • /
    • 2015
  • This study examined the application of the FRBR model to newspapers and news articles. In order to meet the purpose that was mentioned above, we analyzed data items based on the level of newspapers and articles and discussed how the FRBR model may be applied. In terms of the level of a newspaper, each of newspapers, morning/evening paper, issue and edition are regarded as an individual work, and the relationship among them are considered to be the 'whole-part relationship'. Each article on the level of article basis was considered to be a work and was in a relationship of 'whole-part relationship' with the edition of each level of newspapers. Newspaper articles can be represented as texts, photographs, graphics, and tables, etc., and regarded as an individual work. Each work can be a part of the article on a newspaper or can be an independent article itself. Moreover, a uniform heading of each boxed article and running story is included in the work of each article and is forming a 'whole-part relationship'. Because of the changes of the newspaper name, the uniform title of each name regarded as a single binding. It is called the superwork and it is forming 'whole-part relationship' with each name.