• Title/Summary/Keyword: News Article Analysis

Search Result 117, Processing Time 0.024 seconds

A Study on the Trends of Construction Safety Accident in Unstructured Text Using Topic Modeling (비정형 텍스트 기반의 토픽 모델링을 이용한 건설 안전사고 동향 분석)

  • Lee, Sang-Gyu
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.19 no.10
    • /
    • pp.176-182
    • /
    • 2018
  • In order to understand and track the trends of construction safety accident, this study shows the topic trends in the construction safety accident with LDA(Latent Dirichlet Allocation)-based topic modeling method for data analytics. Especially, it performs to figure out the main issue of construction safety accident with unstructured data analysis based on the topic modeling rather than a variety of structured data analysis for preventing to safety accident in construction industry. To apply this methodology, I randomly collected to 540 news article data about construction accident from January 2017 to February 2018. Based on the unstructured data with the LDA-based topic modeling, I found the 10 topics and identified key issues through 10 keyword in each 10 topics. I forecasted the topic issue related to construction safety accident based on analysis of time-series trends about the news data from January 2017 to February 2018. With this method, this research gives a hint about ways of using unstructured news article data to anticipate safety policy and research field and to respond to construction accident safety issues in the future.

A Study of Housing Environment Problems through the Daily newspapers ( I ) - The Change of a type of the Dong-A daily papers (1920~1990) - (일간지를 통해 본 주거환경문제의 연구 ( I ) - 동아일보 (1920년~1990년) 기사 유형의 변천 -)

  • 신경주
    • Journal of the Korean housing association
    • /
    • v.2 no.2
    • /
    • pp.41-53
    • /
    • 1991
  • This study discussed the change of housing environmental problems from the early 1900s to the present.The reason is to find the solution of serious housing environment problems. The documentary research method was used for this study.Articles of content analysis(N= 1129)were published in 1920(the first edition)to December. 31, 1990 which were The Dong - A daily news article about housing environment. The main content of this study was examined the change, such as the number of whole article by time series and importance of article(column number of article), classification of article subject, and the number of article by subject. On the basis of this data, was made by chronological classification of the change of housing environment problems for 70 years. Since overall results will become supply of right information about housing environment to fur peoples, will provide the oppronment that oneself ran participate the protection of housing environment, and further will take a part solution of housing environment problems.At the future, I am going to design deep analysis of article content by subject.

  • PDF

Fake News Checking Tool Based on Siamese Neural Networks and NLP (NLP와 Siamese Neural Networks를 이용한 뉴스 사실 확인 인공지능 연구)

  • Vadim, Saprunov;Kang, Sung-Won;Rhee, Kyung-hyune
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2022.05a
    • /
    • pp.627-630
    • /
    • 2022
  • Over the past few years, fake news has become one of the most significant problems. Since it is impossible to prevent people from spreading misinformation, people should analyze the news themselves. However, this process takes some time and effort, so the routine part of this analysis should be automated. There are many different approaches to this problem, but they only analyze the text and messages, ignoring the images. The fake news problem should be solved using a complex analysis tool to reach better performance. In this paper, we propose the approach of training an Artificial Intelligence using an unsupervised learning algorithm, combined with online data parsing tools, providing independence from subjective data set. Therefore it will be more difficult to spread fake news since people could quickly check if the news or article is trustworthy.

Analysis entrepreneurship trends using keyword analysis of news article Big Data :2013~2022 (뉴스기사 빅데이터의 키워드분석을 활용한 창업 트렌드 분석:2013~2022 )

  • Jaeeog Kim;Byunghoon Jeon
    • Journal of Platform Technology
    • /
    • v.11 no.3
    • /
    • pp.83-97
    • /
    • 2023
  • This research aims to identify startup trends by analyzing a large number of news articles through semantic network analysis. Using the BIGKinds article analysis service provided by the Korea Press Foundation, 330,628 news articles from 19 newspapers from January 2013 to December 2022 were comprehensively analyzed. The study focused on exploring the changes in key issues over the past decade, considering the impact of the social environment and global economic trends on entrepreneurship. We compared the number of news articles and changes in issues before and after the COVID-19 pandemic, and visualized entrepreneurship trends through frequency analysis, relationship analysis, and correlation analysis. The results of the study showed that the top keywords for entrepreneurship-related words are startup activation and commercialization, and the correlation between COVID-19 and entrepreneurship keywords is almost negligible in a linear sense, but the number of news articles decreased during the pandemic, which has an impact. In particular, the most frequently mentioned keywords are Ministry of SMEs and Startups, place is the United States, and person is limited. The agency was the SBA, and the entrepreneurship sector is more affected by social issues than any other sector, with the important characteristics of increased frequency of prompt access. This study supplies essential basic data for understanding and exploring issues and events related to entrepreneurship and suggests future research topics in the field.

  • PDF

Critical Discourse Analysis of Deinstitutionalization News Articles for the Disabled: Focusing on Fairclough's critical discourse analysis

  • JungHyun Kim
    • International Journal of Advanced Culture Technology
    • /
    • v.11 no.2
    • /
    • pp.36-43
    • /
    • 2023
  • This study aims to derive discourse's linguistic meaning, production method, and social practice implications by analyzing news reports on de-facility for people with disabilities. To this end, the discourse was analyzed by applying Fairclough's framework of critical discourse analysis. The subject of analysis is a news article on the de-facility of the disabled on the N portal site, and the analysis period is one year, from January 1 to December 31, 2022. First, as a result of the study, the surface meaning of the news discourse on the de-facility for disabled people was ideological through the seriousness of the problem for disabled people, the poor environment, and the policy of de-facility for disabled people separated from reality. Second, the social meaning of the de-facility news discourse for disabled people appeared from a realistic perspective, such as the structural cause of the problem for disabled people and the need for sensible government policies and measures to practice de-facility for disabled people. Finally, the socio-cultural practical implications of the de-facility news discourse for people with disabilities proposed the development of a systematic and realistic de-facility management manual for the disabled, practical government policy support, and changes in self-support perception for disabled people. The results of this study are expected to help find an alternative direction to reduce the gap between actual policies for de-facility for disabled people and practice in the field in the future.

Article Data Prefetching Policy using User Access Patterns in News-On-demand System (주문형 전자신문 시스템에서 사용자 접근패턴을 이용한 기사 프리패칭 기법)

  • Kim, Yeong-Ju;Choe, Tae-Uk
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.5
    • /
    • pp.1189-1202
    • /
    • 1999
  • As compared with VOD data, NOD article data has the following characteristics: it is created at any time, has a short life cycle, is selected as not one article but several articles by a user, and has high access locality in time. Because of these intrinsic features, user access patterns of NOD article data are different from those of VOD. Thus, building NOD system using the existing techniques of VOD system leads to poor performance. In this paper, we analysis the log file of a currently running electronic newspaper, show that the popularity distribution of NOD articles is different from Zipf distribution of VOD data, and suggest a new popularity model of NOD article data MS-Zipf(Multi-Selection Zipf) distribution and its approximate solution. Also we present a life cycle model of NOD article data, which shows changes of popularity over time. Using this life cycle model, we develop LLBF (Largest Life-cycle Based Frequency) prefetching algorithm and analysis he performance by simulation. The developed LLBF algorithm supports the similar level in hit-ratio to the other prefetching algorithms such as LRU(Least Recently Used) etc, while decreasing the number of data replacement in article prefetching and reducing the overhead of the prefetching in system performance. Using the accurate user access patterns of NOD article data, we could analysis correctly the performance of NOD server system and develop the efficient policies in the implementation of NOD server system.

  • PDF

Prediction of Stock Returns from News Article's Recommended Stocks Using XGBoost and LightGBM Models

  • Yoo-jin Hwang;Seung-yeon Son;Zoon-ky Lee
    • Journal of the Korea Society of Computer and Information
    • /
    • v.29 no.2
    • /
    • pp.51-59
    • /
    • 2024
  • This study examines the relationship between the release of the news and the individual stock returns. Investors utilize a variety of information sources to maximize stock returns when establishing investment strategies. News companies publish their articles based on stock recommendation reports of analysts, enhancing the reliability of the information. Defining release of a stock-recommendation news article as an event, we examine its economic impacts and propose a binary classification model that predicts the stock return 10 days after the event. XGBoost and LightGBM models are applied for the study with accuracy of 75%, 71% respectively. In addition, after categorizing the recommended stocks based on the listed market(KOSPI/KOSDAQ) and market capitalization(Big/Small), this study verifies difference in the accuracy of models across four sub-datasets. Finally, by conducting SHAP(Shapley Additive exPlanations) analysis, we identify the key variables in each model, reinforcing the interpretability of models.

Text-Mining Analyses of News Articles on Schizophrenia (조현병 관련 주요 일간지 기사에 대한 텍스트 마이닝 분석)

  • Nam, Hee Jung;Ryu, Seunghyong
    • Korean Journal of Schizophrenia Research
    • /
    • v.23 no.2
    • /
    • pp.58-64
    • /
    • 2020
  • Objectives: In this study, we conducted an exploratory analysis of the current media trends on schizophrenia using text-mining methods. Methods: First, web-crawling techniques extracted text data from 575 news articles in 10 major newspapers between 2018 and 2019, which were selected by searching "schizophrenia" in the Naver News. We had developed document-term matrix (DTM) and/or term-document matrix (TDM) through pre-processing techniques. Through the use of DTM and TDM, frequency analysis, co-occurrence network analysis, and topic model analysis were conducted. Results: Frequency analysis showed that keywords such as "police," "mental illness," "admission," "patient," "crime," "apartment," "lethal weapon," "treatment," "Jinju," and "residents" were frequently mentioned in news articles on schizophrenia. Within the article text, many of these keywords were highly correlated with the term "schizophrenia" and were also interconnected with each other in the co-occurrence network. The latent Dirichlet allocation model presented 10 topics comprising a combination of keywords: "police-Jinju," "hospital-admission," "research-finding," "care-center," "schizophrenia-symptom," "society-issue," "family-mind," "woman-school," and "disabled-facilities." Conclusion: The results of the present study highlight that in recent years, the media has been reporting violence in patients with schizophrenia, thereby raising an important issue of hospitalization and community management of patients with schizophrenia.

Factors Influencing Subscribers' Voluntary Payment Behavior on an Online News Site: Focusing on the Role of Appreciation (온라인 뉴스 사이트에서 독자의 자발적 구독료 지불행위에 영향을 미치는 요인에 대한 연구: 공감의 역할을 중심으로)

  • Lee, Hyoung-Joo;Rhee, Hosung Timothy;Yang, Sung-Byung
    • Knowledge Management Research
    • /
    • v.14 no.4
    • /
    • pp.1-17
    • /
    • 2013
  • As online communities proliferate, online news sites have received great attention in news media research. Although most of the online news sites provide contents for free, some have adopted the Pay-What-You-Want (PWYW) model by offering a voluntary payment option to the readers. In this study, we investigate the factors which influence subscribers' voluntary payment behavior on an online news site. Drawing upon both the Stimulus-Organism-Response (SOR) framework and the Elaboration Likelihood Model (ELM), we hypothesize that appreciation has a direct effect on the subscribers' voluntary payment behavior, whereas central factors (positive emotional content, cognitive content) and peripheral factors (news sharing, news article length) of the news articles have indirect impacts on voluntary payment behavior through the enhanced appreciation. Based on an empirical analysis of 172 news articles from the Korean online news site that adopted the PWYW pricing model (i.e., Ohmynews.com), we find that appreciation plays a critical role in voluntary payment behavior and that peripheral factors have significant impacts on appreciation. However, the impacts of central factors on appreciation are not found. By identifying influencing factors of subscribers' voluntary payment behavior on online news sites for the first time, this paper suggests a prospective alternative profit model for online news providers faced with fierce competition.

  • PDF

A Study on Fake News Subject Matter, Presentation Elements, Tools of Detection, and Social Media Platforms in India

  • Kanozia, Rubal;Arya, Ritu;Singh, Satwinder;Narula, Sumit;Ganghariya, Garima
    • Asian Journal for Public Opinion Research
    • /
    • v.9 no.1
    • /
    • pp.48-82
    • /
    • 2021
  • This research article attempts to understand the current situation of fake news on social media in India. The study focused on four characteristics of fake news based on four research questions: subject matter, presentation elements of fake news, debunking tool(s) or technique(s) used, and the social media site on which the fake news story was shared. A systematic sampling method was used to select a sample of 90 debunked fake news stories from two Indian fact-checking websites, Alt News and Factly, from December 2019 to February 2020. A content analysis of the four characteristics of fake news stories was carefully analyzed, classified, coded, and presented. The results show that most of the fake news stories were related to politics in India. The majority of the fake news was shared via a video with text in which narrative was changed to mislead users. For the largest number of debunked fake news stories, information from official or primary sources, such as reports, data, statements, announcements, or updates were used to debunk false claims.