Search | Korea Science

Automatic Determination of Usenet News Groups from User Profile (사용자 프로파일에 기초한 유즈넷 뉴스그룹 자동 결정 방법)

Kim, Jong-Wan;Cho, Kyu-Cheol;Kim, Hee-Jae;Kim, Byeong-Man
- Journal of the Korean Institute of Intelligent Systems
- /
- v.14 no.2
- /
- pp.142-149
- /
- 2004
It is important to retrieve exact information coinciding with user's need from lots of Usenet news and filter desired information quickly. Differently from email system, we must previously register our interesting news group if we want to get the news information. However, it is not easy for a novice to decide which news group is relevant to his or her interests. In this work, we present a service classifying user preferred news groups among various news groups by the use of Kohonen network. We first extract candidate terms from example documents and then choose a number of representative keywords to be used in Kohonen network from them through fuzzy inference. From the observation of training patterns, we could find the sparsity problem that lots of keywords in training patterns are empty. Thus, a new method to train neural network through reduction of unnecessary dimensions by the statistical coefficient of determination is proposed in this paper. Experimental results show that the proposed method is superior to the method using every dimension in terms of cluster overlap defined by using within cluster distance and between cluster distance.
https://doi.org/10.5391/JKIIS.2004.14.2.142 인용 PDF KSCI

PMCN: Combining PDF-modified Similarity and Complex Network in Multi-document Summarization

Tu, Yi-Ning;Hsu, Wei-Tse
- International Journal of Knowledge Content Development & Technology
- /
- v.9 no.3
- /
- pp.23-41
- /
- 2019
This study combines the concept of degree centrality in complex network with the Term Frequency $^*$ Proportional Document Frequency ($TF^*PDF$) algorithm; the combined method, called PMCN (PDF-Modified similarity and Complex Network), constructs relationship networks among sentences for writing news summaries. The PMCN method is a multi-document summarization extension of the ideas of Bun and Ishizuka (2002), who first published the $TF^*PDF$ algorithm for detecting hot topics. In their $TF^*PDF$ algorithm, Bun and Ishizuka defined the publisher of a news item as its channel. If the PDF weight of a term is higher than the weights of other terms, then the term is hotter than the other terms. However, this study attempts to develop summaries for news items. Because the $TF^*PDF$ algorithm summarizes daily news, PMCN replaces the concept of "channel" with "the date of the news event", and uses the resulting chronicle ordering for a multi-document summarization algorithm, of which the F-measure scores were 0.042 and 0.051 higher than LexRank for the famous d30001t and d30003t tasks, respectively.
https://doi.org/10.5865/IJKCT.2019.9.3.023 인용 PDF KSCI

Content-based Recommendation Based on Social Network for Personalized News Services (개인화된 뉴스 서비스를 위한 소셜 네트워크 기반의 콘텐츠 추천기법)

Hong, Myung-Duk;Oh, Kyeong-Jin;Ga, Myung-Hyun;Jo, Geun-Sik
- Journal of Intelligence and Information Systems
- /
- v.19 no.3
- /
- pp.57-71
- /
- 2013
Over a billion people in the world generate new news minute by minute. People forecasts some news but most news are from unexpected events such as natural disasters, accidents, crimes. People spend much time to watch a huge amount of news delivered from many media because they want to understand what is happening now, to predict what might happen in the near future, and to share and discuss on the news. People make better daily decisions through watching and obtaining useful information from news they saw. However, it is difficult that people choose news suitable to them and obtain useful information from the news because there are so many news media such as portal sites, broadcasters, and most news articles consist of gossipy news and breaking news. User interest changes over time and many people have no interest in outdated news. From this fact, applying users' recent interest to personalized news service is also required in news service. It means that personalized news service should dynamically manage user profiles. In this paper, a content-based news recommendation system is proposed to provide the personalized news service. For a personalized service, user's personal information is requisitely required. Social network service is used to extract user information for personalization service. The proposed system constructs dynamic user profile based on recent user information of Facebook, which is one of social network services. User information contains personal information, recent articles, and Facebook Page information. Facebook Pages are used for businesses, organizations and brands to share their contents and connect with people. Facebook users can add Facebook Page to specify their interest in the Page. The proposed system uses this Page information to create user profile, and to match user preferences to news topics. However, some Pages are not directly matched to news topic because Page deals with individual objects and do not provide topic information suitable to news. Freebase, which is a large collaborative database of well-known people, places, things, is used to match Page to news topic by using hierarchy information of its objects. By using recent Page information and articles of Facebook users, the proposed systems can own dynamic user profile. The generated user profile is used to measure user preferences on news. To generate news profile, news category predefined by news media is used and keywords of news articles are extracted after analysis of news contents including title, category, and scripts. TF-IDF technique, which reflects how important a word is to a document in a corpus, is used to identify keywords of each news article. For user profile and news profile, same format is used to efficiently measure similarity between user preferences and news. The proposed system calculates all similarity values between user profiles and news profiles. Existing methods of similarity calculation in vector space model do not cover synonym, hypernym and hyponym because they only handle given words in vector space model. The proposed system applies WordNet to similarity calculation to overcome the limitation. Top-N news articles, which have high similarity value for a target user, are recommended to the user. To evaluate the proposed news recommendation system, user profiles are generated using Facebook account with participants consent, and we implement a Web crawler to extract news information from PBS, which is non-profit public broadcasting television network in the United States, and construct news profiles. We compare the performance of the proposed method with that of benchmark algorithms. One is a traditional method based on TF-IDF. Another is 6Sub-Vectors method that divides the points to get keywords into six parts. Experimental results demonstrate that the proposed system provide useful news to users by applying user's social network information and WordNet functions, in terms of prediction error of recommended news.
https://doi.org/10.13088/jiis.2013.19.3.057 인용 PDF KSCI

Quality and Ratings in the Performances of TV News Programs (지상파뉴스의 품질과 시청률의 상관관계에 대한 연구)

Kim, Eujong;Oh, Hyun-kyung
- The Journal of the Korea Contents Association
- /
- v.19 no.12
- /
- pp.249-258
- /
- 2019
Changes in media technolgy affect the competitive status of broadcasting networks as news media. The competitive media environment has pushed broadcasting network news programs to find new ways for leveling their qualitative performance up and rating. This study focuses on the empirical relationship between the two key value, news quality in terms of fairness and in-depthness and news ratings. This study is based on the analysis of broadcasting network news texts and individual news item raitngs. Empirical relationship between news quality factors and ratings was proved positive. But the relationship between the length of news item and rating was proved negative.
https://doi.org/10.5392/JKCA.2019.19.12.249 인용 PDF KSCI

Urdu News Classification using Application of Machine Learning Algorithms on News Headline

Khan, Muhammad Badruddin
- International Journal of Computer Science & Network Security
- /
- v.21 no.2
- /
- pp.229-237
- /
- 2021
Our modern 'information-hungry' age demands delivery of information at unprecedented fast rates. Timely delivery of noteworthy information about recent events can help people from different segments of life in number of ways. As world has become global village, the flow of news in terms of volume and speed demands involvement of machines to help humans to handle the enormous data. News are presented to public in forms of video, audio, image and text. News text available on internet is a source of knowledge for billions of internet users. Urdu language is spoken and understood by millions of people from Indian subcontinent. Availability of online Urdu news enable this branch of humanity to improve their understandings of the world and make their decisions. This paper uses available online Urdu news data to train machines to automatically categorize provided news. Various machine learning algorithms were used on news headline for training purpose and the results demonstrate that Bernoulli Naïve Bayes (Bernoulli NB) and Multinomial Naïve Bayes (Multinomial NB) algorithm outperformed other algorithms in terms of all performance parameters. The maximum level of accuracy achieved for the dataset was 94.278% by multinomial NB classifier followed by Bernoulli NB classifier with accuracy of 94.274% when Urdu stop words were removed from dataset. The results suggest that short text of headlines of news can be used as an input for text categorization process.
https://doi.org/10.22937/IJCSNS.2021.21.2.27 인용 PDF KSCI

Predicting Stock Prices Based on Online News Content and Technical Indicators by Combinatorial Analysis Using CNN and LSTM with Self-attention

Sang Hyung Jung;Gyo Jung Gu;Dongsung Kim;Jong Woo Kim
- Asia pacific journal of information systems
- /
- v.30 no.4
- /
- pp.719-740
- /
- 2020
The stock market changes continuously as new information emerges, affecting the judgments of investors. Online news articles are valued as a traditional window to inform investors about various information that affects the stock market. This paper proposed new ways to utilize online news articles with technical indicators. The suggested hybrid model consists of three models. First, a self-attention-based convolutional neural network (CNN) model, considered to be better in interpreting the semantics of long texts, uses news content as inputs. Second, a self-attention-based, bi-long short-term memory (bi-LSTM) neural network model for short texts utilizes news titles as inputs. Third, a bi-LSTM model, considered to be better in analyzing context information and time-series models, uses 19 technical indicators as inputs. We used news articles from the previous day and technical indicators from the past seven days to predict the share price of the next day. An experiment was performed with Korean stock market data and news articles from 33 top companies over three years. Through this experiment, our proposed model showed better performance than previous approaches, which have mainly focused on news titles. This paper demonstrated that news titles and content should be treated in different ways for superior stock price prediction.
https://doi.org/10.14329/apjis.2020.30.4.719 인용 PDF

A Keyword Network Analysis on Health Disparity in Korea: Focusing on News and its application to Physical Education

Kim, Woo-Kyung
- Journal of the Korea Society of Computer and Information
- /
- v.24 no.3
- /
- pp.143-150
- /
- 2019
This study aimed to analyze the keyword related to Health Disparity in Korea through the method of keyword network analysis and to establish a basic database for suggesting ideas for prospective studies in physical education. To achieve the goal, this study crawled co-occured keyword with 'health' and 'disparity' from news casted in 20 different channels. The duration of the news was 3 months, from September 11th, 2018 to December 11th. The results are as follows. First, among the news during recent 3 months, there were 1,383 keyword related to health disparity and this study selected 173 keyword which had co-occured over 3 times. Second, the inclusiveness of the network was 97.674% and the density was .038. Third, analyzing news related to health disparity, 'mortality' was the most co-occured keyword and 'disparity', 'reinforcement', 'the most', 'health', '6 times', 'Seoul', 'half', 'medicine', and 'local' were shown similarly. And common keyword in 4 centrality were 13 keyword. Lastly, by analyzing eigenvector centrality, significantly different result has shown. 'Disparity' was the most co-occured keyword. Based on this result, this study showed the necessity for reinforcing the public physical education in public education system in Korea. In order to achieve it, the field of physical education must look beyond present elite-focused physical education to public physical activity.
https://doi.org/10.9708/jksci.2019.24.03.143 인용 PDF KSCI HTML

User-Perspective Issue Clustering Using Multi-Layered Two-Mode Network Analysis (다계층 이원 네트워크를 활용한 사용자 관점의 이슈 클러스터링)

Kim, Jieun;Kim, Namgyu;Cho, Yoonho
- Journal of Intelligence and Information Systems
- /
- v.20 no.2
- /
- pp.93-107
- /
- 2014
In this paper, we report what we have observed with regard to user-perspective issue clustering based on multi-layered two-mode network analysis. This work is significant in the context of data collection by companies about customer needs. Most companies have failed to uncover such needs for products or services properly in terms of demographic data such as age, income levels, and purchase history. Because of excessive reliance on limited internal data, most recommendation systems do not provide decision makers with appropriate business information for current business circumstances. However, part of the problem is the increasing regulation of personal data gathering and privacy. This makes demographic or transaction data collection more difficult, and is a significant hurdle for traditional recommendation approaches because these systems demand a great deal of personal data or transaction logs. Our motivation for presenting this paper to academia is our strong belief, and evidence, that most customers' requirements for products can be effectively and efficiently analyzed from unstructured textual data such as Internet news text. In order to derive users' requirements from textual data obtained online, the proposed approach in this paper attempts to construct double two-mode networks, such as a user-news network and news-issue network, and to integrate these into one quasi-network as the input for issue clustering. One of the contributions of this research is the development of a methodology utilizing enormous amounts of unstructured textual data for user-oriented issue clustering by leveraging existing text mining and social network analysis. In order to build multi-layered two-mode networks of news logs, we need some tools such as text mining and topic analysis. We used not only SAS Enterprise Miner 12.1, which provides a text miner module and cluster module for textual data analysis, but also NetMiner 4 for network visualization and analysis. Our approach for user-perspective issue clustering is composed of six main phases: crawling, topic analysis, access pattern analysis, network merging, network conversion, and clustering. In the first phase, we collect visit logs for news sites by crawler. After gathering unstructured news article data, the topic analysis phase extracts issues from each news article in order to build an article-news network. For simplicity, 100 topics are extracted from 13,652 articles. In the third phase, a user-article network is constructed with access patterns derived from web transaction logs. The double two-mode networks are then merged into a quasi-network of user-issue. Finally, in the user-oriented issue-clustering phase, we classify issues through structural equivalence, and compare these with the clustering results from statistical tools and network analysis. An experiment with a large dataset was performed to build a multi-layer two-mode network. After that, we compared the results of issue clustering from SAS with that of network analysis. The experimental dataset was from a web site ranking site, and the biggest portal site in Korea. The sample dataset contains 150 million transaction logs and 13,652 news articles of 5,000 panels over one year. User-article and article-issue networks are constructed and merged into a user-issue quasi-network using Netminer. Our issue-clustering results applied the Partitioning Around Medoids (PAM) algorithm and Multidimensional Scaling (MDS), and are consistent with the results from SAS clustering. In spite of extensive efforts to provide user information with recommendation systems, most projects are successful only when companies have sufficient data about users and transactions. Our proposed methodology, user-perspective issue clustering, can provide practical support to decision-making in companies because it enhances user-related data from unstructured textual data. To overcome the problem of insufficient data from traditional approaches, our methodology infers customers' real interests by utilizing web transaction logs. In addition, we suggest topic analysis and issue clustering as a practical means of issue identification.
https://doi.org/10.13088/jiis.2014.20.2.093 인용 PDF KSCI

A Study on the Legal Regulation of 'Fake News' in the Age of Social Network Services : Focusing on the French Les propositions de loi contre la manipulation de l' information (소셜네트워크서비스 시대 가짜뉴스의 법적 규제에 대한 고찰 : 프랑스 정보조작대처법을 중심으로)

Sunhye Kwak;Sungwook Lee
- Journal of Service Research and Studies
- /
- v.12 no.3
- /
- pp.144-157
- /
- 2022
This study began by pointing out the problem of domestic media reporting on 'fake news' regulations that frequently appear through the French 'Les proposals de loi control de l'information'case, while still approaching with different standards and perspectives on where to see fake news. In the age of 'social network services', the answer to what the media is, what the news is, and who the reporter is increasingly difficult. While reviewing the long history and background of the spread of fake news examined in this study, it was confirmed that could not determine the concept and scope of fake news, punished, regulated, controlled, or judged simply by one standard. From the perspective of 'freedom of expression' set by the law, we have the authority to express our opinions freely. In addition, 'online' space is a place where fake news is generated and spread, but at the same time, there is plenty of room to act as an antidote. In the end, the only alternative to the damage of long-term fake news will be to create a media environment that allows more high-quality "real news" to pour out, allowing us to develop our ability to judge reliable information through balanced competition among various news in the free market of ideas.
https://doi.org/10.18807/jsrs.2022.12.3.144 인용 PDF KSCI

Analysis of Fake News in the 2017 Korean Presidential Election

Go, Seon-gyu;Lee, Mi-ran
- Asian Journal for Public Opinion Research
- /
- v.8 no.2
- /
- pp.105-125
- /
- 2020
The purpose of this paper is to analyze 1) who created and distributed fake news, 2) the distribution channels of fake news, 3) who fake news has targeted, and 4) the effects on voting and the impact of fake news on Korean politics. In South Korea, fake news was mainly created by candidates or election campaigns. The reason is that in the wake of the impeachment of President Park Guen Hye, all the political parties in Korea used fake news as a means of mobilizing supporters for each of their candidates or parties to gain an advantage in situations involving political divisions and confrontations between the pro-impeachment, progressive young generation and anti-impeachment, conservative senior generation. Voters' media usage patterns were polarized through social network services (SNS) media and television. Fake news was mostly received through these two media outlets. According to the spreading structure of fake news in Korea, the younger generation generally uses SNS posts intended for unspecified individuals, and the older generation uses closed SNS like KakaoTalk or Naver's BAND. In the end, it is typically characteristic of the older generation to spread fake news through existing offline human networks. In the 2017 presidential election, fake news has been confirmed to have the effect of mobilizing supporters for each political party. In the presidential election, an increase in voter turnout was confirmed among those in their 20s and those in their 60s or older. Evidently, fake news influenced the election of Moon Jae-In. The influence of fake news is expected to grow further as ideological polarization and consequent political polarization continues to intensify in South Korea.
https://doi.org/10.15206/ajpor.2020.8.2.105 인용 PDF KSCI

Search Result 346, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)