Search | Korea Science

Text Extraction in HIS Color Space by Weighting Scheme

Le, Thi Khue Van;Lee, Gueesang
- Smart Media Journal
- /
- v.2 no.1
- /
- pp.31-36
- /
- 2013
A robust and efficient text extraction is very important for an accuracy of Optical Character Recognition (OCR) systems. Natural scene images with degradations such as uneven illumination, perspective distortion, complex background and multi color text give many challenges to computer vision task, especially in text extraction. In this paper, we propose a method for extraction of the text in signboard images based on a combination of mean shift algorithm and weighting scheme of hue and saturation in HSI color space for clustering algorithm. The number of clusters is determined automatically by mean shift-based density estimation, in which local clusters are estimated by repeatedly searching for higher density points in feature vector space. Weighting scheme of hue and saturation is used for formulation a new distance measure in cylindrical coordinate for text extraction. The obtained experimental results through various natural scene images are presented to demonstrate the effectiveness of our approach.
PDF

Text Mining and Visualization of Papers Reviews Using R Language

Li, Jiapei;Shin, Seong Yoon;Lee, Hyun Chang
- Journal of information and communication convergence engineering
- /
- v.15 no.3
- /
- pp.170-174
- /
- 2017
Nowadays, people share and discuss scientific papers on social media such as the Web 2.0, big data, online forums, blogs, Twitter, Facebook and scholar community, etc. In addition to a variety of metrics such as numbers of citation, download, recommendation, etc., paper review text is also one of the effective resources for the study of scientific impact. The social media tools improve the research process: recording a series online scholarly behaviors. This paper aims to research the huge amount of paper reviews which have generated in the social media platforms to explore the implicit information about research papers. We implemented and shown the result of text mining on review texts using R language. And we found that Zika virus was the research hotspot and association research methods were widely used in 2016. We also mined the news review about one paper and derived the public opinion.
https://doi.org/10.6109/jicce.2017.15.3.170 인용 PDF KSCI

Measuring a Valence and Activation Dimension of Korean Emotion Terms using in Social Media (소셜 미디어에서 사용되는 한국어 정서 단어의 정서가, 활성화 차원 측정)

Rhee, Shin-Young;Ko, Il-Ju
- Science of Emotion and Sensibility
- /
- v.16 no.2
- /
- pp.167-176
- /
- 2013
User-created text data are increasing rapidly caused by development of social media. In opinion mining, User's opinions are extracted by analyzing user's text. A primary goal of sentiment analysis as a branch of opinion mining is to extract user's opinions from a text that is required to build a list of emotion terms. In this paper, we built a list of emotion terms to analyse a sentiment of social media using Facebook as a representative social media. We collected data from Facebook and selected a emotion terms, and measured the dimensions of valence and activation through a survey. As a result, we built a list of 267 emotion terms including the dimension of valence and activation.
PDF

Media coverage of the conflicts over the 4th Industrial Revolution in the Republic of Korea from 2016 to 2020: a text-mining approach

Yang, Jiseong;Kim, Byungjun;Lee, Wonjae
- Asian Journal of Innovation and Policy
- /
- v.11 no.2
- /
- pp.202-221
- /
- 2022
The media has depicted an abrupt socio-technological change in the Republic of Korea with the 4th Industrial Revolution. Because technologies cannot realize their potential without social acceptance, studying conflicts incurred by such a change is imperative. However, little literature has focused on conflicts caused by technologies. Therefore, the current study investigated media coverage regarding conflicts related to the 4th Industrial Revolution from 2016 to 2020 in the Republic of Korea, applying text-mining techniques. We found that the overall amount and coverage pattern conforms to the issue attention cycle. Also, the three major topics ("SMEs & Startups," "Mobility Conflict," and "Human & Technology") indicate quarrels between conflicting social entities. Moreover, the temporal change in media coverage implies the political use of the term rather than technological. However, we also found the media's deliberative discussion on the socio-technological impact. This study is significant because we expanded the discussion on media coverage of technologies to the realm of social conflicts. Furthermore, we explored the news articles of the recent five years with a text-mining approach that enhanced the objectivity of the research.
https://doi.org/10.7545/ajip.2022.11.2.202 인용 PDF KSCI

A Study on architectural historic of Hotel DIABUTSU (대불호텔의 건축사적 고찰)

Sohn, Jang-Won;Cho, Hee-Ra
- Journal of The Korean Digital Architecture Interior Association
- /
- v.11 no.3
- /
- pp.27-34
- /
- 2011
The DIABUTSU hotel was built first in Korea and we know that the hotel was built in 1888. However, it has many questions. This study was conducted to uncover the truth. Non-text media in the study is useful to take advantage of the media. However, it is not used in Korea. I prefer that study by Non-text Media. The findings, DIABUTSU hotel was built in 1884. It was Japanese-style two-story wooden building. HORI was hospitality there and many foreigners stayed. Underwood, Appenzeller and Carles were this hotel and they recorded about the hotel in 1885. We know that three story building was the first hotel. But this is wrong in fact. The first hotel is Japanese-style wooden building built in 1884.
PDF KSCI

Machine Printed and Handwritten Text Discrimination in Korean Document Images

Trieu, Son Tung;Lee, Guee Sang
- Smart Media Journal
- /
- v.5 no.3
- /
- pp.30-34
- /
- 2016
Nowadays, there are a lot of Korean documents, which often need to be identified in one of printed or handwritten text. Early methods for the identification use structural features, which can be simple and easy to apply to text of a specific font, but its performance depends on the font type and characteristics of the text. Recently, the bag-of-words model has been used for the identification, which can be invariant to changes in font size, distortions or modifications to the text. The method based on bag-of-words model includes three steps: word segmentation using connected component grouping, feature extraction, and finally classification using SVM(Support Vector Machine). In this paper, bag-of-words model based method is proposed using SURF(Speeded Up Robust Feature) for the identification of machine printed and handwritten text in Korean documents. The experiment shows that the proposed method outperforms methods based on structural features.
PDF KSCI

Mass Media and Social Media Agenda Analysis Using Text Mining : focused on '5-day Rotation Mask Distribution System' (텍스트 마이닝을 활용한 매스 미디어와 소셜 미디어 의제 분석 : '마스크 5부제'를 중심으로)

Lee, Sae-Mi;Ryu, Seung-Eui;Ahn, Soonjae
- The Journal of the Korea Contents Association
- /
- v.20 no.6
- /
- pp.460-469
- /
- 2020
This study analyzes online news articles and cafe articles on the '5-day Rotation Mask Distribution System', which is emerging as a recent issue due to the COVID-19 incident, to identify the mass media and social media agendas containing media and public reactions. This study figured out the difference between mass media and social media. For analysis, we collected 2,096 full text articles from Naver and 1,840 posts from Naver Cafe, and conducted word frequency analysis, word cloud, and LDA topic modeling analysis through data preprocessing and refinement. As a result of analysis, social media showed real-life topics such as 'family members' purchase', 'the postponement of school opening', ' mask usage', and 'mask purchase', reflecting the characteristics of personal media. Social media was found to play a role of exchanging personal opinions, emotions, and information rather than delivering information. With the application of the research method applied to this study, social issues can be publicized through various media analysis and used as a reference in the process of establishing a policy agenda that evolves into a government agenda.
https://doi.org/10.5392/JKCA.2020.20.06.460 인용 PDF KSCI HTML

Major concerns regarding food services based on news media reports during the COVID-19 outbreak using the topic modeling approach

Yoon, Hyejin;Kim, Taejin;Kim, Chang-Sik;Kim, Namgyu
- Nutrition Research and Practice
- /
- v.15 no.sup1
- /
- pp.110-121
- /
- 2021
BACKGROUND/OBJECTIVES: Coronavirus disease 2019 (COVID-19) cases were first reported in December 2019, in China, and an increasing number of cases have since been detected all over the world. The purpose of this study was to collect significant news media reports on food services during the COVID-19 crisis and identify public communication and significant concerns regarding COVID-19 for suggesting future directions for the food industry and services. SUBJECTS/METHODS: News articles pertaining to food services were extracted from the home pages of major news media websites such as BBC, CNN, and Fox News between March 2020 and February 2021. The retrieved data was sorted and analyzed using Python software. RESULTS: The results of text analytics were presented in the format of the topic label and category for individual topics. The food and health category presented the effects of the COVID-19 pandemic on food and health, such as an increase in delivery services. The policy category was indicative of a change in government policy. The lifestyle change category addressed topics such as an increase in social media usage. CONCLUSIONS: This study is the first to analyze major news media (i.e., BBC, CNN, and Fox News) data related to food services in the context of the COVID-19 pandemic. Text analytics research on the food services domain revealed different categories such as food and health, policy, and lifestyle change. Therefore, this study contributes to the body of knowledge on food services research, through the use of text analytics to elicit findings from media sources.
https://doi.org/10.4162/nrp.2021.15.S1.S110 인용 PDF KSCI

Text-Mining Analyses of News Articles on Schizophrenia (조현병 관련 주요 일간지 기사에 대한 텍스트 마이닝 분석)

Nam, Hee Jung;Ryu, Seunghyong
- Korean Journal of Schizophrenia Research
- /
- v.23 no.2
- /
- pp.58-64
- /
- 2020
Objectives: In this study, we conducted an exploratory analysis of the current media trends on schizophrenia using text-mining methods. Methods: First, web-crawling techniques extracted text data from 575 news articles in 10 major newspapers between 2018 and 2019, which were selected by searching "schizophrenia" in the Naver News. We had developed document-term matrix (DTM) and/or term-document matrix (TDM) through pre-processing techniques. Through the use of DTM and TDM, frequency analysis, co-occurrence network analysis, and topic model analysis were conducted. Results: Frequency analysis showed that keywords such as "police," "mental illness," "admission," "patient," "crime," "apartment," "lethal weapon," "treatment," "Jinju," and "residents" were frequently mentioned in news articles on schizophrenia. Within the article text, many of these keywords were highly correlated with the term "schizophrenia" and were also interconnected with each other in the co-occurrence network. The latent Dirichlet allocation model presented 10 topics comprising a combination of keywords: "police-Jinju," "hospital-admission," "research-finding," "care-center," "schizophrenia-symptom," "society-issue," "family-mind," "woman-school," and "disabled-facilities." Conclusion: The results of the present study highlight that in recent years, the media has been reporting violence in patients with schizophrenia, thereby raising an important issue of hospitalization and community management of patients with schizophrenia.
https://doi.org/10.16946/kjsr.2020.23.2.58 인용

Analysis of Social Media Utilization based on Big Data-Focusing on the Chinese Government Weibo

Li, Xiang;Guo, Xiaoqin;Kim, Soo Kyun;Lee, Hyukku
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.16 no.8
- /
- pp.2571-2586
- /
- 2022
The rapid popularity of government social media has generated huge amounts of text data, and the analysis of these data has gradually become the focus of digital government research. This study uses Python language to analyze the big data of the Chinese provincial government Weibo. First, this study uses a web crawler approach to collect and statistically describe over 360,000 data from 31 provincial government microblogs in China, covering the period from January 2018 to April 2022. Second, a word separation engine is constructed and these text data are analyzed using word cloud word frequencies as well as semantic relationships. Finally, the text data were analyzed for sentiment using natural language processing methods, and the text topics were studied using LDA algorithm. The results of this study show that, first, the number and scale of posts on the Chinese government Weibo have grown rapidly. Second, government Weibo has certain social attributes, and the epidemics, people's livelihood, and services have become the focus of government Weibo. Third, the contents of government Weibo account for more than 30% of negative sentiments. The classified topics show that the epidemics and epidemic prevention and control overshadowed the other topics, which inhibits the diversification of government Weibo.
https://doi.org/10.3837/tiis.2022.08.006 인용 PDF KSCI HTML

Search Result 825, Processing Time 0.023 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)