Search | Korea Science

Adaptive Character Segmentation to Improve Text Recognition Accuracy on Mobile Phones (모바일 시스템에서 텍스트 인식 위한 적응적 문자 분할)

Kim, Jeong Sik;Yang, Hyung Jeong;Kim, Soo Hyung;Lee, Guee Sang;Do, Luu Ngoc;Kim, Sun Hee
- Smart Media Journal
- /
- v.1 no.4
- /
- pp.59-71
- /
- 2012
Since mobile phones are used as common communication devices, their applications are increasingly important to human's life. Using smart-phones camera to collect daily life environment's information is one of targets for many applications such as text recognition, object recognition or context awareness. Studies have been conducted to provide important information through the recognition of texts, which are artificially or naturally included in images and movies acquired from mobile phones. In this study, a character segmentation method that improves character-recognition accuracy in images obtained from mobile phone cameras is proposed. The proposed method first classifies texts in a given image to printed letters and handwritten letters since segmentation approaches for them are different. For printed letters, rough segmentation process is conducted, then the segmented regions are integrated, deleted, and re-segmented. Segmentation for the handwritten letters is performed after skews are corrected and the characters are classified by integrating them. The experimental result shows our method achieves a successful performance for both printed and handwritten letters as 95.9% and 84.7%, respectively.
PDF

A Computer-Aided Text Analysis to Explore Recruitment and Intellectual Polarization Strategies in ISIS Media

Khafaga, Ayman Farid
- International Journal of Computer Science & Network Security
- /
- v.22 no.8
- /
- pp.87-96
- /
- 2022
This paper employs a computer-aided text analysis (CATA) and a Critical Discourse Analysis (CDA) to explore the strategies of recruitment and intellectual polarization in ISIS (Islamic State in Iraq and Syria) media. The paper's main objective is to shed light on the efficacy of employing computer software in the linguistic analysis of texts, and the extent to which CATA software contribute to deciphering hidden meanings of texts as well as to arrive at concise and authentic results from these texts. More specifically, this paper attempts to demonstrate the contribution of CATA software represented in the two variables of Frequency Distribution Analysis (FDA) and Content Analysis (CA) in decoding the strategies of recruitment and intellectual polarization in one of ISIS 's digital publication: Rumiyah (a digital magazine published by ISIS). The analytical focus is on three strategies of recruitment and intellectual polarization: (i) lexicalization, (ii) intertextual religionisation, and (iii) justification. Two main findings are revealed in this study. First, the application of CATA software into the linguistic investigation of texts contributes effectively to the understanding of the thematic and ideological messages pertaining to the analyzed text. Second, the computational analysis guarantees concise, credible, authentic and ample results than is the case if the analysis is conducted without the work of computer software. The paper, therefore, recommends the integration of CATA software into the linguistic analysis of the various types of texts.
https://doi.org/10.22937/IJCSNS.2022.22.8.12 인용 PDF KSCI

A Study on the Consumer Boycott Participation Experience: Using Text Mining Analysis and In-depth Interview (소비자불매운동 참여 경험에 관한 연구: 텍스트마이닝 분석과 심층면접기법의 활용)

Han, Juno;Li, Xu;Hwang, Hyesun
- The Journal of the Korea Contents Association
- /
- v.22 no.2
- /
- pp.88-106
- /
- 2022
This study examined the social discourse on consumer boycott and explored consumer experience using text mining of mass media and social media data and the in-depth interview. The result showed that the topics of online news related to the boycott included the causes of the boycott, the responses of each actor in the process of the boycott, and the effects of the boycott. In the result of the in-depth interviews, it was found that the boycott has been decentralized and the participants had the experience of exploring and verifying information on their own. In the boycott process, there were mixed experiences due to the absence of substitutes and the marketing influence, and positive experiences of expressing one's thoughts and strengthening beliefs through the boycott.
https://doi.org/10.5392/JKCA.2022.22.02.088 인용 PDF KSCI HTML

Korean Text to Gloss: Self-Supervised Learning approach

Thanh-Vu Dang;Gwang-hyun Yu;Ji-yong Kim;Young-hwan Park;Chil-woo Lee;Jin-Young Kim
- Smart Media Journal
- /
- v.12 no.1
- /
- pp.32-46
- /
- 2023
Natural Language Processing (NLP) has grown tremendously in recent years. Typically, bilingual, and multilingual translation models have been deployed widely in machine translation and gained vast attention from the research community. On the contrary, few studies have focused on translating between spoken and sign languages, especially non-English languages. Prior works on Sign Language Translation (SLT) have shown that a mid-level sign gloss representation enhances translation performance. Therefore, this study presents a new large-scale Korean sign language dataset, the Museum-Commentary Korean Sign Gloss (MCKSG) dataset, including 3828 pairs of Korean sentences and their corresponding sign glosses used in Museum-Commentary contexts. In addition, we propose a translation framework based on self-supervised learning, where the pretext task is a text-to-text from a Korean sentence to its back-translation versions, then the pre-trained network will be fine-tuned on the MCKSG dataset. Using self-supervised learning help to overcome the drawback of a shortage of sign language data. Through experimental results, our proposed model outperforms a baseline BERT model by 6.22%.
https://doi.org/10.30693/SMJ.2023.12.1.32 인용 PDF

A Study on Recognition of Robot Barista Using Social Media Text Mining (소셜미디어 텍스트마이닝을 활용한 로봇 바리스타 인식 탐색 연구)

Han Jangheon;An Kabsoo
- Journal of Korea Society of Digital Industry and Information Management
- /
- v.20 no.2
- /
- pp.37-47
- /
- 2024
The food tech market, which uses artificial intelligence robots for the restaurant industry, is gradually expanding. Among them, the robot barista, a representative food tech case for the restaurant industry, is characterized by increasing the efficiency of operators and providing things for visitors to see and enjoy through a 24-hour unmanned operation. This research was conducted through text mining analysis to examine trends related to robot baristas in the restaurant industry. The research results are as follows. First, keywords such as coffee, cafe, certification, ordering, taste, interest, people, robot cafe, coffee barista expert, free, course, unmanned, and wine sommelier were highly frequent. Second, time, variety, possibility, people, process, operation, service, and thought showed high closeness centrality. Third, as a result of CONCOR analysis, a total of 5 keyword clusters with high relevance to the restaurant industry were formed. In order to activate robot barista in the future, it is necessary to pay more attention to functional development that can strengthen its functions and features, as well as online promotion through various events and SNS in the robot barista cafe.
https://doi.org/10.17662/ksdim.2024.20.2.037 인용 PDF HTML

A Deep Learning-based Depression Trend Analysis of Korean on Social Media (딥러닝 기반 소셜미디어 한글 텍스트 우울 경향 분석)

Park, Seojeong;Lee, Soobin;Kim, Woo Jung;Song, Min
- Journal of the Korean Society for information Management
- /
- v.39 no.1
- /
- pp.91-117
- /
- 2022
The number of depressed patients in Korea and around the world is rapidly increasing every year. However, most of the mentally ill patients are not aware that they are suffering from the disease, so adequate treatment is not being performed. If depressive symptoms are neglected, it can lead to suicide, anxiety, and other psychological problems. Therefore, early detection and treatment of depression are very important in improving mental health. To improve this problem, this study presented a deep learning-based depression tendency model using Korean social media text. After collecting data from Naver KonwledgeiN, Naver Blog, Hidoc, and Twitter, DSM-5 major depressive disorder diagnosis criteria were used to classify and annotate classes according to the number of depressive symptoms. Afterwards, TF-IDF analysis and simultaneous word analysis were performed to examine the characteristics of each class of the corpus constructed. In addition, word embedding, dictionary-based sentiment analysis, and LDA topic modeling were performed to generate a depression tendency classification model using various text features. Through this, the embedded text, sentiment score, and topic number for each document were calculated and used as text features. As a result, it was confirmed that the highest accuracy rate of 83.28% was achieved when the depression tendency was classified based on the KorBERT algorithm by combining both the emotional score and the topic of the document with the embedded text. This study establishes a classification model for Korean depression trends with improved performance using various text features, and detects potential depressive patients early among Korean online community users, enabling rapid treatment and prevention, thereby enabling the mental health of Korean society. It is significant in that it can help in promotion.
https://doi.org/10.3743/KOSIM.2022.39.1.091 인용 PDF KSCI

A Web-Based Multimedia Dictionary System Supporting Media Synchronization (미디어 동기화를 지원하는 웹기반 멀티미디어 전자사전 시스템)

Choi, Yong-Jun;Hwang, Do-Sam
- Journal of Korea Multimedia Society
- /
- v.7 no.8
- /
- pp.1145-1161
- /
- 2004
The purpose of this research is to establish a method for the construction of a multimedia electronic dictionary system by integrating the media data available from linguistic resources on the Internet. As the result of this study, existing text-oriented electronic dictionary systems can be developed into multimedia lexical systems with greater efficiency and effectiveness. A method is proposed to integrate the media data of linguistic resources on the Internet by a web browser. In the proposed method, a web browser carries out all the work related to integration of media data, and it does not need a dedicated server system. The system constructed by our web browser environment integrates text, image, and voice sources, and also can produce moving pictures. Each media is associated with the meaning of data so that the data integration and movement may be specified in the associations. SMIL documents are generated by analyzing the meaning of each data unit and they are executed in a web browser. The proposed system can be operated without a dedicated server system. And also, the system saves storage space by sharing the each media data distributed on the Internet, and makes it easier to update data.
PDF

Multimodal Media Content Classification using Keyword Weighting for Recommendation (추천을 위한 키워드 가중치를 이용한 멀티모달 미디어 콘텐츠 분류)

Kang, Ji-Soo;Baek, Ji-Won;Chung, Kyungyong
- Journal of Convergence for Information Technology
- /
- v.9 no.5
- /
- pp.1-6
- /
- 2019
As the mobile market expands, a variety of platforms are available to provide multimodal media content. Multimodal media content contains heterogeneous data, accordingly, user requires much time and effort to select preferred content. Therefore, in this paper we propose multimodal media content classification using keyword weighting for recommendation. The proposed method extracts keyword that best represent contents through keyword weighting in text data of multimodal media contents. Based on the extracted data, genre class with subclass are generated and classify appropriate multimodal media contents. In addition, the user's preference evaluation is performed for personalized recommendation, and multimodal content is recommended based on the result of the user's content preference analysis. The performance evaluation verifies that it is superiority of recommendation results through the accuracy and satisfaction. The recommendation accuracy is 74.62% and the satisfaction rate is 69.1%, because it is recommended considering the user's favorite the keyword as well as the genre.
https://doi.org/10.22156/CS4SMB.2019.9.5.001 인용 PDF KSCI HTML

The Persuasive Impact of Fit between Message Goals(Promotion vs. Prevention) and Modality of Message on Social Media (메시지 조절목표와 메시지 형식 간 적합성이 메시지 설득력에 미치는 영향)

Kim, Dong Hoo;Song, Young-A
- The Journal of the Korea Contents Association
- /
- v.21 no.2
- /
- pp.604-621
- /
- 2021
Examination of the concurrent evolution of communication tools and eating behaviors over recent decades reveals that social media and other forms of digital content have become powerful new driving forces for nutritional choices and food consumption. The purpose of this research was to examine the effect between goal orientation of message (promotion versus prevention) and the type of message (text versus image) on effectiveness of the message. The findings showed that individuals exposed to a promotion-focused message similarly responded to the message regardless of the type of the message. By contrast, those who exposed to a prevention-focused message showed significantly more positive responses to the message posted on the text-based social media than the message on the image-based social media. The findings indicated that, if presented effectively, social media could be harnessed to promote healthier eating habits and behaviors, prevent those which can be harmful, and ultimately improve an individual's daily food consumption and overall quality of life.
https://doi.org/10.5392/JKCA.2021.21.02.604 인용 PDF KSCI HTML

Images of Nurses Appeared in Media Reports Before and After Outbreak of COVID-19: Text Network Analysis and Topic Modeling (COVID-19 발생 전·후 언론보도에 나타난 간호사 이미지에 대한 텍스트 네트워크 분석 및 토픽 모델링)

Park, Min Young;Jeong, Seok Hee;Kim, Hee Sun;Lee, Eun Jee
- Journal of Korean Academy of Nursing
- /
- v.52 no.3
- /
- pp.291-307
- /
- 2022
Purpose: The aims of study were to identify the main keywords, the network structure, and the main topics of press articles related to nurses that have appeared in media reports. Methods: Data were media articles related to the topic "nurse" reported in 16 central media within a one-year period spanning July 1, 2019 to June 30, 2020. Data were collected from the Big Kinds database. A total of 7,800 articles were searched, and 1,038 were used for the final analysis. Text network analysis and topic modeling were performed using NetMiner 4.4. Results: The number of media reports related to nurses increased by 3.86 times after the novel coronavirus (COVID-19) outbreak compared to prior. Pre- and post-COVID-19 network characteristics were density 0.002, 0.001; average degree 4.63, 4.92; and average distance 4.25, 4.01, respectively. Four topics were derived before and after the COVID-19 outbreak, respectively. Pre-COVID-19 example topics are "a nurse who committed suicide because she could not withstand the Taewoom at work" and "a nurse as a perpetrator of a newborn abuse case," while post-COVID-19 examples are "a nurse as a victim of COVID-19," "a nurse working with the support of the people," and "a nurse as a top contributor and a warrior to protect from COVID-19." Conclusion: Topic modeling shows that topics become more positive after the COVID-19 outbreak. Individual nurses and nursing organizations should continuously monitor and conduct further research on nurses' image.
https://doi.org/10.4040/jkan.22002 인용 PDF KSCI

Search Result 836, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)