Search | Korea Science

A Sentiment Classification System Using Feature Extraction from Seed Words and Support Vector Machine (종자 어휘를 이용한 자질 추출과 지지 벡터 기계(SVM)을 이용한 문서 감정 분류 시스템의 개발)

Hwang, Jae-Won;Jeon, Tae-Gyun;Ko, Young-Joong
- 한국HCI학회:학술대회논문집
- /
- 2007.02a
- /
- pp.938-942
- /
- 2007
신문 기사 및 상품 평은 특정 주제나 상품을 대상으로 하여 글쓴이의 감정과 의견이 잘 나타나 있는 대표적인 문서이다. 최근 여론 조사 및 상품 의견 조사 등 다양한 측면에서 대용량의 문서의 의미적 분류 및 분석이 요구되고 있다. 본 논문에서는 문서에 나타난 내용을 기준으로 문서가 나타내고 있는 감정을 긍정과 부정의 두 가지 범주로 분류하는 시스템을 구현한다. 문서 분류의 시작은 감정을 지닌 대표적인 종자 어휘(seed word)로부터 시작하며, 자질의 선정은 한국어 특징상 감정 및 감각을 표현하는 명사, 형용사, 부사, 동사를 대상으로 한다. 가중치 부여 방법은 한글 유의어 사전을 통해 종자 어휘의 의미를 확장하여 각각의 가중치를 책정한다. 단어 벡터로 표현된 입력 문서를 이진 분류기인 지지벡터 기계를 이용하여 문서에 나타난 감정을 판단하는 시스템을 구현하고 그 성능을 평가한다.
PDF

A study about the aspect of translation on 'Kyo(驚)' in novel 『Kokoro』 -Focusing on novels translated in Korean and English (소설 『こころ』에 나타난 감정표현 '경(驚)'에 관한 번역 양상 - 한국어 번역 작품과 영어 번역 작품을 중심으로 -)

Yang, JungSoon
- Cross-Cultural Studies
- /
- v.51
- /
- pp.329-356
- /
- 2018
Types of emotional expressions are comprised of vocabulary that describes emotion and composition of sentences to express emotion such as an exclamatory sentence and a rhetorical question, expressions of interjection, adverbs of attitude for an idea, and a style of writing. This study is focused on vocabulary that describes emotion and analyzes the aspect of translation when emotional expression of 'Kyo(驚)' is shown in "Kokoro". As a result, the aspect of translation for expression of 'Kyo(驚)' showed that it was translated to vocabulary as suggested in the dictionary in some cases. However, it was not always translated as suggested in the dictionary. Vocabulary that describes the emotion of 'Kyo(驚)' in Japanese sentences is mostly translated to corresponding parts of speech in Korean. Some adverbs needed to add 'verbs' when they were translated. Different vocabulary was added or used to maximize emotion. However, the corresponding part of speech in English was different from Korean. Examples of Japanese sentences expressing 'Kyo(驚)' by verbs were translated to expression of participles for passive verbs such as 'surprise' 'astonish' 'amaze' 'shock' 'frighten' 'stun' in many cases. Idioms were also translated with focus on the function of sentences rather than the form of sentences. Those expressed in adverbs did not accompany verbs of 'Kyo(驚)'. They were translated to expression of participles for passive verbs and adjectives such as 'surprise' 'astonish' 'amaze' 'shock' 'frighten' 'stun' in many cases. Main agents of emotion were showat the first person and the third person in simple sentences. Translation of emotional expressions when a main agent was the first person showed that the fundamental word order of Japanese was translated as in Korean. However, adverbs of time and adverbs of degree were ended to be added. The first person as the main agent of emotion was positioned at the place of subject when it was translated in English. However, things or causes of events were positioned at the place of subject in some cases to show the degree of 'Kyo(驚)' which the main agent experienced. The expression of conjecture and supposition or a certain visual and auditory basis was added to translate the expression of emotion when the main agent of emotion was the third person. Simple sentences without the main agent of emotion showed that their subjects could be omitted even if they were essential components because they could be known through context in Korean. These omitted subjects were found and translated in English. Those subjects were not necessarily human who was the main agent of emotion. They could be things or causes of events that specified the expression of emotion.

The Blog Polarity Classification Technique using Opinion Mining (오피니언 마이닝을 활용한 블로그의 극성 분류 기법)

Lee, Jong-Hyuk;Lee, Won-Sang;Park, Jea-Won;Choi, Jae-Hyun
- Journal of Digital Contents Society
- /
- v.15 no.4
- /
- pp.559-568
- /
- 2014
Previous polarity classification using sentiment analysis utilizes a sentence rule by product reviews based rating points. It is difficult to be applied to blogs which have not rating of product reviews and is possible to fabricate product reviews by comment part-timers and managers who use web site so it is not easy to understand a product and store reviews which are reliability. Considering to these problems, if we analyze blogs which have personal and frank opinions and classify polarity, it is possible to understand rightly opinions for the product, store. This paper suggests that we extract high frequency vocabularies in blogs by several domains and choose topic words. Then we apply a technique of sentiment analysis and classify polarity about contents of blogs. To evaluate performances of sentiment analysis, we utilize the measurement index that use Precision, Recall, F-Score in an information retrieval field. In a result of evaluation, using suggested sentiment analysis is the better performances to classify polarity than previous techniques of using the sentence rule based product reviews.
https://doi.org/10.9728/dcs.2014.15.4.559 인용 PDF KSCI

Analyzing Contextual Polarity of Unstructured Data for Measuring Subjective Well-Being (주관적 웰빙 상태 측정을 위한 비정형 데이터의 상황기반 긍부정성 분석 방법)

Choi, Sukjae;Song, Yeongeun;Kwon, Ohbyung
- Journal of Intelligence and Information Systems
- /
- v.22 no.1
- /
- pp.83-105
- /
- 2016
Measuring an individual's subjective wellbeing in an accurate, unobtrusive, and cost-effective manner is a core success factor of the wellbeing support system, which is a type of medical IT service. However, measurements with a self-report questionnaire and wearable sensors are cost-intensive and obtrusive when the wellbeing support system should be running in real-time, despite being very accurate. Recently, reasoning the state of subjective wellbeing with conventional sentiment analysis and unstructured data has been proposed as an alternative to resolve the drawbacks of the self-report questionnaire and wearable sensors. However, this approach does not consider contextual polarity, which results in lower measurement accuracy. Moreover, there is no sentimental word net or ontology for the subjective wellbeing area. Hence, this paper proposes a method to extract keywords and their contextual polarity representing the subjective wellbeing state from the unstructured text in online websites in order to improve the reasoning accuracy of the sentiment analysis. The proposed method is as follows. First, a set of general sentimental words is proposed. SentiWordNet was adopted; this is the most widely used dictionary and contains about 100,000 words such as nouns, verbs, adjectives, and adverbs with polarities from -1.0 (extremely negative) to 1.0 (extremely positive). Second, corpora on subjective wellbeing (SWB corpora) were obtained by crawling online text. A survey was conducted to prepare a learning dataset that includes an individual's opinion and the level of self-report wellness, such as stress and depression. The participants were asked to respond with their feelings about online news on two topics. Next, three data sources were extracted from the SWB corpora: demographic information, psychographic information, and the structural characteristics of the text (e.g., the number of words used in the text, simple statistics on the special characters used). These were considered to adjust the level of a specific SWB. Finally, a set of reasoning rules was generated for each wellbeing factor to estimate the SWB of an individual based on the text written by the individual. The experimental results suggested that using contextual polarity for each SWB factor (e.g., stress, depression) significantly improved the estimation accuracy compared to conventional sentiment analysis methods incorporating SentiWordNet. Even though literature is available on Korean sentiment analysis, such studies only used only a limited set of sentimental words. Due to the small number of words, many sentences are overlooked and ignored when estimating the level of sentiment. However, the proposed method can identify multiple sentiment-neutral words as sentiment words in the context of a specific SWB factor. The results also suggest that a specific type of senti-word dictionary containing contextual polarity needs to be constructed along with a dictionary based on common sense such as SenticNet. These efforts will enrich and enlarge the application area of sentic computing. The study is helpful to practitioners and managers of wellness services in that a couple of characteristics of unstructured text have been identified for improving SWB. Consistent with the literature, the results showed that the gender and age affect the SWB state when the individual is exposed to an identical queue from the online text. In addition, the length of the textual response and usage pattern of special characters were found to indicate the individual's SWB. These imply that better SWB measurement should involve collecting the textual structure and the individual's demographic conditions. In the future, the proposed method should be improved by automated identification of the contextual polarity in order to enlarge the vocabulary in a cost-effective manner.
https://doi.org/10.13088/jiis.2016.22.1.083 인용 PDF KSCI

Emotion Classification in Dialogues Using Embedding Features (임베딩 자질을 이용한 대화의 감정 분류)

Shin, Dong-Won;Lee, Yeon-Soo;Jang, Jung-Sun;Lim, Hae-Chang
- Annual Conference on Human and Language Technology
- /
- 2015.10a
- /
- pp.109-114
- /
- 2015
대화 시스템에서 사용자 발화에 대한 감정 분석은 적절한 시스템 응답과 서비스를 제공하는데 있어 매우 중요한 정보이다. 본 연구에서는 단순한 긍, 부정이 아닌 분노, 슬픔, 공포, 기쁨 등 Plutchick의 8 분류 체계에 해당하는 상세한 감정을 분석 하는 데 있어, 임베딩 모델을 사용하여 기존의 어휘 자질을 효과적으로 사용할 수 있는 새로운 방법을 제안한다. 또한 대화 속에서 발생한 감정의 지속성을 반영하기 위하여 문장 임베딩 벡터와 문맥 임베딩 벡터를 자질로서 이용하는 방법에 대해 제안한다. 실험 결과 제안하는 임베딩 자질은 특히 내용어에 대해 기존의 어휘 자질을 대체할 수 있으며, 데이터 부족 문제를 다소 해소하여 성능 향상에 도움이 되는 것으로 나타났다.
PDF

A Study on Intelligent Chatbot Based on Emotional in Mobile Environment (모바일 환경에서의 감성 기반 지능형 챗봇 연구)

Yoon, Kyoung-Il;Yang, Jin-Sol;Jo, Yeong-Hoon;Chung, Kwang Sik
- Proceedings of the Korea Information Processing Society Conference
- /
- 2019.10a
- /
- pp.918-921
- /
- 2019
최근 4차 산업혁명으로 인해 인공지능 기술의 발달로 인해 사람의 감정을 모델링하는 연구와 챗봇에 관한 연구가 활발히 진행되고 있다. 챗봇과의 대화 중에서 감정 모델은 중요한 요소이지만, 서로 별개로 연구되었으며, 그 수 또한 부족하다. 보다 개인적이고 전문적인 서비스를 제공하기 위해선 챗봇 사용자의 감정을 분류할 수 있어야 한다. 이에 본 연구에서는 모바일 플랫폼에서의 특징을 고려한 신조어를 포함한 사용자가 입력한 문장의 감정을 파악하고 감정에 따른 응답을 구현하는 챗봇을 구현하는 방안에 대하여 제안한다.
https://doi.org/10.3745/PKIPS.y2019m10a.918 인용 PDF

A Comparative Study on Joy in Russian and Korean (기쁨의 의미연구 - 러시아어와 한국어의 비교를 중심으로 -)

Kim, Jung-Il
- Cross-Cultural Studies
- /
- v.41
- /
- pp.113-140
- /
- 2015
This paper explains how the basic and instinctive emotion "joy" is verbally expressed in Russian and Korean. In particular, the main concern of this pater is on the cultural context with which the emotion "joy" is related and the ways in which the emotion "joy" has a wide range of uses. The semantic and pragmatic characteristics of the uses of the expression "joy" can be explained through the cultural and historical backgrounds in both languages. In Russian, joy has two variants, radost' and udovol'stvie. It is very difficult to distinguish a significant difference between them; however, the former is mainly connected with more mental, spiritual, cultural, and religious contexts, whereas the latter is mainly related with more concrete, instantaneous contexts and daily life. The former produces an impression that has a more wide, spiritual, and macroscopic attitude toward a situation, whereas the latter produces an impression that has a microscopic and instantaneous attitude toward a situation. Compared with the Russian expressions, the Korean equivalents, 기쁨 and 즐거움, have a very similar opposition like that of the Russian. The former is based on a more logical and causal relation between an anticipation or desire and the current situations, whereas the latter is based on the participation of speakers in a situation and has a very instantaneous characteristic, like a udovol'stvie in Russian. Thus, it can be reasonable argued that the Russian udovol'stvie and the Korean 즐거움 share many similar semantic properties. In brief summary, in both languages there exists two distinctive variants that show a privative opposition to express the emotional concept of joy.

A Sentiment Analysis of Internet Movie Reviews Using String Kernels (문자열 커널을 이용한 인터넷 영화평의 감정 분석)

Kim, Sang-Do;Yoon, Hee-Geun;Park, Seong-Bae;Park, Se-Young;Lee, Sang-Jo
- Annual Conference on Human and Language Technology
- /
- 2009.10a
- /
- pp.56-60
- /
- 2009
오늘날 인터넷은 개인의 감정, 의견을 서로 공유할 수 있는 공간이 되고 있다. 하지만 인터넷에는 너무나 방대한 문서가 존재하기 때문에 다른 사용자들의 감정, 의견 정보를 개인의 의사 결정에 활용하기가 쉽지 않다. 최근 들어 감정이나 의견을 자동으로 추출하기 위한 연구가 활발하게 진행되고 있으며, 감정 분석에 관한 기존 연구들은 대부분 어구의 극성(polarity) 정보가 있는 감정 사전을 사용하고 있다. 하지만 인터넷에는 나날이 신조어가 새로 생기고 언어 파괴 현상이 자주 일어나기 때문에 사전에 기반한 방법은 한계가 있다. 본 논문은 감정 분석 문제를 긍정과 부정으로 구분하는 이진 분류 문제로 본다. 이진 분류 문제에서 탁월한 성능을 보이는 Support Vector Machines(SVM)을 사용하며, 문서들 간의 유사도 계산을 위해 문장의 부분 문자열을 비교하는 문자열 커널을 사용한다. 실험 결과, 실제 영화평에서 제안된 모델이 비교 대상으로 삼은 Bag of Words(BOW) 모델보다 안정적인 성능을 보였다.
PDF

Generating a Korean Sentiment Lexicon Through Sentiment Score Propagation (감정점수의 전파를 통한 한국어 감정사전 생성)

Park, Ho-Min;Kim, Chang-Hyun;Kim, Jae-Hoon
- KIPS Transactions on Software and Data Engineering
- /
- v.9 no.2
- /
- pp.53-60
- /
- 2020
Sentiment analysis is the automated process of understanding attitudes and opinions about a given topic from written or spoken text. One of the sentiment analysis approaches is a dictionary-based approach, in which a sentiment dictionary plays an much important role. In this paper, we propose a method to automatically generate Korean sentiment lexicon from the well-known English sentiment lexicon called VADER (Valence Aware Dictionary and sEntiment Reasoner). The proposed method consists of three steps. The first step is to build a Korean-English bilingual lexicon using a Korean-English parallel corpus. The bilingual lexicon is a set of pairs between VADER sentiment words and Korean morphemes as candidates of Korean sentiment words. The second step is to construct a bilingual words graph using the bilingual lexicon. The third step is to run the label propagation algorithm throughout the bilingual graph. Finally a new Korean sentiment lexicon is generated by repeatedly applying the propagation algorithm until the values of all vertices converge. Empirically, the dictionary-based sentiment classifier using the Korean sentiment lexicon outperforms machine learning-based approaches on the KMU sentiment corpus and the Naver sentiment corpus. In the future, we will apply the proposed approach to generate multilingual sentiment lexica.
https://doi.org/10.3745/KTSDE.2020.9.2.53 인용 PDF KSCI

Synonym Emotional Adjectives in Coordination: Analyzing [Emotional Adjective + '-ko(and)'] + Emotional Adjective] Structures in Korean (감정형용사 유의어 결합 연구 -[[감정형용사 + '-고'] + 감정형용사] 구성-)

Park, JINA;Jeong, Yong-Ho
- The Journal of the Convergence on Culture Technology
- /
- v.10 no.3
- /
- pp.565-577
- /
- 2024
This discussion looked at how emotional adjectives are connected in the format [[emotional adjective + '-ko(and)'] + emotional adjective]. As a result, it was confirmed that there are quite a few cases in which two or more emotional adjectives are used to express emotions in Korean. This can help Korean learners understand and express the individual lexical meanings of emotional adjectives more clearly by identifying emotional adjectives that are used together with the corresponding configuration. It was believed that it could help Korean language learners express complex emotions or create rich emotional expressions when expressing their emotions in Korean. It is hoped that the examples and frequency of [[emotional adjective+'-ko(and)'+emotional adjective] shown in this discussion will be of some help in teaching and learning Korean emotional vocabulary.
https://doi.org/10.17703/JCCT.2024.10.3.565 인용 PDF

Search Result 115, Processing Time 0.023 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)