• Title/Summary/Keyword: 텍스트 연구

Search Result 3,492, Processing Time 0.031 seconds

Sentimental Analysis Research Trends (감성분석 연구 동향)

  • Lee, Jung-Hoon
    • Annual Conference of KIPS
    • /
    • 2018.05a
    • /
    • pp.358-361
    • /
    • 2018
  • 비정형 데이터 증가로 텍스트 마이닝을 사용해 데이터를 분석하는 연구가 주목받고 있다. 감성분석은 단어와 문맥을 분석하여 텍스트의 감정을 파악하는 기술이다. 본 논문에서는 감성분석 연구 동향, 적용분야, 방법론에 관해 분석하고 기술하려 한다. 감성분석은 2001년 채팅의 감정을 분석하면서 시작되었고, 2008년부터 본격적으로 연구가 진행되었다. 감성분석은 SNS, 상품 후기, 영화평, 뉴스 기사 등 다양한 데이터에 적용되고 있으며, 사회이슈 찬반 분석과 장소 선호도 분석 등 다양한 연구에서 사용되었다. 감성분석 방법은 감성사전을 이용하는 방식과 기계학습을 사용하는 방식으로 나누어지며 분석 방법을 발전시키기 위한 연구가 진행되고 있다.

Examining the Intellectual Structure of Housing Studies in Korea with Text Mining and Factor Analysis (저자 프로파일링과 요인분석을 이용한 국내 주거학 분야의 지적 구조 분석)

  • Lee, Jae-Yun;Kim, Hee-Jeon;Ryoo, Jong-Duk
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.44 no.2
    • /
    • pp.285-308
    • /
    • 2010
  • This study analyzes the intellectual structure in domestic research of the Housing field, by utilizing text mining technique. Unlike the existing research that mainly uses text clustering in statistical analyses to identify subject specialties, core authors, and relationships between research areas, this study applied author profiling and factor analysis. To supplement the analysis of intellectual structure generated by text mining, and to perform evaluation on intellectual structure itself, two professionals in the housing field were interviewed. The intellectual structure, generated through text mining, was evaluated and showed its division of valid research areas that is slightly different from the traditional intellectual structure in the housing field.

Studies on the linguistic properties of the IT-People documents for an efficient Information Retrieval (IT 인물 관련 텍스트 정보의 효율적인 검색을 위한 Sub-language의 속성 연구)

  • Koh, Seung-Hui;Kim, So-Yeon;Cheon, Seung-Mi;Nam, Jee-Sun;Kim, Kweon-Yang;Park, Se-Young;Berlocher, Ivan
    • Annual Conference on Human and Language Technology
    • /
    • 2007.10a
    • /
    • pp.241-249
    • /
    • 2007
  • 본 연구는 IT 인물 관련 텍스트 정보의 효율적인 검색을 위하여 문서 내에서 인물과 관련된 정보를 담고 있는 문장들이 어떠한 특징을 가지고 실현되는가를 살펴보고 언어적 속성을 어떻게 구조화하고 형식화할 것인가를 논의하는 것을 목적으로 한다. 언어적 속성 분석을 위해서 전자신문 내에서 인물 관련 코퍼스를 수집하고 이들의 분석을 통해 다음과 같이 문제가 되는 특징들을 확인하였다. 즉 외래어 음차 표기문제, 복합명사 및 명사구 그리고 서술 명사적 표현의 문제 등으로 요약된다. IT라는 특정 영역에 대해 텍스트 내에서의 어휘-통사적 패턴을 분석하고 언어적 특징에 대한 효율적 기술을 위해서는 LGG 부분 문법 그래프 모델을 활용하도록 한다. 본 연구는 특정 영역인 IT 관련 문서에서 자연언어 텍스트를 대상으로 정보 검색할 때 문제가 되는 다양한 언어학적 현상들을 다루며, 향후보다 확장된 영역에서의 효율적 언어 처리에 대한 방법론적 대안을 제시할 수 있을 것으로 기대된다.

  • PDF

An e-Book Interface by Providing Visual Information of Hypertext Structure Will be Affect Learning Comprehension and Usability According to Learner's Learning Preferences (하이퍼텍스트의 정보구조를 제공한 e-Book 인터페이스 환경에서 학습자의 정보처리유형이 학업성취도 및 사용편의성에 미치는 효과)

  • Sung, Eun-Mo
    • The Journal of the Korea Contents Association
    • /
    • v.12 no.2
    • /
    • pp.483-496
    • /
    • 2012
  • The purpose of this study is to examine difference of information processing style on lesson comprehension scores and usability ratings in e-Learning containing visual information structure. To address this goal, 68 university students were participated in this research. They were asked information processing style test, lesson comprehension test, and usability ratings after completed e-Learning lesson. According to the result, there was not significant difference between visual and verbal information process style on lesson comprehension as learn outcomes. However, students who are visual information processing style were significantly higher ratings than students who are verbal information processing style on 4 of 8 usability scales; awareness of lesson structure, awareness of lesson length, ease of navigation, and ease of lesson learning. These result indicate that there will be needed the design of aptitude treatment interaction for e-Book according to information processing style.

Abstruseness of Rimbaud's Barbare : Autotextuality and Meaning (랭보의 「야만」의 난해성 : '자기텍스트성'과 '의미')

  • Shin, Ok-Keun
    • Cross-Cultural Studies
    • /
    • v.43
    • /
    • pp.327-354
    • /
    • 2016
  • Rimbaud's prose poem, Barbare in Illuminations, is known for its abstruseness with regard to forms, themes, metaphors. This paper first analyzes the poem's grammatical structure to make sense of such an inscrutable piece of work, then discusses its autotextuality in order to decipher its meaning by comparison with Rimbaud's other works. Autotextuality, a method of literary interpretation of Rimbaud's prose poem presented by Steve Murphy, refers to the intertextuality between the author's works. Despite some previous researches focusing on the intertextuality of Barbare, previous authors have failed not only to find its meaning but also to determine its significance. The abstruseness of Rimbaud's Barbare is sometimes considered an example of the meaningless of Rimbaud's work. However, examining the textual structure and the autotextuality builds meaning, rather than rendering the work meaningless. Barbare which consists entirely of noun phrases and metaphors means destruction, fusion and the pure power of regeneration in the original context of Rimbaud's work. This poem is Rimbaud's answer to Baudelaire's poetic question, Any of where out of World, and presents a strange scenery that uses 'the eternal female voice' to reach the Vulcan in the North Pole. Interpretation of Barbare could provide a methodology for reading the difficult Illuminations. The kind of analyses used are, for example, analysis of the text, analysis of verbal indicators, autotextuality, and an understanding of the joy and the solitude in the silence of the poem. Understanding Barbare may provide a method of interpreting the abstruseness of Illuminations. Through this approach, we can connect and combine every fragment of the Illuminations, so that we can reconstruct the story and the adventure contained therein.

A Study on the Improvement of Retrieval Efficiency Based on the CRFMD (공통기술표현포맷에 기반한 다매체자료의 검색효율 향상에 관한 연구)

  • Park, Il-Jong;Jeong, Ki-Tai
    • Journal of the Korean Society for information Management
    • /
    • v.23 no.3 s.61
    • /
    • pp.5-21
    • /
    • 2006
  • In recent years, theories of image and sound analysis have been proposed to work with text retrieval systems and have progressed quickly with the rapid progress in data processing speeds. This study proposes a common representation format for multimedia documents (CRFMD) composed of both images and text to form a single data structure. It also shows that image classification of a given test set is dramatically improved when text features are encoded together with image features. CRFMD might be applicable to other areas of multimedia document retrieval and processing, such as medical image retrieval, World Wide Web searching, and museum collection retrieval.

Text Mining for Korean: Characteristics and Application to 2011 Korean Economic Census Data (한국어 텍스트 마이닝의 특성과 2011 한국 경제총조사 자료에의 응용)

  • Goo, Juna;Kim, Kyunga
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.7
    • /
    • pp.1207-1217
    • /
    • 2014
  • 2011 Korean Economic Census is the first economic census in Korea, which contains text data on menus served by Korean-food restaurants as well as structured data on characteristics of restaurants including area, opening year and total sales. In this paper, we applied text mining to the text data and investigated statistical and technical issues and characteristics of Korean text mining. Pork belly roast was the most popular menu across provinces and/or restaurant types in year 2010, and the number of restaurants per 10000 people was especially high in Kangwon-do and Daejeon metropolitan city. Beef tartare and fried pork cutlet are popular menus in start-up restaurants while whole chicken soup and maeuntang (spicy fish stew) are in long-lived restaurants. These results can be used as a guideline for menu development to restaurant owners, and for government policy-making process that lead small restaurants to choose proper menus for successful business.

Analysis of Interrelation between Image and Text as Fusion Relationship -Through Advertising Production Class- (융합적 관계로서의 이미지와 텍스트의 상호관계성 분석 연구 -광고 제작 수업을 통하여-)

  • Seo, Hwa-Jung;Huh, Yoon Jung
    • Journal of the Korea Convergence Society
    • /
    • v.9 no.7
    • /
    • pp.155-162
    • /
    • 2018
  • This study explores the relationship between images and texts through advertising production using images and texts, and analyzes the student works with the semiotics of Roland Barth. Since Barth emphasized the interpreter's interpretation rather than the producer's intention in his work, he interpreted the work as a receiver. It was analyzed in terms of socio-cultural meaning of what students produced in the works. A total of 64 classes were held for the first two classes in D high school. The results of analyzing students' works after the advertisement production class are as follows. First, as a result of analyzing Barth 's myth structure model, advertisement image and text are symbols and have meaning. Second, advertising image and text complement each other and have the characteristic of interrelationship that constitutes meaning. Third, By attracting the socio-cultural implications inherent in the students' advertising, their values and interests could be discovered.

Properties of chi-square statistic and information gain for feature selection of imbalanced text data (불균형 텍스트 데이터의 변수 선택에 있어서의 카이제곱통계량과 정보이득의 특징)

  • Mun, Hye In;Son, Won
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.4
    • /
    • pp.469-484
    • /
    • 2022
  • Since a large text corpus contains hundred-thousand unique words, text data is one of the typical large-dimensional data. Therefore, various feature selection methods have been proposed for dimension reduction. Feature selection methods can improve the prediction accuracy. In addition, with reduced data size, computational efficiency also can be achieved. The chi-square statistic and the information gain are two of the most popular measures for identifying interesting terms from text data. In this paper, we investigate the theoretical properties of the chi-square statistic and the information gain. We show that the two filtering metrics share theoretical properties such as non-negativity and convexity. However, they are different from each other in the sense that the information gain is prone to select more negative features than the chi-square statistic in imbalanced text data.

The Systemic Functional Linguistics Analysis of Texts in Elementary Science Textbooks by Curriculum Revision (교육과정 변천에 따른 초등 과학 교과서 텍스트에 대한 체계기능언어학적 분석)

  • Maeng, Seung-Ho;Kim, Hye-Ree;Kim, Chan-Jong;Lee, Jeong-A
    • Journal of The Korean Association For Science Education
    • /
    • v.27 no.3
    • /
    • pp.242-252
    • /
    • 2007
  • This study analyzed the science texts covering 'air pressure' and 'wind' in common with every curriculum from the syllabus period to the $7^{th}$ curriculum in terms of Systemic Functional Linguistics. Important findings revealed in this study were as follows: In the aspect of ideational metafunction, the texts including much scientific information were reduced by curriculum revision. Most forms of information were 'definition' and 'fact' rather than 'principle'. In the aspect of interpersonal metafunction, the gap between students and texts were getting closer and the social position of students were concerned gradually by curriculum revisions. In the aspect of textual metafunction, the ratios of technical terminology and notation were reduced, however the amount of texts in science textbooks were reduced as well. While the subject was presented in the early texts, it was omitted as time went on. The consistency of subject and theme were reduced in the $7^{th}$ curriculum remarkably.