• Title/Summary/Keyword: 어휘사용빈도

Search Result 104, Processing Time 0.023 seconds

Appearance Frequency of 'Eco-Friendly' Emotion and Sensibility Words and their Changes (친환경 감성 어휘의 종류별 사용빈도 및 변화 양상)

  • Na, Young-Joo
    • Science of Emotion and Sensibility
    • /
    • v.14 no.2
    • /
    • pp.207-220
    • /
    • 2011
  • The purpose of this study is to investigate sensibility words related with eco-friendly in the two media fashion magazines and internet newspapers and to analysis their appearance frequency and changes by the year through 1999~2010. Most frequently used words are 'nature, eco, cotton, natural fiber, health, fresh, clear, preservation, harmony, com fiber, and Lohas'. The words are divided in 4 groups: 'Nature/Environment, Material/Fiber, Human, and Adjectives/Micell'. A point of appearing time is analyzed: 'ecology, memory-shape material, organic, spa' were used before 2000, 'nature environment, eco-friendly, stretch material, wellbeing, substitute, recycling' were in 2000-2001, 'smart material, eco material, green' in 2002-2003, 'coolbiz, Lohas, natural dye' in 2004-2005, 'herb medicine, sustainable, warmbiz' in 2006-2007, 'greensumer, greenlife, solar energy, forest bath' in 2008-2009. Looking into their changes, in early 2000, the words of eco-friendly emotion and sensibility had appeared frequently relatively, but later on they decreased, and again recently increased showing highest appearing frequency. 'Nature/Environment' words have appeared recently very much, while 'Human' sensibility words have not changed much or decreased a little. 'Adjective/Micell' words has increased little bit recently. 'Material/Fiber' words showed decrease at fashion magazine, while they increased at the pages of internet news.

  • PDF

Analysis of Keywords in national river occupancy permits by region using text mining and network theory (텍스트 마이닝과 네트워크 이론을 활용한 권역별 국가하천 점용허가 키워드 분석)

  • Seong Yun Jeong
    • Smart Media Journal
    • /
    • v.12 no.11
    • /
    • pp.185-197
    • /
    • 2023
  • This study was conducted using text mining and network theory to extract useful information for application for occupancy and performance of permit tasks contained in the permit contents from the permit register, which is used only for the simple purpose of recording occupancy permit information. Based on text mining, we analyzed and compared the frequency of vocabulary occurrence and topic modeling in five regions, including Seoul, Gyeonggi, Gyeongsang, Jeolla, Chungcheong, and Gangwon, as well as normalization processes such as stopword removal and morpheme analysis. By applying four types of centrality algorithms, including stage, proximity, mediation, and eigenvector, which are widely used in network theory, we looked at keywords that are in a central position or act as an intermediary in the network. Through a comprehensive analysis of vocabulary appearance frequency, topic modeling, and network centrality, it was found that the 'installation' keyword was the most influential in all regions. This is believed to be the result of the Ministry of Environment's permit management office issuing many permits for constructing facilities or installing structures. In addition, it was found that keywords related to road facilities, flood control facilities, underground facilities, power/communication facilities, sports/park facilities, etc. were at a central position or played a role as an intermediary in topic modeling and networks. Most of the keywords appeared to have a Zipf's law statistical distribution with low frequency of occurrence and low distribution ratio.

Characteristics of Environmental Color Image Vocabulary for Public Healthcare Facility (공공보건시설 환경색채이미지 어휘 특성)

  • Park, Heykyung;Oh, Jiyoung
    • Korea Science and Art Forum
    • /
    • v.31
    • /
    • pp.171-180
    • /
    • 2017
  • The purpose of this study is to analyze the characteristics of color image for establishing the color environment contributing to the promotion of public health in the public health facilities and to utilize it as data of public health color plan and index development. For this purpose, the results of the previous precedent studies were integrated and public health facilities were classified into medical facilities (general hospitals), health facilities (public health centers), and sub - healing facilities (elderly care facilities). We visited 18 public health facilities in total, measured the environmental color of with a spectroscopic, compared the results and the precedent studies results, and identified color image characteristics and future supplement points. The results are as follows. First, the previous studies related to the environment color image vocabulary of the public health facilities, it prefer comfortable, bright and positive image. Second, as a result of direct measurement the environmental color of the public health facilities, it is found that most of them use the high brightness and low saturation color of Y series. Third, as a result of analyzing vocabulary of environmental color image of public health facilities, 'natural' image showed the highest frequency, and other images such as 'gentle' and 'decent' appeared. It was difficult to understand the characteristics of the color image vocabularies of public health facilities. This study is a convergence study of color science and environmental design, and it extends the scope of multidisciplinary research related to design and it will be helpful in environmental planning on user's emotion.

The Processing and Representations of Ambiguos Morpheme in Korean Words : Centered in Aphasics. (한국어 중의적 형태소 표상양식과 처리 특성 : 실어증 환자를 중심으로)

  • 정재범;편성범;김태훈;남기춘
    • Proceedings of the Korean Society for Cognitive Science Conference
    • /
    • 2002.05a
    • /
    • pp.151-156
    • /
    • 2002
  • 중의적인 단어를 처리하는 방법에 대한 선행연구로, 첫째 문맥에 맞는 의미가 먼저 활성화된다는 가설과 둘째, 여러 뜻 중에 상대적인 빈도에 따라 많이 쓰이는 의미가 먼저 활성화되고, 그것이 문맥과 일치하지 않는다면, 다른 관련된 의미를 찾는다는 가설이 제기되었다. 마지막으로 문맥에 상관없이 모든 의미가 활성화 된 후 문맥을 고려하여 문맥에 적절한 의미를 선택한다는 가설이 있다. 본 연구에서는 '먹을', '감을' 등과 같이 2가지 의미의 품사가 다른 중의 어절과 '쥐어', '감어' 등과 같이 어절 문맥('어')이 주어진 어절의 의미 활성화가 어떻게 다른지를 조사하였다. 본 연구의 목적을 위해 점화어휘 판단 과제를 사용하였다. 실험 1의 결과는 SOA 150ms 조건에서 점화자극어절과 관련된 의미가 품사와 관련 없이 모두 활성화되었다. SOA 1000ms 조건에서는 상대적으로 많이 쓰이는 체언의 의미는 계속 활성화 되어 있는 반면, 용언의 의미 점화량은 감소하였다. 명칭성 실어증 환자인 SDK의 경우 SOA 150ms 조건에서는 일반인과 같은 형태소 처리특성을 보였으나 1000ms 조건에서는 달랐다. 다른 명칭성 실어증 환자인 BIS과 전반성 실어증 환자인 PSB는 SOA 150ms 조건과 1000ms 조건에서 일반인과 아주 다른 양상을 보였다. 이것은 실어증 환자의 타잎에 따라 형태소의 처리나 중의적인 의미 활성화가 일반인과는 다르다는 것을 보여준다. 실험 2에서는 어절 문맥이 있는 '먹어', '쥐어', '감어' 등과 같은 어절을 사용하였다. 실험 2의 결과는 SOA 150ms 조건일 때 어절문맥의 영향으로 용언의 의미만 촉진적 점화효과가 있었고, 체언의 의미는 활성화되지 않았다. 그러나 SOA 1000ms로 지연시켰을 때는 용언뿐만 아니라 체언의 의미도 촉진적 점화효과가 있었다. 실험 1과 2의 결과는 중의적인 한국어 어절의 경우에도 모든 의미가 활성화되나 어절 문맥이 존재할 때는 어절 문맥의 제약으로 어절 문맥에 맞는 한 가지 의미만 활성화된다는 것을 암시한다. 또한 이러한 결과는 한국어 어절이 분석된 형태가 아닌 어절 형태로 심성 어휘집에 저장되어 있다는 것을 암시한다. 실어증 환자의 경우 실험 1과 마찬가지로 환자의 수준이나 종류에 따라 다양한 반응을 보여주었다.

  • PDF

Development of Weather Forecast Sign Language Broadcasting System for the Hearing-Impaired (청각장애인을 위한 일기예보 수화방송 시스템 개발)

  • Oh, Juhyun;Jeon, Seong-Gyu;Eun, Junho;Kim, Minho;Kwon, Hyuk-Chul;Kim, Iktae;Kim, Jaihyun
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2013.06a
    • /
    • pp.401-404
    • /
    • 2013
  • 청각장애인을 위한 지상파방송 서비스 중 자막방송은 100%에 가까운 편성 비율을 달성하고 있지만, 화면을 가리는 수화방송은 5% 수준의 편성에 그치고 있다. 본 연구에서는 자막방송을 수화로 번역하여 그래픽 수화방송을 생성함으로써 수화방송의 비율을 높이고자 한다. 수화 단어들의 빈도를 파악하고 중요 단어부터 모션 캡처하기 위해 과거 3년간 일기예보 스크립트를 분석하였다. 자막방송 문장을 형태소별로 분석한 다음 중요 품사 위주로 단어 단위로 번역하고, 기 구축된 한국어 어휘의미망을 이용하여 수화사전에 없는 유의어와 하위어를 대표어로 대체하였다. 기계번역 기술이 수화통역사의 수준을 따라잡을 수는 없지만 향후 수화방송도 선택적 서비스가 가능해지고 수화통역사의 수화방송이 모든 프로그램에 편성될 때까지 본 시스템이 보조적 시청 수단으로 사용 가능할 것이다.

  • PDF

The Selection of the Most Painful Word in the Visual Analogue Scale(VAS) for Pain and the Psychosocial Factors in Association with Pain Assessment in Korean Adult Cancer Patients - for the Development of Korean Cancer Pain Assessment Tool(K-CPAT) by Delphi Method - ("표준형 성인 암성 통증 평가도구" 개발을 위한 시각통증등급의 최고통증강도 어휘 및 심리.사회적 평가 항목의 선정 - 델파이 방법을 이용 -)

  • Kim, Jin-Seo;Chun, Byung-Chul;Choi, Youn-Seon;Song, Chan-Hee;Yeom, Chang-Hwan;Lee, Myung-Aha;Lee, June-Young;Yoon, So-Young;Jang, Se-Kwon;Lee, Young-Hee;Lee, Kyoung-Uk;Lee, Chul;Park, Jean-No
    • Journal of Hospice and Palliative Care
    • /
    • v.6 no.1
    • /
    • pp.11-21
    • /
    • 2003
  • This paper addresses the minor differences in the description of pain in Korean language in order to develop a standarized cancer pain aneument tool for Korean adults, Korean Cancer Pain Assessement Tool. The subtle differences in the meaning of expressions used cannot be translated into English and therefore we omiltted the English abstract.

  • PDF

Automatic Text Summarization based on Selective Copy mechanism against for Addressing OOV (미등록 어휘에 대한 선택적 복사를 적용한 문서 자동요약)

  • Lee, Tae-Seok;Seon, Choong-Nyoung;Jung, Youngim;Kang, Seung-Shik
    • Smart Media Journal
    • /
    • v.8 no.2
    • /
    • pp.58-65
    • /
    • 2019
  • Automatic text summarization is a process of shortening a text document by either extraction or abstraction. The abstraction approach inspired by deep learning methods scaling to a large amount of document is applied in recent work. Abstractive text summarization involves utilizing pre-generated word embedding information. Low-frequent but salient words such as terminologies are seldom included to dictionaries, that are so called, out-of-vocabulary(OOV) problems. OOV deteriorates the performance of Encoder-Decoder model in neural network. In order to address OOV words in abstractive text summarization, we propose a copy mechanism to facilitate copying new words in the target document and generating summary sentences. Different from the previous studies, the proposed approach combines accurate pointing information and selective copy mechanism based on bidirectional RNN and bidirectional LSTM. In addition, neural network gate model to estimate the generation probability and the loss function to optimize the entire abstraction model has been applied. The dataset has been constructed from the collection of abstractions and titles of journal articles. Experimental results demonstrate that both ROUGE-1 (based on word recall) and ROUGE-L (employed longest common subsequence) of the proposed Encoding-Decoding model have been improved to 47.01 and 29.55, respectively.

Analyzing the Language Usage Characteristics of Korean Dark Web Users (국내 다크웹 사용자들의 언어 사용 특성 분석)

  • Youjin Lee;Dayeon Yim;Yongjae Lee
    • Annual Conference on Human and Language Technology
    • /
    • 2022.10a
    • /
    • pp.397-402
    • /
    • 2022
  • 익명 네트워크 기술에 기반한 다크웹은 일반 표면웹보다 더 강화된 익명성을 제공한다. 최근 이 익명성을 악용하여 다수의 다크웹 사용자들이 다크웹 내에서 범죄 행위를 모의하는 행위가 꾸준히 발생하고 있다. 특히, 국내 다크웹 사용자들은 마약 유포를 위한 방법을 공유하거나 성착취물 유포 행위 등에 직간접적으로 가담하고 있다. 이와 같은 범죄 행위들은 수사 기관의 눈을 피해 현재까지도 계속해서 발생하고 있어 국내 다크웹 범죄 동향 파악의 필요성이 증대되고 있다. 그러나 다크웹 특성상 범죄 행위를 논의하는 게시글을 수집하기가 어렵고, 다크웹 내에서의 언어 사용 특성에 대한 이해 부족으로 그동안 다크웹 사용자들이 어떤 내용의 범죄를 모의하는지 파악하기가 어려웠다. 본 논문에서는 국내 사용자들이 활동하는 다크웹 포럼들을 중심으로 사용자들의 언어 사용 특성을 연구하고, 이를 통해 다크웹에서 다뤄지는 범죄 유형들을 분석한다. 이를 위해, 자연어처리 기반의 분석 방법론을 적용하여 다크웹에서 공유되는 게시글을 수집하고 다크웹 사용자들의 은어와 특정 범죄군에서 선호되는 언어 특성을 파악한다. 특히 현재 다크웹 내에서 사용자들 사이에 관측되는 어휘들에 대한 기술통계 분석과 유의어 관계 분석을 수행하였고, 실제 다크웹 내에서 사용자들이 어떠한 범죄에 관심이 많은지를 분석하였으며, 더 나아가 수사의 효율성을 증대시키기 위한 소셜미디어, URL 인용 빈도에 대한 연구를 진행하였다.

  • PDF

Homonym Disambiguation based on Mutual Information and Sense-Tagged Compound Noun Dictionary (상호정보량과 복합명사 의미사전에 기반한 동음이의어 중의성 해소)

  • Heo, Jeong;Seo, Hee-Cheol;Jang, Myung-Gil
    • Journal of KIISE:Software and Applications
    • /
    • v.33 no.12
    • /
    • pp.1073-1089
    • /
    • 2006
  • The goal of Natural Language Processing(NLP) is to make a computer understand a natural language and to deliver the meanings of natural language to humans. Word sense Disambiguation(WSD is a very important technology to achieve the goal of NLP. In this paper, we describe a technology for automatic homonyms disambiguation using both Mutual Information(MI) and a Sense-Tagged Compound Noun Dictionary. Previous research work using word definitions in dictionary suffered from the problem of data sparseness because of the use of exact word matching. Our work overcomes this problem by using MI which is an association measure between words. To reflect language features, the rate of word-pairs with MI values, sense frequency and site of word definitions are used as weights in our system. We constructed a Sense-Tagged Compound Noun Dictionary for high frequency compound nouns and used it to resolve homonym sense disambiguation. Experimental data for testing and evaluating our system is constructed from QA(Question Answering) test data which consisted of about 200 query sentences and answer paragraphs. We performed 4 types of experiments. In case of being used only MI, the result of experiment showed a precision of 65.06%. When we used the weighted values, we achieved a precision of 85.35% and when we used the Sense-Tagged Compound Noun Dictionary, we achieved a precision of 88.82%, respectively.

A Study on the Development of English Inflectional Morphemes Based on the CHILDES Corpus (CHILDES 코퍼스를 기반으로 한 아동의 영어 굴절형태소 발달 연구)

  • Min, Myung Sook;Jun, Jongsup;Lee, Sun-Young
    • Korean Journal of Cognitive Science
    • /
    • v.24 no.3
    • /
    • pp.203-235
    • /
    • 2013
  • The goal of this paper is to test the findings about English-speaking children's acquisition of inflectional morphemes in the literature using a large-scale database. For this, we obtained a 4.7-million-word corpus from the CHILDES (Child Language Data Exchange System) database, and analyzed 1,630 British and American children's uses of English derivational morphemes up to age 7. We analyzed the type and token frequencies, type per token ratio (TTR), and the lexical diversity (D) for such inflectional morphemes as the present progressive -ing, the past tense -(e)d, the comparative and superlative -er/est with reference to children's nationality and age groups. To sum up our findings, the correlations between the D value and children's age varied from morpheme to morpheme; e.g. we found no correlation for -ing, a marginal correlation for -ed, and a strong correlation for -er/-est. Our findings are consistent with Brown's (1973) classical observation that children learn progressive forms earlier than the past tense marker. In addition, overgeneralization errors were frequently found for -ed, but rarely for -ing, showing a U-shaped developmental pattern at ages 2-3. Finally, American children showed higher D scores than British children, which showed that American children used inflectional morphemes for more word types compared with British children. The present study has its significance in testing the earlier findings in the literature by setting up well-defined methodology for analyzing the entire CHILDES database.

  • PDF