Search | Korea Science

Refinement of KorLex based on WordNet (워드넷 기반 한국어 명사 어휘의미망의 정제)

Hwang, Soon-Hee;Yoon, Ae-Sun
- Proceedings of the Korean Society for Cognitive Science Conference
- /
- 2005.05a
- /
- pp.267-272
- /
- 2005
최근 들어 온톨로지(ontology), 시소러스(thesaurus) 등과 함께 주목받고 있는 Princeton 대학의 워드넷(WordNet, 이하 PWN) 은 자연어 처리(NLP)와 관련하여 대안을 제시할 수 있는 어휘의미망(lexico-semantic network)이다. 또한 PWN을 기반으로 상이한 개별어 어휘의미망 구축이 여러 차례 시도되었고, 현재도 진행 중이다. 본 연구는 간접 구축 방식에 의한 어휘의미망 구축 시 요구되는 정제(refinement) 방식들을 검토하고, 이를 한국어 명사 어휘의미망(KL)에 적용하여 정확도 검증 방법의 한 대안으로 제시하였다. 또한 보다 정교한 정제 방법의 모색과 고찰은 향후 과제로 삼고자 한다.
PDF

Implementation of Word Sense Disambiguation System based on Korean WordNet (한국어 어휘의미망에 기반을 둔 어의 중의성 해소 시스템의 구현)

Kim, Minho;Hwang, Myeong-Jin;Shin, Jong-Hun;Kwon, Hyuk-Chul
- Annual Conference on Human and Language Technology
- /
- 2008.10a
- /
- pp.96-102
- /
- 2008
자연언어처리에서 어휘의 의미를 구분하는 것은 기계번역이나 정보검색과 같은 여러 응용 분야에서 매우 중요한 역할을 한다. 국내에서도 여러 어의 중의성 해소 시스템이 소개되었으나 대부분 시스템이 의미 부착 말뭉치를 이용한 감독 학습 방식을 기반으로 두고 있다. 본 논문은 한국어 어휘의미망을 이용한 비감독 어의 중의성 해소 시스템을 소개한다. 일반적으로 감독어의 중의성 해소 시스템은 비감독 어의 중의성 해소 시스템보다 성능은 좋으나 대규모의 의미 부착 말뭉치가 있어야 한다. 그러나 본 시스템은 한국어 어휘의미망과 의미 미부착 말뭉치에서 추출한 어휘 통계정보를 이용해, 의미 부착 말뭉치에서 추출한 의미별 통계 정보를 이용하는 감독 중의성 해소 방법과 같은 효과를 낸다. 본 시스템과 타 시스템의 성능 비교를 위해 'SENSEVAL-2' 평가 대회의 한국어 평가 데이터를 이용하였다. 실험 결과는 추출된 통계 정보를 바탕으로 우도비를 이용하였을 때 정확도 72.09%, 관계어 가중치를 추가로 이용하였을 때 정확도 77.02%로 감독 중의성 해소 시스템보다 높은 성능을 보였다.
PDF

Generation of Compound Nouns Using Automatic Constructed Syntactic Patterns and Semantic Network (자동 생성한 구문패턴과 의미망을 이용한 복합명사 생성)

Im, Ji-Hui;Choe, Ho-Seop;Ock, Cheol-Young
- Proceedings of the Korea Information Processing Society Conference
- /
- 2004.05a
- /
- pp.655-658
- /
- 2004
본 논문은 구문패턴과 의미망을 이용하여 복합명사를 생성하는 방법을 제안하고, 제안된 방법으로 설계 구현된 시스템, 복합명사 생성기(Compound Nouns Generator : CNG)를 제시한다. 복합명사에 관한 연구는 형태적, 구문적, 의미적인 관점에서의 분석과 생성에 이르기까지 폭넓게 진행되고 있다. 본 논문에서는, 사전에 등재된 표제어만을 복합명사로 인정하고, 나머지는 1차적으로 명사 연결구로 파악한다. 그리고 이것을 다시 신형 복합명사와 명사 연결구로 파악함으로써, 복합명사에 대한 명확한 기준을 제시하여 자연언어처리, 정보검색 등에서 효율성을 높이고자 하였다. 본 논문에서 제안한 시스템은 복합명사 확장을 위해 구문패턴을 자동 생성함으로써 시스템의 융통성을 향상시키고, 구문패턴과 의미망을 통해 생성된 복합명사와 명사 연결구의 말뭉치상의 빈도정보를 이용함으로써 고빈도의 명사 연결구가 복합명사로서 굳어질 수 있는 가능성을 보여주기도 한다. 또한, 구문패턴과 의미망을 통해 생성된 명사 연결구를 비교하여 생성된 신형 복합명사는 구문패턴에 의해 뜻풀이가 자동으로 생성된다.
PDF

Chinese Unsupervised Word Sense Disambiguation using WordNet (어휘의미망을 이용한 중국어 비감독 어의 중의성 해소)

Lian, Guang-Zhe;Kim, Minho;Kwon, Hyuk-Chul
- Proceedings of the Korea Information Processing Society Conference
- /
- 2012.04a
- /
- pp.365-368
- /
- 2012
어의 중의성 해소는 자연어처리에서 중요한 역할을 한다. 감독 중의성 해소 방법은 비감독 중의성 해소 방법보다 높은 성능을 나타내지만, 구축비용이 큰 대규모 의미부착 말뭉치가 필요하다. 본 논문에서는 중국어 어휘의미망(HowNet)과 의미 미부착 말뭉치를 이용한 중국어 비감독 어의 중의성 해소 방법을 제안한다. 의미 미부착 말뭉치에서 통계정보를 추출하고, 중국어 어휘 의미망에서 중의성 어휘의 의미별 형제어를 추출하여 중의성 어휘의 주변 문맥에 나타나는 어휘와 카이제곱검정(${\chi}^2$-test)에 의한 독립성 검정을 통해 어휘 간 연관성을 판단하고 중의성 해소를 한다. 본 논문에서 제안한 중의성 해소방법의 성능을 SemEval-2007 평가데이터에서 측정한 결과 명사와 동사에서 각각 64.7%, 49.4%를 나타냈다. 이는 SemEval-2007 중국어 비감독 중의성 해소에서 가장 높은 성능을 나타낸 시스템보다 13.1%, 13.9% 높은 성능이다.
https://doi.org/10.3745/PKIPS.y2012m04a.365 인용 PDF

Semantic Network Analysis of 2019 Gangwon-do Wild Fire News Reporting: Focusing on Media Agenda Analysis (2019년 강원도 화재 보도에 대한 언어망 분석: 미디어의제 분석을 중심으로)

Lee, Jeng Hoon
- The Journal of the Korea Contents Association
- /
- v.19 no.11
- /
- pp.153-167
- /
- 2019
This study aims to identify the media agenda and to compare each media agenda by media and by time period, analyzing the news about 2019 Gangwon-do's wild fire reported by 37 Korean news media. Using the topic modeling algorithm and semantic network analysis, this study inspected the configuration of the network media agenda and examined the intermedia agenda setting effect by using QAP correlation analysis. Results showed that the sensational media agenda with the attributes such as victim aid and political conflict and the similarity of each media agenda for this disaster reporting.
https://doi.org/10.5392/JKCA.2019.19.11.153 인용 PDF KSCI

The Method of Using the Automatic Word Clustering System for the Evaluation of Verbal Lexical-Semantic Network (동사 어휘의미망 평가를 위한 단어클러스터링 시스템의 활용 방안)

Kim Hae-Gyung;Yoon Ae-Sun
- Journal of the Korean Society for Library and Information Science
- /
- v.40 no.3
- /
- pp.175-190
- /
- 2006
For the recent several years, there has been much interest in lexical semantic network However it seems to be very difficult to evaluate the effectiveness and correctness of it and invent the methods for applying it into various problem domains. In order to offer the fundamental ideas about how to evaluate and utilize lexical semantic networks, we developed two automatic vol·d clustering systems, which are called system A and system B respectively. 68.455.856 words were used to learn both systems. We compared the clustering results of system A to those of system B which is extended by the lexical-semantic network. The system B is extended by reconstructing the feature vectors which are used the elements of the lexical-semantic network of 3.656 '-ha' verbs. The target data is the 'multilingual Word Net-CoroNet'. When we compared the accuracy of the system A and system B, we found that system B showed the accuracy of 46.6% which is better than that of system A. 45.3%.
https://doi.org/10.4275/KSLIS.2006.40.3.175 인용 PDF

The Design and Implementation of Korean History Web Courseware Using Semantic Network (의미망을 활용한 국사과 웹 코스웨어의 설계 및 구현)

Park, Chan-Ghu;Yun, Hong-Won
- The Journal of Korean Association of Computer Education
- /
- v.3 no.1
- /
- pp.177-189
- /
- 2000
This paper describes the design and implementation of Korean History Web courseware using semantic network in order to build learning environment in the viewpoint of cognitive flexibility theory. The most important thing in design for a courseware using semantic network is to build learning environment. The first step to do this is to analyze learning contents and after that we should define the type of link between learning subjects. We should develope the knowledge map which has the link of each type connected with every learning subject.
PDF

Component-based Reuse using Semantic Network (의미망을 이용한 컴포넌트 기반 재사용)

Han Jung-Soo;Kim Gui-Jug
- Proceedings of the Korea Information Processing Society Conference
- /
- 2004.11a
- /
- pp.357-360
- /
- 2004
본 논문은 소프트웨어의 효율적인 재사용을 위해 소스 코드 기반 컴포넌트 검색 방법을 제안하였다. 제안한 방법은 2단계로 이루어지는데, 먼저 라이브러리에 저장된 클래스를 기반으로 한 컴포넌트는 파싱 과정을 거쳐 의미망을 구성하고, 다음으로 사용자가 질의한 소스 코드를 이용하여 검색이 이루어진다. 소스 코드에서 추출된 식별자가 컴포넌트의 의미망을 활성화시켜 연관된 컴포넌트를 검색한다. 본 연구에서 제안한 검색방법은 프로그래머의 관심을 라이브러리 내에 있는 컴포넌트로 유도하여 재사용성을 높일 수 있으며, 프로그래밍 패턴을 제공함으로써 프로그래머로 하여금 프로그램의 가이드 라인으로 사용할 수 있도록 도움을 줄 수 있다.
PDF

Homonym disambiguation using syntactic pattern and recursive definition network (구문패턴과 순환 뜻풀이망을 이용한 동형이의어 분별)

이왕우;최호섭;옥철영
- Proceedings of the Korean Information Science Society Conference
- /
- 2002.04b
- /
- pp.457-459
- /
- 2002
뜻풀이에서 추출한 의미 정보를 이용만 통계시인 방법의 기존 동형이의어 분별 시스템에는 불필요한 의미 정보들을 많이 가지고 있었다. 그리고 동형이의어간의 의미정보가 서로 교차하는 부분이 많아 확률적인 결정에 오류를 발생시켰다. 본 논문에서는 뜻풀이에서 구문패턴을 분석하여 보다 정제된 의미 정보를 추출하였고, 구문패턴에 속하는 어휘들의 하위어를 사전에서 자동 추출하여 부족한 의미 정보를 보완하였다. 또한, 구문패턴으로 분별할 수 없는 일부 동형이의어들은 순환 뜻풀이 망(RDN)을 이용하여 동형이의어를 분별하였다. 이러한 방법으로 동형이의어 분별을 통해 기존 연구보다 8%의 정확률 향상을 가져왔다.
PDF

Automatic Construction of Syntactic Relation in Lexical Network(U-WIN) (어휘망(U-WIN)의 구문관계 자동구축)

Im, Ji-Hui;Choe, Ho-Seop;Ock, Cheol-Young
- Journal of KIISE:Software and Applications
- /
- v.35 no.10
- /
- pp.627-635
- /
- 2008
An extended form of lexical network is explored by presenting U-WIN, which applies lexical relations that include not only semantic relations but also conceptual relations, morphological relations and syntactic relations, in a way different with existing lexical networks that have been centered around linking structures with semantic relations. So, This study introduces the new methodology for constructing a syntactic relation automatically. First of all, we extract probable nouns which related to verb based on verb's sentence type. However we should decided the extracted noun's meaning because extracted noun has many meanings. So in this study, we propose that noun's meaning is decided by the example matching rule/syntactic pattern/semantic similarity, frequency information. In addition, syntactic pattern is expanded using nouns which have high frequency in corpora.
PDF KSCI

Search Result 896, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)