• Title/Summary/Keyword: word-net

Search Result 258, Processing Time 0.025 seconds

A Study on Phoneme Likely Units to Improve the Performance of Context-dependent Acoustic Models in Speech Recognition (음성인식에서 문맥의존 음향모델의 성능향상을 위한 유사음소단위에 관한 연구)

  • 임영춘;오세진;김광동;노덕규;송민규;정현열
    • The Journal of the Acoustical Society of Korea
    • /
    • v.22 no.5
    • /
    • pp.388-402
    • /
    • 2003
  • In this paper, we carried out the word, 4 continuous digits. continuous, and task-independent word recognition experiments to verify the effectiveness of the re-defined phoneme-likely units (PLUs) for the phonetic decision tree based HM-Net (Hidden Markov Network) context-dependent (CD) acoustic modeling in Korean appropriately. In case of the 48 PLUs, the phonemes /ㅂ/, /ㄷ/, /ㄱ/ are separated by initial sound, medial vowel, final consonant, and the consonants /ㄹ/, /ㅈ/, /ㅎ/ are also separated by initial sound, final consonant according to the position of syllable, word, and sentence, respectively. In this paper. therefore, we re-define the 39 PLUs by unifying the one phoneme in the separated initial sound, medial vowel, and final consonant of the 48 PLUs to construct the CD acoustic models effectively. Through the experimental results using the re-defined 39 PLUs, in word recognition experiments with the context-independent (CI) acoustic models, the 48 PLUs has an average of 7.06%, higher recognition accuracy than the 39 PLUs used. But in the speaker-independent word recognition experiments with the CD acoustic models, the 39 PLUs has an average of 0.61% better recognition accuracy than the 48 PLUs used. In the 4 continuous digits recognition experiments with the liaison phenomena. the 39 PLUs has also an average of 6.55% higher recognition accuracy. And then, in continuous speech recognition experiments, the 39 PLUs has an average of 15.08% better recognition accuracy than the 48 PLUs used too. Finally, though the 48, 39 PLUs have the lower recognition accuracy, the 39 PLUs has an average of 1.17% higher recognition characteristic than the 48 PLUs used in the task-independent word recognition experiments according to the unknown contextual factor. Through the above experiments, we verified the effectiveness of the re-defined 39 PLUs compared to the 48PLUs to construct the CD acoustic models in this paper.

Graph-Based Word Sense Disambiguation Using Iterative Approach (반복적 기법을 사용한 그래프 기반 단어 모호성 해소)

  • Kang, Sangwoo
    • The Journal of Korean Institute of Next Generation Computing
    • /
    • v.13 no.2
    • /
    • pp.102-110
    • /
    • 2017
  • Current word sense disambiguation techniques employ various machine learning-based methods. Various approaches have been proposed to address this problem, including the knowledge base approach. This approach defines the sense of an ambiguous word in accordance with knowledge base information with no training corpus. In unsupervised learning techniques that use a knowledge base approach, graph-based and similarity-based methods have been the main research areas. The graph-based method has the advantage of constructing a semantic graph that delineates all paths between different senses that an ambiguous word may have. However, unnecessary semantic paths may be introduced, thereby increasing the risk of errors. To solve this problem and construct a fine-grained graph, in this paper, we propose a model that iteratively constructs the graph while eliminating unnecessary nodes and edges, i.e., senses and semantic paths. The hybrid similarity estimation model was applied to estimate a more accurate sense in the constructed semantic graph. Because the proposed model uses BabelNet, a multilingual lexical knowledge base, the model is not limited to a specific language.

Ontology Mapping using Semantic Relationship Set of the WordNet (워드넷의 의미 관계 집합을 이용한 온톨로지 매핑)

  • Kwak, Jung-Ae;Yong, Hwan-Seung
    • Journal of KIISE:Databases
    • /
    • v.36 no.6
    • /
    • pp.466-475
    • /
    • 2009
  • Considerable research in the field of ontology mapping has been done when information sharing and reuse becomes necessary by a variety of ontology development. Ontology mapping method consists of the lexical, structural, instance, and logical inference similarity computing. Lexical similarity computing used in most ontology mapping methods performs an ontology mapping by using the synonym set defined in the WordNet. In this paper, we define the Super Word Set including the hypenym, hyponym, holonym, and meronym set and propose an ontology mapping method using the Super Word Set. The results of experiments show that our method improves the performance by up to 12%, compared with previous ontology mapping method.

Semi-automatic Ontology Modeling for VOD Annotation for IPTV (IPTV의 VOD 어노테이션을 위한 반자동 온톨로지 모델링)

  • Choi, Jung-Hwa;Heo, Gil;Park, Young-Tack
    • Journal of KIISE:Software and Applications
    • /
    • v.37 no.7
    • /
    • pp.548-557
    • /
    • 2010
  • In this paper, we propose a semi-automatic modeling approach of ontology to annotate VOD to realize the IPTV's intelligent searching. The ontology is made by combining partial tree that extracts hypernym, hyponym, and synonym of keywords related to a service domain from WordNet. Further, we add to the partial tree new keywords that are undefined in WordNet, such as foreign words and words written in Chinese characters. The ontology consists of two parts: generic hierarchy and specific hierarchy. The former is the semantic model of vocabularies such as keywords and contents of keywords. They are defined as classes including property restrictions in the ontology. The latter is generated using the reasoning technique by inferring contents of keywords based on the generic hierarchy. An annotation generates metadata (i.e., contents and genre) of VOD based on the specific hierarchy. The generic hierarchy can be applied to other domains, and the specific hierarchy helps modeling the ontology to fit the service domain. This approach is proved as good to generate metadata independent of any specific domain. As a result, the proposed method produced around 82% precision with 2,400 VOD annotation test data.

The Influence of Negative Emotions on Customer Contribution to Organizational Innovation in an Online Brand Community (온라인 브랜드 커뮤니티 내 부정적 감정들이 기업 혁신을 위한 고객 기여에 미치는 영향)

  • Jung, Suyeon;Lee, Hanjun;Suh, Yongmoo
    • Journal of Internet Computing and Services
    • /
    • v.14 no.4
    • /
    • pp.91-100
    • /
    • 2013
  • In recent years, online brand communities, whereby firms and customers interact freely, are emerging trend, because customers' opinions collected in these communities can help firms to achieve their innovation effectively. In this study, we examined whether customer opinions containing negative emotions have influence on their adoption for organizational innovation. To that end, we firstly classified negative emotions into five categories of detailed negative emotions such as Fear, Anger, Shame, Sadness, and Frustration. Then, we developed a lexicon for each category of negative emotions, using WordNet and SentiWordNet. From 81,543 customer opinions collected from MyStarbucksIdea.com which is Starbucks' brand community, we extracted terms that belong to each lexicon. We conducted an experiment to examine whether the existence, frequency and strength of terms with negative emotions in each category affect the adoption of customer opinions for organizational innovation. In the experiment, we statistically verified that there is a positive relationship between customer ideas containing negative emotions and their adoption for innovation. Especially, Frustration and Sadness out of the five emotions are significantly influential to organizational innovation.

Feature Generation of Dictionary for Named-Entity Recognition based on Machine Learning (기계학습 기반 개체명 인식을 위한 사전 자질 생성)

  • Kim, Jae-Hoon;Kim, Hyung-Chul;Choi, Yun-Soo
    • Journal of Information Management
    • /
    • v.41 no.2
    • /
    • pp.31-46
    • /
    • 2010
  • Now named-entity recognition(NER) as a part of information extraction has been used in the fields of information retrieval as well as question-answering systems. Unlike words, named-entities(NEs) are generated and changed steadily in documents on the Web, newspapers, and so on. The NE generation causes an unknown word problem and makes many application systems with NER difficult. In order to alleviate this problem, this paper proposes a new feature generation method for machine learning-based NER. In general features in machine learning-based NER are related with words, but entities in named-entity dictionaries are related to phrases. So the entities are not able to be directly used as features of the NER systems. This paper proposes an encoding scheme as a feature generation method which converts phrase entities into features of word units. Futhermore, due to this scheme, entities with semantic information in WordNet can be converted into features of the NER systems. Through our experiments we have shown that the performance is increased by about 6% of F1 score and the errors is reduced by about 38%.

Automatic WordNet mapping using word sense disambiguation (의미 애매성 해소를 이용한 WordNet 자동 매핑)

  • Lee, Chang-Ki;Lee, Geun-Bae
    • Annual Conference on Human and Language Technology
    • /
    • 2000.10d
    • /
    • pp.262-268
    • /
    • 2000
  • 본 논문에서는 어휘 의미 애매성 해소와 영어 대역어 사전 그리고 외국언어에 존재하는 개념체계를 이용하여 한국어 개념체계를 자동으로 구축하는 방법을 기술한다. 본 논문에서 사용하는 방법은 기존의 개념체계 구축 방법들에 비해 적은 노력과 시간을 필요로 한다. 또한 상기한 자동 구축 방법에서 사용하는 어휘 의미 애매성 해소를 위한 6가지 feature도 함께 설명한다.

  • PDF

지능형 전문가관리 프레임워크를 위한 주제 분야 계층 자동 생성

  • Yang, Geun-U;Lee, Sang-Ro
    • 한국경영정보학회:학술대회논문집
    • /
    • 2007.11a
    • /
    • pp.294-299
    • /
    • 2007
  • In this paper, we introduce the methodology for the automatic generation of the subject field hierarchy for Intellgent Expert Management Framework using WordNet. Intelligent Expert Management Framework, which is proposed as an appropriate method to manage valuable tacit knowledge within the organization, defines the expert profile structure and proposes the efficient method to automate the process to collect and update the expert profile information based on the profile structure defined. To increase the satisfaction level of users, additional intelligent search features are defined and users can be given the list of experts in related or similar expert fields when they perform expert searches based on the expert database being built. To enable automatic profiling of the organizational experts as well as intelligent expert searches, the subject field hierarchy, upon which the expert profiles are classified and expert searches for similar fields are performed, should be predefined. In this paper, we propose the WordNet library method that first eliminates the ambiguity of the senses of nominal data values, constructs the subject field hierarchy by overlapping the hypernym of the remaining senses, and lastly adjusts the derived hierarchy to the preference of users. Based on the proposed methodology, we expect to avoid the prohibitive costs in building large subject field hierarchies when manually done as well as maintain the objectivity of the hierarchies.

  • PDF

TagPlus: A Retrieval System using Synonym Tag in Folksonomy (TagPlus: 폭소노미에서 동의어 태그를 이용한 검색 시스템)

  • Lee, Sun-Sook;Yong, Hwan-Seung
    • Journal of Digital Contents Society
    • /
    • v.8 no.3
    • /
    • pp.255-262
    • /
    • 2007
  • Collaborative tagging describes the process by which many users add metadata in the form of keywords to shared content. Recently, collaborative tagging has grown in popularity on the web, on sites that allow users to tag bookmarks, photographs, videos and other content. In this paper, we analyze the structure and basic knowledge of collaborative tagging systems as well as their dynamical aspects. We also present a retrieval system, TagPlus, using synonym tag that is derived from WordNet database. Specifically, TagPlus, a synonym tag based system has users retrieve images from Flickr system. The proposed system show the images tagged by not only the tag that users input but also the synonyms that are synonyms with the tag.

  • PDF

Creation of the Conversion Table from Hangeul to the Roman Alphabet

  • Kim, Kyoung-Jing;Rhee, Sang-Burm
    • Proceedings of the IEEK Conference
    • /
    • 2002.07a
    • /
    • pp.321-324
    • /
    • 2002
  • For a rule-based conversion of Hangout into the Roman alphabet rather than a word-for-word conversion, one must come up with a faultless model for the Korean standard pronunciation rules, which are the basis of the Romanization. It is on this foundation that the Korean-Roman alphabet conversion table can be created. For linguistic modeling using PetriNet, modeling boundary and notation of modeling can be defined. In order to describe PetriNet, which is a dynamic modeling tool, as a static one, one can model the standard Korean pronunciation rules and the Hangout-Roman alphabet notation by conversion into incident matrix Thus, this research attempts to develop a mathematical modeling tool for a natural language using PetriNet, and create a Korean-Roman alphabet conversion table.

  • PDF