• 제목/요약/키워드: word-net

검색결과 258건 처리시간 0.025초

Document Clustering Using Semantic Features and Fuzzy Relations

  • Kim, Chul-Won;Park, Sun
    • Journal of information and communication convergence engineering
    • /
    • 제11권3호
    • /
    • pp.179-184
    • /
    • 2013
  • Traditional clustering methods are usually based on the bag-of-words (BOW) model. A disadvantage of the BOW model is that it ignores the semantic relationship among terms in the data set. To resolve this problem, ontology or matrix factorization approaches are usually used. However, a major problem of the ontology approach is that it is usually difficult to find a comprehensive ontology that can cover all the concepts mentioned in a collection. This paper proposes a new document clustering method using semantic features and fuzzy relations for solving the problems of ontology and matrix factorization approaches. The proposed method can improve the quality of document clustering because the clustered documents use fuzzy relation values between semantic features and terms to distinguish clearly among dissimilar documents in clusters. The selected cluster label terms can represent the inherent structure of a document set better by using semantic features based on non-negative matrix factorization, which is used in document clustering. The experimental results demonstrate that the proposed method achieves better performance than other document clustering methods.

키워드 네트워크 분석을 활용한 생태관광연구 경향 분석 (The recent research wave in ecotourism research using keyword network analysis)

  • 이재혁;손용훈
    • 농촌계획
    • /
    • 제22권2호
    • /
    • pp.45-55
    • /
    • 2016
  • From 1970, the concept of ecotourism is introduced, lots of studies in ecotourism appeared. Review these studies are necessary for future ecotourism studies. Some review studies on ecotourism are existed. However, these approach also limitation of subjectivities and some sorts of papers has not been reviewed. This study use keyword network analysis which is used as big data analysis to overcome the limitation. Foreign 2455 studies and domestic 163 studies which have ecotoursim in keywords, are analyzed for reviewing. As a result, 3 cluster('Sustainable tourism development', 'Ecological conservation', 'Ecotourist analysis' appeared, in ecotourism studies. In addition, this cluster has deep relationship with region. 'Sustainable tourism development' is related to Eurasia, Australia, Europe. 'Ecological conservation' is related to Africa. 'Ecotourism analysis' is related to North America. Especially 'Resident participation', 'Stakeholder' are appeared many times in Asia region. These results show that ecotourism studies are interpreted in regional contexts. It means that although only one word 'ecotourism' is used in different contexts, regional approach are needed for exact use. In Korea, the keywords are focused on ecotourists and developments. As Korea has lots of ecotour village, resident participation studies have to be supplemented.

워드넷을 이용한 스키마 엘리먼트 매칭 시스템 (Schema Element Matching System using WordNet)

  • 이민호;이원구;최윤수;윤화묵;최동훈;조민희;정한민
    • 한국정보과학회:학술대회논문집
    • /
    • 한국정보과학회 2012년도 한국컴퓨터종합학술대회논문집 Vol.39 No.1(C)
    • /
    • pp.122-124
    • /
    • 2012
  • 정보의 상호운용성 확보를 위해서 여러 형태로 정의되어 있는 스키마들을 매칭하는 것은 반드시 필요한 작업이다. 워드넷은 영어의 의미 어휘목록으로 유의어 집단과 어휘 목록사이의 다양한 의미관계를 기록하여 자동화된 본문 분석과 인공지능 응용에 활용할 수 있다. 본 논문에서는 워드넷을 이용하여 스키마 엘리먼트 이름의 의미 집합을 추출하고 대응하는 엘리먼트 의미 집합과의 유사도를 측정함으로써 스키마 엘리먼트를 매칭하는 시스템을 제안한다. 본 시스템은 다중매칭된 복잡한 관계를 간단한 방법으로 단일매칭화함으로써 사용자가 직관적이고 용이하게 사용할 수 있다. 이를 통하여 데이터 통합, 변환, 분산 검색 등 정보의 상호운용이 필요한 다양한 분야에서 활용될 수 있을 것으로 기대한다.

텍스트마이닝을 활용한 HPV 백신 접종 관련 연구 동향 분석 (A Text Mining Analysis of HPV Vaccination Research Trends)

  • 손예동;강희선
    • Child Health Nursing Research
    • /
    • 제25권4호
    • /
    • pp.458-467
    • /
    • 2019
  • Purpose: The purpose of this study was to identify human papillomavirus (HPV) vaccination research trends by visualizing a keyword network. Methods: Articles about HPV vaccination were retrieved from the PubMed and Web of Science databases. A total of 1,448 articles published in 2006~2016 were selected. Keywords from the abstracts of these articles were extracted using the text mining program WordStat and standardized for analysis. Sixty-four keywords out of 287 were finally chosen after pruning. Social network analysis using NetMiner was applied to analyze the whole keyword network and the betweenness centrality of the network. Results: According to the results of the social network analysis, the central keywords with high betweenness centrality included "health education", "health personnel", "parents", "uptake", "knowledge", and "health promotion". Conclusion: To increase the uptake of HPV vaccination, health personnel should provide health education and vaccine promotion for parents and adolescents. Using social media, governmental organizations can offer accurate information that is easily accessible. School-based education will also be helpful.

하이퍼텍스트 문서의 자동분류를 위한 워드넷 기반 특징 합병 기법 (A WordNet-based Feature Merge Method for HyperText Classification)

  • 노준호;김한준;장재영
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2012년도 추계학술발표대회
    • /
    • pp.406-409
    • /
    • 2012
  • 본 논문은 하이퍼텍스트 문서의 자동분류 성능을 높이기 위한 새로운 접근법을 제시한다. 하이퍼텍스트 문서는 일반 문서와 달리 하이퍼링크로 서로 연결된 구조를 가진다. 이 하이퍼링크 정보는 대상문서와 연관도가 높은 정보를 가지고 있으며, 이러한 링크 정보로부터 특징을 보다 잘 선별하기 위해서는 보다 정밀한 접근법이 필요하다. 본 논문은 단어간 의미 유사도를 기반으로 하이퍼텍스트 링크 정보를 활용한 특징 가공기법을 제안한다. 제안 기법은 하이퍼링크 문서로부터 대상문서와 연관도가 높은 특징을 추출하기 위해 단어간 유사도 함수를 사용하며, 유사도 함수는 워드넷의 상/하위어 관계를 이용한다. 그리고 추출된 특징들 중 의미적으로 비슷한 개념의 특징들을 합병함으로써 의미적으로 보다 견고한 분류 모델을 구축한다. 제안 기법을 검증하기 위해 Web-KB 문서집합을 이용하여 실험을 수행하였고 실험 결과 기존 방법보다 우수한 성능을 보였다.

Sentiment analysis of nuclear energy-related articles and their comments on a portal site in Rep. of Korea in 2010-2019

  • Jeong, So Yun;Kim, Jae Wook;Kim, Young Seo;Joo, Han Young;Moon, Joo Hyun
    • Nuclear Engineering and Technology
    • /
    • 제53권3호
    • /
    • pp.1013-1019
    • /
    • 2021
  • This paper reviewed the temporal changes in the public opinions on nuclear energy in Korea with a big data analysis of nuclear energy-related articles and their comments posted on the portal site NAVER. All articles that included at least one of "nuclear energy," "nuclear power plant (NPP)," "nuclear power phase-out," or "anti-nuclear" in their titles or main text were extracted from those posted on NAVER in January 2010-December 2019. First, we performed annual word frequency analysis to identify what words had appeared most frequently in the articles. For that period, the most frequent words were "NPP," "nuclear energy," and "energy." In addition, "safety" has remained in the upper ranks since the Fukushima NPP accident. Then, we performed sentiment analysis of the pre-processed articles. The sentiment analysis showed that positive-tone articles have been reported more frequently than negativetone over the entire analysis period. Last, we performed sentiment analysis of the comments on the articles to examine the public's intention regarding nuclear issues. The analysis showed that the number of negative comments to articles each month-irrespective of positive or negative tone-was always larger than that of positive comments over the entire analysis period.

패션 디자이너 브랜드 '알렉산더 맥퀸' 작품에 나타난 나비 이미지 패션 디자인 (Butterfly Image Fashion Design in the Fashion Designer Brand 'Alexander McQueen')

  • 전세미;염혜정
    • 패션비즈니스
    • /
    • 제23권4호
    • /
    • pp.24-37
    • /
    • 2019
  • This study focused on the fashion designer brand 'Alexander McQueen' to determine how butterflies are used in modern fashion through the sensibilities of certain designers. To this end, both a literature review and empirical research were conducted. First, we examined the origin of the word and appearance characteristics of butterflies based on prior research and a book, and also surveyed the tendencies used by the fashion designer brand Alexander McQueen. Second, out of 239 items announced by the fashion designer brand "Alexander McQueen" RTW (Ready to Wear) ranging from the S/S Collection in 2008 to the 2018-9 F/W collection, 73 pieces deemed to be fashion using butterfly images were collected through www.samsung.net and www.firstview.com, then analyzed based on timing and aesthetic characteristics. Results. The analysis by time period was divided into fantasy, handicraft, mix and match, and aesthetic characteristics shown in the order of compromise beauty, rhythmical beauty, and voluptuous beauty. The purpose of this study was to determine how butterflies are expressed in fashion based on the sensibility of a specific designer in modern fashion, the fashion designer brand 'Alexander McQueen'. Based on the results of this study, we hope that the information presented herein on fashion of natural images will serve as a basic material for similar research or design ideas as an example of designs based on butterfly images.

Positive or negative? Public perceptions of nuclear energy in South Korea: Evidence from Big Data

  • Park, Eunil
    • Nuclear Engineering and Technology
    • /
    • 제51권2호
    • /
    • pp.626-630
    • /
    • 2019
  • After several significant nuclear accidents, public attitudes toward nuclear energy technologies and facilities are considered to be one of the essential factors in the national energy and electricity policy-making process of several nations that employ nuclear energy as their key energy resource. However, it is difficult to explore and capture such an attitude, because the majority of prior studies analyzed public attitudes with a limited number of respondents and fragmentary opinion polls. In order to supplement this point, this study suggests a big data analyzing method with K-LIWC (Korean-Linguistic Inquiry and Word Count), sentiment and query analysis methods, and investigates public attitudes, positive and negative emotional statements about nuclear energy with the collected data sets of well-known social media and network services in Korea over time. Results show that several events and accidents related to nuclear energy have consistent or temporary effects on the attitude and ratios of the statements, depending on the kind of events and accidents. The presented methodology and the use of big data in relation to the energy industry is suggested as it can be helpful in addressing and exploring public attitudes. Based on the results, implications, limitations, and future research areas are presented.

의미네트워크 분석법을 이용한 근대 건축문화유산의 보존과 활용에 관한 사회적 논의 분석 - 부산광역시 근대건조물 구)한성은행 부산지점(청자빌딩)을 중심으로 - (An Analysis of Social Discussion on Preservation and Utilization of Modern Architectural Heritage using Semantic Network Analysis - Focussed on the former Busan Branch of Hansung Bank(Cheong-Ja Bldg) as a Modern Heritage -)

  • 안재철
    • 대한건축학회논문집:계획계
    • /
    • 제35권7호
    • /
    • pp.101-108
    • /
    • 2019
  • In this research, I conducted a semantic network analysis centering on media articles on purchasing, revitalizing, and utilizing the former Busan branch of Hansung Bank, a modern architectural heritage. We sought the most efficient analysis elements for the analysis of the social arguments about preservation and utilization embedded in media articles. For this reason, Degree Centrality measures how many connections the word described in the media article has, and Betweenness Centrality measures the influence that controls the flow of information through correlation I examined. In addition, keyword that express the theme well examined the aggregation structure in each sub-network. In this research, in theoretical terms, it makes sense in that the social discussion embedded in the article of the mass media is grasped empirically through semantic network analysis of words. Methodological aspect is best when it includes nouns and adjectives and the distance between words is more than four words in the analysis of the cohesive structure of the semantic network to determine whether the influence of social discussions is best assessed through the connection between words to media articles.

지능형 OMDR 기반의 자동 문서 공유 에이전트를 이용한 지식서비스 (A Knowledge Service Using Automatic Document Sharing based on Intelligent OMDR)

  • 김수경;최호진
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2008년도 추계학술발표대회
    • /
    • pp.747-750
    • /
    • 2008
  • 본 연구는 온톨로지, 자연어 처리, 메타데이터 등의 시맨틱 웹 기반 기술들을 이용하여 시맨틱 웹 응용을 위한 전체적인 기술 적용과 그의 활용에 목적을 두고 있다. 이를 위해 OWL을 기반으로 조직이나 기관의 지식 주제별 도메인 온톨로지와, 기존 워드넷(WordNet)이나 더브린 코어 메타데이터(Dublin Core Meta Data)와 조직에 정의된 데이터베이스의 스키마를 MDR로 구축하여 상호 연결하여 온톨로지가 갖는 지능적 추론과 규칙 서비스와 표준화된 메타데이터의 결합 방법을 제공한다. 이는 기존에 온톨로지와 메타데이터의 재활용과 연결(Alignment)에 있어 연구적으로 높은 가치가 있다. 그리고 조직의 사용자가 문서를 작성할 때 문서의 내용에 대해 자연어 처리 기술과 온톨로지의 기술을 이용해 적합한 용어나 메타데이터를 자동으로 제공하여 작성된 문서의 공유와 재사용성을 높이고, 작성된 문서를 XML 형식으로 구성되는 XML 기반 지능 문서 데이터베이스(XMB Based Intelligent Document Database)에 저장하여 유사한 문서를 작성하거나 사용할 필요가 있는 사용자에게 문서 등록과 검색 에이전트(Document Registry and Retrieval Agent)를 통해 이러한 제공하여 문서 지식의 사유화를 최소화 하고, 유사 문서의 재작성과 또는 특정 문서의 작성에 필요한 시간이나 경비를 줄이게 된다. 또한 웹상이나 PDA 같은 개인 휴대장치를 통해서도 서 등록과 검색 에이전트를 통해 문서를 검색하고 사용할 수 있게 한다면 언제 어디서나 해당 서비스를 활용하는 유비쿼터스와 시맨틱 웹의 실질적 응용을 거둘 수도 있으리라 사료된다.