• 제목/요약/키워드: Semantic analysis

검색결과 1,355건 처리시간 0.034초

SVM 기계학습을 이용한 웹문서의 자동 의미 태깅 (Automatic semantic annotation of web documents by SVM machine learning)

  • 황운호;강신재
    • 한국산업정보학회논문지
    • /
    • 제12권2호
    • /
    • pp.49-59
    • /
    • 2007
  • 본 논문은 시맨틱 웹의 실현을 위해서는 필수적인 작업인 웹문서의 의미를 자동으로 태깅할 수 있는 시스템에 관한 것이다. 웹상의 방대한 자원을 일일이 사람이 수작업으로 의미를 태깅한다는 것은 사실상 불가능하기 때문에 한국어 웹문서를 대상으로 대량의 학습 데이터를 수집하고 자연어처리 기법과 시소러스를 이용하여 특징을 추출한 후 SVM 기계학습을 통하여 개념분류기를 구축하였다. 한국어의 특징을 파악하여 의미 태깅에 필요한 특징 정보를 추출하기 위해서 형태소 분석과 구문 분석을 하였다. 추출된 특징정보는 가도카와 시소러스의 의미코드를 이용하여 학습벡터로 구성되는데, 이는 유사한 단어나 구를 하나의 개념코드로 매핑하여 시스템의 재현율을 높이는 역할을 하게 된다. 실험결과 자동 의미 태깅 분야에서 본 접근방법의 가능성을 확인할 수 있었다.

  • PDF

사물인터넷 환경에서 대용량 스트리밍 센서데이터의 실시간·병렬 시맨틱 변환 기법 (Real-time and Parallel Semantic Translation Technique for Large-Scale Streaming Sensor Data in an IoT Environment)

  • 권순현;박동환;방효찬;박영택
    • 정보과학회 논문지
    • /
    • 제42권1호
    • /
    • pp.54-67
    • /
    • 2015
  • 최근 사물인터넷 환경에서는 발생하는 센서데이터의 가치와 데이터의 상호운용성을 증진시키기 위해 시맨틱웹 기술과의 접목에 대한 연구가 활발히 진행되고 있다. 이를 위해서는 센서데이터와 서비스 도메인 지식의 융합을 위한 센서데이터의 시맨틱화는 필수적이다. 하지만 기존의 시맨틱 변환기술은 정적인 메타데이터를 시맨틱 데이터(RDF)로 변환하는 기술이며, 이는 사물인터넷 환경의 실시간성, 대용량성의 특징을 제대로 처리할 수 없는 실정이다. 따라서 본 논문에서는 사물인터넷 환경에서 발생하는 대용량 스트리밍 센서데이터의 실시간 병렬처리를 통해 시맨틱 데이터로 변환하는 기법을 제시한다. 본 기법에서는 시맨틱 변환을 위한 변환규칙을 정의하고, 정의된 변환규칙과 온톨로지 기반 센서 모델을 통해 실시간 병렬로 센서데이터를 시맨틱 변환하여 시맨틱 레파지토리에 저장한다. 성능향상을 위해 빅데이터 실시간 분석 프레임워크인 아파치 스톰을 이용하여, 각 변환작업을 병렬로 처리한다. 이를 위한 시스템을 구현하고, 대용량 스트리밍 센서데이터인 기상청 AWS 관측데이터를 이용하여 제시된 기법에 대한 성능평가를 진행하여, 본 논문에서 제시된 기법을 입증한다.

GOVERNMENT-CIVIC GROUP CONFLICTS AND COMMUNICATION STRATEGY: A TEXT ANALYSIS OF TV DEBATES ON KOREA'S IMPORT OF U.S. BEEF

  • Cho, Seong Eun;Choi, Myunggoon;Park, Han Woo
    • Journal of Contemporary Eastern Asia
    • /
    • 제11권1호
    • /
    • pp.1-20
    • /
    • 2012
  • This study analyzes messages from Korean TV debates on the conflict over U.S. beef imports and the process of negotiations over the imports in 2008. The authors have conducted a content analysis and a semantic network analysis by using KrKwic and CONCOR. The data was drawn from nine TV debates aired by three major TV networks in Korea (MBC, KBS, and SBS) from 27 April 27 2008 to 6 July 2008. The results indicate substantial differences in the semantic structure between arguments by the government and those by civic groups. We also investigated the relationship between the terms frequently used by both sides (i.e., the government and civic groups), and the terms used exclusively by one side. There was a gradual increase in the number of terms frequently used by both sides over time, from the formation of the conflict to its escalation to its resolution. The results indicate the possibility of general agreement in conflict situations.

Emerging Gender Issues in Korean Online Media: A Temporal Semantic Network Analysis Approach

  • Lee, Young-Joo;Park, Ji-Young
    • Journal of Contemporary Eastern Asia
    • /
    • 제18권2호
    • /
    • pp.118-141
    • /
    • 2019
  • In South Korea, as awareness of gender equality increased since the 1990s, policies for gender equality and social awareness of equality have been established. Until recently, however, the gap between men and women in social and economic activities has not reached the globally desired level and led to social conflict throughout the country. In this study, we analyze the content of online news comments to understand the public perception of gender equality and the details of gender conflict and to grasp the emergence and diffusion process of emerging issues on gender equality. We collected text data from the online news that included the word 'gender equality' posted from January 2012 to June 2017 and also collected comments on each selected news item. Through text mining and the temporal semantic network analysis, we tracked the changes in discourse on gender equality and conflict. Results revealed that gender conflicts are increasing in the online media, and the focus of conflict is shifting from 'position and role inequality' to 'opportunity inequality'.

COVID-19 확산 방지를 위한 시맨틱 진단 및 추적시스템 (A Semantic Diagnosis and Tracking System to Prevent the Spread of COVID-19)

  • 순위샹;이용주
    • 한국전자통신학회논문지
    • /
    • 제15권3호
    • /
    • pp.611-616
    • /
    • 2020
  • 본 논문은 대도시에서의 COVID-19 바이러스 확산을 막기 위해, 대한민국 서울의 감염 상황에 대한 클러스터 분석을 통한 링크드 데이터 기반 시맨틱 진단 및 추적 시스템을 제안한다. 본 논문은 크게 3개의 섹션으로 구성되어 있는데, 클러스터 분석을 위해 서울의 감염자 정보를 수집하고, 중요한 감염 환자 속성을 추출하여 랜덤 포레스트를 기반으로 한 진단 모델을 구축하고, 그리고 링크드 데이터를 기반으로 한 추적 시스템을 설계하고 구현한다. 실험 결과 진단 모델의 정확도가 80% 이상으로 나타났으며, 더군다나 본 논문에서 제안한 추적 시스템은 기존 시스템들보다 더 유연하고 개방적이며 시맨틱 쿼리도 지원한다.

국내 소비자의 일본 패션제품에 대한 정치적 소비 연구 (Korean Consumers' Political Consumption of Japanese Fashion Products)

  • 최영현;이규혜
    • 한국의류학회지
    • /
    • 제44권2호
    • /
    • pp.295-309
    • /
    • 2020
  • In 2019, Japan announced trade regulations against Korean products; consequently, the sales of Japanese products in Korea dropped due to a Korean consumers' boycott. This study measured the Korean consumers' political consumption behavior toward Japanese fashion products. Unstructured text data from online media sources and consumer posted sources such as blog and SNS were collected. Text mining techniques and semantic network analysis were used to process unstructured data. This study used text mining techniques and semantic network analysis to process data. The results identified boycotting Japanese fashion products and buycotting alternative products and Korean brands due to consumers' political consumption. Two brand cases were investigated in detail. Online text data before and after the political action were compared and significant changes in consumption as well as emotional expressions were identified. Product related industry sectors were identified in terms of the political consumption of fashion: liquor, automobile and tourism industry sectors were closely linked to the fashion sector in terms of boycotting. More "boycott" and "buycott" fashion brands (reflected in consumer attitudes and feelings) were detected in consumer driven texts than in media driven sources.

의미네트워크를 활용한 초등학교 예비교사들의 물질 개념체계 분석 (An Analysis of Conceptual Structure in the Subjects related to Matter of Elementary School Pre-service Teachers using SNA Method)

  • 김도욱
    • 한국초등과학교육학회지:초등과학교육
    • /
    • 제37권1호
    • /
    • pp.39-53
    • /
    • 2018
  • The purpose of this study was to investigate the conceptual structure of subjects related to matter having pre-service elementary school teachers by applying semantic network analysis (SNA). The analyzed concepts in the subjects of matter were 6 words such as 'atom', 'molecule', 'ion', 'electron', 'matter' and 'particle'. The results of SNA of the concepts are as follows : 1. In the semantic network of 'atom', words having a high betweenness centrality were linked with the words based on both the scientific context and the everyday context. 2. The network of 'molecule' was analyzed to be more organized than the network of the 'atom'. 3. In the network of 'ion', the group of words of the scientific context was distinguished from the group of words of the everyday context. 4. The network of 'electron' was analyzed to be more oriented on electricity and magnetism in the field of physics. 5. In the network of 'matter', the words related to compounds were linked with knowledge of history of science. 6. The network of 'particle' was not structured with words based on particulate nature of matter.

"쾌적" 이미지의 평가 및 인식구조에 관한 연구 (A study on Appreciation and Perceptive Structure of "Keijeok" (Amenity) image)

  • 양진무;노경준;안정현
    • 환경영향평가
    • /
    • 제8권1호
    • /
    • pp.61-70
    • /
    • 1999
  • The purpose of this study is to understand an appreciation and a perceptive structure of "keijeok" image by Semantic Differential Technique and Factor Analysis. The data used in this study was obtained by the questionnaire survey carried out in Pusan metropolitan city. 15 adjective pairs in the survey were evaluated by the Semantic Differential scales graded 7 ranges from 1(very good) to 7(very bad). A total of 452 samples were collected by the survey and analyzed for this study. The results are as follows; First, 15 variables comprehended to "keijeok" image were estimated as a positive conception(LT 4.0). What's more, residents may perceive "keijeok" image as intangible and aesthetic aspect such as "fresh", "pleasant", "clean". Second, the result of factor analysis shows that factor I which express the major conceptual meaning of "keijeok" image tends to have intangible or aesthetic adjective pairs rather than concrete, whereas factor II which has the weaker meaning compared with factor I may represent a functional aspect of "keijeok" image. It can explain that the perceptive structure of "keijeok" image may be largely influenced by subjective sense, then added or concreted with objective conception or environmental situation. The results can be considered as an important matter which should be reflected at the stage of environmental planning for people's amiable and desirable place.

  • PDF

대통령 지시사항에 대한 의미연결망 분석 : 2001년~2009년의 정권별 패턴을 중심으로 (Semantic Network Analysis for the President Directions Item : Focusing on Patterns(2001~2009))

  • 정의룡
    • 문화기술의 융합
    • /
    • 제4권1호
    • /
    • pp.129-137
    • /
    • 2018
  • 본 연구는 그간 학문적, 이슈적 관심에서 간과되었던 "대통령 지시사항"에 대한 분석을 수행하였다. 이러한 분석을 통해 대통령의 행정부처 내에서 진행되었던 정책방향의 차이에 대한 이해를 제고시키고, 시대의 흐름을 이해하는데 기여할 뿐만 아니라 향후 정책발굴에 있어서도 도움이 될 수 있음을 확인할 수 있었다. 본 연구는 대통령 지시사항과 관련한 키워드 네트워크 구조에 대한 패턴변화를 의미연결망 분석을 통해 고찰함으로써 대통령의 정책방향 차이가 대통령의 의지와 제도적 맥락 간의 상호작용에 연동되고 있다는 시사점을 제공하는데 기여하고 있다.

간호사의 직장 내 괴롭힘에 대한 국내 연구 동향 분석: 의미연결망분석과 토픽모델링 중심 (A Study on Research Trend for Nurses' Workplace Bullying in Korea: Focusing on Semantic Network Analysis and Topic Modeling)

  • 최정실;김영지
    • 한국직업건강간호학회지
    • /
    • 제28권4호
    • /
    • pp.221-229
    • /
    • 2019
  • Purpose: The aim of this study was to identify core keywords and topic groups of workplace bullying researches in the past 10 years for better understanding research trend. Methods: The study was conducted in four steps: 1) collecting abstracts, 2) extracting and cleaning semantic morphemes, 3) building co-occurrence matrix and 4) analyzing network features and clustering topic groups. Results: 437 articles between 2010 and 2019 were retrieved from 5 databases (RISS, NDSL, Google scholar, DBPIA and Kyobo Scholar). Forty-one abstracts from these articles were extracted, and network analysis was conducted using semantic network module. The most important core keywords were 'turnover', 'intention', 'factor', 'program' and 'nursing'. Four topic groups were identified from Korean databases. Major topics were 'turnover' and 'organization culture'. Conclusion: After reviewing previous research, it has been found that turnover intention has been emphasized. Further research focused on various intervention is needed to relieve workplace bullying in nursing field.