• 제목/요약/키워드: Semantic analysis

검색결과 1,355건 처리시간 0.028초

Relations between Reputation and Social Media Marketing Communication in Cryptocurrency Markets: Visual Analytics using Tableau

  • Park, Sejung;Park, Han Woo
    • International Journal of Contents
    • /
    • 제17권1호
    • /
    • pp.1-10
    • /
    • 2021
  • Visual analytics is an emerging research field that combines the strength of electronic data processing and human intuition-based social background knowledge. This study demonstrates useful visual analytics with Tableau in conjunction with semantic network analysis using examples of sentiment flow and strategic communication strategies via Twitter in a blockchain domain. We comparatively investigated the sentiment flow over time and language usage patterns between companies with a good reputation and firms with a poor reputation. In addition, this study explored the relations between reputation and marketing communication strategies. We found that cryptocurrency firms more actively produced information when there was an increased public demand and increased transactions and when the coins' prices were high. Emotional language strategies on social media did not affect cryptocurrencies' reputations. The pattern in semantic representations of keywords was similar between companies with a good reputation and firms with a poor reputation. However, the reputable firms communicated on a wide range of topics and used more culturally focused strategies, and took more advantages of social media marketing by expanding their outreach to other social media networks. The visual big data analytics provides insights into business intelligence that helps informed policies.

'영끌' 보도에 대한 언어망 분석: 뉴스 정보원 다양성을 중심으로 (Semantic Network Analysis of 'Young-Kl(panic buying)': Focusing on News Source Diversity)

  • 이정훈
    • 한국콘텐츠학회논문지
    • /
    • 제21권12호
    • /
    • pp.23-33
    • /
    • 2021
  • 이번 연구는 일간지, 경제지, 지상파 TV 등 총 11개의 언론 매체들이 보도한 '영끌' 관련 뉴스 기사를 분석하여 각 보도 프레임과 인용문 프레임을 파악하였다. 의미망 분석을 활용하여 매체별 인용문의 프레임, 정보원의 종류별 인용문 프레임 등을 비교, 분석하였고 인용된 정보원의 종류와 빈도, 그리고 각 프레임의 집중도 지수도 측정하였다. 분석 결과, 보도 프레임은 10개의 주제로 구성되었고 인용문의 프레임은 14개의 주제로 구성된 것으로 나타났다. 매체별 인용문과 정보원 종류별 인용문 프레임들 사이 차이는 관찰되었지만 인용 빈도가 높은 정부, 정치권, 비즈니스 정보원 프레임의 집중도가 상대적으로 높은 것으로 나타났다. 따라서 정보원의 수적 다양성만으로는 보도 프레임의 다양성을 확립하는 것이 제한적일 수 있다는 실증적 근거를 제시하였다.

Using Syntax and Shallow Semantic Analysis for Vietnamese Question Generation

  • Phuoc Tran;Duy Khanh Nguyen;Tram Tran;Bay Vo
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제17권10호
    • /
    • pp.2718-2731
    • /
    • 2023
  • This paper presents a method of using syntax and shallow semantic analysis for Vietnamese question generation (QG). Specifically, our proposed technique concentrates on investigating both the syntactic and shallow semantic structure of each sentence. The main goal of our method is to generate questions from a single sentence. These generated questions are known as factoid questions which require short, fact-based answers. In general, syntax-based analysis is one of the most popular approaches within the QG field, but it requires linguistic expert knowledge as well as a deep understanding of syntax rules in the Vietnamese language. It is thus considered a high-cost and inefficient solution due to the requirement of significant human effort to achieve qualified syntax rules. To deal with this problem, we collected the syntax rules in Vietnamese from a Vietnamese language textbook. Moreover, we also used different natural language processing (NLP) techniques to analyze Vietnamese shallow syntax and semantics for the QG task. These techniques include: sentence segmentation, word segmentation, part of speech, chunking, dependency parsing, and named entity recognition. We used human evaluation to assess the credibility of our model, which means we manually generated questions from the corpus, and then compared them with the generated questions. The empirical evidence demonstrates that our proposed technique has significant performance, in which the generated questions are very similar to those which are created by humans.

EVALUATION OF STATIC ANALYSIS TOOLS USED TO ASSESS SOFTWARE IMPORTANT TO NUCLEAR POWER PLANT SAFETY

  • OURGHANLIAN, ALAIN
    • Nuclear Engineering and Technology
    • /
    • 제47권2호
    • /
    • pp.212-218
    • /
    • 2015
  • We describe a comparative analysis of different tools used to assess safety-critical software used in nuclear power plants. To enhance the credibility of safety assessments and to optimize safety justification costs, $Electricit{\acute{e}}$ de France (EDF) investigates the use of methods and tools for source code semantic analysis, to obtain indisputable evidence and help assessors focus on the most critical issues. EDF has been using the PolySpace tool for more than 10 years. Currently, new industrial tools based on the same formal approach, Abstract Interpretation, are available. Practical experimentation with these new tools shows that the precision obtained on one of our shutdown systems software packages is substantially improved. In the first part of this article, we present the analysis principles of the tools used in our experimentation. In the second part, we present the main characteristics of protection-system software, and why these characteristics are well adapted for the new analysis tools. In the last part, we present an overview of the results and the limitations of the tools.

A Study on Gamification Consumer Perception Analysis Using Big Data

  • Se-won Jeon;Youn Ju Ahn;Gi-Hwan Ryu
    • International Journal of Advanced Culture Technology
    • /
    • 제11권3호
    • /
    • pp.332-337
    • /
    • 2023
  • The purpose of the study was to analyze consumers' perceptions of gamification. Based on the analyzed data, we would like to provide data by systematically organizing the concept, game elements, and mechanisms of gamification. Recently, gamification can be easily found around medical care, corporate marketing, and education. This study collected keywords from social media portal sites Naver, Daum, and Google from 2018 to 2023 using TEXTOM, a social media analysis tool. In this study, data were analyzed using text mining, semantic network analysis, and CONCOR analysis methods. Based on the collected data, we looked at the relevance and clusters related to gamification. The clusters were divided into a total of four clusters: 'Awareness of Gamification', 'Gamification Program', 'Future Technology of Gamification', and 'Use of Gamification'. Through social media analysis, we want to investigate and identify consumers' perceptions of gamification use, and check market and consumer perceptions to make up for the shortcomings. Through this, we intend to develop a plan to utilize gamification.

언어 네트워크 분석을 통한 IFLA의 학교도서관 가이드라인 비교·분석에 관한 연구 (A Comparative Analysis Study of IFLA School Library Guidelines Using Semantic Network Analysis)

  • 이병기
    • 한국도서관정보학회지
    • /
    • 제51권2호
    • /
    • pp.1-21
    • /
    • 2020
  • 본 연구는 언어 네트워크 분석을 통해 IFLA의 학교도서관 가이드라인의 언어적 의미를 파악하는데 목적이 있다. IFLA의 학교도서관 가이드라인은 2002년 초판과 2015년에 개정한 제2판이 있다. 본 연구는 학교도서관 가이드라인의 2002년판과 2015년판을 언어 네트워크의 관점에서 분석하고, 상호 비교하였다. 대상 테스트로부터 키워드들을 추출하고 동시출현관계를 바탕으로 언어 네트워크를 구성하였다. 동시출현 네트워크로부터 중심성(연결정도 중심성, 근접 중심성, 매개 중심성)을 분석하였다. 또한, 본 연구는 넷마이너4.0의 LDA 기능을 사용하여 토픽모델링 분석을 수행하였다. 본 연구의 주요 결과는 다음과 같다. 첫째, 중심성 차원에서 비교해 보면, 2015년판에서 'Program, Teaching, Reading, Inquiry, Literacy, Media' 등의 키워드가 2002년판에 비해 높게 나타나고 있다. 둘째, 2002년판의 중심성 상위 리스트에서 보이지 않던 'Inquiry'와 'Achievement' 키워드가 2015년판의 연결정도 중심성과 근접중심성에 새롭게 출현하고 있다. 셋째, 토픽 모델링의 분석 결과, 2002년판에 비해 2015년판은 학교도서관 서비스, 사서교사의 교수학습 활동, 미디어 및 정보활용교육, 교육과정 참여 등에 관한 토픽의 비중이 높아지고 있다.

시맨틱 웹 자원의 랭킹을 위한 알고리즘: 클래스중심 접근방법 (A Ranking Algorithm for Semantic Web Resources: A Class-oriented Approach)

  • 노상규;박현정;박진수
    • Asia pacific journal of information systems
    • /
    • 제17권4호
    • /
    • pp.31-59
    • /
    • 2007
  • We frequently use search engines to find relevant information in the Web but still end up with too much information. In order to solve this problem of information overload, ranking algorithms have been applied to various domains. As more information will be available in the future, effectively and efficiently ranking search results will become more critical. In this paper, we propose a ranking algorithm for the Semantic Web resources, specifically RDF resources. Traditionally, the importance of a particular Web page is estimated based on the number of key words found in the page, which is subject to manipulation. In contrast, link analysis methods such as Google's PageRank capitalize on the information which is inherent in the link structure of the Web graph. PageRank considers a certain page highly important if it is referred to by many other pages. The degree of the importance also increases if the importance of the referring pages is high. Kleinberg's algorithm is another link-structure based ranking algorithm for Web pages. Unlike PageRank, Kleinberg's algorithm utilizes two kinds of scores: the authority score and the hub score. If a page has a high authority score, it is an authority on a given topic and many pages refer to it. A page with a high hub score links to many authoritative pages. As mentioned above, the link-structure based ranking method has been playing an essential role in World Wide Web(WWW), and nowadays, many people recognize the effectiveness and efficiency of it. On the other hand, as Resource Description Framework(RDF) data model forms the foundation of the Semantic Web, any information in the Semantic Web can be expressed with RDF graph, making the ranking algorithm for RDF knowledge bases greatly important. The RDF graph consists of nodes and directional links similar to the Web graph. As a result, the link-structure based ranking method seems to be highly applicable to ranking the Semantic Web resources. However, the information space of the Semantic Web is more complex than that of WWW. For instance, WWW can be considered as one huge class, i.e., a collection of Web pages, which has only a recursive property, i.e., a 'refers to' property corresponding to the hyperlinks. However, the Semantic Web encompasses various kinds of classes and properties, and consequently, ranking methods used in WWW should be modified to reflect the complexity of the information space in the Semantic Web. Previous research addressed the ranking problem of query results retrieved from RDF knowledge bases. Mukherjea and Bamba modified Kleinberg's algorithm in order to apply their algorithm to rank the Semantic Web resources. They defined the objectivity score and the subjectivity score of a resource, which correspond to the authority score and the hub score of Kleinberg's, respectively. They concentrated on the diversity of properties and introduced property weights to control the influence of a resource on another resource depending on the characteristic of the property linking the two resources. A node with a high objectivity score becomes the object of many RDF triples, and a node with a high subjectivity score becomes the subject of many RDF triples. They developed several kinds of Semantic Web systems in order to validate their technique and showed some experimental results verifying the applicability of their method to the Semantic Web. Despite their efforts, however, there remained some limitations which they reported in their paper. First, their algorithm is useful only when a Semantic Web system represents most of the knowledge pertaining to a certain domain. In other words, the ratio of links to nodes should be high, or overall resources should be described in detail, to a certain degree for their algorithm to properly work. Second, a Tightly-Knit Community(TKC) effect, the phenomenon that pages which are less important but yet densely connected have higher scores than the ones that are more important but sparsely connected, remains as problematic. Third, a resource may have a high score, not because it is actually important, but simply because it is very common and as a consequence it has many links pointing to it. In this paper, we examine such ranking problems from a novel perspective and propose a new algorithm which can solve the problems under the previous studies. Our proposed method is based on a class-oriented approach. In contrast to the predicate-oriented approach entertained by the previous research, a user, under our approach, determines the weights of a property by comparing its relative significance to the other properties when evaluating the importance of resources in a specific class. This approach stems from the idea that most queries are supposed to find resources belonging to the same class in the Semantic Web, which consists of many heterogeneous classes in RDF Schema. This approach closely reflects the way that people, in the real world, evaluate something, and will turn out to be superior to the predicate-oriented approach for the Semantic Web. Our proposed algorithm can resolve the TKC(Tightly Knit Community) effect, and further can shed lights on other limitations posed by the previous research. In addition, we propose two ways to incorporate data-type properties which have not been employed even in the case when they have some significance on the resource importance. We designed an experiment to show the effectiveness of our proposed algorithm and the validity of ranking results, which was not tried ever in previous research. We also conducted a comprehensive mathematical analysis, which was overlooked in previous research. The mathematical analysis enabled us to simplify the calculation procedure. Finally, we summarize our experimental results and discuss further research issues.

Big Data Analysis on the Perception of Home Training According to the Implementation of COVID-19 Social Distancing

  • Hyun-Chang Keum;Kyung-Won Byun
    • International Journal of Internet, Broadcasting and Communication
    • /
    • 제15권3호
    • /
    • pp.211-218
    • /
    • 2023
  • Due to the implementation of COVID-19 distancing, interest and users in 'home training' are rapidly increasing. Therefore, the purpose of this study is to identify the perception of 'home training' through big data analysis on social media channels and provide basic data to related business sector. Social media channels collected big data from various news and social content provided on Naver and Google sites. Data for three years from March 22, 2020 were collected based on the time when COVID-19 distancing was implemented in Korea. The collected data included 4,000 Naver blogs, 2,673 news, 4,000 cafes, 3,989 knowledge IN, and 953 Google channel news. These data analyzed TF and TF-IDF through text mining, and through this, semantic network analysis was conducted on 70 keywords, big data analysis programs such as Textom and Ucinet were used for social big data analysis, and NetDraw was used for visualization. As a result of text mining analysis, 'home training' was found the most frequently in relation to TF with 4,045 times. The next order is 'exercise', 'Homt', 'house', 'apparatus', 'recommendation', and 'diet'. Regarding TF-IDF, the main keywords are 'exercise', 'apparatus', 'home', 'house', 'diet', 'recommendation', and 'mat'. Based on these results, 70 keywords with high frequency were extracted, and then semantic indicators and centrality analysis were conducted. Finally, through CONCOR analysis, it was clustered into 'purchase cluster', 'equipment cluster', 'diet cluster', and 'execute method cluster'. For the results of these four clusters, basic data on the 'home training' business sector were presented based on consumers' main perception of 'home training' and analysis of the meaning network.

한국학 연구 논문의 의미 구조 기반 메타데이터 연구 (A Study on the Metadata based on the Semantic Structure of the Korean Studies Research Articles)

  • 송민선;고영만
    • 한국도서관정보학회지
    • /
    • 제46권3호
    • /
    • pp.277-299
    • /
    • 2015
  • 본 연구의 목적은 복합학의 특성을 띠는 한국학 분야 연구 논문을 대상으로 의미적 탐색 시스템 구축을 위한 메타데이터를 체계적으로 구조화하기 위한 것이다. 이를 위해 먼저 학술 자료의 내용적 의미 구조를 정리한 기존의 연구들을 비교 분석하고, 이어서 한국학 분야 연구 논문에 수록된 저자키워드의 유형별 범주화 작업을 통해 한국학 분야에서 필요로 하는 연구 논문의 의미적인 구조를 분석하였으며, 두 작업의 결과를 기반으로 한국학 분야 연구 논문의 의미적 탐색 시스템을 구축하기 위한 의미구조 메타데이터 항목 16개를 도출하여 체계화 하였다. 본 연구는 실제 한국학 분야 연구자들이 필요로 하는 학술적 지식을 반영할 수 있는 의미적 메타데이터 구성 방법론을 체계적으로 제시하였으며, 특히 한국학 분야 연구 자료의 내용적 특성을 살펴보는데 있어 실제 연구자들이 부여한 키워드를 유형화하고 분석하여 반영하였다는데 의의가 있다.

의미 유사도를 활용한 Distant Supervision 기반의 트리플 생성 성능 향상 (Improving The Performance of Triple Generation Based on Distant Supervision By Using Semantic Similarity)

  • 윤희근;최수정;박성배
    • 정보과학회 논문지
    • /
    • 제43권6호
    • /
    • pp.653-661
    • /
    • 2016
  • 기존의 패턴기반 트리플 생성 시스템은 distant supervision의 가정으로 인해 오류 패턴을 생성하여 트리플 생성 시스템의 성능을 저하시키는 문제점이 있다. 이 문제점을 해결하기 위해 본 논문에서는 패턴과 프로퍼티 사이의 의미 유사도 기반의 패턴 신뢰도를 측정하여 오류 패턴을 제거하는 방법을 제안한다. 의미 유사도 측정은 비지도 학습 방법인 워드임베딩과 워드넷 기반의 어휘 의미 유사도 측정 방법을 결합하여 사용한다. 또한 한국어 패턴과 영어 프로퍼티 사이의 언어 및 어휘 불일치 문제를 해결하기 위해 정준 상관 분석과 사전 기반의 번역을 사용한다. 실험 결과에 따르면 제안한 의미 유사도 기반의 패턴 신뢰도 측정 방법이 기존의 방법보다 10% 높은 정확률의 트리플 집합을 생성하여, 트리플 생성 성능 향상을 증명하였다.