• Title/Summary/Keyword: Text Mining Analysis

Search Result 1,217, Processing Time 0.025 seconds

A Study on the Analysis of Related Information through the Establishment of the National Core Technology Network: Focused on Display Technology (국가핵심기술 관계망 구축을 통한 연관정보 분석연구: 디스플레이 기술을 중심으로)

  • Pak, Se Hee;Yoon, Won Seok;Chang, Hang Bae
    • The Journal of Society for e-Business Studies
    • /
    • v.26 no.2
    • /
    • pp.123-141
    • /
    • 2021
  • As the dependence of technology on the economic structure increases, the importance of National Core Technology is increasing. However, due to the nature of the technology itself, it is difficult to determine the scope of the technology to be protected because the scope of the relation is abstract and information disclosure is limited due to the nature of the National Core Technology. To solve this problem, we propose the most appropriate literature type and method of analysis to distinguish important technologies related to National Core Technology. We conducted a pilot test to apply TF-IDF, and LDA topic modeling, two techniques of text mining analysis for big data analysis, to four types of literature (news, papers, reports, patents) collected with National Core Technology keywords in the field of Display industry. As a result, applying LDA theme modeling to patent data are highly relevant to National Core Technology. Important technologies related to the front and rear industries of displays, including OLEDs and microLEDs, were identified, and the results were visualized as networks to clarify the scope of important technologies associated with National Core Technology. Throughout this study, we have clarified the ambiguity of the scope of association of technologies and overcome the limited information disclosure characteristics of national core technologies.

Big Data Analysis of Social Media on Gangwon-do Tourism (강원도 관광에 대한 소셜 미디어 빅데이터 분석)

  • JIN, TIANCHENG;Jeong, Eun-Hee
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.14 no.3
    • /
    • pp.193-200
    • /
    • 2021
  • Recently, posts and opinions on tourist attractions are actively shared on social media. These social big data provide meaningful information to identify objective images of tourist destinations recognized by consumers. Therefore, an in-depth understanding of the tourist image is possible by analyzing these big data on tourism. The study is to analyze destination images in Gangwon-do using big data from social media. It is wanted to understand destination images in Gangwon-do using semantic network analysis and then provided suggestions on how to enhance image to secure differentiated competitiveness as a destination for tourists. According to the frequency analysis results, as tourism in Gangwon-do, Sokcho, Gangneung, and Yangyang were mentioned at a high level in that order, and the purpose of travel was restaurant tour, gourmet food, family trip, vacation, and experience. In particular, it was found that they preferred day trips, weekends, and experiences. Four suggestions were made based on the results. First, it is necessary to develop various types of hotels, accommodation facilities and experience-oriented tour packages. Second, it is necessary to develop a day-to-day travel package that utilizes proximity to the Seoul metropolitan area. Third, it is necessary to promote traditional restaurants and local food. Finally, it is necessary to develop tourist package suitable for healing and family travel. Through this research, the destination image of Gangwon-do was identified and a tourism marketing strategy was presented to improve competitiveness. It also provided a theoretical basis for the use of the big data of tourism consumers in the field of tourism business.

Research Trends and Knowledge Structure of Digital Transformation in Fashion (패션 영역에서 디지털 전환 관련 연구동향 및 지식구조)

  • Choi, Yeong-Hyeon;Jeong, Jinha;Lee, Kyu-Hye
    • Journal of Digital Convergence
    • /
    • v.19 no.3
    • /
    • pp.319-329
    • /
    • 2021
  • This study aims to investigate Korean fashion-related research trends and knowledge structures on digital transformation through information-based approaches. Accordingly, we first identified the current status of the relevant research in Korean academic literature by year and journal; subsequently, we derived key research topics through network analysis, and then analyzed major research trends and knowledge structures by time. From 2010 to 2020, we collected 159 studies published on Korean academic platforms, cleansed data through Python 3.7, and measured centrality and network implementation through NodeXL 1.0.1. The results are as follows: first, related research has been actively conducted since 2016, mainly concentrated in clothing and art areas. Second, the online platform, AR/VR, appeared as the most frequently mentioned topic, and consumer psychological analysis, marketing strategy suggestion, and case analysis were used as the main research methods. Through clustering, major research contents for each sub-major of clothing were derived. Third, major subject by period was considered, which has, over time, changed from consumer-centered research to strategy suggestion, and design development research of platforms or services. This study contributes to enhancing insight into the fashion field on digital transformation, and can be used as a basic research to design research on related topics.

A Study on the Analysis of Centrality and Brokerage Measures of Journal Citation Network - Focusing on KCI Journals - (학술지 인용 네트워크의 중심성과 중개성 분석에 관한 연구 - KCI 등재 학술지를 중심으로 -)

  • Lee, Soo-Sang
    • Journal of Korean Library and Information Science Society
    • /
    • v.50 no.4
    • /
    • pp.77-100
    • /
    • 2019
  • This study aims to analyze and compare centrality and brokerage measures of journal citation network focusing on textmining research. The analytic sample was 193 academic articles collected from 136 KCI journals published in 2018. The journal citation network was constructed based on citation relations. The characteristics, centralities, and brokerages of network was analyzed. The journal citation network consisted 136 nodes and 413 links with directed and weight. According to the five types of centrality(out-degree, in-degree, out-closeness, in-closeness, betweenness), journals of social sciences, engineering, and interdisciplinary research showed higher centrality. Social sciences, engineering and interdisciplinary research journals also showed higher brokerages as a result of brokerage analysis which identify five types of brokerage roles(coordinator, gatekeeper, representative, consultant, liaison). The centralities and brokerages of journals are positively correlated. This study suggested how to construct journal citation network from the articles focusing on certain topics. This was meaningful study in terms of conducting brokerage analysis and comparing it with centrality in the journal citation network.

Analysis of Research Trends in SIAM Journal on Applied Mathematics Using Topic Modeling (토픽모델링을 활용한 SIAM Journal on Applied Mathematics의 연구 동향 분석)

  • Kim, Sung-Yeun
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.21 no.7
    • /
    • pp.607-615
    • /
    • 2020
  • The purpose of this study was to analyze the research status and trends related to the industrial mathematics based on text mining techniques with a sample of 4910 papers collected in the SIAM Journal on Applied Mathematics from 1970 to 2019. The R program was used to collect titles, abstracts, and key words from the papers and to analyze topic modeling techniques based on LDA algorithm. As a result of the coherence score on the collected papers, 20 topics were determined optimally using the Gibbs sampling methods. The main results were as follows. First, studies on industrial mathematics were conducted in a variety of mathematics fields, including computational mathematics, geometry, mathematical modeling, topology, discrete mathematics, probability and statistics, with a focus on analysis and algebra. Second, 5 hot topics (mathematical biology, nonlinear partial differential equation, discrete mathematics, statistics, topology) and 1 cold topic (probability theory) were found based on time series regression analysis. Third, among the fields that were not reflected in the 2015 revised mathematics curriculum, numeral system, matrix, vector in space, and complex numbers were extracted as the contents to be covered in the high school mathematical curriculum. Finally, this study suggested strategies to activate industrial mathematics in Korea, described the study limitations, and proposed directions for future research.

Product Feature Extraction and Rating Distribution Using User Reviews (사용자 리뷰를 이용한 상품 특징 추출 및 평점 분배)

  • Son, Soobin;Chun, Jonghoon
    • The Journal of Society for e-Business Studies
    • /
    • v.22 no.1
    • /
    • pp.65-87
    • /
    • 2017
  • We propose a method to analyze the user reviews and ratings of the products in the online shopping mall and automatically extracts the features of the products to determine the characteristics of a product. By judging whether a rating is given by a specific feature of a product, our method distributes the score to each feature. Conventional methods force users to wastes time reading overflowing number of reviews and ratings to decide whether to buy the product or not. Moreover, it is difficult to grasp the merits and demerits of the product, because of the way reviews and ratings are provided. It is structured in a way that it is impossible to decide which rating is given to the which characteristics of the product. Therefore, in this paper, to resolve this problem, we propose a method to automatically extract the feature of the product from the user review and distribute the score to appropriate characteristics of the product by calculating the rating of each feature from the overall rating. proposed method collects product reviews and ratings, conducts morphological analysis, and extracts features and emotional words of the products. In addition, a method for determining the polarity of a sentence in which the feature appears is given a weight value for each feature. results of the experiment and the questionnaires comparing the existing methods show the usefulness of the proposed method. We also validates the results by comparing the analysis conducted by the product review experts.

An analysis of the signaling effect of FOMC statements (미 연준 통화정책방향 의결문의 시그널링 효과 분석)

  • Woo, Shinwook;Chang, Youngjae
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.3
    • /
    • pp.321-334
    • /
    • 2020
  • The US Federal Reserve (Fed) has decided to cut interest rates. When we look at the expression of the FOMC statements at the time of policy change period we can understand that Fed has been communicating with markets through a change of word selection. However, there is a criticism that the method of analyzing the expression of the decision sentence through the context can be subjective and limited in qualitative analysis. In this paper, we evaluate the signaling effect of FOMC statements based on previous research. We analyze decision making characteristics from the viewpoint of text mining and try to predict future policy trend changes by capturing changes in expressions between statements. For this purpose, a decision tree and neural network models are used. As a result of the analysis, it can be judged that the discrepancy indicators between statements could be used to predict the policy change in the future and that the US Federal Reserve has systematically implemented policy signaling through the policy statements.

A Study on Image Recognition of local Currency Consumers Using Big Data (빅데이터를 활용한 지역화폐 소비자 이미지 인식에 관한 연구)

  • Kim, Myung-hee;Ryu, Ki-hwan
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.4
    • /
    • pp.11-17
    • /
    • 2022
  • Currently, the income and funds of the local economy are flowing out to the metropolitan area, and talented people, the driving force for regional development, also gather in the metropolitan area, and the local economy is facing a serious crisis. Local currency is issued by local governments and is a currency with auxiliary and complementary functions that can be used only within the area concerned. In order to revitalize the local economy, as local governments have focused their attention on the introduction of local currency, studies on the issuance and use of local currency are continuously being conducted. In this study, by using big data from data materials such as portals and SNS, the consumer image of local currency issued in local governments was identified through big data analysis, and based on the research results, the issuance and operation of local currency was conducted. The purpose is to present implications for The results of this study are as follows. First, by inducing local consumption through the policy issuance of local currency, it is showing the effect of increasing the economic income of the region. Second, local governments are exerting efforts to revitalize the economy and establish a virtuous cycle system for the local economy by issuing and distributing local currency. Third, the introduction of blockchain technology shows the stable operation of local currency. With academic significance, it was possible to grasp the changed appearance and effect of local currency through big data analysis and the policy direction of local currency.

Analysis of Research Trends in Tax Compliance using Topic Modeling (토픽모델링을 활용한 조세순응 연구 동향 분석)

  • Kang, Min-Jo;Baek, Pyoung-Gu
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.1
    • /
    • pp.99-115
    • /
    • 2022
  • In this study, domestic academic journal papers on tax compliance, tax consciousness, and faithful tax payment (hereinafter referred to as "tax compliance") were comprehensively analyzed from an interdisciplinary perspective as a representative research topic in the field of tax science. To achieve the research purpose, topic modeling technique was applied as part of text mining. In the flow of data collection-keyword preprocessing-topic model analysis, potential research topics were presented from tax compliance related keywords registered by the researcher in a total of 347 papers. The results of this study can be summarized as follows. First, in the keyword analysis, keywords such as tax investigation, tax avoidance, and honest tax reporting system were included in the top 5 keywords based on simple term-frequency, and in the TF-IDF value considering the relative importance of keywords, they were also included in the top 5 keywords. On the other hand, the keyword, tax evasion, was included in the top keyword based on the TF-IDF value, whereas it was not highlighted in the simple term-frequency. Second, eight potential research topics were derived through topic modeling. The topics covered are (1) tax fairness and suppression of tax offenses, (2) the ideology of the tax law and the validity of tax policies, (3) the principle of substance over form and guarantee of tax receivables (4) tax compliance costs and tax administration services, (5) the tax returns self- assessment system and tax experts, (6) tax climate and strategic tax behavior, (7) multifaceted tax behavior and differential compliance intentions, (8) tax information system and tax resource management. The research comprehensively looked at the various perspectives on the tax compliance from an interdisciplinary perspective, thereby comprehensively grasping past research trends on tax compliance and suggesting the direction of future research.

Digital Transformation: Using D.N.A.(Data, Network, AI) Keywords Generalized DMR Analysis (디지털 전환: D.N.A.(Data, Network, AI) 키워드를 활용한 토픽 모델링)

  • An, Sehwan;Ko, Kangwook;Kim, Youngmin
    • Knowledge Management Research
    • /
    • v.23 no.3
    • /
    • pp.129-152
    • /
    • 2022
  • As a key infrastructure for digital transformation, the spread of data, network, artificial intelligence (D.N.A.) fields and the emergence of promising industries are laying the groundwork for active digital innovation throughout the economy. In this study, by applying the text mining methodology, major topics were derived by using the abstract, publication year, and research field of the study corresponding to the SCIE, SSCI, and A&HCI indexes of the WoS database as input variables. First, main keywords were identified through TF and TF-IDF analysis based on word appearance frequency, and then topic modeling was performed using g-DMR. With the advantage of the topic model that can utilize various types of variables as meta information, it was possible to properly explore the meaning beyond simply deriving a topic. According to the analysis results, topics such as business intelligence, manufacturing production systems, service value creation, telemedicine, and digital education were identified as major research topics in digital transformation. To summarize the results of topic modeling, 1) research on business intelligence has been actively conducted in all areas after COVID-19, and 2) issues such as intelligent manufacturing solutions and metaverses have emerged in the manufacturing field. It has been confirmed that the topic of production systems is receiving attention once again. Finally, 3) Although the topic itself can be viewed separately in terms of technology and service, it was found that it is undesirable to interpret it separately because a number of studies comprehensively deal with various services applied by combining the relevant technologies.