• Title/Summary/Keyword: 키워드 빈도 분석

Search Result 354, Processing Time 0.024 seconds

Analysis of relationship between frequency of crime occurrence and frequency of web search (범죄 발생 빈도수와 웹 검색 빈도수의 관계 분석 연구)

  • Park, Jung-Min;Park, Koo-Rack;Chung, Young-Suk
    • Journal of the Korea Convergence Society
    • /
    • v.9 no.5
    • /
    • pp.15-20
    • /
    • 2018
  • In modern society, crime is one of the major social problems. Crime has a great impact not only on victims but also on those around them. It is important to predict crimes before they occur and to prevent crime. Various studies have been conducted to predict crime. One of the most important factors in predicting crime is frequency of crime occurrence. The frequency of crime is widely used as basic data for predicting crime. However, the frequency of crime occurrence is announced about 2 years after the statistical processing period. In this paper, we propose a frequency analysis of crime - related key words retrieved from the web as a way to indirectly grasp the frequency of crime occurrence. The relationship between the number of frequency of crime occurrence and frequency of actual crime occurrence was analyzed by correlation coefficient.

Covid 19 News Data Analysis and Visualization

  • Hur, Tai-Sung;Hwang, In-Yong
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.4
    • /
    • pp.37-43
    • /
    • 2022
  • In this paper, we calculate the word frequency by date and region using news data related to COVID-19 distributed for about 8 months from December 2019 to July 2020, and visualized the correlation with the current state data of COVID-19 patients using the results. News data was collected from Big Kids, a news big data system operated by the Korea Press Promotion Foundation. The visualization system proposed in this paper shows the news frequency of the selected region compared to the overall region, the key keyword of the selected region, the region of the main keyword, and the date change of the selected region. Through this visualization, the main keywords and trends of COVID-19 confirmed and infected people can be identified for previous events.

Analysis of Research Trends in Tax Compliance using Topic Modeling (토픽모델링을 활용한 조세순응 연구 동향 분석)

  • Kang, Min-Jo;Baek, Pyoung-Gu
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.1
    • /
    • pp.99-115
    • /
    • 2022
  • In this study, domestic academic journal papers on tax compliance, tax consciousness, and faithful tax payment (hereinafter referred to as "tax compliance") were comprehensively analyzed from an interdisciplinary perspective as a representative research topic in the field of tax science. To achieve the research purpose, topic modeling technique was applied as part of text mining. In the flow of data collection-keyword preprocessing-topic model analysis, potential research topics were presented from tax compliance related keywords registered by the researcher in a total of 347 papers. The results of this study can be summarized as follows. First, in the keyword analysis, keywords such as tax investigation, tax avoidance, and honest tax reporting system were included in the top 5 keywords based on simple term-frequency, and in the TF-IDF value considering the relative importance of keywords, they were also included in the top 5 keywords. On the other hand, the keyword, tax evasion, was included in the top keyword based on the TF-IDF value, whereas it was not highlighted in the simple term-frequency. Second, eight potential research topics were derived through topic modeling. The topics covered are (1) tax fairness and suppression of tax offenses, (2) the ideology of the tax law and the validity of tax policies, (3) the principle of substance over form and guarantee of tax receivables (4) tax compliance costs and tax administration services, (5) the tax returns self- assessment system and tax experts, (6) tax climate and strategic tax behavior, (7) multifaceted tax behavior and differential compliance intentions, (8) tax information system and tax resource management. The research comprehensively looked at the various perspectives on the tax compliance from an interdisciplinary perspective, thereby comprehensively grasping past research trends on tax compliance and suggesting the direction of future research.

A Study of Themes and Trends in Research of Global Maritime Economics through Keyword Network Analysis (키워드 네트워크 분석을 통한 세계 해운경제의 연구 주제와 동향에 대한 연구)

  • Jhang, Se-Eun;Lee, Su-Ho
    • Journal of Korea Port Economic Association
    • /
    • v.32 no.1
    • /
    • pp.79-95
    • /
    • 2016
  • This study identifies themes and trends in maritime economics and logistics by examining 303 papers published in international journals from 2000 to 2014 using keyword network analysis. Network analysis can be used because the collected data follow Zipf's law and the power law. Utilizing the degree centrality and betweenness centrality, we find the important keywords in each five year period and determine the importance of shared keywords. To further explain keyword centralities, we invented a Delta-C algorithm to show the trends of keywords over time. We found that degree centrality is useful for identifying important research themes in each period because it is mainly concerned with the number of connections. On the other hands, betweenness centrality is useful to determine the unique themes that emerge in each of the specific periods.

A Bibliometric Study on Sustainable Development Goals (SDGs) Research Trends in Entrepreneurship (키워드 네트워크 분석을 활용한 창업분야 지속가능발전목표(SDGs) 연구동향 분석)

  • An, Seung Kwon;Choi, Min Jung
    • Asia-Pacific Journal of Business Venturing and Entrepreneurship
    • /
    • v.18 no.2
    • /
    • pp.21-34
    • /
    • 2023
  • The purpose of this study is to examine the extent of Sustainable Development Goals (SDGs)-related research in the field of entrepreneurship globally since the adoption of the SDGs at the UN General Assembly, and to compare international and domestic research trends in order to determine the direction of SDGs-related research in entrepreneurship in Korea. Utilizing three databases-Web of Science (WoS), KCI, and DBpia- SDGs-related studies in entrepreneurship were extracted by employing specific search terms. After data purification, a total of 356 studies abroad and 4 studies in Korea were used for analysis. After data purification, a total of 356 international studies and 4 Korean studies were analyzed. Due to the limited number of domestic studies, the research trends were examined by conducting frequency analysis and keyword network analysis on international studies alone. Frequency analysis revealed that SDGs research in entrepreneurship primarily focused on sustainability-related terms and was conducted in conjunction with business models, innovation, entrepreneurship education, and strategies. Furthermore, yearly frequency analysis demonstrated an expansion of topics to encompass research on entrepreneurship and SDGs policies, the roles and capabilities of female entrepreneurs in SDGs implementation, energy start-ups and SDGs, directions for implementing SDGs in business schools and SDGs education, indicators for SDGs implementation and evaluation, and technologies for sustainability. The keyword network analysis identified central topics such as business, sustainability, SDGs, innovation, entrepreneurship, business models, and education, with research areas extending to entrepreneurship ecosystems, change and strategy, ethics, and climate. This study holds significance in establishing a foundation for SDGs research in entrepreneurship, which is currently an underexplored area in Korea, by presenting emerging research trends related to SDGs in entrepreneurship.

  • PDF

Extracting User-Specific Advertising Keywords Based on Textual Data Mining from KakaoTalk (카카오톡에서의 텍스트 데이터 마이닝 기반의 사용자별 적합 광고 키워드 도출 )

  • Yerim Jeon;Dayeong So;Jimin Lee;Eunjin (Jinny) Jo;Jihoon Moon
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2023.05a
    • /
    • pp.368-369
    • /
    • 2023
  • 대화 데이터 기반 광고 추천은 광고 마케팅에서 고객 맞춤형 광고 제공, 마케팅 효과 극대화 등을 위한 중요한 기술로 주목받고 있다. 본 논문에서는 모바일 인스턴스 메신저인 카카오톡 대화창에서 발생한 텍스트 데이터를 기반으로 대화 내용을 분석하여 대화 주제별 적절한 광고 키워드를 제안한다. 이를 위해 주제별 대화 내용을 미용, 식음료, 상거래로 세분하고 KoNLPy 의 Okt 를 이용하여 텍스트 전처리를 수행하고 키워드별로 빈도수를 뽑아 워드 클라우드를 제시한다. 또한, 잠재 디리클레 할당(Latent Dirichlet Allocation, LDA)을 기반으로 대화 주제를 세분화한 뒤 라벨링을 통해 주제별 대화 키워드를 분석한다. 실험 결과, 대화 주제를 온라인 쇼핑, 헤어, 뷰티 관리, 음식으로 나눌 수 있었으며, 토픽별 상위 키워드를 Word2Vec 을 통해 특정 단어와 유사한 키워드를 도출하여 적절한 광고 키워드를 제시할 수 있었다.

Analyzing Trends in Research Data Using Keyword Network Analysis: Focusig on SCOPUS DB (키워드 네트워크 분석을 활용한 연구데이터 분야 동향 분석 - SCOPUS DB를 중심으로 -)

  • Hyojin Geum;Suntae Kim
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.35 no.2
    • /
    • pp.85-108
    • /
    • 2024
  • This study aimed to analyze the research trends of research data academic papers from 2010 to 2024 to understand the research status of research data over the past 15 years. To achieve this goal, keyword frequency analysis and network centrality analysis were conducted on 14,921 academic articles published in Scopus DB. The keyword network analysis using UCINET, which was divided into the first period (2010-2014), second period (2015-2019), and third period (2020-2024) according to the period of publication of academic journals, revealed the main keywords studied regardless of the period, the keywords that attracted attention by period, and the keywords that decreased in attention over time. It was found that the most active topic of research data-related research in the last 15 years is data sharing, and most of the keywords with high Degree Centrality also have high Betweenness Centrality. The results of this study can be utilized as a basis for suggesting future research directions in the field of research data in Korea.

Dynamic recomposition of document category using user intention tree (사용자 의도 트리를 사용한 동적 카테고리 재구성)

  • Kim, Hyo-Lae;Jang, Young-Cheol;Lee, Chang-Hoon
    • The KIPS Transactions:PartB
    • /
    • v.8B no.6
    • /
    • pp.657-668
    • /
    • 2001
  • It is difficult that web documents are classified with exact user intention because existing document classification systems are based on word frequency number using single keyword. To improve this defect, first, we use keyword, a query, domain knowledge. Like explanation based learning, first, query is analyzed with knowledge based information and then structured user intention information is extracted. We use this intention tree in the course of existing word frequency number based document classification as user information and constraints. Thus, we can classify web documents with more exact user intention. In classifying document, structured user intention information is helpful to keep more documents and information which can be lost in the system using single keyword information. Our hybrid approach integrating user intention information with existing statistics and probability method is more efficient to decide direction and range of document category than existing word frequency approach.

  • PDF

A Study on Keywords Extraction based on Semantic Analysis of Document (문서의 의미론적 분석에 기반한 키워드 추출에 관한 연구)

  • Song, Min-Kyu;Bae, Il-Ju;Lee, Soo-Hong;Park, Ji-Hyung
    • Proceedings of the Korea Inteligent Information System Society Conference
    • /
    • 2007.11a
    • /
    • pp.586-591
    • /
    • 2007
  • 지식 관리 시스템, 정보 검색 시스템, 그리고 전자 도서관 시스템 등의 문서를 다루는 시스템에서는 문서의 구조화 및 문서의 저장이 필요하다. 문서에 담겨있는 정보를 추출하기 위해 가장 우선시되어야 하는 것은 키워드의 선별이다. 기존 연구에서 가장 널리 사용된 알고리즘은 단어의 사용 빈도를 체크하는 TF(Term Frequency)와 IDF(Inverted Document Frequency)를 활용하는 TF-IDF 방법이다. 그러나 TF-IDF 방법은 문서의 의미를 반영하지 못하는 한계가 존재한다. 이를 보완하기 위하여 본 연구에서는 세 가지 방법을 활용한다. 첫 번째는 문헌 속에서의 단어의 위치 및 서론, 결론 등의 특정 부분에 사용된 단어의 활용도를 체크하는 문헌구조적 기법이고, 두 번째는 강조 표현, 비교 표현 등의 특정 사용 문구를 통제 어휘로 지정하여 활용하는 방법이다. 마지막으로 어휘의 사전적 의미를 분석하여 이를 메타데이터로 활용하는 방법인 언어학적 기법이 해당된다. 이를 통하여 키워드 추출 과정에서 문서의 의미 분석도 수행하여 키워드 추출의 효율을 높일 수 있다.

  • PDF

A Method for Compound Noun Extraction to Improve Accuracy of Keyword Analysis of Social Big Data

  • Kim, Hyeon Gyu
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.8
    • /
    • pp.55-63
    • /
    • 2021
  • Since social big data often includes new words or proper nouns, statistical morphological analysis methods have been widely used to process them properly which are based on the frequency of occurrence of each word. However, these methods do not properly recognize compound nouns, and thus have a problem in that the accuracy of keyword extraction is lowered. This paper presents a method to extract compound nouns in keyword analysis of social big data. The proposed method creates a candidate group of compound nouns by combining the words obtained through the morphological analysis step, and extracts compound nouns by examining their frequency of appearance in a given review. Two algorithms have been proposed according to the method of constructing the candidate group, and the performance of each algorithm is expressed and compared with formulas. The comparison result is verified through experiments on real data collected online, where the results also show that the proposed method is suitable for real-time processing.