• 제목/요약/키워드: Text frequency analysis

검색결과 453건 처리시간 0.025초

Association Modeling on Keyword and Abstract Data in Korean Port Research

  • Yoon, Hee-Young;Kwak, Il-Youp
    • Journal of Korea Trade
    • /
    • 제24권5호
    • /
    • pp.71-86
    • /
    • 2020
  • Purpose - This study investigates research trends by searching for English keywords and abstracts in 1,511 Korean journal articles in the Korea Citation Index from the 2002-2019 period using the term "Port." The study aims to lay the foundation for a more balanced development of port research. Design/methodology - Using abstract and keyword data, we perform frequency analysis and word embedding (Word2vec). A t-SNE plot shows the main keywords extracted using the TextRank algorithm. To analyze which words were used in what context in our two nine-year subperiods (2002-2010 and 2010-2019), we use Scattertext and scaled F-scores. Findings - First, during the 18-year study period, port research has developed through the convergence of diverse academic fields, covering 102 subject areas and 219 journals. Second, our frequency analysis of 4,431 keywords in 1,511 papers shows that the words "Port" (60 times), "Port Competitiveness" (33 times), and "Port Authority" (29 times), among others, are attractive to most researchers. Third, a word embedding analysis identifies the words highly correlated with the top eight keywords and visually shows four different subject clusters in a t-SNE plot. Fourth, we use Scattertext to compare words used in the two research sub-periods. Originality/value - This study is the first to apply abstract and keyword analysis and various text mining techniques to Korean journal articles in port research and thus has important implications. Further in-depth studies should collect a greater variety of textual data and analyze and compare port studies from different countries.

Topic Analysis of Foreign Policy and Economic Cooperation: A Text Mining Approach

  • Jiaen Li;Youngjun Choi
    • Journal of Korea Trade
    • /
    • 제26권8호
    • /
    • pp.37-57
    • /
    • 2022
  • Purpose -International diplomacy is key for the cohesive economic growth of countries around the world. This study aims to identify the major topics discussed and make sense of word pairs used in sentences by Chinese senior leaders during their diplomatic visits. It also compares the differences between key topics addressed during diplomatic visits to developed and developing countries. Design/methodology - We employed three methods: word frequency, co-word, and semantic network analysis. Text data are crawling state and official visit news released by the Ministry of Foreign Affairs of the People's Republic of China regarding diplomatic visits undertaken from 2015-2019. Findings - The results show economic and diplomatic relations most prominently during state and official visits. The discussion topics were classified according to nine centrality keywords most central to the structure and had the maximum influence in China. Moreover, the results showed that China's diplomatic issues and strategies differ between developed and developing countries. The topics mentioned in developing countries were more diverse. Originality/value - Our study proposes an effective approach to identify key topics in Chinese diplomatic talks with other countries. Moreover, it shows that discussion topics differ for developed and developing countries. The findings of this research can help researchers conduct empirical studies on diplomacy relationships and extend our method to other countries. Additionally, it can significantly help key policymakers gain insights into negotiations and establish a good diplomatic relationship with China.

Exploring the Key Factors that Lead to Intentions to Use AI Fashion Curation Services through Big Data Analysis

  • Shin, Eunjung;Hwang, Ha Sung
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제16권2호
    • /
    • pp.676-691
    • /
    • 2022
  • An increasing number of companies in the fashion industry are using AI curation services. The purpose of this study is to investigate perceptions of and intentions to use AI fashion curation services among customers by using text mining. To accomplish this goal, we collected a total of 34,190 online posts from two Korean portals, Naver and Daum. We conducted frequency analysis to identify the most frequently mentioned keywords using Textom. The analysis extracted "various," "good," "many," "right," and "new" at the highest frequency, indicating that consumers had positive perceptions of AI fashion curation services. In addition, we conducted a semantic network analysis with the top-50 most frequently used keywords, classifying customers' perceptions of AI fashion curation services into three groups: shopping, platform, and business profit. We also identified the factors that boost continuous use intentions: usability, usefulness, reliability, enjoyment, and personalization. We conclude this paper by discussing the theoretical and practical implications of these findings.

패션 트렌트(2010~2019)의 주요 요소로서 소재 - 텍스트마이닝을 통한 분석 - (Material as a Key Element of Fashion Trend in 2010~2019 - Text Mining Analysis -)

  • 장남경;김민정
    • 한국의류산업학회지
    • /
    • 제22권5호
    • /
    • pp.551-560
    • /
    • 2020
  • Due to the nature of fashion design that responds quickly and sensitively to changes, accurate forecasting for upcoming fashion trends is an important factor in the performance of fashion product planning. This study analyzed the major phenomena of fashion trends by introducing text mining and a big data analysis method. The research questions were as follows. What is the key term of the 2010SS~2019FW fashion trend? What are the terms that are highly relevant to the key trend term by year? Which terms relevant to the key trend term has shown high frequency in news articles during the same period? Data were collected through the 2010SS~2019FW Pre-Trend data from the leading trend information company in Korea and 45,038 articles searched by "fashion+material" from the News Big Data System. Frequency, correlation coefficient, coefficient of variation and mapping were performed using R-3.5.1. Results showed that the fashion trend information were reflected in the consumer market. The term with the highest frequency in 2010SS~2019FW fashion trend information was material. In trend information, the terms most relevant to material were comfort, compact, look, casual, blend, functional, cotton, processing, metal and functional by year. In the news article, functional, comfort, sports, leather, casual, eco-friendly, classic, padding, culture, and high-quality showed the high frequency. Functional was the only fashion material term derived every year for 10 years. This study helps expand the scope and methods of fashion design research as well as improves the information analysis and forecasting capabilities of the fashion industry.

텍스트 마이닝을 통한 상급종합병원의 미션, 비전, 핵심가치 분석 연구 (Analysis of Mission, Vision and Core values in Korean Tertiary General Hospitals Through Text Mining)

  • 이지훈
    • 한국병원경영학회지
    • /
    • 제28권2호
    • /
    • pp.32-43
    • /
    • 2023
  • Purposes: This research is conducted to identify main features and trends of mission, vision and core values in Korean tertiary general hospitals by using text-mining. Methodology: For the study, 45 mission, 112 vision and 190 core values are collected from 45 tertiary general hospitals' homepages in 2022 and use word frequency analysis and Leyword co-occurrence analysis. Findings: In the tertiary general hospitals' mission, there are high frequency words such as 'health', 'humanity', 'medical treatment', 'education', 'research', 'happiness', 'love', 'best', 'spirit', and mission mainly includes the content of contributing humanity's health and happiness with these words. In case of vision, high frequency words are 'hospital', 'medical treatment', 'research', 'lead', 'trust', 'centered', 'patient', 'best', 'future'. By using these words in vision, it represents the definition and characteristics of vision such as ideal organizations in the future, goals and targets. As a result of the Leyword co-occurrence analysis, vision includes the content of 'high-tech medical treatment', 'special care for patients', 'leading education and research', 'the highest trust with customer', 'creative talents training'. -astly, the high frequency word-pairs in core values are 'social distribution', 'innovation pursuit', 'cooperation and harmony', and it defines standards of behavior for organizations. Practical Implication: To correct the problems of vision, mission and core values from findings, firstly, it needs for Korean tertiary general hospitals to use the words that can explain organization's identity and differentiate others in their mission. Secondly, considering strengthening the role of hospitals in their community and the importance of members in organizations, it is necessary to establish vision with considering community and members to activate vision effectively. Thirdly, because there are no specific guidelines of establishing mission, vision and core values for healthcare organizations, this research concepts and results could be utilized when other organizations establish mission, vision and core values.

  • PDF

텍스트마이닝을 이용한 약물유해반응 보고자료 분석 (Analysis of Adverse Drug Reaction Reports using Text Mining)

  • 김현희;유기연
    • 한국임상약학회지
    • /
    • 제27권4호
    • /
    • pp.221-227
    • /
    • 2017
  • Background: As personalized healthcare industry has attracted much attention, big data analysis of healthcare data is essential. Lots of healthcare data such as product labeling, biomedical literature and social media data are unstructured, extracting meaningful information from the unstructured text data are becoming important. In particular, text mining for adverse drug reactions (ADRs) reports is able to provide signal information to predict and detect adverse drug reactions. There has been no study on text analysis of expert opinion on Korea Adverse Event Reporting System (KAERS) databases in Korea. Methods: Expert opinion text of KAERS database provided by Korea Institute of Drug Safety & Risk Management (KIDS-KD) are analyzed. To understand the whole text, word frequency analysis are performed, and to look for important keywords from the text TF-IDF weight analysis are performed. Also, related keywords with the important keywords are presented by calculating correlation coefficient. Results: Among total 90,522 reports, 120 insulin ADR report and 858 tramadol ADR report were analyzed. The ADRs such as dizziness, headache, vomiting, dyspepsia, and shock were ranked in order in the insulin data, while the ADR symptoms such as vomiting, 어지러움, dizziness, dyspepsia and constipation were ranked in order in the tramadol data as the most frequently used keywords. Conclusion: Using text mining of the expert opinion in KIDS-KD, frequently mentioned ADRs and medications are easily recovered. Text mining in ADRs research is able to play an important role in detecting signal information and prediction of ADRs.

유비쿼터스도시종합계획과 유비쿼터스도시계획 비교 연구 -U-서비스 계획을 중심으로- (A Comparative Study between Ubiquitous City Comprehensive Plan and Ubiquitous City Plan - Focusing on U-Service Plan)

  • 유지송;정다운;이미숙;민경주
    • Spatial Information Research
    • /
    • 제23권2호
    • /
    • pp.83-93
    • /
    • 2015
  • 최근 U-City 계획을 수립한 지자체의 U-서비스는 시설 및 도시 관리 위주의 서비스로 구현되고 있으며, 시민 맞춤형 U-서비스는 계획에만 그치고 있는 실정이다. 이에 본 연구는 U-City 종합계획과 U-City 계획의 U-서비스 내용을 네트워크 텍스트 분석과 단어 빈도 분석을 통해 비교 검토하여 향후 시민 맞춤형 U-서비스 제공을 위한 시사점을 제시하였다. 제1, 2차 U-City 종합계획과 4개 지방자치단체의 U-City 계획 중 U-서비스 계획 내용을 추출하여 주요 단어들을 산출하였고, 도출된 단어를 통해 네트워크 텍스트 분석과 단어 빈도 분석을 실시하였다. 분석 결과를 바탕으로 향후 U-City 종합계획에서는 지자체의 특색에 따른 서비스 추가와 정책 재정 지원 및 시민의 필요사항을 반영하여 다양한 분야의 시민 맞춤형 U-서비스 개발과 같은 시사점을 도출하였으며, 이를 통해 U-City에 대한 시민들의 인식 또한 증가될 것으로 기대할 수 있다.

아토바스타틴의 새로운 약물 적응증 탐색을 위한 비정형 데이터 분석 (Analysis of Unstructured Data on Detecting of New Drug Indication of Atorvastatin)

  • 정휘수;강길원;최웅;박종혁;신광수;서영성
    • Journal of health informatics and statistics
    • /
    • 제43권4호
    • /
    • pp.329-335
    • /
    • 2018
  • Objectives: In recent years, there has been an increased need for a way to extract desired information from multiple medical literatures at once. This study was conducted to confirm the usefulness of unstructured data analysis using previously published medical literatures to search for new indications. Methods: The new indications were searched through text mining, network analysis, and topic modeling analysis using 5,057 articles of atorvastatin, a treatment for hyperlipidemia, from 1990 to 2017. Results: The extracted keywords was 273. In the frequency of text mining and network analysis, the existing indications of atorvastatin were extracted in top level. The novel indications by Term Frequency-Inverse Document Frequency (TF-IDF) were atrial fibrillation, heart failure, breast cancer, rheumatoid arthritis, combined hyperlipidemia, arrhythmias, multiple sclerosis, non-alcoholic fatty liver disease, contrast-induced acute kidney injury and prostate cancer. Conclusions: Unstructured data analysis for discovering new indications from massive medical literature is expected to be used in drug repositioning industries.