• Title/Summary/Keyword: Frequency based Text Analysis

Search Result 237, Processing Time 0.023 seconds

Comparisons of Airline Service Quality Using Social Network Analysis (소셜 네트워크 분석을 활용한 항공서비스 품질 비교)

  • Park, Ju-Hyeon;Lee, Hyun Cheol
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.42 no.3
    • /
    • pp.116-130
    • /
    • 2019
  • This study investigates passenger-authored online reviews of airline services using social network analysis to compare the differences in customer perceptions between full service carriers (FSCs) and low cost carriers (LCCs). While deriving words with high frequency and weight matrix based on the text analysis for FSCs and LCCs respectively, we analyze the semantic network (betweenness centrality, eigenvector centrality, degree centrality) to compare the degree of connection between words in online reviews of each airline types using the social network analysis. Then we compare the words with high frequency and the connection degree to gauge their influences in the network. Moreover, we group eight clusters for FSCs and LCCs using the convergence of iterated correlations (CONCOR) analysis. Using the resultant clusters, we match the clusters to dimensions of two types of service quality models ($Gr{\ddot{o}}nroos$, Brady & Cronin (B&C)) to compare the airline service quality and determine which model fits better. From the semantic network analysis, FSCs are mainly related to inflight service words and LCCs are primarily related to the ground service words. The CONCOR analysis reveals that FSCs are mainly related to the dimension of outcome quality in $Gr{\ddot{o}}nroos$ model, but evenly distributed to the dimensions in B&C model. On the other hand, LCCs are primarily related to the dimensions of process quality in both $Gr{\ddot{o}}nroos$ and B&C models. From the CONCOR analysis, we also observe that B&C model fits better than $Gr{\ddot{o}}nroos$ model for the airline service because the former model can capture passenger perceptions more specifically than the latter model can.

Derivation of Green Infrastructure Planning Factors for Reducing Particulate Matter - Using Text Mining - (미세먼지 저감을 위한 그린인프라 계획요소 도출 - 텍스트 마이닝을 활용하여 -)

  • Seok, Youngsun;Song, Kihwan;Han, Hyojoo;Lee, Junga
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.49 no.5
    • /
    • pp.79-96
    • /
    • 2021
  • Green infrastructure planning represents landscape planning measures to reduce particulate matter. This study aimed to derive factors that may be used in planning green infrastructure for particulate matter reduction using text mining techniques. A range of analyses were carried out by focusing on keywords such as 'particulate matter reduction plan' and 'green infrastructure planning elements'. The analyses included Term Frequency-Inverse Document Frequency (TF-IDF) analysis, centrality analysis, related word analysis, and topic modeling analysis. These analyses were carried out via text mining by collecting information on previous related research, policy reports, and laws. Initially, TF-IDF analysis results were used to classify major keywords relating to particulate matter and green infrastructure into three groups: (1) environmental issues (e.g., particulate matter, environment, carbon, and atmosphere), target spaces (e.g., urban, park, and local green space), and application methods (e.g., analysis, planning, evaluation, development, ecological aspect, policy management, technology, and resilience). Second, the centrality analysis results were found to be similar to those of TF-IDF; it was confirmed that the central connectors to the major keywords were 'Green New Deal' and 'Vacant land'. The results from the analysis of related words verified that planning green infrastructure for particulate matter reduction required planning forests and ventilation corridors. Additionally, moisture must be considered for microclimate control. It was also confirmed that utilizing vacant space, establishing mixed forests, introducing particulate matter reduction technology, and understanding the system may be important for the effective planning of green infrastructure. Topic analysis was used to classify the planning elements of green infrastructure based on ecological, technological, and social functions. The planning elements of ecological function were classified into morphological (e.g., urban forest, green space, wall greening) and functional aspects (e.g., climate control, carbon storage and absorption, provision of habitats, and biodiversity for wildlife). The planning elements of technical function were classified into various themes, including the disaster prevention functions of green infrastructure, buffer effects, stormwater management, water purification, and energy reduction. The planning elements of the social function were classified into themes such as community function, improving the health of users, and scenery improvement. These results suggest that green infrastructure planning for particulate matter reduction requires approaches related to key concepts, such as resilience and sustainability. In particular, there is a need to apply green infrastructure planning elements in order to reduce exposure to particulate matter.

The Prediction of Cryptocurrency on Using Text Mining and Deep Learning Techniques : Comparison of Korean and USA Market (텍스트 마이닝과 딥러닝을 활용한 암호화폐 가격 예측 : 한국과 미국시장 비교)

  • Won, Jonggwan;Hong, Taeho
    • Knowledge Management Research
    • /
    • v.22 no.2
    • /
    • pp.1-17
    • /
    • 2021
  • In this study, we predicted the bitcoin prices of Bithum and Coinbase, a leading exchange in Korea and USA, using ARIMA and Recurrent Neural Networks(RNNs). And we used news articles from each country to suggest a separated RNN model. The suggested model identifies the datasets based on the changing trend of prices in the training data, and then applies time series prediction technique(RNNs) to create multiple models. Then we used daily news data to create a term-based dictionary for each trend change point. We explored trend change points in the test data using the daily news keyword data of testset and term-based dictionary, and apply a matching model to produce prediction results. With this approach we obtained higher accuracy than the model which predicted price by applying just time series prediction technique. This study presents that the limitations of the time series prediction techniques could be overcome by exploring trend change points using news data and various time series prediction techniques with text mining techniques could be applied to improve the performance of the model in the further research.

The Characteristics and Improvement Directions of Regional Climate Change Adaptation Policies in accordance with Damage Cases (지자체 기후변화 적응 대책 특성 및 개선 방향)

  • Ahn, Yoonjung;Kang, Youngeun;Park, Chang Sug;Kim, Ho Gul
    • Journal of Environmental Impact Assessment
    • /
    • v.25 no.4
    • /
    • pp.296-306
    • /
    • 2016
  • There is a growing interest in establishing a regional climate change adaptation policy as the climate change impact in the region and local scale increases. This study focused on the analysis of 32 regions on its characteristics of local climate change adaptation plans. First, statistic program R was used for conducting cluster analysis based on the frequency and budgets of adaptation plan. Further, we analyzed damage frequency from newspapers regarding climate change impacts in eight categories which were caused by extreme weather events on 2,565 cases for 24 years. Lastly, the characteristics of climate change adaptation plan was compared with damage frequency patterns for evaluating the adequacy of climate change adaptation plan on each cluster. Four different clusters were created by cluster analysis. Most clusters clearly have their own characteristics on certain sectors. There was a high frequency of damage in 'disaster' and 'health' sectors. Climate change adaptation plan and budget also invested a lot on those sectors. However, when comparing the relative rate among regional governments, there was a difference between types of damage and climate change adaptation plan. We assumed that the difference could come from that each region established their adaptation plans based on not only the frequency of damage, but vulnerability assessment, and expert opinions as well. The result of study could contribute to policy making of climate change adaptation plan.

A Study on Space Consumption Behavior of Contemporary Consumers -Focusing on Analysis of Social Media Big Data- (현대 소비자의 공간소비행동에 관한 연구 -소셜미디어 데이터 분석을 중심으로-)

  • Ahn, Suh Young;Koh, Ae-Ran
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.44 no.5
    • /
    • pp.1019-1035
    • /
    • 2020
  • This study examines the millennial generation, who express themselves and share information on social media after experiencing constantly changing 'hot places' (places of interest) in contemporary cities, with the goal of analyzing space consumption behaviors. Data were collected via an Instagram crawler application developed with Python 3.4 administered to 19,262 posts using the term 'hot places' from November 1 and December 15, 2019. Issues were derived from a text mining technique using Textom 2.0; in addition, semantic network analysis using Ucinet6 and the NetDraw program were also conducted. The results are as follows. First, a frequency analysis of keywords for hot places indicated words frequently found in nouns were related to food, local names, SNS and timing. Words related to positive emotions felt in experience, and words related to behavior in hot places appeared in predicate. Based on importance, communication is the most important keyword and influenced all issues. Second, the results of visualization of semantic network analysis revealed four categories in the scope of the definition of "hot place": (1) culinary exploration, (2) atmosphere of cafés, (3) happy daily life of 'me' expressed in images, (4) emotional photos.

Social Media Bigdata Analysis Based on Information Security Keyword Using Text Mining (텍스트마이닝을 활용한 정보보호 키워드 기반 소셜미디어 빅데이터 분석)

  • Chung, JinMyeong;Park, YoungHo
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.27 no.5
    • /
    • pp.37-48
    • /
    • 2022
  • With development of Digital Technology, social issues are communicated through digital-based platform such as SNS and form public opinion. This study attempted to analyze big data from Twitter, a world-renowned social network service, and find out the public opinion. After collecting Twitter data based on 14 keywords for 1 year in 2021, analyzed the term-frequency and relationship among keyword documents with pearson correlation coefficient using Data-mining Technology. Furthermore, the 6 main topics that on the center of information security field in 2021 were derived through topic modeling using the LDA(Latent Dirichlet Allocation) technique. These results are expected to be used as basic data especially finding key agenda when establishing strategies for the next step related industries or establishing government policies.

Analysis of the influence of food-related social issues on corporate management performance using a portal search index

  • Yoon, Chaebeen;Hong, Seungjee;Kim, Sounghun
    • Korean Journal of Agricultural Science
    • /
    • v.46 no.4
    • /
    • pp.955-969
    • /
    • 2019
  • Analyzing on-line consumer responses is directly related to the management performance of food companies. Therefore, this study collected and analyzed data from an on-line portal site created by consumers about food companies with issues and examined the relationships between the data and the management performance. Through this process, we identified consumers' awareness of these companies obtained from big data analysis and analyzed the relationship between the results and the sales and stock prices of the companies through a time-series graph and correlation analysis. The results of this study were as follows. First, the result of the text mining analysis suggests that consumers respond more sensitively to negative issues than to positive issues. Second, the emotional analysis showed that companies' ethics issues (Enterprise 3 and 4) have a higher level of emotional continuity than that of food safety issues. It can be interpreted that the problem of ethical management has great influence on consumers' purchasing behavior. Finally, In the case of all negative food issues, the number of word frequency and emotional scores showed opposite trends. As a result of the correlation analysis, there was a correlation between word frequency and stock price in the case of all negative food issues and also between emotional scores and stock price. Recently, studies using big data analytics have been conducted in various fields. Therefore, based on this research, it is expected that studies using big data analytics will be done in the agricultural field.

Research Trend Analysis in Fashion Design Studies in Korea using Topic Modeling (토픽모델링을 이용한 국내 패션디자인 연구동향 분석)

  • Jang, Namkyung;Kim, Min-Jeong
    • Journal of Digital Convergence
    • /
    • v.15 no.6
    • /
    • pp.415-423
    • /
    • 2017
  • This study explored research trends by investigating articles published in the Journal of Korean Society of Fashion Design from 2001 through 2015. English key words and abstracts were analyzed using text mining and topic modeling techniques. The findings are as followings. By the text mining technique, 183 core terms, appeared more than 30 times, were derived from 7137 words used in total 338 articles' key words and abstracts. 'Fashion' and 'design' showed the highest frequency rate. After that, the well-received topic modeling technique, LDA, was applied to the collected data sets. Several distinct sub-research domains strongly tied with the previous fashion design field, except for topics such as fashion brand marketing and digital technology, were extracted. It was observed that there are the growing and declining trends in the research topics. Based on findings, implication, limitation, and future research questions were presented.

Social Perception of the Invention Education Center as seen in Big Data (빅데이터 분석을 통한 발명 교육 센터에 대한 사회적 인식)

  • Lee, Eun-Sang
    • Journal of the Korea Convergence Society
    • /
    • v.13 no.1
    • /
    • pp.71-80
    • /
    • 2022
  • The purpose of this study is to analyze the social perception of invention education center using big data analysis method. For this purpose, data from January 2014 to September 2021 were collected using the Textom website as a keyword searched for 'invention+education+center' in blogs, cafes, and news channels of NAVER and DAUM website. The collected data was refined using the Textom website, and text mining analysis and semantic network analysis were performed by the Textom website, Ucinet 6, and Netdraw programs. The collected data were subjected to a primary and secondary refinement process and 60 keywords were selected based on the word frequency. The selected key words were converted into matrix data and analyzed by semantic network analysis. As a result of text mining analysis, it was confirmed that 'student', 'operation', 'Korea Invention Promotion Association', and 'Korean Intellectual Property Office' were the meaningful keywords. As a result of semantic network analysis, five clusters could be identified: 'educational operation', 'invention contest', 'education process and progress', 'recruitment and support for business', and 'supervision and selection institution'. Through this study, it was possible to confirm various meaningful social perceptions of the general public in relation to invention education center on the internet. The results of this study will be used as basic data that provides meaningful implications for researchers and policy makers studying for invention education.

Analysis of trends in domestic research on addiction using text mining and CONCOR (텍스트마이닝과 CONCOR을 활용한 중독 관련 국내 연구 동향 분석)

  • Sol-Ji Lee;Ki-Hyok Youn
    • Journal of Internet of Things and Convergence
    • /
    • v.9 no.6
    • /
    • pp.99-110
    • /
    • 2023
  • This study analyzed 817 articles published in Korean professional journals over the past three years, from 2020 to 2022, using text mining techniques to identify trends in addiction research in Korea and explore development directions. The analysis results are as follows. First, as a result of the analysis of the top keywords, online addiction studies such as smartphones, games, Internet, gambling, and relationship addiction were prominent as the top keywords. Second, as a result of TF-IDF analysis, many addiction studies related to behavioral addiction such as smartphones, games, the Internet, and work addiction have been conducted over the past three years, and in particular, there are many studies on addiction problems such as smartphones, games, and the Internet that have not yet been clinically diagnosed as addiction problems. This is the same as the result of word frequency analysis, and it can be interpreted that recent studies have been remarkably conducted on more diverse addiction problems. Third, the 2-gram analysis shows that words that mainly correspond to behavioral addiction, such as smartphones, games, and the Internet, appear side by side with the keyword addiction, and among them, words paired with smartphones are mentioned a lot in research papers and are being studied. Fourth, as a result of the CONCOR analysis, there were five clusters: a study on universal addiction issues such as alcohol use disorders and the Internet, a study of recovery on drug and gambling addiction, a study on mobile devices and media addiction, a study on the latest trends related to behavioral addiction, and other addiction issues. Finally, based on the results of this study, a direction for future addiction-related research was suggested.