• Title/Summary/Keyword: Quantitative Text Analysis

Search Result 147, Processing Time 0.022 seconds

Who is to Blame for Infection?: Emotional Discourse in Editorial Articles during the Emerging Infectious Diseases Epidemics in Korea (감염병과 감정: 신종감염병에 관한 대중매체의 메시지와 공포, 분노 감정)

  • Kim, Jongwoo;Kang, Jiwoong
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.12
    • /
    • pp.816-827
    • /
    • 2021
  • The purpose of this study is to understand the relationship between fear and anger emotions in the discourse produced by the media during the period of major emerging infectious diseases (SARS, Swine Flu, MERS, and COVID-19) that occurred since 2000 in Korea. The researcher collected editorial articles of the major daily newspaper after a significant epidemic of new infectious diseases and analyzed them using the Extended Parallel Processing Model (EPPM) and text mining techniques. In all epidemic times, fear appears stronger than anger, but the smaller the fear, the greater the risk control message is produced. In detail, fear emerges strongly in the discourse of the risk of infectious diseases or the economic crisis. Anger appears strong when the government's quarantine failures, groups where group infections occurred, and concealing information about infectious diseases. In this process, anger is strongly expressed against the factors that threaten the safety of society. Anger is also an emotion that can justify strong quarantine, but it can be the basis for discourse on minority hate. In this respect, anger is a two-sided emotion, so it must be handled carefully in the media.

Identification of Employee Experience Factors and Their Influence on Job Satisfaction (직원경험 요인 파악 및 직무 만족도에 끼치는 영향력 분석)

  • Juhyeon Lee;So-Hyun Lee;Hee-Woong Kim
    • Information Systems Review
    • /
    • v.25 no.2
    • /
    • pp.181-203
    • /
    • 2023
  • With the fierce competition of companies for the attraction of outstanding individuals, job satisfaction of employees has been of importance. In this circumstance, many companies try to invest in job satisfaction improvement by finding employees' everyday experiences and difficulties. However, due to a lack of understanding of the employee experience, their investments are not paying off. This study examined the relationship between employee experience and job satisfaction using employee reviews and company ratings from Glassdoor, one of the largest employee communities worldwide. We use text mining techniques such as K-means clustering and LDA topic-based sentiment analysis to extract key experience factors by job level, and DistilBERT sentiment analysis to measure the sentiment score of each employee experience factor. The drawn employee experience factors and each sentiment score were analyzed quantitatively, and thereby relations between each employee experience factor and job satisfaction were analyzed. As a result, this study found that there is a significant difference between the workplace experiences of managers and general employees. In addition, employee experiences that affect job satisfaction also differed between positions, such as customer relationship and autonomy, which did not affect the satisfaction of managers. This study used text mining and quantitative modeling method based on theory of work adjustment so as to find and verify main factors of employee experience, and thus expanded research literature. In addition, the results of this study are applicable to the personnel management strategy for improving employees' job satisfaction, and are expected to improve corporate productivity ultimately.

An Analysis on the Vocabulary in the English-Translation Version of Donguibogam Using the Corpus-based Analysis (코퍼스 분석방법을 이용한 『동의보감(東醫寶鑑)』 영역본의 어휘 분석)

  • Jung, Ji-Hun;Kim, Dong-Ryul;Kim, Do-Hoon
    • The Journal of Korean Medical History
    • /
    • v.28 no.2
    • /
    • pp.37-45
    • /
    • 2015
  • Objectives : A quantitative analysis on the vocabulary in the English translation version of Donguibogam. Methods : This study quantitatively analyzed the English-translated texts of Donguibogam with the Corpus-based analysis, and compared the quantitative results analyzing the texts of original Donguibogam. Results : As the results from conducting the corpus analysis on the English-translation version of Donguibogam, it was found that the number of total words (Token) was about 1,207,376, and the all types of used words were about 20.495 and the TTR (Type/Token Rate) was 1.69. The accumulation rate reaching to the high-ranking 1000 words was 83.54%, and the accumulation rate reaching to the high-ranking 2000 words was 90.82%. As the words having the high-ranking frequency, the function words like 'the, and of, is' mainly appeared, and for the content words, the words like 'randix, qi, rhizoma and water' were appeared in multi frequencies. As the results from comparing them with the corpus analysis results of original version of Donguibogam, it was found that the TTR was higher in the English translation version than that of original version. The compositions of function words and contents words having high-ranking frequencies were similar between the English translation version and the original version of Donguibogam. The both versions were also similar in that their statements in the parts of 'Remedies' and 'Acupuncture' showed higher composition rate of contents words than the rate of function words. Conclusions : The vocabulary in the English translation version of Donguibogam showed that this book was a book keeping the complete form of sentence and an Korean medical book at the same time. Meanwhile, the English translation version of Donguibogam had some problems like the unification of vocabulary due to several translators, and the incomplete delivery of word's meanings from the Chinese character-culture area to the English-culture area, and these problems are considered as the matters to be considered in a work translating Korean old medical books in English.

An Analysis of Domestic Newspaper Articles on 5.18 using the Bigkinds System (빅카인즈를 활용한 5·18 관련 국내 기사 분석 연구)

  • Juhyeon Park;Hyunji Park;Youngbum Gim
    • Journal of the Korean Society for information Management
    • /
    • v.41 no.1
    • /
    • pp.107-132
    • /
    • 2024
  • This study attempted to analyze newspaper articles related to May 18 through frequency analysis and network analysis using news data related to May 18 for about 30 years from 1990 to 2022 at the Korea Press Foundation's Big Kinds. Specifically, quantitative change trends were examined by analyzing the amount of articles by period and region, and the connection structure between major keywords by the regime was explored through network analysis by regime using co-appearance keywords. As a result of the analysis, it was found that 2019 had the largest amount of coverage, which had many social issues in time, and the Jeolla-do region had the largest amount of coverage in the region. And as a result of network analysis, there were differences in words related to May 18 in news data according to the perception and policy of the regime toward May 18. As a result of synthesizing the analysis of May 18 news data, it was confirmed that May 18 was becoming a democratic movement over time regardless of region, but at the same time, the distortion of May 18 was not resolved.

A Corpus Analysis of British-American Children's Adventure Novels: Treasure Island (영미 아동 모험 소설에 관한 코퍼스 분석 연구: 『보물섬』을 중심으로)

  • Choi, Eunsaem;Jung, Chae Kwan
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.1
    • /
    • pp.333-342
    • /
    • 2021
  • In this study, we analyzed the vocabulary, lemmas, keywords, and n-grams in 『Treasure Island』 to identify certain linguistic features of this British-American children's adventure novel. The current study found that, contrary to the popular claim that frequently-used words are important and essential to a story, the set of frequently-used words in 『Treasure Island』 were mostly function words and proper nouns that were not directly related to the plot found in 『Treasure Island』. We also ascertained that a list of keywords using a statistical method making use of a corpus program was not good enough to surmise the story of 『Treasure Island』. However, we managed to extract 30 keywords through the first quantitative keyword analysis and then a second qualitative keyword analysis. We also carried out a series of n-gram analyses and were able to discover lexical bundles that were preferred and frequently used by the author of 『Treasure Island』. We hope that the results of this study will help spread this knowledge among British-American children's literature as well as to further put forward corpus stylistic theory.

Establishment of Strategy for Management of Technology Using Data Mining Technique (데이터 마이닝을 통한 기술경영 전략 수립에 관한 연구)

  • Lee, Junseok;Lee, Joonhyuck;Kim, Gabjo;Park, Sangsung;Jang, Dongsik
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.25 no.2
    • /
    • pp.126-132
    • /
    • 2015
  • Technology forecasting is about understanding a status of a specific technology in the future, based on the current data of the technology. It is useful when planning technology management strategies. These days, it is common for countries, companies, and researchers to establish R&D directions and strategies by utilizing experts' opinions. However, this qualitative method of technology forecasting is costly and time consuming since it requires to collect a variety of opinions and analysis from many experts. In order to deal with these limitations, quantitative method of technology forecasting is being studied to secure objective forecast result and help R&D decision making process. This paper suggests a methodology of technology forecasting based on quantitative analysis. The methodology consists of data collection, principal component analysis, and technology forecasting by logistic regression, which is one of the data mining techniques. In this research, patent documents related to autonomous vehicle are collected. Then, the texts from patent documents are extracted by text mining technique to construct an appropriate form for analysis. After principal component analysis, logistic regression is performed by using principal component score. On the basis of this result, it is possible to analyze R&D development situation and technology forecasting.

The Major Technology Distribution Analysis of Domestic Defense Companies in Naval Ships based on Patent Information Data (함정 분야 방산업체 주요 기술 분포 분석)

  • Kim, Jang-Eun
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.21 no.7
    • /
    • pp.625-637
    • /
    • 2020
  • In order to decide the naval ship weapon system acquisition for national policy/market economy activities, the decision makers can determine policy based on current technology level/concentration/utilization. For this, the decision makers apply the major common technology field analysis using patents data. As a method for collecting patent data, we can collect patent data of domestic mobile carriers through the Korea Intellectual Property Rights Information System of Korean Intellectual Property Office. As a result, we collected 14,964 patents/352 International Patent Classification(IPC) types. Based on these data, we performed three analysis processes (SNA, PCA, ARIMA, Text Mining) and got each result from extracting 58 IPC types of SNA and 7 IPC types of PCA. Based on the analysis results, we have confirmed that 7 IPC(B63B, H01M, F03D, B01D, H02K, B23K, H01H) types are the Major Common Technology Distribution of domestic Defense Companies.

Bibliometric Analysis on Studies of Korean Intangible Cultural Property Dance : Focusing on Events in the Seoul Area (한국무형문화재 춤 연구의 계량서지학적 분석 : 서울지역 종목을 중심으로)

  • Yoo, Ji-Young;Kim, Jee-Young;Baek, Hyun-Soon
    • Journal of Korea Entertainment Industry Association
    • /
    • v.13 no.4
    • /
    • pp.139-147
    • /
    • 2019
  • This study conducted bibliometric analysis on studies of Korean intangible cultural heritage dance in the Seoul area and it aimed to figure out the tendencies of that research. For this, a list of Korean intangible cultural heritage dance studies of 24 events was collected and analysis was conducted through the big data analysis solution of TEXTOM. Text mining was used as the method for analysis. Research results showed that first, most of the studies were conducted on the Bongsan Talchum and studies on teaching and learning methods were especially actively conducted. On the other hand, there were not many studies on Gut and the need for research vitalization in that area was confirmed. Second, in studies on Cheoyongmu events, the term'contemporary Cheoyongmu' was used frequently. This can be considered the use of meaningful terms with regard to intangible cultural heritage dance that has changed throughout history. At this, the vitalization of research that can reveal the typicality of dance is demanded from research of other events as well. Third, there was a notable amount of research that compared and analyzed dance styles with regard to the Munmyoilmu. This was seen as the result of discussions in the Korean dancing world regarding archetypal dance styles expanding into academic discussions. Therefore, it was revealed that academic discussions can connect to academic outcomes apart from whether the matter is right or wrong.

Evaluating Perceived Smartness of Product from Consumer's Point of View: The Concept and Measurement

  • Lee, Won-Jun
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.6 no.1
    • /
    • pp.149-158
    • /
    • 2019
  • Due to the rapid development of IT (information technology) and internet, products become smart and able to collect, process and produce information and can think of themselves to provide better service to consumers. However, research on the characteristics of smart product is still sparse. In this paper, we report the systemic development of a scale to measure the perceived product smartness associated with smart product. To develop product smartness scale, this study follows systemic scale development processes of item generation, item reduction, scale validation, reliability and validity test consequently. And, after acquiring a large amount of qualitative interview data asking the definition of smart product, we add a unique process to reduce the initial items using both a text mining method using 'r' s/w and traditional reliability and validity tests including factor analysis. Based on an initial qualitative inquiry and subsequent quantitative survey, an eight-factor scale of product smartness is developed. The eight factors are multi-functionality, human-like touch, ability to cooperate, autonomy, situatedness, network connectivity, integrity, and learning capability consequently. Results from Korean samples support the proposed measures of product smartness in terms of reliability, validity, and dimensionality. Implications and directions for further study are discussed. The developed scale offers important theoretical and pragmatic implications for researchers and practitioners.

A Comparative Study of Figure Skating Commentary on NBCSN and MBC's Coverage of 2018 Olympic Games (NBCSN과 MBC의 평창동계올림픽 피겨 스케이팅 해설에 대한 비교분석: 피겨 스케이팅 중계방송 해설의 개선방안에 대하여)

  • Song, Yung-Joo;Kim, Hana
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.8
    • /
    • pp.94-105
    • /
    • 2022
  • The purpose of this study is to suggest improvement plan for a commentary on figure skating in Korea from comparing to NBCSN and MBC's coverage of the 2018 Pyungchang Olympic Games employing both of quantitative and text analysis. Results indicate that NBCSN and MBC's commentary on figure skating have definitely different characteristics in terms of expertise and dramatizing ability. The commentator of MBC frequently used monotonous and repetitive emotional expression and provided incoherent information in very automatic way. Whereas, NBCSN's comments expressed very diverse way on introduction of players, explanation of technique and evaluation, especially on dramatizing ability to contextualize combining players' performance and background information.