• Title/Summary/Keyword: Quantitative Text Analysis

Search Result 147, Processing Time 0.032 seconds

Bankruptcy Prediction Modeling Using Qualitative Information Based on Big Data Analytics (빅데이터 기반의 정성 정보를 활용한 부도 예측 모형 구축)

  • Jo, Nam-ok;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.2
    • /
    • pp.33-56
    • /
    • 2016
  • Many researchers have focused on developing bankruptcy prediction models using modeling techniques, such as statistical methods including multiple discriminant analysis (MDA) and logit analysis or artificial intelligence techniques containing artificial neural networks (ANN), decision trees, and support vector machines (SVM), to secure enhanced performance. Most of the bankruptcy prediction models in academic studies have used financial ratios as main input variables. The bankruptcy of firms is associated with firm's financial states and the external economic situation. However, the inclusion of qualitative information, such as the economic atmosphere, has not been actively discussed despite the fact that exploiting only financial ratios has some drawbacks. Accounting information, such as financial ratios, is based on past data, and it is usually determined one year before bankruptcy. Thus, a time lag exists between the point of closing financial statements and the point of credit evaluation. In addition, financial ratios do not contain environmental factors, such as external economic situations. Therefore, using only financial ratios may be insufficient in constructing a bankruptcy prediction model, because they essentially reflect past corporate internal accounting information while neglecting recent information. Thus, qualitative information must be added to the conventional bankruptcy prediction model to supplement accounting information. Due to the lack of an analytic mechanism for obtaining and processing qualitative information from various information sources, previous studies have only used qualitative information. However, recently, big data analytics, such as text mining techniques, have been drawing much attention in academia and industry, with an increasing amount of unstructured text data available on the web. A few previous studies have sought to adopt big data analytics in business prediction modeling. Nevertheless, the use of qualitative information on the web for business prediction modeling is still deemed to be in the primary stage, restricted to limited applications, such as stock prediction and movie revenue prediction applications. Thus, it is necessary to apply big data analytics techniques, such as text mining, to various business prediction problems, including credit risk evaluation. Analytic methods are required for processing qualitative information represented in unstructured text form due to the complexity of managing and processing unstructured text data. This study proposes a bankruptcy prediction model for Korean small- and medium-sized construction firms using both quantitative information, such as financial ratios, and qualitative information acquired from economic news articles. The performance of the proposed method depends on how well information types are transformed from qualitative into quantitative information that is suitable for incorporating into the bankruptcy prediction model. We employ big data analytics techniques, especially text mining, as a mechanism for processing qualitative information. The sentiment index is provided at the industry level by extracting from a large amount of text data to quantify the external economic atmosphere represented in the media. The proposed method involves keyword-based sentiment analysis using a domain-specific sentiment lexicon to extract sentiment from economic news articles. The generated sentiment lexicon is designed to represent sentiment for the construction business by considering the relationship between the occurring term and the actual situation with respect to the economic condition of the industry rather than the inherent semantics of the term. The experimental results proved that incorporating qualitative information based on big data analytics into the traditional bankruptcy prediction model based on accounting information is effective for enhancing the predictive performance. The sentiment variable extracted from economic news articles had an impact on corporate bankruptcy. In particular, a negative sentiment variable improved the accuracy of corporate bankruptcy prediction because the corporate bankruptcy of construction firms is sensitive to poor economic conditions. The bankruptcy prediction model using qualitative information based on big data analytics contributes to the field, in that it reflects not only relatively recent information but also environmental factors, such as external economic conditions.

The Main Path Analysis of Korean Studies Using Text Mining: Based on SCOPUS Literature Containing 'Korea' as a Keyword (텍스트 마이닝을 활용한 한국학 주경로(Main Path) 분석: '한국'을 키워드로 포함하는 SCOPUS 문헌을 대상으로)

  • Kim, Hea-Jin
    • Journal of the Korean Society for information Management
    • /
    • v.37 no.3
    • /
    • pp.253-274
    • /
    • 2020
  • In this study, text mining and main path analysis (MPA) were applied to understand the origins and development paths of research areas that make up the mainstream of Korean studies. To this end, a quantitative analysis was attempted based on digital texts rather than the traditional humanities research methodology, and the main paths of Korean studies were extracted by collecting documents related to Korean studies including citation information using a citation database, and establishing a direct citation network. As a result of the main path analysis, two main path clusters (Korean ancient agricultural culture (history, culture, archeology) and Korean acquisition of English (linguistics)) were found in the key-route search for the Humanities field of Korean studies. In the field of Korean Studies Humanities and Social Sciences, four main path clusters were discovered: (1) Korea regional/spatial development, (2) Korean economic development (Economic aid/Soft power), (3) Korean industry (Political economics), and (4) population of Korea (Sex selection) & North Korean economy (Poverty, South-South cooperation).

A Comparative Study on the Types and its Importance of Trade Claims between China and the United States: Using Text Mining Techniques (중국과 미국의 무역클레임 유형과 중요도 비교 연구 : 텍스트 마이닝 기법을 활용하여)

  • Cheon Yu;Yun-Seop Hwang
    • Korea Trade Review
    • /
    • v.47 no.3
    • /
    • pp.177-190
    • /
    • 2022
  • This study is designed to identify the differences in the types and importance of trade claims at the national level. For analysis data, abstracts of arbitration and court judgments published on the website of the United Nations Commission on International Trade Law are collected and used. The target countries are China and the United States, with 102 cases from China and 59 cases from the United States. By applying topic modeling techniques to the collection decisions of China and the United States, trade claims are categorized, and the importance of each type is identified using the network centrality index derived through semantic network analysis. The analysis results are as follows. First, the main types of trade claims were the same for both the United States and China: product nonconformity, delivery issues, and payments. However, in China, the order of product nonconformity > delivery issues > payments was important, and in the United States, payments > product nonconformity > delivery issues were found to be important. This study is significant in that it presents a strategic trade claim management plan using a quantitative methodology.

Speech Rate Variation in Synchronous Speech (동시발화에 나타나는 발화 속도 변이 분석)

  • Kim, Miran;Nam, Hosung
    • Phonetics and Speech Sciences
    • /
    • v.4 no.4
    • /
    • pp.19-27
    • /
    • 2012
  • When two speakers read a text together, the produced speech has been shown to reduce a high degree of variability (e.g., pause duration and placement, and speech rate). This paper provides a quantitative analysis of speech rate variation exhibited in synchronous speech by examining the global and local patterns in two dialects of Mandarin Chinese (Taiwan and Shanghai). We analyzed the speech data in terms of mean speech rate and the reference of "Just Noticeable difference (JND)" within a subject and across subjects. Our findings show that speakers show lower and less variable speech rates when they read a text synchronously than when they read alone. This global pattern is observed consistently across speakers and dialects maintaining the unique local variation patterns of speech rate for each dialect. We conclude that paired speakers lower their speech rates and decrease the variability in order to ensure the synchrony of their speech.

The Trend and Tasks of Meister High School Research: Network Text Analysis and Content Analysis (마이스터고 연구의 동향과 과제: 네트워크 텍스트 분석 및 내용분석)

  • Bae, Sang Hoon;Jang, Chang Seong;Lee, Tae Hee;Cho, Sung Bum
    • Journal of vocational education research
    • /
    • v.33 no.3
    • /
    • pp.83-104
    • /
    • 2014
  • The study examined the trends of research on Meister high schools in Korea. The study also investigated differences of research interests between the university faculty and graduate students who are the future researchers in this field. A total of 56 research articles were analyzed using the network text analysis method and the content analysis. The results showed that 56% of all studies was done to reveal the distinguishable characteristics of Meister students and teachers compared to their counterpart in vocational schools. 17.6% of studies were about school curriculum, while 14.0% of studies were on school organization and operation. Only 12.3% of studies were conducted to evaluate school performance. Quantitative studies outnumbered qualitative ones. Based on the results, this study suggested implications for policies and future research on meister high school.

Analysis of Research Trends on Archival Information Services Using Text Mining (텍스트마이닝을 활용한 국내외 기록서비스 연구동향 분석)

  • Seohee Park;Hye-Eun Lee
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.24 no.1
    • /
    • pp.89-109
    • /
    • 2024
  • The study analyzed the research trends of domestic and international record information services from 2003 to 2022. A total of 136 academic papers registered in the Korea Citation Index (KCI) and 74 from the Library, Information Science & Technology Abstracts (LISTA) were examined by quantitative and qualitative content analysis to understand the research status of 20 years from various angles, such as publication year, research type, researcher type, subject, and purpose. Frequency analysis, co-occurrence frequency analysis, centrality analysis, and topic modeling were performed by applying text mining techniques. Results showed that domestic papers demonstrated a research flow focused on specific institutions or records, and user-centered satisfaction surveys and content-centered studies were conducted. Moreover, foreign papers confirmed various evaluation-oriented and information provision studies, such as data, resources, and collections, along with the research trend focusing on the relationship between archivists and users. The management of information resources was identified as a common topic in both domestic and foreign papers, but it is possible to identify that domestic research focuses on maintaining the quality of domestic information resources, while foreign research focuses on the storage and retrieval of information.

An Analysis of Causes of Marine Incidents at sea Using Big Data Technique (빅데이터 기법을 활용한 항해 중 준해양사고 발생원인 분석에 관한 연구)

  • Kang, Suk-Young;Kim, Ki-Sun;Kim, Hong-Beom;Rho, Beom-Seok
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.24 no.4
    • /
    • pp.408-414
    • /
    • 2018
  • Various studies have been conducted to reduce marine accidents. However, research on marine incidents is only marginal. There are many reports of marine incidents, but the main content of existing studies has been qualitative, which makes quantitative analysis difficult. However, quantitative analysis of marine accidents is necessary to reduce marine incidents. The purpose of this paper is to analyze marine incident data quantitatively by applying big data techniques to predict marine incident trends and reduce marine accident. To accomplish this, about 10,000 marine incident reports were prepared in a unified format through pre-processing. Using this preprocessed data, we first derived major keywords for the Marine incidents at sea using text mining techniques. Secondly, time series and cluster analysis were applied to major keywords. Trends for possible marine incidents were predicted. The results confirmed that it is possible to use quantified data and statistical analysis to address this topic. Also, we have confirmed that it is possible to provide information on preventive measures by grasping objective tendencies for marine incidents that may occur in the future through big data techniques.

An Analysis Scheme Design of Customer Spending Pattern using Text Mining (텍스트 마이닝을 이용한 소비자 소비패턴 분석 기법 설계)

  • Jeong, Eun-Hee;Lee, Byung-Kwan
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.11 no.2
    • /
    • pp.181-188
    • /
    • 2018
  • In this paper, we propose an analysis scheme of customer spending pattern using text mining. In proposed consumption pattern analysis scheme, first we analyze user's rating similarity using Pearson correlation, second we analyze user's review similarity using TF-IDF cosine similarity, third we analyze the consistency of the rating and review using Sendiwordnet. And we select the nearest neighbors using rating similarity and review similarity, and provide the recommended list that is proper with consumption pattern. The precision of recommended list are 0.79 for the Pearson correlation, 0.73 for the TF-IDF, and 0.82 for the proposed consumption pattern. That is, the proposed consumption pattern analysis scheme can more accurately analyze consumption pattern because it uses both quantitative rating and qualitative reviews of consumers.

Trends in the Study of Nursing Professionals in Korea: A Convergence Study of Text Network Analysis and Topic Modeling (국내 간호전문직관 연구 주제 동향: 텍스트네트워크분석과 토픽모델링의 융합)

  • Park, Chan-Sook
    • Journal of the Korea Convergence Society
    • /
    • v.12 no.9
    • /
    • pp.295-305
    • /
    • 2021
  • The purpose of this study is to explore the trend of nursing professional research topics published domestically through quantitative content analysis. The research method performed procedures for collecting academic papers, refining and extracting words, and data analysis. A text network was developed by collecting 351 papers and extracting words from the abstract, and network analysis and topic modeling were performed. The core-topics were nurses, nursing professionalism, nursing students, nursing care, professional self-concept, health care professionals, satisfaction, clinical competence, and self-efficacy. Through topic modeling, topic groups of nurse's professionalism, nursing students' professionalism, nursing professional identity, and nursing competency were identified. Over time, core-topics remained unchanged, but topics such as role conflict and ethical values in the 1990s, self-leadership and socialization in the 2000s, and clinical practice stress and support systems in the 2010s have emerged. In conclusion, it is necessary to facilitate multidimensional interventional research to improve nursing professionalism of clinical nurses and nursing students.