• Title/Summary/Keyword: TextMining

Search Result 1,563, Processing Time 0.024 seconds

Financial Footnote Analysis for Financial Ratio Predictions based on Text-Mining Techniques (재무제표 주석의 텍스트 분석 통한 재무 비율 예측 향상 연구)

  • Choe, Hyoung-Gyu;Lee, Sang-Yong Tom
    • Knowledge Management Research
    • /
    • v.21 no.2
    • /
    • pp.177-196
    • /
    • 2020
  • Since the adoption of K-IFRS(Korean International Financial Reporting Standards), the amount of financial footnotes has been increased. However, due to the stereotypical phrase and the lack of conciseness, deriving the core information from footnotes is not really easy yet. To propose a solution for this problem, this study tried financial footnote analysis for financial ratio predictions based on text-mining techniques. Using the financial statements data from 2013 to 2018, we tried to predict the earning per share (EPS) of the following quarter. We found that measured prediction errors were significantly reduced when text-mined footnotes data were jointly used. We believe this result came from the fact that discretionary financial figures, which were hardly predicted with quantitative financial data, were more correlated with footnotes texts.

Analysis of 'Better Class' Characteristics and Patterns from College Lecture Evaluation by Longitudinal Big Data

  • Nam, Min-Woo;Cho, Eun-Soon
    • International Journal of Contents
    • /
    • v.15 no.3
    • /
    • pp.7-12
    • /
    • 2019
  • The purpose of this study was to analyze characteristics and patterns of 'better class' by using the longitudinal text mining big data analysis technique from subjective lecture evaluation comments. First, this study classified upper 30% classes to deduce certain characteristics and patterns from every five-year subjective text data for 10 years. A total of 47,177courses (100%) from spring semester 2005 to fall semester 2014 were analyzed from a university at a metropolitan city in the mid area of South Korea. This study extracted meaningful words such as good, course, professor, appreciation, lecture, interesting, useful, know, easy, improvement, progress, teaching material, passion, and concern from the order of frequency 2005-2009. The other set of words were class, appreciation, professor, good, course, interesting, understanding, useful, help, student, effort, thinking, not difficult, explanation, lecture, hard, pleasant, easy, study, examination, like, various, fun, and knowledge 2010-2014. This study suggests that the characteristics and patterns of 'better class' at college, should be analyzed according to different academic code such as liberal arts, fine arts, social science, engineering, math and science, and etc.

A Study on De-Identification Methods to Create a Basis for Safety Report Text Mining Analysis (항공안전 보고 데이터 텍스트 분석 기반 조성을 위한 비식별 처리 기술 적용 연구)

  • Hwang, Do-bin;Kim, Young-gon;Sim, Yeong-min
    • Journal of the Korean Society for Aviation and Aeronautics
    • /
    • v.29 no.4
    • /
    • pp.160-165
    • /
    • 2021
  • In order to identify and analyze potential aviation safety hazards, analysis of aviation safety report data must be preceded. Therefore, in consideration of the provisions of the Aviation Safety Act and the recommendations of ICAO Doc 9859 SMM Edition 4th, personal information in the reporting data and sensitive information of the reporter, etc. It identifies the scope of de-identification targets and suggests a method for applying de-identification processing technology to personal and sensitive information including unstructured text data.

Analysis of AI Digital Textbook Keywords Using Text Mining (텍스트 마이닝을 활용한 AI 디지털교과서 키워드 분석)

  • Junhong Min;Mi Ryang Kim
    • Journal of Information Technology Services
    • /
    • v.23 no.5
    • /
    • pp.87-105
    • /
    • 2024
  • This study aims to explore the potential issues and challenges associated with the development, introduction, utilization, and stabilization of AI digital textbook, as well as to identify tasks necessary to address these challenges. We collected and analyzed data from domestic news articles and previous research literature related to "AI digital textbook" to derive key keywords using a comprehensive text analysis approach with Bigkinds and Textom. Through Bigkinds, we conducted keyword trend analysis, associated word analysis, and relationship analysis. Using Textom, we performed keyword frequency analysis, N-gram analysis, TF-IDF(Term Frequency-Inverse Document Frequency) analysis, and network analysis. This approach allowed us to identify the main issues related to the development, implementation, utilization of AI digital textbook and explore the necessary tasks to address these challenges.

Analysis of the Yearbook from the Korea Meteorological Administration using a text-mining agorithm (텍스트 마이닝 알고리즘을 이용한 기상청 기상연감 자료 분석)

  • Sun, Hyunseok;Lim, Changwon;Lee, YungSeop
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.4
    • /
    • pp.603-613
    • /
    • 2017
  • Many people have recently posted about personal interests on social media. The development of the Internet and computer technology has enabled the storage of digital forms of documents that has resulted in an explosion of the amount of textual data generated; subsequently there is an increased demand for technology to create valuable information from a large number of documents. A text mining technique is often used since text-based data is mostly composed of unstructured forms that are not suitable for the application of statistical analysis or data mining techniques. This study analyzed the Meteorological Yearbook data of the Korea Meteorological Administration (KMA) with a text mining technique. First, a term dictionary was constructed through preprocessing and a term-document matrix was generated. This term dictionary was then used to calculate the annual frequency of term, and observe the change in relative frequency for frequently appearing words. We also used regression analysis to identify terms with increasing and decreasing trends. We analyzed the trends in the Meteorological Yearbook of the KMA and analyzed trends of weather related news, weather status, and status of work trends that the KMA focused on. This study is to provide useful information that can help analyze and improve the meteorological services and reflect meteorological policy.

Using Text-mining Method to Identify Research Trends of Freshwater Exotic Species in Korea (텍스트마이닝 (text-mining) 기법을 이용한 국내 담수외래종 연구동향 파악)

  • Do, Yuno;Ko, Eui-Jeong;Kim, Young-Min;Kim, Hyo-Gyeom;Joo, Gea-Jae;Kim, Ji Yoon;Kim, Hyun-Woo
    • Korean Journal of Ecology and Environment
    • /
    • v.48 no.3
    • /
    • pp.195-202
    • /
    • 2015
  • We identified research trends for freshwater exotic species in South Korea using text mining methods in conjunction with bibliometric analysis. We searched scientific and common names of freshwater exotic species as searching keywords including 1 mammal species, 3 amphibian-reptile species, 11 fish species, 2 aquatic plant species. A total of 245 articles including research articles and abstracts of conference proceedings published by 56 academic societies and institutes were collected from scientific article databases. The search keywords used were the common names for the exotic species. The $20^{th}$ century (1900's) saw the number of articles increase; however, during the early $21^{st}$ century (2000's) the number of published articles decreased slowly. The number of articles focusing on physiological and embryological research was significantly greater than taxonomic and ecological studies. Rainbow trout and Nile tilapia were the main research topic, specifically physiological and embryological research associated with the aquaculture of these species. Ecological studies were only conducted on the distribution and effect of large-mouth bass and nutria. The ecological risk associated with freshwater exotic species has been expressed yet the scientific information might be insufficient to remove doubt about ecological issues as expressed by interested by individuals and policy makers due to bias in research topics with respect to freshwater exotic species. The research topics of freshwater exotic species would have to diversify to effectively manage freshwater exotic species.

A Trend Analysis and Policy proposal for the Work Permit System through Text Mining: Focusing on Text Mining and Social Network analysis (텍스트마이닝을 통한 고용허가제 트렌드 분석과 정책 제안 : 텍스트마이닝과 소셜네트워크 분석을 중심으로)

  • Ha, Jae-Been;Lee, Do-Eun
    • Journal of Convergence for Information Technology
    • /
    • v.11 no.9
    • /
    • pp.17-27
    • /
    • 2021
  • The aim of this research was to identify the issue of the work permit system and consciousness of the people on the system, and to suggest some ideas on the government policies on it. To achieve the aim of research, this research used text mining based on social data. This research collected 1,453,272 texts from 6,217 units of online documents which contained 'work permit system' from January to December, 2020 using Textom, and did text-mining and social network analysis. This research extracted 100 key words frequently mentioned from the analyses of data top-level key word frequency, and degree centrality analysis, and constituted job problem, importance of policy process, competitiveness in the respect of industries, and improvement of living conditions of foreign workers as major key words. In addition, through semantic network analysis, this research figured out major awareness like 'employment policy', and various kinds of ambient awareness like 'international cooperation', 'workers' human rights', 'law', 'recruitment of foreigners', 'corporate competitiveness', 'immigrant culture' and 'foreign workforce management'. Finally, this research suggested some ideas worth considering in establishing government policies on the work permit system and doing related researches.

Research Trends on Emotional Labor in Korea using text mining (텍스트마이닝을 활용한 감정노동 연구 동향 분석)

  • Cho, Kyoung-Won;Han, Na-Young
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.26 no.6
    • /
    • pp.119-133
    • /
    • 2021
  • Research has been conducted in many fields to identify research trends using text mining, but in the field of emotional labor, no research has been conducted using text mining to identify research trends. This study uses text mining to deeply analyze 1,465 papers at the Korea Citation Index (KCI) from 2004 to 2019 containing the subject word 'emotional labor' to understand the trend of emotional labor researches. Topics were extracted by LDA analysis, and IDM analysis was performed to confirm the proportion and similarity of the topics. Through these methods, an integrated analysis of topics was conducted considering the usefulness of topics with high similarity. The research topics are divided into 11 categories in descending order: stress of emotional labor (12.2%), emotional labor and social support (12.0%), customer service workers' emotional labor (10.9%), emotional labor and resilience (10.2%), emotional labor strategy (9.2%), call center counselor's emotional labor (9.1%), results of emotional labor (9.0%), emotional labor and job exhaustion (7.9%), emotional intelligence (7.1%), preliminary care service workers' emotional labor (6.6%), emotional labor and organizational culture (5.9%). Through topic modeling and trend analysis, the research trend of emotional labor and the academic progress are analyzed to present the direction of emotional labor research, and it is expected that a practical strategy for emotional labor can be established.

Using Text Mining for the Analysis of Research Trends Related to Laws Under the Ministry of Oceans and Fisheries (텍스트 마이닝을 활용한 해양수산부 법률 관련 연구동향 분석연구)

  • Hwang, Kyu Won;Lee, Moon Suk;Yun, So Ra
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.28 no.4
    • /
    • pp.549-566
    • /
    • 2022
  • Recently, artificial intelligence (AI) technology has progressed rapidly, and industries using this technology are significantly increasing. Further, analysis research using text mining, which is an application of artificial intelligence, is being actively developed in the field of social science research. About 125 laws, including joint laws, have been enacted under the Ministry of Oceans and Fisheries in various sectors including marine environment, fisheries, ships, fishing villages, ports, etc. Research on the laws under the Ministry of Oceans and Fisheries has been progressively conducted, and is steadily increasing quantitatively. In this study, the domestic research trends were analyzed through text mining, targeting the research papers related to laws of the Ministry of Oceans and Fisheries. As part of this research method, first, topic modeling which is a type of text mining was performed to identify potential topics. Second, co-occurrence network analysis was performed, focusing on the keywords in the research papers dealing with specific laws to derive the key themes covered. Finally, author network analysis was performed to explore social networks among authors. The results showed that key topics have been changed by period, and subjects were explored by targeting Ship Safety Law, Marine Environment Management Law, Fisheries Law, etc. Furthermore, in this study, core researchers were selected based on author network analysis, and the tendency for joint research performed by authors was identified. Through this study, changes in the topics for research related to the laws of the Ministry of Oceans and Fisheries were identified up to date, and it is expected that future research topics will be further diversified, and there will be growth of quantitative and qualitative research in the field of oceans and fisheries.

Self-Evolving Expert Systems based on Fuzzy Neural Network and RDB Inference Engine

  • Kim, Jin-Sung
    • Journal of Intelligence and Information Systems
    • /
    • v.9 no.2
    • /
    • pp.19-38
    • /
    • 2003
  • In this research, we propose the mechanism to develop self-evolving expert systems (SEES) based on data mining (DM), fuzzy neural networks (FNN), and relational database (RDB)-driven forward/backward inference engine. Most researchers had tried to develop a text-oriented knowledge base (KB) and inference engine (IE). However, this approach had some limitations such as 1) automatic rule extraction, 2) manipulation of ambiguousness in knowledge, 3) expandability of knowledge base, and 4) speed of inference. To overcome these limitations, knowledge engineers had tried to develop an automatic knowledge extraction mechanism. As a result, the adaptability of the expert systems was improved. Nonetheless, they didn't suggest a hybrid and generalized solution to develop self-evolving expert systems. To this purpose, we propose an automatic knowledge acquisition and composite inference mechanism based on DM, FNN, and RDB-driven inference engine. Our proposed mechanism has five advantages. First, it can extract and reduce the specific domain knowledge from incomplete database by using data mining technology. Second, our proposed mechanism can manipulate the ambiguousness in knowledge by using fuzzy membership functions. Third, it can construct the relational knowledge base and expand the knowledge base unlimitedly with RDBMS (relational database management systems) module. Fourth, our proposed hybrid data mining mechanism can reflect both association rule-based logical inference and complicate fuzzy relationships. Fifth, RDB-driven forward and backward inference time is shorter than the traditional text-oriented inference time.

  • PDF