• Title/Summary/Keyword: Text-Mining

Search Result 1,498, Processing Time 0.028 seconds

An analysis of the signaling effect of FOMC statements (미 연준 통화정책방향 의결문의 시그널링 효과 분석)

  • Woo, Shinwook;Chang, Youngjae
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.3
    • /
    • pp.321-334
    • /
    • 2020
  • The US Federal Reserve (Fed) has decided to cut interest rates. When we look at the expression of the FOMC statements at the time of policy change period we can understand that Fed has been communicating with markets through a change of word selection. However, there is a criticism that the method of analyzing the expression of the decision sentence through the context can be subjective and limited in qualitative analysis. In this paper, we evaluate the signaling effect of FOMC statements based on previous research. We analyze decision making characteristics from the viewpoint of text mining and try to predict future policy trend changes by capturing changes in expressions between statements. For this purpose, a decision tree and neural network models are used. As a result of the analysis, it can be judged that the discrepancy indicators between statements could be used to predict the policy change in the future and that the US Federal Reserve has systematically implemented policy signaling through the policy statements.

Measurement of Classes Complexity in the Object-Oriented Analysis Phase (객체지향 분석 단계에서의 클래스 복잡도 측정)

  • Kim, Yu-Kyung;Park, Jai-Nyun
    • Journal of KIISE:Software and Applications
    • /
    • v.28 no.10
    • /
    • pp.720-731
    • /
    • 2001
  • Complexity metrics have been developed for the structured paradigm of software development are not suitable for use with the object-oriented(OO) paradigm, because they do not support key object-oriented concepts such as inheritance, polymorphism. message passing and encapsulation. There are many researches on OO software metrics such as program complexity or design metrics. But metrics measuring the complexity of classes at the OO analysis phase are needed because they provide earlier feedback to the development project. and earlier feedback means more effective developing and less costly maintenance. In this paper, we propose the new metrics to measure the complexity of analysis classes which draw out in the analysis based on RUP(Rational Unified Process). By the collaboration complexity, is denoted by CC, we mean the maximum number of the collaborations can be achieved with each of the collaborator and determine the potential complexity. And the interface complexity, is denoted by IC, shows the difficulty related to understand the interface of collaborators each other. We verify theoretically the suggested metrics for Weyuker's nine properties. Moreover, we show the computation results for analysis classes of the system which automatically respond to questions of the user using the text mining technique. As a result of the comparison of CC and CBO and WMC suggested by Chidamber and Kemerer, the class that have highly the proposed metric value maintain the high complexity at the design phase too. And the complexity can be represented by CC and IC more than CBO and WMC. We can expect that our metrics may provide us the earlier feedback and hence possible to predict the efforts, costs and time required to remainder processes. As a result, we expect to develop the cost-effective OO software by reviewing the complexity of analysis classes in the first stage of SDLC(Software Development Life Cycle).

  • PDF

Stock Price Prediction Using Sentiment Analysis: from "Stock Discussion Room" in Naver (SNS감성 분석을 이용한 주가 방향성 예측: 네이버 주식토론방 데이터를 이용하여)

  • Kim, Myeongjin;Ryu, Jihye;Cha, Dongho;Sim, Min Kyu
    • The Journal of Society for e-Business Studies
    • /
    • v.25 no.4
    • /
    • pp.61-75
    • /
    • 2020
  • The scope of data for understanding or predicting stock prices has been continuously widened from traditional structured format data to unstructured data. This study investigates whether commentary data collected from SNS may affect future stock prices. From "Stock Discussion Room" in Naver, we collect 20 stocks' commentary data for six months, and test whether this data have prediction power with respect to one-hour ahead price direction and price range. Deep neural network such as LSTM and CNN methods are employed to model the predictive relationship. Among the 20 stocks, we find that future price direction can be predicted with higher than the accuracy of 50% in 13 stocks. Also, the future price range can be predicted with higher than the accuracy of 50% in 16 stocks. This study validate that the investors' sentiment reflected in SNS community such as Naver's "Stock Discussion Room" may affect the demand and supply of stocks, thus driving the stock prices.

A Study on the School Library Research Trends Using Topic Modeling (토픽모델링을 활용한 학교도서관 연구동향 분석)

  • Jung, Young-Joo;Kim, Hea-Jin
    • Journal of Korean Library and Information Science Society
    • /
    • v.51 no.3
    • /
    • pp.103-121
    • /
    • 2020
  • This study aimed to analyze the research trends of school libraries from 1990 to July 2020. To this end, LDA topic modeling analysis was conducted to the domestic article abstracts related to school libraries. The total number of documents is 498 papers published by the four major domestic journals in Library and Information Science. The log-likelihood estimate criterion was used to determine the number of topics for topic modeling. As a result of the study, 27 topics were discovered, then, theory were categorized by eight subject areas: general, institutional system, building/equipment, operation/management, data organization, service, education, and others. The most popular research was library utilization classes (T27) and Information Utilization (T2). More than 20 studies were found in each evaluation index development (T13), school librarian placement (T24), learning information media utilization (T3), community public library (T7), library cooperation (T9), library use (T17), library research (T11), reading education (T4), collection development (T5), and education effects/teaching methods (T18).

Trend of Research and Industry-Related Analysis in Data Quality Using Time Series Network Analysis (시계열 네트워크분석을 통한 데이터품질 연구경향 및 산업연관 분석)

  • Jang, Kyoung-Ae;Lee, Kwang-Suk;Kim, Woo-Je
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.5 no.6
    • /
    • pp.295-306
    • /
    • 2016
  • The purpose of this paper is both to analyze research trends and to predict industrial flows using the meta-data from the previous studies on data quality. There have been many attempts to analyze the research trends in various fields till lately. However, analysis of previous studies on data quality has produced poor results because of its vast scope and data. Therefore, in this paper, we used a text mining, social network analysis for time series network analysis to analyze the vast scope and data of data quality collected from a Web of Science index database of papers published in the international data quality-field journals for 10 years. The analysis results are as follows: Decreases in Mathematical & Computational Biology, Chemistry, Health Care Sciences & Services, Biochemistry & Molecular Biology, Biochemistry & Molecular Biology, and Medical Information Science. Increases, on the contrary, in Environmental Sciences, Water Resources, Geology, and Instruments & Instrumentation. In addition, the social network analysis results show that the subjects which have the high centrality are analysis, algorithm, and network, and also, image, model, sensor, and optimization are increasing subjects in the data quality field. Furthermore, the industrial connection analysis result on data quality shows that there is high correlation between technique, industry, health, infrastructure, and customer service. And it predicted that the Environmental Sciences, Biotechnology, and Health Industry will be continuously developed. This paper will be useful for people, not only who are in the data quality industry field, but also the researchers who analyze research patterns and find out the industry connection on data quality.

A comparative study of domestic and international research trends of mathematics education through topic modeling (토픽모델링을 활용한 국내외 수학교육 연구 동향 비교 연구)

  • Shin, Dongjo
    • The Mathematical Education
    • /
    • v.59 no.1
    • /
    • pp.63-80
    • /
    • 2020
  • This study analyzed 3,114 articles published in KCI journals and 1,636 articles published in SSCI journals from 2000 to 2019 in order to compare domestic and international research trends of mathematics education using a topic modeling method. Results indicated that there were 16 similar research topics in domestic and international mathematics education journals: algebra/algebraic thinking, fraction, function/representation, statistics, geometry, problem-solving, model/modeling, proof, achievement effect/difference, affective factor, preservice teacher, teaching practice, textbook/curriculum, task analysis, assessment, and theory. Also, there were 7 distinct research topics in domestic and international mathematics education journals. Topics such as affective/cognitive domain and research trends, mathematics concept, class activity, number/operation, creativity/STEAM, proportional reasoning, and college/technology were identified from the domestic journals, whereas discourse/interaction, professional development, identity/equity, child thinking, semiotics/embodied cognition, intervention effect, and design/technology were the topics identified from the international journals. The topic related to preservice teacher was the most frequently addressed topic in both domestic and international research. The topic related to in-service teachers' professional development was the second most popular topic in international research, whereas it was not identified in domestic research. Domestic research in mathematics education tended to pay attention to the topics concerned with the mathematical competency, but it focused more on problem-solving and creativity/STEAM than other mathematical competencies. Rather, international research highlighted the topic related to equity and social justice.

Technology Planning through Technology Roadmap: Application of Patent Citation Network (기술로드맵을 통한 기술기획: 특허인용네트워크의 활용)

  • Jeong, Yu-Jin;Yoon, Byung-Un
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.12 no.11
    • /
    • pp.5227-5237
    • /
    • 2011
  • Technology roadmap is a powerful tool that considers relationships of technology, product and market and referred as a supporting technology strategy and planning. There are numerous studies that have attempted to develop technology roadmap and case studies on specific technology areas. However, a number of studies have been dependant on brainstorming and discussion of expert group, delphi technique as qualitative analysis rather than systemic and quantitative analysis. To overcome the limitation, patent analysis considered as quite quantitative analysis is employed in this paper. Therefore, this paper proposes new technology roadmapping based on patent citation network considering technology life cycle and suggests planning for undeveloped technology but considered as promising. At first, patent data and citation information are collected and patent citation network is developed on the basis of collected patent information. Secondly, we investigate a stage of technology in the life cycle by considering patent application year and the technology life cycle, and duration of technology development is estimated. In addition, subsequent technologies are grouped as nodes of a super-level technology to show the evolution of the technology for the period. Finally, a technology roadmap is drawn by linking these technology nodes in a technology layer and estimating the duration of development time. Based on technology roadmap, technology planning is conducted to identify undeveloped technology through text mining and this paper suggests characteristics of technology that needs to be developed in the future. In order to illustrate the process of the proposed approach, technology for hydrogen storage is selected in this paper.

A Study of Intelligent Recommendation System based on Naive Bayes Text Classification and Collaborative Filtering (나이브베이즈 분류모델과 협업필터링 기반 지능형 학술논문 추천시스템 연구)

  • Lee, Sang-Gi;Lee, Byeong-Seop;Bak, Byeong-Yong;Hwang, Hye-Kyong
    • Journal of Information Management
    • /
    • v.41 no.4
    • /
    • pp.227-249
    • /
    • 2010
  • Scholarly information has increased tremendously according to the development of IT, especially the Internet. However, simultaneously, people have to spend more time and exert more effort because of information overload. There have been many research efforts in the field of expert systems, data mining, and information retrieval, concerning a system that recommends user-expected information items through presumption. Recently, the hybrid system combining a content-based recommendation system and collaborative filtering or combining recommendation systems in other domains has been developed. In this paper we resolved the problem of the current recommendation system and suggested a new system combining collaborative filtering and Naive Bayes Classification. In this way, we resolved the over-specialization problem through collaborative filtering and lack of assessment information or recommendation of new contents through Naive Bayes Classification. For verification, we applied the new model in NDSL's paper service of KISTI, especially papers from journals about Sitology and Electronics, and witnessed high satisfaction from 4 experimental participants.

A Study on the Privacy Awareness through Bigdata Analysis (빅데이터 분석을 통한 프라이버시 인식에 관한 연구)

  • Lee, Song-Yi;Kim, Sung-Won;Lee, Hwan-Soo
    • Journal of Digital Convergence
    • /
    • v.17 no.10
    • /
    • pp.49-58
    • /
    • 2019
  • In the era of the 4th industrial revolution, the development of information technology brought various benefits, but it also increased social interest in privacy issues. As the possibility of personal privacy violation by big data increases, academic discussion about privacy management has begun to be active. While the traditional view of privacy has been defined at various levels as the basic human rights, most of the recent research trends are mainly concerned only with the information privacy of online privacy protection. This limited discussion can distort the theoretical concept and the actual perception, making the academic and social consensus of the concept of privacy more difficult. In this study, we analyze the privacy concept that is exposed on the internet based on 12,000 news data of the portal site for the past one year and compare the difference between the theoretical concept and the socially accepted concept. This empirical approach is expected to provide an understanding of the changing concept of privacy and a research direction for the conceptualization of privacy for current situations.

A Topic Analysis of College Education Using Big Data of News Articles (뉴스 빅데이터를 통해 검토한 대학교육의 토픽 분석)

  • Yang, Ji-Yeon;Koo, Jeong-Ho
    • Journal of Digital Convergence
    • /
    • v.19 no.12
    • /
    • pp.11-20
    • /
    • 2021
  • This study extracts topics related to university education through newspaper articles and analyzes the characteristics of each topic and the reporting patterns of each newspaper. The 9 topics were discovered using LDA. Topic 1 and Topic 3 are related to university support projects for education, but Topic 3 is focused on local universities. Topic 2 is about university education after COVID-19, Topic 4 teaching-learning methods, Topic 5 government policies, Topic 6 the high school education contribution university support projects, Topic 7 the university education vision, Topic 8 internationalization, and Topic 9 the entrance exam. The Chosun Ilbo, Kyunghyang, and Hankyoreh reported a lot of articles associated to lectures after COVID-19, government policies, and comments on university education. Relevant articles since 2016 have been analyzed by newspaper type and before/after COVID-19 through which differences in the topics were studied and discussed. These findings would suggest a basic policy guideline for university education and imply that the positive and negative effects of the media need to be considered.