• Title/Summary/Keyword: text mining technique

Search Result 222, Processing Time 0.023 seconds

Group-wise Keyword Extraction of the External Audit using Text Mining and Association Rules (텍스트마이닝과 연관규칙을 이용한 외부감사 실시내용의 그룹별 핵심어 추출)

  • Seong, Yoonseok;Lee, Donghee;Jung, Uk
    • Journal of Korean Society for Quality Management
    • /
    • v.50 no.1
    • /
    • pp.77-89
    • /
    • 2022
  • Purpose: In order to improve the audit quality of a company, an in-depth analysis is required to categorize the audit report in the form of a text document containing the details of the external audit. This study introduces a systematic methodology to extract keywords for each group that determines the differences between groups such as 'audit plan' and 'interim audit' using audit reports collected in the form of text documents. Methods: The first step of the proposed methodology is to preprocess the document through text mining. In the second step, the documents are classified into groups using machine learning techniques and based on this, important vocabularies that have a dominant influence on the performance of classification are extracted. In the third step, the association rules for each group's documents are found. In the last step, the final keywords for each group representing the characteristics of each group are extracted by comparing the important vocabulary for classification with the important vocabulary representing the association rules of each group. Results: This study quantitatively calculates the importance value of the vocabulary used in the audit report based on machine learning rather than the qualitative research method such as the existing literature search, expert evaluation, and Delphi technique. From the case study of this study, it was found that the extracted keywords describe the characteristics of each group well. Conclusion: This study is meaningful in that it has laid the foundation for quantitatively conducting follow-up studies related to key vocabulary in each stage of auditing.

Performance analysis of volleyball games using the social network and text mining techniques (사회네트워크분석과 텍스트마이닝을 이용한 배구 경기력 분석)

  • Kang, Byounguk;Huh, Mankyu;Choi, Seungbae
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.3
    • /
    • pp.619-630
    • /
    • 2015
  • The purpose of this study is to provide basic information to develop a game strategy plan of a team in a future by identifying the patterns of attack and pass of national men's professional volleyball teams and extracting core key words related with volleyball game performance to evaluate game performance using 'social network analysis' and 'text mining'. As for the analysis result of 'social network analysis' with the whole data, group '0' (6 players) and group '1' (11 players) were partitioned. A point of view the degree centrality and betweenness centrality in 'social network analysis' results, we can know that the group '1' more active game performance than the group '0'. The significant result for two group (win and loss) obtained by 'text mining' according to two groups ('0' and '1') obtained by 'social network analysis' showed significant difference (p-value: 0.001). As for clustering of each network, group '0' had the tendency to score points through set player D and E. In group '1', the player K had the tendency to fail if he attack through 'dig'; players C and D have a good performance through 'set' play.

An exploratory study for the development of a education framework for supporting children's development in the convergence of "art activity" and "language activity": Focused on Text mining method ('미술'과 '언어' 활동 융합형의 아동 발달지원 교육 프레임워크 개발을 위한 탐색적 연구: 텍스트 마이닝을 중심으로)

  • Park, Yunmi;Kim, Sijeong
    • Journal of the Korea Convergence Society
    • /
    • v.12 no.3
    • /
    • pp.297-304
    • /
    • 2021
  • This study aims not only to access the visual thought-oriented approach that has been implemented in established art therapy and education but also to integrate language education and therapeutic approach to support the development of school-age children. Thus, text mining technique was applied to search for areas where different areas of language and art can be integrated. This research was conducted in accordance with the procedure of basic research, preliminary DB construction, text screening, DB pre-processing and confirmation, stop-words removing, text mining analysis and the deduction about the convergent areas. These results demonstrated that this study draws convergence areas related to regional, communication, and learning functions, areas related to problem solving and sensory organs, areas related to art and intelligence, areas related to information and communication, areas related to home and disability, topics, conceptualization, peer-related areas, integration, reorganization, attitudes. In conclusion, this study is meaningful in that it established a framework for designing an activity-centered convergence program of art and language in the future and attempted a holistic approach to support child development.

Analysis of Use Behavior of Urban Park Users Expressing Depression on Social Media Using Text Mining Technique (텍스트 마이닝 기법을 활용한 SNS 상에서 우울감을 언급한 도시공원 이용자의 이용행태 분석)

  • Oh, Jiyeon;Nam, Seongwoo;Lee, Peter Sang-Hoon
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.6
    • /
    • pp.319-328
    • /
    • 2022
  • The purpose of this study was to investigate the relationship between depression due to the COVID-19 pandemic and park use behaviors using on line posts. During the period of the pandemic prevention activities, text data containing both 'park' and 'depression' were collected from blogs and cafes in the search engine of Naver and Daum, then analyzed using Text Mining and Social Network techniques. As a result, the main usage behaviors of park users who mentioned depression were 'look', 'stroll(walk)' and 'eat'. Other types of behaviors were connected centering around 'look', one of the communication behaviors. Also, from CONCOR analysis, as the cluster referred from communication behavior and dynamic behavior was formed as a single behavior type, it was considered park users with depression perceived the park as the space for communication and physical activities. As the spread of COVID-19 caused the restriction of communication activities, the users might consider parks as one of the solutions. In addition, it was considered that passive usage behaviors have prevailed rather than active ones due to the depression. Resulting outcomes would be useful to plan helpful urban park for citizens. It is necessary to further analyze the park use behavior of users in relation to the period of before/after the COVID-19 pandemic and the existence/nonexistence of depression.

Item-Based Collaborative Filtering Recommendation Technique Using Product Review Sentiment Analysis (상품 리뷰 감성분석을 이용한 아이템 기반 협업 필터링 추천 기법)

  • Yun, So-Young;Yoon, Sung-Dae
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.24 no.8
    • /
    • pp.970-977
    • /
    • 2020
  • The collaborative filtering recommendation technique has been the most widely used since the beginning of e-commerce companies introducing the recommendation system. As the online purchase of products or contents became an ordinary thing, however, recommendation simply applying purchasers' ratings led to the problem of low accuracy in recommendation. To improve the accuracy of recommendation, in this paper suggests the method of collaborative filtering that analyses product reviews and uses them as a weighted value. The proposed method refines product reviews with text mining to extract features and conducts sentiment analysis to draw a sentiment score. In order to recommend better items to user, sentiment weight is used to calculate the predicted values. The experiment results show that higher accuracy can be gained in the proposed method than the traditional collaborative filtering.

Study on Designing and Implementing Online Customer Analysis System based on Relational and Multi-dimensional Model (관계형 다차원모델에 기반한 온라인 고객리뷰 분석시스템의 설계 및 구현)

  • Kim, Keun-Hyung;Song, Wang-Chul
    • The Journal of the Korea Contents Association
    • /
    • v.12 no.4
    • /
    • pp.76-85
    • /
    • 2012
  • Through opinion mining, we can analyze the degree of positive or negative sentiments that customers feel about important entities or attributes in online customer reviews. But, the limit of the opinion mining techniques is to provide only simple functions in analyzing the reviews. In this paper, we proposed novel techniques that can analyze the online customer reviews multi-dimensionally. The novel technique is to modify the existing OLAP techniques so that they can be applied to text data. The novel technique, that is, multi-dimensional analytic model consists of noun, adjective and document axes which are converted into four relational tables in relational database. The multi-dimensional analysis model would be new framework which can converge the existing opinion mining, information summarization and clustering algorithms. In this paper, we implemented the multi-dimensional analysis model and algorithms. we recognized that the system would enable us to analyze the online customer reviews more complexly.

Marketing Strategies for the Korean High Speed Electric Multiple Unit (HEMU train) (동력분산형 고속철도 마케팅 전략 수립)

  • Kim, Yeon Kyu
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.34 no.1
    • /
    • pp.329-332
    • /
    • 2014
  • The Korean High Speed Electric Multiple Unit (HEMU) train system is soon being applicable to practical use. This new technology is expected not only to reshape the domestic market but also to be exported to overseas markets for high-speed train system. This study aims to prospect demands on the HEMU train technology and to formulate marketing strategies using a text-mining technique, therefore, providing a foundation for successful commercialization of the HEMU train system.

Review of Trends in Wind Energy Research Publications in Journal of the Korean Solar Energy Society (태양에너지학회 논문집의 풍력에너지 연구동향 분석)

  • Kim, Hyun-Goo
    • Journal of the Korean Solar Energy Society
    • /
    • v.40 no.4
    • /
    • pp.1-11
    • /
    • 2020
  • The Journal of the Korean Solar Energy Society is the first journal in South Korea that adopts wind energy as one of its subjects. Since 2000, more than 140 papers on wind energy have been published in the journal, which accounts for 8.5% of the total publication. However, in recent years, the number of published papers on wind energy has been decreasing steadily, and a reason for this decline is the significant dependence on a few specific institutions and authors. In this study, wind energy subjects were classified using the frequency analysis of the subject words extracted from the title, keywords, and abstract of wind energy papers using the text mining technique. In addition, the Korea Citation Index was used to perform quantitative level evaluation by subject and institution and to analyze the trends and characteristics of the wind energy field. Therefore, it was identified that in terms of the number of publications and citations, the main subject areas were resource/micrositing and policy/potential.

Prevention through Design (PtD) of integrating accident precursors in BIM

  • Chang, Soowon;Oh, Heung Jin;Lee, JeeHee
    • International conference on construction engineering and project management
    • /
    • 2022.06a
    • /
    • pp.94-102
    • /
    • 2022
  • Construction workers are engaged in many activities that may expose them to serious hazards, such as falling, unguarded machinery, or being struck by heavy construction equipment. Despite extensive research in building information modeling (BIM) for safety management, current approaches, detecting safety issues after design completion, may limit the opportunities to prevent predictable and potential accidents when decisions of building materials and systems are made. In this respect, this research proposes a proactive approach to detecting safety issues from the early design phase. This research aims to explore accident precursors and integrate them into BIM for tracking safety hazards during the design development process. Accident precursors can be identified from construction incident reports published by OSHA using a text mining technique. Through BIM-integrated accident precursors, construction safety hazards can be identified during the design phase. The results will contribute to supporting a successful transition from the design stage to the construction stage that considers a safe construction workplace. This will advance the body of knowledge about construction safety management by elucidating a hypothesis that safety hazards can be detected during the design phase involving decisions about materials, building elements, and equipment. In addition, the proactive approach will help the Architecture, Engineering and Construction (AEC) industry eliminate occupational safety hazards before near-miss situations appear on construction sites.

  • PDF

The proposition of cosine net confidence in association rule mining (연관 규칙 마이닝에서의 코사인 순수 신뢰도의 제안)

  • Park, Hee Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.1
    • /
    • pp.97-106
    • /
    • 2014
  • The development of big data technology was to more accurately predict diversified contemporary society and to more efficiently operate it, and to enable impossible technique in the past. This technology can be utilized in various fields such as the social science, economics, politics, cultural sector, and science technology at the national level. It is a prerequisite to find valuable information by data mining techniques in order to analyze big data. Data mining techniques associated with big data involve text mining, opinion mining, cluster analysis, association rule mining, and so on. The most widely used data mining technique is to explore association rules. This technique has been used to find the relationship between each set of items based on the association thresholds such as support, confidence, lift, similarity measures, etc.This paper proposed cosine net confidence as association thresholds, and checked the conditions of interestingness measure proposed by Piatetsky-Shapiro, and examined various characteristics. The comparative studies with basic confidence and cosine similarity, and cosine net confidence were shown by numerical example. The results showed that cosine net confidence are better than basic confidence and cosine similarity because of the relevant direction.