• Title/Summary/Keyword: Text Mining Method

Search Result 448, Processing Time 0.025 seconds

A Basic Study on the Application of Text-Maining Method for Qualitative Evaluation through Barrier Free Certification in School Facilities (학교시설의 장애물 없는 생활환경(Barrier Free) 인증 사례를 통한 정성평가 텍스트마이닝 기법 적용에 관한 기초연구)

  • Yun, Pyeong-Se;Lee, Jong-Kuk
    • The Journal of Sustainable Design and Educational Environment Research
    • /
    • v.19 no.1
    • /
    • pp.25-35
    • /
    • 2020
  • Since the introduction and operation of BF certification, a total of 6,432 certificates has been issued until February 2020, of which educational research facilities make 1,091 cases (754 preliminary certification, 337 main certification) out of 6,237 buildings, acquiring BF certification of about 20%. Qualitative evaluation is conducted with focus on the three items of BF-certified building evaluation index, which are medium facilities, internal facilities, and sanitary facilities, and major keywords are the deducted through the Text Mining analysis of the derived results. As a result, problems with access paths occurred in the case of the facilities, and assessment indicators for users were found to be necessary among the assessment of the steps of the internal facilities. Finally, we could see that sanitation facilities needed to improve toilets installed in residential development facilities. Based on the results obtained, the study seeks to suggest directions for improving the evaluation index required for BF-certified school facilities.

A Study on Prediction of Patent Registration using Text Mining (텍스트 마이닝을 이용한 특허 등록 예측에 관한 연구)

  • Koo, Jung-Min;Park, Sang-Sung;Shin, Young-Geon;Jung, Won-Kyo;Jang, Dong-Sik
    • Proceedings of the KAIS Fall Conference
    • /
    • 2009.05a
    • /
    • pp.325-328
    • /
    • 2009
  • Recently, as importance of knowledge property right is rising, a patent is being issue. A patent is exclusive rights of knowledge or technique, and it must be registered for approval of rights. Therefore, prediction of patent registration can be important information for company or individuals which gain profit using a patent. In this paper, we proposed a method for prediction of patent registration using text mining and a algorithm for constructing database.

  • PDF

Research Trends Analysis of Information Security using Text Mining (텍스트마이닝을 이용한 정보보호 연구동향 분석)

  • Kim, Taekyung;Kim, Changsik
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.14 no.2
    • /
    • pp.19-25
    • /
    • 2018
  • With the development of IT technology, various services such as artificial intelligence and autonomous vehicles are being introduced, and many changes are taking place in our lives. However, if secure security is not provided, it will cause many risks, so the information security becomes more important. In this paper, we analyzed the research trends of main themes of information security over time. In order to conduct the research, 'Information Security' was searched in the Web of Science database. Using the abstracts of theses published from 1991 to 2016, we derived main research topics through topic modeling and time series regression analysis. The topic modeling results showed that the research topics were Information technology, system access, attack, threat, risk management, network type, security management, security awareness, certification level, information protection organization, security policy, access control, personal information, security investment, computing environment, investment cost, system structure, authentication method, user behavior, encryption. The time series regression results indicated that all the topics were hot topics.

A Study on Classifications of Useful Customer Reviews by Applying Text Mining Approach (텍스트 마이닝을 활용한 고객 리뷰의 유용성 지수 개선에 관한 연구)

  • Lee, Hong Joo
    • Journal of Information Technology Services
    • /
    • v.14 no.4
    • /
    • pp.159-169
    • /
    • 2015
  • Customer reviews are one of the important sources for purchase decision makings in online stores. Online stores have tried to provide useful reviews in product pages to customers. To assess the usefulness of customer reviews before other users have voted enough on the reviews, diverse aspects of reviews were utilized in prevous studies. Style and semantic information were utilized in many studies. This study aims to test diverse alogrithms and datasets for identifying a proper classification method and threshold to classify useful reviews. In particular, most researches utilized ratio type helpfulness index as Amazon.com used. However, there is another type of usefulness index utilized in TripAdviser.com or Yelp.com, count type helpfulness index. There was no proper threshold to classify useful reviews yet for count type helpfulness index. This study used reivews and their usefulness votes on restaurnats from Yelp.com to devise diverse datasets and applied text mining approaches to classify useful reviews. Random Forest, SVM, and GLMNET showed the greater values of accuracy than other approaches.

Toward Sentiment Analysis Based on Deep Learning with Keyword Detection in a Financial Report (재무 보고서의 키워드 검출 기반 딥러닝 감성분석 기법)

  • Jo, Dongsik;Kim, Daewhan;Shin, Yoojin
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.24 no.5
    • /
    • pp.670-673
    • /
    • 2020
  • Recent advances in artificial intelligence have allowed for easier sentiment analysis (e.g. positive or negative forecast) of documents such as a finance reports. In this paper, we investigate a method to apply text mining techniques to extract in the financial report using deep learning, and propose an accounting model for the effects of sentiment values in financial information. For sentiment analysis with keyword detection in the financial report, we suggest the input layer with extracted keywords, hidden layers by learned weights, and the output layer in terms of sentiment scores. Our approaches can help more effective strategy for potential investors as a professional guideline using sentiment values.

Changes in Specialty Coffee Consumption Post-pandemic

  • Lim, Miri;Ryu, Gihwan
    • International journal of advanced smart convergence
    • /
    • v.11 no.3
    • /
    • pp.157-161
    • /
    • 2022
  • The coffee industry continues to grow steadily due to the spread of coffee and changes in consumer awareness. Once upon a time, instant coffee was common, People today have distinct personal preferences As consumption needs for favorite foods are segmented, ways to enjoy coffee are diversifying. This study was conducted through analysis of consumption changes for specialty coffee as a changed issue of COVID-19 The goal is to present a vision for the future of the specialty coffee industry. As a research method, text mining through big data analysis was conducted to extract and analyze factors affecting the change in specialty coffee consumption. As a result of the study, we judged that specialty coffee is consumed by using a drip tool that allows you to easily enjoy coffee at home after Corona 19. Therefore, hand drips used in home cafes were found to play a central role in the change in specialty coffee consumption.

A Study on City Brand Evaluation Method Using Text Mining : Focused on News Media (텍스트 마이닝 기법을 활용한 도시 브랜드 평가방법론 연구 : 뉴스미디어를 중심으로)

  • Yoon, Seungsik;Shin, Minchul;Kang, Juyoung
    • Journal of Information Technology Services
    • /
    • v.18 no.1
    • /
    • pp.153-171
    • /
    • 2019
  • Competition among cities has become fierce with decentralization and globalization, and each city tries to establish a brand image of the city to build its competitiveness and implement its policies based on it. At this time, surveys, expert interviews, etc. are commonly used to establish city brands. These methods are difficult to establish as sampling methods an empirical component, the biggest component of a city brand. In this paper, therefore, based on the precedent research's urban brand measurement and components, the words representing each city image property were extracted and relocated to five indicators to form the evaluation index. The constructed indicators have been validated through the review of three experts. Through the index, we analyzed the brands of four cities, Ulsan, Incheon, Yeosu, and Gyeongju, and identified the factors by using Topic Modeling and Word Cloud. This methodology is expected to reduce costs and monitor timely in identifying and analyzing urban brand images in the future.

A Study on Educational Data Mining for Public Data Portal through Topic Modeling Method with Latent Dirichlet Allocation (LDA기반 토픽모델링을 활용한 공공데이터 기반의 교육용 데이터마이닝 연구)

  • Seungki Shin
    • Journal of The Korean Association of Information Education
    • /
    • v.26 no.5
    • /
    • pp.439-448
    • /
    • 2022
  • This study aims to search for education-related datasets provided by public data portals and examine what data types are constructed through classification using topic modeling methods. Regarding the data of the public data portal, 3,072 cases of file data in the education field were collected based on the classification system. Text mining analysis was performed using the LDA-based topic modeling method with stopword processing and data pre-processing for each dataset. Program information and student-supporting notifications were usually provided in the pre-classified dataset for education from the data portal. On the other hand, the characteristics of educational programs and supporting information for the disabled, parents, the elderly, and children through the perspective of lifelong education were generally indicated in the dataset collected by searching for education. The results of data analysis through this study show that providing sufficient educational information through the public data portal would be better to help the students' data science-based decision-making and problem-solving skills.

A Recognition Method for Korean Spatial Background in Historical Novels (한국어 역사 소설에서 공간적 배경 인식 기법)

  • Kim, Seo-Hee;Kim, Seung-Hoon
    • Journal of Information Technology Services
    • /
    • v.15 no.1
    • /
    • pp.245-253
    • /
    • 2016
  • Background in a novel is most important elements with characters and events, and means time, place and situation that characters appeared. Among the background, spatial background can help conveys topic of a novel. So, it may be helpful for choosing a novel that readers want to read. In this paper, we are targeting Korean historical novels. In case of English text, It can be recognize spatial background easily because it use upper and lower case and words used with the spatial information such as Bank, University and City. But, in case Korean text, it is difficult to recognize that spatial background because there is few information about usage of letter. In the previous studies, they use machine learning or dictionaries and rules to recognize about spatial information in text such as news and text messages. In this paper, we build a nation dictionaries that refer to information such as 'Korean history' and 'Google maps.' We Also propose a method for recognizing spatial background based on patterns of postposition in Korean sentences comparing to previous works. We are grasp using of postposition with spatial background because Korean characteristics. And we propose a method based on result of morpheme analyze and frequency in a novel text for raising accuracy about recognizing spatial background. The recognized spatial background can help readers to grasp the atmosphere of a novel and to understand the events and atmosphere through recognition of the spatial background of the scene that characters appeared.

HTML Text Extraction Using Frequency Analysis (빈도 분석을 이용한 HTML 텍스트 추출)

  • Kim, Jin-Hwan;Kim, Eun-Gyung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.9
    • /
    • pp.1135-1143
    • /
    • 2021
  • Recently, text collection using a web crawler for big data analysis has been frequently performed. However, in order to collect only the necessary text from a web page that is complexly composed of numerous tags and texts, there is a cumbersome requirement to specify HTML tags and style attributes that contain the text required for big data analysis in the web crawler. In this paper, we proposed a method of extracting text using the frequency of text appearing in web pages without specifying HTML tags and style attributes. In the proposed method, the text was extracted from the DOM tree of all collected web pages, the frequency of appearance of the text was analyzed, and the main text was extracted by excluding the text with high frequency of appearance. Through this study, the superiority of the proposed method was verified.