• Title/Summary/Keyword: topic

Search Result 4,611, Processing Time 0.027 seconds

Comments Classification System using Topic Signature (Topic Signature를 이용한 댓글 분류 시스템)

  • Bae, Min-Young;Cha, Jeong-Won
    • Journal of KIISE:Software and Applications
    • /
    • v.35 no.12
    • /
    • pp.774-779
    • /
    • 2008
  • In this work, we describe comments classification system using topic signature. Topic signature is widely used for selecting feature in document classification and summarization. Comments are short and have so many word spacing errors, special characters. We firstly convert comments into 7-gram. We consider the 7-gram as sentence. We convert the 7-gram into 3-gram. We consider the 3-gram as word. We select key feature using topic signature and classify new inputs by the Naive Bayesian method. From the result of experiments, we can see that the proposed method is outstanding over the previous methods.

A Comparison of Ontology Languages: Focusing on W3C OWL and ISO Topic Maps (온톨로지 언어의 비교 연구: W3C OWL과 ISO 토픽맵을 중심으로)

  • Oh, Sam-Gyun
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.15 no.2
    • /
    • pp.71-96
    • /
    • 2004
  • The purpose of this study is to describe major concepts related to W3C OWL and ISO Topic Maps and to provide the result of comparison and analysis regarding semantic expression power between two ontology languages. This paper is comprised of the following parts: 1) describing URI and namespace concepts that are fundamental building block of effective ontology construction; 2) offering detailed explanation of major Topic Map concepts such as topics, associations, and occurrences; 3) providing how to accomplish the second purpose of cataloging(grouping related items when displaying the search result) using Topic Map; and 4) finally explaining the difference between two ontology languages in terms of semantic expression power.

  • PDF

Research of Patent Technology Trends in Textile Materials: Text Mining Methodology Using DETM & STM (섬유소재 분야 특허 기술 동향 분석: DETM & STM 텍스트마이닝 방법론 활용)

  • Lee, Hyun Sang;Jo, Bo Geun;Oh, Se Hwan;Ha, Sung Ho
    • The Journal of Information Systems
    • /
    • v.30 no.3
    • /
    • pp.201-216
    • /
    • 2021
  • Purpose The purpose of this study is to analyze the trend of patent technology in textile materials using text mining methodology based on Dynamic Embedded Topic Model and Structural Topic Model. It is expected that this study will have positive impact on revitalizing and developing textile materials industry as finding out technology trends. Design/methodology/approach The data used in this study is 866 domestic patent text data in textile material from 1974 to 2020. In order to analyze technology trends from various aspect, Dynamic Embedded Topic Model and Structural Topic Model mechanism were used. The word embedding technique used in DETM is the GloVe technique. For Stable learning of topic modeling, amortized variational inference was performed based on the Recurrent Neural Network. Findings As a result of this analysis, it was found that 'manufacture' topics had the largest share among the six topics. Keyword trend analysis found the fact that natural and nanotechnology have recently been attracting attention. The metadata analysis results showed that manufacture technologies could have a high probability of patent registration in entire time series, but the analysis results in recent years showed that the trend of elasticity and safety technology is increasing.

Research Trend on Diabetes Mobile Applications: Text Network Analysis and Topic Modeling (당뇨병 모바일 앱 관련 연구동향: 텍스트 네트워크 분석 및 토픽 모델링)

  • Park, Seungmi;Kwak, Eunju;Kim, Youngji
    • Journal of Korean Biological Nursing Science
    • /
    • v.23 no.3
    • /
    • pp.170-179
    • /
    • 2021
  • Purpose: The aim of this study was to identify core keywords and topic groups in the 'Diabetes mellitus and mobile applications' field of research for better understanding research trends in the past 20 years. Methods: This study was a text-mining and topic modeling study including four steps such as 'collecting abstracts', 'extracting and cleaning semantic morphemes', 'building a co-occurrence matrix', and 'analyzing network features and clustering topic groups'. Results: A total of 789 papers published between 2002 and 2021 were found in databases (Springer). Among them, 435 words were extracted from 118 articles selected according to the conditions: 'analyzed by text network analysis and topic modeling'. The core keywords were 'self-management', 'intervention', 'health', 'support', 'technique' and 'system'. Through the topic modeling analysis, four themes were derived: 'intervention', 'blood glucose level control', 'self-management' and 'mobile health'. The main topic of this study was 'self-management'. Conclusion: While more recent work has investigated mobile applications, the highest feature was related to self-management in the diabetes care and prevention. Nursing interventions utilizing mobile application are expected to not only effective and powerful glycemic control and self-management tools, but can be also used for patient-driven lifestyle modification.

Seasonal analysis of Beach-related Issues using Local Newspaper Articles and Topic Modeling (지역신문기사 자료와 토픽모델링을 이용한 해변 관련 계절별 현안분석)

  • Yoo, Mu-Sang;Jeong, Su-Yeon;Kim, Geon-Hu;Sohn, Chul
    • Journal of the Korean Regional Science Association
    • /
    • v.34 no.4
    • /
    • pp.19-34
    • /
    • 2018
  • The purpose of this study is to analyze the seasonal issues using the local newspaper articles with the keyword beach from 2004 to 2017. Topic modeling and Time series regression analysis based on open source programs were performed for analysis. Topic modeling results showed 35 topics in spring, 47 topics in summer, 36 topics in autumn and 35 topics in winter. The common themes were 'beaches', 'festivals and events', 'accident and environmental issues', 'tourism', 'development and sale', 'administration and policy' and 'weather'. Time series regression analysis showed in the spring, 5 Hot-Topics and 2 Cold-Topic were found out of the 35 topics. In the summer, 6 Hot-Topics and 3 Cold-Topic were found out of the 47 topics. In the autumn, 4 Hot-Topics and 3 Cold-Topic were found out of the 36 topics. In the winter, 3 Hot-Topics and 3 Cold-Topic were found out of the 35 topics. And for each season, topics that do not fall into the Hot-Topic and Cold-Topic are classified as Neutral-Topic. In this study if seasonal uses are different such as beaches are deemed that seasonal topic modeling for analysis of regional issues will yield more useful results and enable detailed diagnosis.

Ontology Language based on Topic Maps for Semantic Web Service (시맨틱 웹 서비스를 위한 Topic Maps 기반의 온톨로지 언어)

  • 황윤영;유정연;유소연;이규철
    • Proceedings of the CALSEC Conference
    • /
    • 2003.09a
    • /
    • pp.191-196
    • /
    • 2003
  • The Semantic web service is able to intelligently discover, execute, composite and monitor the Web Service. It constructs the ontology on Web Service and describes the Semantic Markup in the machine-readable form. The currently developing technologies of the Semantic Web Service discovery are DAML-S matchmaker in Carnegie Mellon University, Process Handbook in MIT and etc. In this paper, we propose the ontology language based on Topic Maps that supports the benefits and solves the problems of the Semantic Web Service discovery technologies .

  • PDF

Define the Ontology and Query Language Based on Topic Maps for Service (TM-S : 서비스를 위한 Topic Maps기반의 온톨로지 및 질의 언어 설계)

  • 유정연;이규철
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2004.04b
    • /
    • pp.109-111
    • /
    • 2004
  • 대표적인 시맨틱 웹 서비스 발견 기술은 OWL-S와 MIT의 Process Handbook이 있다. 그러나. OWL-S는 개발 초기 단계이기 때문에, 아직 효과적인 웹 서비스 발견을 제공하기에는 몇 가지 제약 조건을 가지고 있다. 예를 들어. 정보 전송을 위한 제악 조건과 실행에 따른 상태 변환 정보를 정의하고 있지 않다. 또한. 사용자가 원하는 프로세스들의 시맨틱 정보들을 정의하고 있지 않다. 반면, MIT Process Handbook은 OWL-S와 같이 서비스 모델에 대한 상세한 정보들을 정의하고 있지 않아, 서비스 작성에 필요한 서비스들을 찾기가 어렵다. 그러므로, 본 논문에서는 Topic Maps 기반의 TM-S(Topic Maps for Service)를 제안하였다.

  • PDF

A Study on Research Trends in the Smart Farm Field using Topic Modeling and Semantic Network Analysis (토픽모델링과 언어네트워크분석을 활용한 스마트팜 연구 동향 분석)

  • Oh, Juyeon;Lee, Joonmyeong;Hong, Euiki
    • Journal of Digital Convergence
    • /
    • v.20 no.2
    • /
    • pp.203-215
    • /
    • 2022
  • The study is to investigate research trends and knowledge structures in the Smart Farm field. To achieve the research purpose, keywords and the relationship among keywords were analyzed targeting 104 Korean academic journals related to the Smart Farm in KCI(Korea Citation Index), and topics were analyzed using the LDA Topic Modeling technique. As a result of the analysis, the main keywords in the Korean Smart Farm-related research field were 'environment', 'system', 'use', 'technology', 'cultivation', etc. The results of Degree, Betweenness, and Eigenvector Centrality were presented. There were 7 topics, such as 'Introduction analysis of Smart Farm', 'Eco-friendly Smart Farm and economic efficiency of Smart Farm', 'Smart Farm platform design', 'Smart Farm production optimization', 'Smart Farm ecosystem', 'Smart Farm system implementation', and 'Government policy for Smart Farm' in the results of Topic Modeling. This study will be expected to serve as basic data for policy development necessary to advance Korean Smart Farm research in the future by examining research trends related to Korean Smart Farm.

A Study on the Trends of Construction Safety Accident in Unstructured Text Using Topic Modeling (비정형 텍스트 기반의 토픽 모델링을 이용한 건설 안전사고 동향 분석)

  • Lee, Sang-Gyu
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.19 no.10
    • /
    • pp.176-182
    • /
    • 2018
  • In order to understand and track the trends of construction safety accident, this study shows the topic trends in the construction safety accident with LDA(Latent Dirichlet Allocation)-based topic modeling method for data analytics. Especially, it performs to figure out the main issue of construction safety accident with unstructured data analysis based on the topic modeling rather than a variety of structured data analysis for preventing to safety accident in construction industry. To apply this methodology, I randomly collected to 540 news article data about construction accident from January 2017 to February 2018. Based on the unstructured data with the LDA-based topic modeling, I found the 10 topics and identified key issues through 10 keyword in each 10 topics. I forecasted the topic issue related to construction safety accident based on analysis of time-series trends about the news data from January 2017 to February 2018. With this method, this research gives a hint about ways of using unstructured news article data to anticipate safety policy and research field and to respond to construction accident safety issues in the future.

Combining Ego-centric Network Analysis and Dynamic Citation Network Analysis to Topic Modeling for Characterizing Research Trends (자아 중심 네트워크 분석과 동적 인용 네트워크를 활용한 토픽모델링 기반 연구동향 분석에 관한 연구)

  • Yu, So-Young
    • Journal of the Korean Society for information Management
    • /
    • v.32 no.1
    • /
    • pp.153-169
    • /
    • 2015
  • The combined approach of using ego-centric network analysis and dynamic citation network analysis for refining the result of LDA-based topic modeling was suggested and examined in this study. Tow datasets were constructed by collecting Web of Science bibliographic records of White LED and topic modeling was performed by setting a different number of topics on each dataset. The multi-assigned top keywords of each topic were re-assigned to one specific topic by applying an ego-centric network analysis algorithm. It was found that the topical cohesion of the result of topic modeling with the number of topic corresponding to the lowest value of perplexity to the dataset extracted by SPLC network analysis was the strongest with the best values of internal clustering evaluation indices. Furthermore, it demonstrates the possibility of developing the suggested approach as a method of multi-faceted research trend detection.