• 제목/요약/키워드: topic mining

검색결과 503건 처리시간 0.032초

메타데이터 마이닝 시스템의 관리효율성의 제고전략 (Strategy to Improve the Management Efficiency of Meta Data Mining System)

  • 윤용운
    • 한국경영과학회:학술대회논문집
    • /
    • 한국경영과학회/대한산업공학회 2005년도 춘계공동학술대회 발표논문
    • /
    • pp.276-279
    • /
    • 2005
  • Many large organizations that have allocated resources to Data Administration(DA) have DA-context meta data mining. Also meta data is an interesting topic in the data warehouse world. This conceptual view gradually cleared up, and recently we have been talking more confidently about the back-room and front-room meta data. We describe the processes and problems that characterize the general architecture of s meta data mining system to do improve management efficiency that require further research and development.

  • PDF

텍스트 마이닝을 활용한 황해 관련 연구동향 분석연구 (Analysis of Research Trends in Relation to the Yellow Sea using Text Mining)

  • 황규원;김진경;강승구;강길모
    • 해양환경안전학회지
    • /
    • 제29권7호
    • /
    • pp.724-739
    • /
    • 2023
  • 황해는 지정학적으로 한국, 중국, 북한 사이 해역에 위치하고 있으며, 최근 해양공간 이용이 확대되어 사회적·경제적 가치가 증가하고 있다. 또한 기후변화로 인한 해양환경 변화, 대기오염물질 이동 등 한·중 공동 대응 및 협력의 필요성이 증가되고 있다. 본 연구에서는 황해(Yellow Sea) 키워드의 연구논문을 대상으로 핵심주제(Topic)을 도출하고, 저자 네트워크 분석을 수행하여 연구동향을 탐색하였다. 연구대상으로 1984년부터 2021년 사이에 게재된 Web of Science DataBase의 황해 관련 연구논문을 추출하고, 한중 어업협정, 해양환경공동조사 등 한국과 중국의 주요 이벤트를 중심으로 4개의 시기로 구분하였다. 연구방법으로 텍스트 마이닝(Text Mining)의 일종인 토픽모델링(Topic Modeling)을 활용하여 Topic을 도출하였다. 또한 저자 네트워크를 분석하여 해당 분야의 주요 연구 그룹(Community)과 연구자 및 연구기관의 영향력을 파악하고 시사점을 제시하였다. 분석결과 황해 연구논문의 핵심주제는 1기 퇴적물, 해양생물, 2기 산성화, 미세먼지, 3기 수산양식, 지진, 4기 탄소요인, 해양생태계 등으로 변화하였고, 시기별로 핵심 연구자를 중심의 연구자 그룹이 증가하였다. 연구결과를 토대로 황해 관련 연구 동향과 주요 연구자 및 연구기관을 파악함으로써 향후 한국과 중국 간의 황해 연구협력에 기여하고자 한다.

Improvement of recommendation system using attribute-based opinion mining of online customer reviews

  • Misun Lee;Hyunchul Ahn
    • 한국컴퓨터정보학회논문지
    • /
    • 제28권12호
    • /
    • pp.259-266
    • /
    • 2023
  • 본 논문에서는 속성기반 오피니언 마이닝(ABOM)을 적용한 협업 필터링의 정확도 성능을 개선할 수 있는 알고리즘을 제안한다. 실험을 위해 국내 스마트폰 사용자의 스마트폰 앱에 대한 총 1,227건의 온라인 소비자 리뷰 데이터가 분석에 사용되었다. KKMA(꼬꼬마)분석기를 이용하여 형태소 분석 및 KOSAC를 사용하여 감성어 분석 후 LDA 토픽 모델링을 사용하여 속성 추출한 가중치 값을 부여한 리뷰별로 토픽 모델링 결과를 이용하여 협업필터링의 평점과 감성스코어의 평점을 합산한 평균값 정확도 오차를 계산한 통계모형 성능 평가인 MAE, MAPE, RMSE를 사용하였다. 실험을 통해 추천 알고리즘 중 전통적인 협업필터링과 LDA 속성 추출과 감성분석을 결합한 속성기반 오피니언 마이닝(Aspect-Based Opinion Mining, ABOM) 기법을 결합하여 온라인 고객의 앱 평점(APP_Score) 대한 정확도를 예측하였다. 분석 결과 전통적인 협업필터링을 구현한 평점의 정확도 보다 속성기반 오피니언 마이닝 CF를 적용한 평점의 예측 정확도가 더 우수한 것으로 나타났다.

K 패션에 대한 글로벌 미디어 보도 경향 분석 -다이내믹 토픽 모델링(Dynamic Topic Modeling)의 적용- (Analysis of Global Media Reporting Trends for K-fashion -Applying Dynamic Topic Modeling-)

  • 안효선;김지영
    • 한국의류학회지
    • /
    • 제46권6호
    • /
    • pp.1004-1022
    • /
    • 2022
  • This study seeks to investigate K-fashion's external image by examining the trends in global media reporting. It applies Dynamic Topic Modeling (DTM), which captures the evolution of topics in a sequentially organized corpus of documents, and consists of text preprocessing, the determination of the number of topics, and a timeseries analysis of the probability distribution of words within topics. The data set comprised 551 online media articles on 'Korean fashion' or 'K-fashion' published on Google News between 2010 and 2021. The analysis identifies seven topics: 'brand look and style,' 'lifestyle,' 'traditional style,' 'Seoul Fashion Week (SFW) event,' 'model size,' 'K-pop,' and 'fashion market,' as well as annual topic proportion trends. It also explores annual word changes within the topic and indicates increasing and decreasing word patterns. In most topics, the probability distribution of the word 'brand' is confirmed to be on the increase, while 'digital,' 'platform,' and 'virtual' have been newly created in the 'SFW event' topic. Moreover, this study confirms the transition of each K-fashion topic over the past 12 years, along with various factors related to Hallyu content, traditional culture, government support, and digital technology innovation.

Research trends in the Korean Journal of Women Health Nursing from 2011 to 2021: a quantitative content analysis

  • Ju-Hee Nho;Sookkyoung Park
    • 여성건강간호학회지
    • /
    • 제29권2호
    • /
    • pp.128-136
    • /
    • 2023
  • Purpose: Topic modeling is a text mining technique that extracts concepts from textual data and uncovers semantic structures and potential knowledge frameworks within context. This study aimed to identify major keywords and network structures for each major topic to discern research trends in women's health nursing published in the Korean Journal of Women Health Nursing (KJWHN) using text network analysis and topic modeling. Methods: The study targeted papers with English abstracts among 373 articles published in KJWHN from January 2011 to December 2021. Text network analysis and topic modeling were employed, and the analysis consisted of five steps: (1) data collection, (2) word extraction and refinement, (3) extraction of keywords and creation of networks, (4) network centrality analysis and key topic selection, and (5) topic modeling. Results: Six major keywords, each corresponding to a topic, were extracted through topic modeling analysis: "gynecologic neoplasms," "menopausal health," "health behavior," "infertility," "women's health in transition," and "nursing education for women." Conclusion: The latent topics from the target studies primarily focused on the health of women across all age groups. Research related to women's health is evolving with changing times and warrants further progress in the future. Future research on women's health nursing should explore various topics that reflect changes in social trends, and research methods should be diversified accordingly.

Biomedical Ontologies and Text Mining for Biomedicine and Healthcare: A Survey

  • Yoo, Ill-Hoi;Song, Min
    • Journal of Computing Science and Engineering
    • /
    • 제2권2호
    • /
    • pp.109-136
    • /
    • 2008
  • In this survey paper, we discuss biomedical ontologies and major text mining techniques applied to biomedicine and healthcare. Biomedical ontologies such as UMLS are currently being adopted in text mining approaches because they provide domain knowledge for text mining approaches. In addition, biomedical ontologies enable us to resolve many linguistic problems when text mining approaches handle biomedical literature. As the first example of text mining, document clustering is surveyed. Because a document set is normally multiple topic, text mining approaches use document clustering as a preprocessing step to group similar documents. Additionally, document clustering is able to inform the biomedical literature searches required for the practice of evidence-based medicine. We introduce Swanson's UnDiscovered Public Knowledge (UDPK) model to generate biomedical hypotheses from biomedical literature such as MEDLINE by discovering novel connections among logically-related biomedical concepts. Another important area of text mining is document classification. Document classification is a valuable tool for biomedical tasks that involve large amounts of text. We survey well-known classification techniques in biomedicine. As the last example of text mining in biomedicine and healthcare, we survey information extraction. Information extraction is the process of scanning text for information relevant to some interest, including extracting entities, relations, and events. We also address techniques and issues of evaluating text mining applications in biomedicine and healthcare.

텍스트마이닝을 활용한 Covid-19 기간 동안의 항공산업 관련 키워드 트렌드 분석 (Keyword trends analysis related to the aviation industry during the Covid-19 period using text mining)

  • 최동현;송보미;박다현;이성우
    • 한국산업정보학회논문지
    • /
    • 제27권2호
    • /
    • pp.115-128
    • /
    • 2022
  • 본 연구는 Covid-19 팬데믹이 항공산업에 미친 영향과 동향을 살펴보고자 국내 뉴스 기사 데이터를 활용하여 키워드 트렌드 분석을 진행하였다. 데이터 수집을 위하여 Covid-19 발생 기준으로 전, 후 각 6개월의 기간을 나누어 '항공사' 키워드를 중심으로 관련 기사들을 추출하였다. 이후 기간별 동시 출현 빈도를 파악한 후 LDA 기법을 이용하여 토픽 모델링을 진행하였으며, Covid-19의 진행 동향과 토픽 패턴과의 관계 분석을 통해 상황에 따른 주요 토픽을 도출하였다. 이러한 결과를 활용하여 Covid-19와 같이 범세계적으로 영향을 주는 전염병이 발생할 경우 그 추이에 따라 항공산업에 미치는 영향을 예측할 수 있는 기초자료로 활용될 수 있을 것으로 기대된다.

Analyzing the Major Issues of the 4th Industrial Revolution

  • Jeon, Jeonghwan;Suh, Yongyoon
    • Asian Journal of Innovation and Policy
    • /
    • 제6권3호
    • /
    • pp.262-273
    • /
    • 2017
  • Recently, the attention to the $4^{th}$ Industrial Revolution has been increasing. In the $4^{th}$ Industrial Revolution era, the boundaries between physical space, digital space, and biological space are becoming blurred because of the active convergence between various fields. There are many issues about the $4^{th}$ Industrial Revolution such as artificial intelligence, Internet of things, big data, and cyber physical system. To cope with the $4^{th}$ Industrial Revolution, an accurate analysis and technology planning need to be undertaken from a broad point of view. However, there is little research on the analysis of the major issues about the 4th Industrial Revolution. Accordingly, this study aims to analyse these major issues. Data mining such as topic modelling method is used for this analysis. This study is expected to be helpful for the researcher and policy maker of the 4th Industrial Revolution.

토픽모델링을 활용한 4차 산업혁명의 주요 이슈 분석

  • 전정환;서용윤
    • 한국기술혁신학회:학술대회논문집
    • /
    • 한국기술혁신학회 2017년도 추계학술대회 논문집
    • /
    • pp.1321-1321
    • /
    • 2017
  • Recently the attention to the 4th industrial revolution has been increasing. In the 4th industrial revolution era, the boundaries of physical space, digital space, and biological space are becoming blurred since the active convergence between various fields There are a lot of issues on the 4th industrial revolution such as artificial intelligence, internet of thing, big data, and cyber physical system. Accordingly, this study aims to analyse the main issues of the 4th industrial revolution. Data mining such as topic modelling method is used for the analysis. This study is expected to be helpful for the researcher and policy maker of the 4th industrial revolution.

  • PDF

텍스트 마이닝을 이용한 시대별 청소년 문제 토픽 분석 (Topic Analysis on the Adolescent Problem Using Text Mining)

  • 조경원;조주연
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국정보통신학회 2018년도 추계학술대회
    • /
    • pp.203-204
    • /
    • 2018
  • This research was conducted to identify adolescent problems in internet articles. This research defines adolescent problems as diverse issues related to adolescents and examine how it was dealt in the media to find out how different categories and the aspect of adolescent problems are changing by time. The result of the research was that in 1990's, education policy and family were mainly dealt with when it came to adolescent problems. As the era is changing, adolescent problems were far diversified compared to the past, and each problems are dealt with similar importance. This research is significant in that it does not only examine the social trend adolescent problems but also expand the range of adolescent counselling and utilizes quantitative analysis in considering diversity to provide new information.

  • PDF