• Title/Summary/Keyword: 잠재 디리클레 할당

Search Result 43, Processing Time 0.023 seconds

A Content-based TV Program Recommendation System Using Age and Plots (연령 및 프로그램 줄거리를 활용한 콘텐츠 기반 TV 프로그램 추천 시스템)

  • Bang, Hanbyul;Lee, HyeWoo;Lee, Jee-Hyong
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2015.01a
    • /
    • pp.51-54
    • /
    • 2015
  • 추천 시스템의 대표적인 연구 중 하나인 콘텐츠 기반 추천 시스템 연구는 TV 프로그램이나 영화의 줄거리, 장르, 리뷰 등의 콘텐츠의 메타데이터를 이용한다. 그러나 이러한 연구들은 콘텐츠 관련 정보에만 의존할 뿐, 시청자의 프로파일과 콘텐츠의 정보를 함께 고려하지 않는다. 본 논문에서는 시청자의 프로파일 중 연령과 콘텐츠의 정보인 프로그램의 줄거리를 활용한 TV 프로그램 추천 시스템을 제안한다. 본 추천 시스템은 시청자를 연령에 따라 분류한 후, LDA 알고리즘을 이용하여 시청자의 시청 TV 프로그램의 줄거리를 분류된 나이에 따라 각각의 줄거리 토픽 모델로 생성한다. 이를 기준으로 시청자가 원하는 시간대에 방송되는 프로그램들의 줄거리 토픽벡터와 시청자의 선호도 토픽벡터의 유사도를 비교해 가장 유사도가 높은 TV 프로그램을 시청자에게 추천하는 방식이다. 본 논문에서는 연구의 효용성을 검증하기 위해 줄거리만을 사용한 경우와 줄거리와 연령을 동시에 활용한 경우를 비교 실험하였다. 실험을 통해 프로그램의 줄거리만을 사용한 경우보다 연령을 동시에 활용한 경우의 추천 시스템 성능이 개선된 것을 확인할 수 있었다.

  • PDF

Automatic Malware Detection Rule Generation and Verification System (악성코드 침입탐지시스템 탐지규칙 자동생성 및 검증시스템)

  • Kim, Sungho;Lee, Suchul
    • Journal of Internet Computing and Services
    • /
    • v.20 no.2
    • /
    • pp.9-19
    • /
    • 2019
  • Service and users over the Internet are increasing rapidly. Cyber attacks are also increasing. As a result, information leakage and financial damage are occurring. Government, public agencies, and companies are using security systems that use signature-based detection rules to respond to known malicious codes. However, it takes a long time to generate and validate signature-based detection rules. In this paper, we propose and develop signature based detection rule generation and verification systems using the signature extraction scheme developed based on the LDA(latent Dirichlet allocation) algorithm and the traffic analysis technique. Experimental results show that detection rules are generated and verified much more quickly than before.

Analysis of sustainable fashion research trends using topic modeling (토픽 모델링을 이용한 지속가능패션 연구 동향 분석)

  • Lee, Hana
    • The Research Journal of the Costume Culture
    • /
    • v.29 no.4
    • /
    • pp.538-553
    • /
    • 2021
  • As interest in the sustainable fashion industry continues to increase along with climate issues, it is necessary to identify research trends in sustainable fashion and seek new development directions. Therefore, this study aims to analyze research trends on sustainable fashion. For this purpose, related papers were collected from the KCI (Korean Citation Index) and Scopus, and 340 articles were used for the study. The collected data went through data transformation, data preprocessing, topic modeling analysis, core topic derivation, and visualization through a Python algorithm. A total of eight topics were obtained from the comprehensive analysis: consumer clothing consumption behavior and environment, upcycle product development, product types by environmental approach, ESG business activities, materials and material development, process-based approach, lifestyle and consumer experience, and brand strategy. Topics were related to consumption, production, and education of sustainable fashion, respectively. KCI analysis results and Scopus analysis results derived eight topics but showed differences from the comprehensive analysis results. This study provides primary data for exploring various themes of sustainable fashion. It is significant in that the data were analyzed based on probability using a research method that excluded the subjective value of the researcher. It is recommended that follow-up studies be conducted to examine social trends.

Analysis of speech in game marketing video using text mining techniques (텍스트 마이닝 기법을 이용한 게임 마케팅 비디오에서의 스피치 분석)

  • Lee, Yeokyung;Kim, Jaejik
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.1
    • /
    • pp.147-159
    • /
    • 2022
  • Nowadays, various social media platforms are widely spread and people closely use such platforms in daily life. By doing so, social influencers with a large number of subscribers, views, and comments have huge impact in our society. Following this trend, many companies are actively using influencers for marketing purpose to promote their products and services. In this study, we extract the speeches of influencers from videos for game marketing and analyze them using various text mining techniques. In the analysis, we distinguish game videos leading to successful marketing and failed marketing, and we explore and compare the linguistic features of the influencers for successful and failed marketings.

Extracting User-Specific Advertising Keywords Based on Textual Data Mining from KakaoTalk (카카오톡에서의 텍스트 데이터 마이닝 기반의 사용자별 적합 광고 키워드 도출 )

  • Yerim Jeon;Dayeong So;Jimin Lee;Eunjin (Jinny) Jo;Jihoon Moon
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2023.05a
    • /
    • pp.368-369
    • /
    • 2023
  • 대화 데이터 기반 광고 추천은 광고 마케팅에서 고객 맞춤형 광고 제공, 마케팅 효과 극대화 등을 위한 중요한 기술로 주목받고 있다. 본 논문에서는 모바일 인스턴스 메신저인 카카오톡 대화창에서 발생한 텍스트 데이터를 기반으로 대화 내용을 분석하여 대화 주제별 적절한 광고 키워드를 제안한다. 이를 위해 주제별 대화 내용을 미용, 식음료, 상거래로 세분하고 KoNLPy 의 Okt 를 이용하여 텍스트 전처리를 수행하고 키워드별로 빈도수를 뽑아 워드 클라우드를 제시한다. 또한, 잠재 디리클레 할당(Latent Dirichlet Allocation, LDA)을 기반으로 대화 주제를 세분화한 뒤 라벨링을 통해 주제별 대화 키워드를 분석한다. 실험 결과, 대화 주제를 온라인 쇼핑, 헤어, 뷰티 관리, 음식으로 나눌 수 있었으며, 토픽별 상위 키워드를 Word2Vec 을 통해 특정 단어와 유사한 키워드를 도출하여 적절한 광고 키워드를 제시할 수 있었다.

Automatic Product Review Helpfulness Estimation based on Review Information Types (상품평의 정보 분류에 기반한 자동 상품평 유용성 평가)

  • Kim, Munhyong;Shin, Hyopil
    • Journal of KIISE
    • /
    • v.43 no.9
    • /
    • pp.983-997
    • /
    • 2016
  • Many available online product reviews for any given product makes it difficult for a consumer to locate the helpful reviews. The purpose of this study was to investigate automatic helpfulness evaluation of online product reviews according to review information types based on the target of information. The underlying assumption was that consumers find reviews containing specific information related to the product itself or the reliability of reviewers more helpful than peripheral information, such as shipping or customer service. Therefore, each sentence was categorized by given information types, which reduced the semantic space of review sentences. Subsequently, we extracted specific information from sentences by using a topic-based representation of the sentences and a clustering algorithm. Review ranking experiments indicated more effective results than other comparable approaches.

Analysis on Status and Trends of SIAM Journal Papers using Text Mining (텍스트마이닝 기법을 활용한 미국산업응용수학 학회지의 연구 현황 및 동향 분석)

  • Kim, Sung-Yeun
    • The Journal of the Korea Contents Association
    • /
    • v.20 no.7
    • /
    • pp.212-222
    • /
    • 2020
  • The purpose of this study is to understand the current status and trends of the research studies published by the Society for Industrial and Applied Mathematics which is a leader in the field of industrial mathematics around the world. To perform this purpose, titles and abstracts were collected from 6,255 research articles between 2016 and 2019, and the R program was used to analyze the topic modeling model with LDA techniques and a regression model. As the results of analyses, first, a variety of studies have been studied in the fields of industrial mathematics, such as algebra, discrete mathematics, geometry, topological mathematics, probability and statistics. Second, it was found that the ascending research subjects were fluid mechanics, graph theory, and stochastic differential equations, and the descending research subjects were computational theory and classical geometry. The results of the study, based on the understanding of the overall flows and changes of the intellectual structure in the fields of industrial mathematics, are expected to provide researchers in the field with implications of the future direction of research and how to build an industrial mathematics curriculum that reflects the zeitgeist in the field of education.

Research Trend Analysis in Fashion Design Studies in Korea using Topic Modeling (토픽모델링을 이용한 국내 패션디자인 연구동향 분석)

  • Jang, Namkyung;Kim, Min-Jeong
    • Journal of Digital Convergence
    • /
    • v.15 no.6
    • /
    • pp.415-423
    • /
    • 2017
  • This study explored research trends by investigating articles published in the Journal of Korean Society of Fashion Design from 2001 through 2015. English key words and abstracts were analyzed using text mining and topic modeling techniques. The findings are as followings. By the text mining technique, 183 core terms, appeared more than 30 times, were derived from 7137 words used in total 338 articles' key words and abstracts. 'Fashion' and 'design' showed the highest frequency rate. After that, the well-received topic modeling technique, LDA, was applied to the collected data sets. Several distinct sub-research domains strongly tied with the previous fashion design field, except for topics such as fashion brand marketing and digital technology, were extracted. It was observed that there are the growing and declining trends in the research topics. Based on findings, implication, limitation, and future research questions were presented.

Topic Modeling on Fine Dust Issues Using LDA Analysis (LDA 기법을 이용한 미세먼지 이슈의 토픽모델링 분석)

  • Yoon, soonuk;Kim, Minchul
    • Journal of Energy Engineering
    • /
    • v.29 no.2
    • /
    • pp.23-29
    • /
    • 2020
  • In this study, the last 10 years of news data on fine dust was collected and 80 topics are selected through LDA analysis. As a result, weather-related information made up the main words for the topic, and we can see that fine dust becomes a big issue below 10 degrees Celsius. The frequency of exposure to the media and the maximum concentration of fine dust are correlated with positive. Topics related to fine dust reduction measures and the government's comprehensive measures over the past decade, topics related to products such as air purifiers related to fine dust, topics related to policies protecting vulnerable people from fine dust, and topics on fine dust reduction through R&D were found to be major topics. Measures against fine dust as a social issue can be seen to be closely related to the government's policy.

Convergence Study on Research Topics for Thyroid Cancer in Korea (국내 갑상선암 논문 토픽에 대한 융합연구)

  • Yang, Ji-Yeon
    • Journal of the Korea Convergence Society
    • /
    • v.10 no.2
    • /
    • pp.75-81
    • /
    • 2019
  • The purpose of this study was to perform a convergence study for the investigation of the trend of research topics related to thyroid cancer in Korea. We collected related research papers from DBpia and employed LDA-based topic model. In result, we identified four research topics, each of which concerns "Surgery", "Disease aggressiveness", "Survival analysis", and "Well-being of patients". With multinomial logistic regression, we found significant time trend, where "Surgery"-related topic was popular before 2000, topics regarding "Disease aggressiveness" and "Survival analysis" were frequently addressed in the 2000s, and "Survival analysis" and especially "Well-being of patients" have been pursued since 2010. The findings would serve as a reference guide for research directions. Future work may examine whether the recent change in research topics is observed in other diseases.