• Title/Summary/Keyword: Text-Mining

Search Result 1,510, Processing Time 0.024 seconds

A Study on Correlation Analysis of One-Person Housing Space Design Convergence Contents by Using Social Network Analysis (소셜 네트워크 분석 방법론을 활용한 1인 주거공간디자인 융합콘텐츠 상관관계 분석)

  • Park, Eun Soo;Kim, Ji Eun
    • Korea Science and Art Forum
    • /
    • v.34
    • /
    • pp.133-148
    • /
    • 2018
  • Korea's housing structure is predicted that one-person housing will be the most common type of housing in Korea. Therefore, this study intends to derive contents for designing a one-person housing space considering the life of a rapidly increasing one-person householder. For this purpose, this study objectively derives the social, economic and cultural influencing factors of one-person households through big data analysis, and analyzed the correlation between contents using social network analysis methodology. In this paper, 60 core contents related to one person housing space were derived by applying big data analysis methodology. And through social network analysis, the most influential contents were derived from the space editing and space composition categories. This means that the residential space is an important part of the design idea that can flexibly respond to changes in the user's life. Based on this study, future research will focus on the concept and design methodology of one-person housing space.

Big Data Application for Judgment on Consumer's Awareness of the Trademark (상표의 소비자 인식 판단을 위한 빅데이터 활용 방안)

  • You, Hyun-Woo;Lee, Hwan-soo
    • Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology
    • /
    • v.6 no.8
    • /
    • pp.399-408
    • /
    • 2016
  • As entering the Big Data age, utilization of Big Data is also increasing in the intellectual property sector. Meanwhile, the purpose of a trademark which distinguishes the source of the goods essentially is to enable the public to recognize the goods. Big Data technologies which is recently becoming a issue can be used as a tool to judge consumer's awareness of the trademark. It was difficult for judgment of trademark awareness through traditional ways. As a new way, survey methodology has bee received attention, and it was applied to the field of trademark law. However, various problems such as cost, time, objectivity, and fairness were observed. In order to overcome theses limitations, this study proposes new way utilizing big data analytics for judgment on consumer's awareness of the trademark. This new way will not only contribute to enhancing the objectivity of judging trademark awareness but also utilized to support for related legal judgments.

Stock Market Prediction Using Sentiment on YouTube Channels (유튜브 주식채널의 감성을 활용한 코스피 수익률 등락 예측)

  • Su-Ji, Cho;Cheol-Won Yang;Ki-Kwang Lee
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.46 no.2
    • /
    • pp.102-108
    • /
    • 2023
  • Recently in Korea, YouTube stock channels increased rapidly due to the high social interest in the stock market during the COVID-19 period. Accordingly, the role of new media channels such as YouTube is attracting attention in the process of generating and disseminating market information. Nevertheless, prior studies on the market forecasting power of YouTube stock channels remain insignificant. In this study, the market forecasting power of the information from the YouTube stock channel was examined and compared with traditional news media. To measure information from each YouTube stock channel and news media, positive and negative opinions were extracted. As a result of the analysis, opinion in channels operated by media outlets were found to be leading indicators of KOSPI market returns among YouTube stock channels. The prediction accuracy by using logistic regression model show 74%. On the other hand, Sampro TV, a popular YouTube stock channel, and the traditional news media simply reported the market situation of the day or instead showed a tendency to lag behind the market. This study is differentiated from previous studies in that it verified the market predictive power of the information provided by the YouTube stock channel, which has recently shown a growing trend in Korea. In the future, the results of advanced analysis can be confirmed by expanding the research results for individual stocks.

Visualizing Unstructured Data using a Big Data Analytical Tool R Language (빅데이터 분석 도구 R 언어를 이용한 비정형 데이터 시각화)

  • Nam, Soo-Tai;Chen, Jinhui;Shin, Seong-Yoon;Jin, Chan-Yong
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.05a
    • /
    • pp.151-154
    • /
    • 2021
  • Big data analysis is the process of discovering meaningful new correlations, patterns, and trends in large volumes of data stored in data stores and creating new value. Thus, most big data analysis technology methods include data mining, machine learning, natural language processing, and pattern recognition used in existing statistical computer science. Also, using the R language, a big data tool, we can express analysis results through various visualization functions using pre-processing text data. The data used in this study was analyzed for 21 papers in the March 2021 among the journals of the Korea Institute of Information and Communication Engineering. In the final analysis results, the most frequently mentioned keyword was "Data", which ranked first 305 times. Therefore, based on the results of the analysis, the limitations of the study and theoretical implications are suggested.

  • PDF

Visualizing Article Material using a Big Data Analytical Tool R Language (빅데이터 분석 도구 R 언어를 이용한 논문 데이터 시각화)

  • Nam, Soo-Tai;Shin, Seong-Yoon;Jin, Chan-Yong
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.05a
    • /
    • pp.326-327
    • /
    • 2021
  • Newly, big data utilization has been widely interested in a wide variety of industrial fields. Big data analysis is the process of discovering meaningful new correlations, patterns, and trends in large volumes of data stored in data stores and creating new value. Thus, most big data analysis technology methods include data mining, machine learning, natural language processing, and pattern recognition used in existing statistical computer science. Also, using the R language, a big data tool, we can express analysis results through various visualization functions using pre-processing text data. The data used in this study were analyzed for 29 papers in a specific journal. In the final analysis results, the most frequently mentioned keyword was "Research", which ranked first 743 times. Therefore, based on the results of the analysis, the limitations of the study and theoretical implications are suggested.

  • PDF

Development of Social Data Collection and Loading Engine-based Reliability analysis System Against Infectious Disease Pandemic (감염병 위기 대응을 위한 소셜 데이터 수집 및 적재 엔진 기반 신뢰도 분석 시스템 개발)

  • Doo Young Jung;Sang-Jun Lee;MIN KYUNG IL;Seogsong Jeong;HyunWook Han
    • The Journal of Bigdata
    • /
    • v.7 no.2
    • /
    • pp.103-111
    • /
    • 2022
  • There are many institutions, organizations, and sites related to responding to infectious diseases, but as the pandemic situation such as COVID-19 continues for years, there are many changes in the initial and current aspects, and accordingly, policies and response systems are evolving. As a result, regional gaps arise, and various problems are scattered due to trust, distrust, and implementation of policies. Therefore, in the process of analyzing social data including information transmission, Twitter data, one of the major social media platforms containing inaccurate information from unknown sources, was developed to prevent facts in advance. Based on social data, which is unstructured data, an algorithm that can automatically detect infectious disease threats is developed to create an objective basis for responding to the infectious disease crisis to solidify international competitiveness in related fields.

A Study of Information Literacy Curriculum Using Topic Modeling (토픽모델링을 활용한 정보활용교육 연구주제 분석 및 교육내용 제안)

  • Jihye, Yun;Yoo Kyung, Jeong
    • Journal of the Korean Society for information Management
    • /
    • v.39 no.4
    • /
    • pp.1-21
    • /
    • 2022
  • The aim of this study is to identify the research topics and suggest an information literacy curriculum by analyzing research articles on information literacy. For this purpose, we applied the topic modeling technique to 97 scientific articles and identified the core contents of information literacy education, such as media literacy, information literacy instruction, and the use of information resources. Based on the analysis results, we suggested an information literacy curriculum by considering the Big 6 model, information literacy standards of American Association of School Library, and Association of College and Research Libraries's information literacy competencies. This study is significant in that it considered 'use of information resources' and 'information ethics' to suggest information literacy education.

Consumers' perceptions of dietary supplements before and after the COVID-19 pandemic based on big data

  • Eunjung Lee;Hyo Sun Jung;Jin A Jang
    • Journal of Nutrition and Health
    • /
    • v.56 no.3
    • /
    • pp.330-347
    • /
    • 2023
  • Purpose: This study identified words closely associated with the keyword "dietary supplement" (DS) using big data in Korean social media and investigated consumer perceptions and trends related to DSs before (2019) and after the coronavirus disease 2019 (COVID-19) pandemic (2021). Methods: A total of 37,313 keywords were found for the 2019 period, and 35,336 keywords were found for the 2021 period using blogs and cafes on Daum and Naver. Results were derived by text mining, semantic networking, network visualization analysis, and sentiment analysis. Results: The DS-related keywords that frequently appeared before and after COVID-19 were "recommend", "vitamin", "health", "children", "multiple", and "lactobacillus". "Calcium", "lutein", "skin", and "immunity" also had high frequency-inverse document frequency (TF-IDF) values. These keywords imply a keen interest in DSs among Korean consumers. Big data results also reflected social phenomena related to DSs; for example, "baby" and "pregnant woman" had lower TD-IDF values after the pandemic, suggesting lower marriage and birth rates but higher values for "joint", indicating reduced physical activity. A network centered on vitamins and health care was produced by semantic network analysis in 2019. In 2021, values were highest for deficiency and need, indicating that individuals were searching for DSs after the COVID-19 pandemic due to a lack an awareness of the need for adequate nutrient intake. Before the pandemic, DSs and vitamins were associated with healthcare and life cycle-related topics, such as pregnancy, but after the COVID-19 pandemic, consumer interests changed to disease prevention and treatment. Conclusion: This study provides meaningful clues regarding consumer perceptions and trends related to DSs before and after the COVID-19 pandemic and fundamental data on the effect of the pandemic on consumer interest in dietary supplements.

A study of changes in user experience and service evaluation - Topic modeling of Netflix app reviews (사용자 경험과 서비스 평가의 변화에 관한 연구 - 넷플릭스 앱 리뷰 토픽 모델링을 통해)

  • Seon Yeong Yu;Mi Jin Noh;Yang Sok Kim;Mu Moung Cho Han
    • Smart Media Journal
    • /
    • v.12 no.6
    • /
    • pp.27-34
    • /
    • 2023
  • As Netflix usage has increased due to the COVID-19 pandemic, users' experiences with the service have also increased. Therefore, this study aims to conduct topic modeling analysis based on Netflix review data to explore the changes in Netflix user experience and service before and after the COVID-19 pandemic. We collected Netflix app review data from the Google Play Store using the Google Play Scraper library, and used topic modeling to examine keyword differences between app reviews before and after the pandemic. The analysis revealed four main topics: Netflix app features, Netflix content, Netflix service usage, and Netflix overall reviews. After the pandemic, when user experience increased, users tended to use more diverse and detailed keywords in their reviews. By using Netflix review data to analyze users' opinions, this study shows the changes in user experience of Netflix services before and after the pandemic, which can be used as a guide to strengthen competitiveness in the competitive OTT market.

Keyword Extraction through Text Mining and Open Source Software Category Classification based on Machine Learning Algorithms (텍스트 마이닝을 통한 키워드 추출과 머신러닝 기반의 오픈소스 소프트웨어 주제 분류)

  • Lee, Ye-Seul;Back, Seung-Chan;Joe, Yong-Joon;Shin, Dong-Myung
    • Journal of Software Assessment and Valuation
    • /
    • v.14 no.2
    • /
    • pp.1-9
    • /
    • 2018
  • The proportion of users and companies using open source continues to grow. The size of open source software market is growing rapidly not only in foreign countries but also in Korea. However, compared to the continuous development of open source software, there is little research on open source software subject classification, and the classification system of software is not specified either. At present, the user uses a method of directly inputting or tagging the subject, and there is a misclassification and hassle as a result. Research on open source software classification can also be used as a basis for open source software evaluation, recommendation, and filtering. Therefore, in this study, we propose a method to classify open source software by using machine learning model and propose performance comparison by machine learning model.