• Title/Summary/Keyword: Topic Data

Search Result 1,572, Processing Time 0.024 seconds

User Review Mining: An Approach for Software Requirements Evolution

  • Lee, Jee Young
    • International journal of advanced smart convergence
    • /
    • v.9 no.4
    • /
    • pp.124-131
    • /
    • 2020
  • As users of internet-based software applications increase, functional and non-functional problems for software applications are quickly exposed to user reviews. These user reviews are an important source of information for software improvement. User review mining has become an important topic of intelligent software engineering. This study proposes a user review mining method for software improvement. User review data collected by crawling on the app review page is analyzed to check user satisfaction. It analyzes the sentiment of positive and negative that users feel with a machine learning method. And it analyzes user requirement issues through topic analysis based on structural topic modeling. The user review mining process proposed in this study conducted a case study with the a non-face-to-face video conferencing app. Software improvement through user review mining contributes to the user lock-in effect and extending the life cycle of the software. The results of this study will contribute to providing insight on improvement not only for developers, but also for service operators and marketing.

Spatial analysis based on topic modeling using foreign tourist review data: Case of Daegu (외국인 관광객 리뷰데이터를 활용한 토픽모델링 기반의 공간분석: 대구광역시를 사례로)

  • Jung, Ji-Woo;Kim, Seo-Yun;Kim, Hyeon-Yu;Yoon, Ju-Hyeok;Jang, Won-Jun;Kim, Keun-Wook
    • Journal of Digital Convergence
    • /
    • v.19 no.8
    • /
    • pp.33-42
    • /
    • 2021
  • As smartphone-based tourism platforms have become active, policy establishment and service enhancement using review data are being made in various fields. In the case of the preceding studies using tourism review data, most of the studies centered on domestic tourists were conducted, and in the case of foreign tourist studies, studies were conducted only on data collected in some languages and text mining techniques. In this study, 3,515 review data written by foreigners were collected by designating the "Daegu attractions" keyword through the online review site. And LDA-based topic modeling was performed to derive tourism topics. The spatial approach through global and local spatial autocorrelation analysis for each topic can be said to be different from previous studies. As a result of the analysis, it was confirmed that there is a global spatial autocorrelation, and that tourist destinations mainly visited by foreigners are concentrated locally. In addition, hot spots have been drawn around Jung-gu in most of the topics. Based on the analysis results, it is expected to be used as a basic research for spatial analysis based on local government foreign tourism policy establishment and topic modeling. And The limitations of this study were also presented.

Study on Potential Topics of the MyData and Data Transactions Using LDA Topic Modeling (국내 마이데이터 태동과 데이터 거래에 관한 잠재적 주제 분석)

  • Cho, Ji Yeon;Lee, Bong Gyou
    • Journal of Digital Convergence
    • /
    • v.20 no.3
    • /
    • pp.221-229
    • /
    • 2022
  • With the recent full-fledged MyData service, interest in the use of personal data is increasing. However, studies on MyData are still in the early stages, focusing on legal and institutional discussions, and studies from a comprehensive perspective are insufficient. Therefore, this study aimed at finding the potential topics formed by social discussions by analyzing news data from 2018 to the present. News data analysis using LDA topic modeling were conducted and 6 potential topics including digital transformation in finance, scope of Mydata business license, amendments and data-related laws, safe use of big data, data economy promotion policy and strategy of the financial industry were derived. This study has significance in that it comprehensively viewed the issues that emerged with the MyData and deriving gaps in previous discussion. Future research is expected to identify changes after the launch of MyData service and provide specific implications through research by specific industries.

Topic Analysis Using Big Data Related to 'Blockchain usage': Focused on Newspaper Articles ('블록체인 활용' 관련 빅데이터를 활용한 토픽 분석: 신문기사를 중심으로)

  • Kim, Sungae;Jun, Soojin
    • Journal of Industrial Convergence
    • /
    • v.18 no.1
    • /
    • pp.73-78
    • /
    • 2020
  • To analyze the main topics related to the use of blockchain technology, the Topic Modeling Technique was applied to the 'Blockchain Technology Utilization' big data shown in newspaper articles. To this end, from 2013 to 2019, when newspaper articles on the use of blockchain technology first appeared, the topics were extracted from 21 newspapers and analyzed by time to 15,537 articles. As a result of the analysis, articles related to the utilization of blockchain technology have increased exponentially since 2015 and focused on IT_science and economics. Key words related to cryptocurrency, bitcoin and virtual currency were weighted high, although they differed depending on time. Blockchain technology, which had focused on financial transactions, gradually expanded to big data, Internet of Things and artificial intelligence. As a result, changes in corporate topics were also made together to expand into various fields at banks for financial transactions, focusing on large and global companies. The study showed how these topics were changing, along with the main topics in newspaper articles related to the use of blockchain technology.

WV-BTM: A Technique on Improving Accuracy of Topic Model for Short Texts in SNS (WV-BTM: SNS 단문의 주제 분석을 위한 토픽 모델 정확도 개선 기법)

  • Song, Ae-Rin;Park, Young-Ho
    • Journal of Digital Contents Society
    • /
    • v.19 no.1
    • /
    • pp.51-58
    • /
    • 2018
  • As the amount of users and data of NS explosively increased, research based on SNS Big data became active. In social mining, Latent Dirichlet Allocation(LDA), which is a typical topic model technique, is used to identify the similarity of each text from non-classified large-volume SNS text big data and to extract trends therefrom. However, LDA has the limitation that it is difficult to deduce a high-level topic due to the semantic sparsity of non-frequent word occurrence in the short sentence data. The BTM study improved the limitations of this LDA through a combination of two words. However, BTM also has a limitation that it is impossible to calculate the weight considering the relation with each subject because it is influenced more by the high frequency word among the combined words. In this paper, we propose a technique to improve the accuracy of existing BTM by reflecting semantic relation between words.

Rural Tourism Image and Major Activity Space in Gochang County Shown in Social Data - Focusing on the Keyword 'Gochang-gun Travel' - (소셜데이터에 나타난 고창군의 농촌관광 이미지와 주요 활동공간 - '고창군 여행' 키워드를 중심으로 -)

  • Kim, Young-Jin;Son, Gwangryul;Lee, Dongchae;Son, Yong-hoon
    • Journal of Korean Society of Rural Planning
    • /
    • v.27 no.3
    • /
    • pp.103-116
    • /
    • 2021
  • In this study, the characteristics of rural tourism image perceived by urban residents were analyzed through text analysis of blog data. In order to examine the images related to rural tourism, blog data written with the keyword "Gochang-gun travel" was used. LDA topic analysis, one of the text mining techniques, was used for the analysis. In the tourism image of Gochang-gun, 9 topics were derived, and 112 major places appeared. This was divided into 3 main activities and 5 object spaces through the review of keywords and the original text of blog data. As a result of the analysis, the traditional main resources of the region, Seonun mountain, Seonun temple, and Gochang-eup fortress, formed topic. On the other hand, world heritage such as dolmen and Ungok wetland did not appear as topic. In particular, the farms operated by the private sector form individual topics, and the theme farm can be seen as an important resource for tourism in Gochang-gun. Also, through the distribution of place keywords, it was possible to understand the characteristics of travel by region and the usage behavior of visitors. In the case of Gochang-gun, there was a phenomenon in which visitors were biased by region. This seems to be the result of Gochang-gun seeking to vitalize local tourism focusing on natural, ecological, and scenic resources. It is necessary to establish a plan for balanced regional development and develop other types of tourism resources. This study is different in that it identified the types and characteristics of rural tourism images in the region perceived by visitors, and the status of tourism at the regional level.

Comparison of policy perceptions between national R&D projects and standing committees using topic modeling analysis : focusing on the ICT field (토픽모델링 분석을 활용한 국가연구개발사업과제와 국회 상임위원회 사이의 정책 인식 비교 : ICT 분야를 중심으로)

  • Song, Byoungki;Kim, Sangung
    • Journal of Industrial Convergence
    • /
    • v.20 no.7
    • /
    • pp.1-11
    • /
    • 2022
  • In this paper, numerical values are derived using topic modeling among data-based evaluation methodologies discussed by various research institutes. In addition, we will focus on the ICT field to see if there is a difference in policy perception between the national R&D project and standing committee. First, we create model for classifying ICT documents by learning R&D project data using HAN model. And we perform LDA topic modeling analysis on ICT documents classified by applying the model, compare the distribution with the topics derived from the R&D project data and proceedings of standing committees. Specifically, a total of 26 topics were derived. Also, R&D project data had professionally topics, and the standing committee-discuss relatively social and popular issues. As the difference in perception can be numerically confirmed, it can be used as a basic study on indicators that can be used for future policy or project evaluation.

Digital Epidemiology: Use of Digital Data Collected for Non-epidemiological Purposes in Epidemiological Studies

  • Park, Hyeoun-Ae;Jung, Hyesil;On, Jeongah;Park, Seul Ki;Kang, Hannah
    • Healthcare Informatics Research
    • /
    • v.24 no.4
    • /
    • pp.253-262
    • /
    • 2018
  • Objectives: We reviewed digital epidemiological studies to characterize how researchers are using digital data by topic domain, study purpose, data source, and analytic method. Methods: We reviewed research articles published within the last decade that used digital data to answer epidemiological research questions. Data were abstracted from these articles using a data collection tool that we developed. Finally, we summarized the characteristics of the digital epidemiological studies. Results: We identified six main topic domains: infectious diseases (58.7%), non-communicable diseases (29.4%), mental health and substance use (8.3%), general population behavior (4.6%), environmental, dietary, and lifestyle (4.6%), and vital status (0.9%). We identified four categories for the study purpose: description (22.9%), exploration (34.9%), explanation (27.5%), and prediction and control (14.7%). We identified eight categories for the data sources: web search query (52.3%), social media posts (31.2%), web portal posts (11.9%), webpage access logs (7.3%), images (7.3%), mobile phone network data (1.8%), global positioning system data (1.8%), and others (2.8%). Of these, 50.5% used correlation analyses, 41.3% regression analyses, 25.6% machine learning, and 19.3% descriptive analyses. Conclusions: Digital data collected for non-epidemiological purposes are being used to study health phenomena in a variety of topic domains. Digital epidemiology requires access to large datasets and advanced analytics. Ensuring open access is clearly at odds with the desire to have as little personal data as possible in these large datasets to protect privacy. Establishment of data cooperatives with restricted access may be a solution to this dilemma.

Approaches to Studying Low Birth Rate in Korea: A Critical Review (우리나라 저출산 관련 연구 동향 분석)

  • Na, Yu-Mi;Kim, Mi-Kyung
    • Korean Journal of Human Ecology
    • /
    • v.19 no.5
    • /
    • pp.817-833
    • /
    • 2010
  • This study was dedicated to searching better course of low birth rate study in Korea by carefully analyzing past and present low birth rate researches. For this 179 studies(101 master thesis and 78 journal articles) from 1991 to 2009 were analyzed. Next, using SPSS Win 12.0, the research type, topic, participants, data collection and method of data analysis were compared to the studies' years of publication. The most frequently applied research approach, topic, sampling method, data collection procedure and data analysis method in the research was found to be a literature study, solution and prevention of low birth rate related policy, literature study, literacy analysis. In conclusion, low birth rate studies should become more diversified in terms of types of the research, data collection method, and data analysis. Additionally, research topics should become more realistic and specified. Moreover, research results should be verified before they are applied to the policy.

A Study on Contributor to Sports Development Big Data Research Using Oral Records

  • Byun, Jisun
    • Journal of Multimedia Information System
    • /
    • v.8 no.4
    • /
    • pp.301-308
    • /
    • 2021
  • The purpose of this study is to analyze the oral records of sports development contributors to explore the direction of big data research on sports development contributors in the future. To this end, the audio file produced in the interview with Lee00, a sports development contributor, was converted into text. The major themes were extracted by analyzing these oral records. The sub-themes were extracted in chronological order. Keywords were extracted by analyzing sub-themes. And the extracted keywords are searched in Google search engine to find related topics and to use them. A Google search for the topic 'Mt. Inwang' extracted from the oral archives of Lee00, a contributor to the development of sports, finds newspaper articles about President Moon Jae-in's climbing Mt. Inwang and opening up Mt. Bukhan. In addition, articles about Mt. Inwang and mountain climbers that the narrator In-jeong Lee speaks are searched for. Through these articles, you can Deriving the theme of the museum exhibition, Collection of museum exhibits, Use as climbing education material.