• Title/Summary/Keyword: Big Data Pattern Analysis

Search Result 171, Processing Time 0.035 seconds

A Study on Characteristics of Eco-friendly Behaviors using Big Data: Focusing on the Customer Sales Data of Green Card (빅 데이터를 활용한 친환경행동 특성에 관한 연구: 대용량 그린카드 거래데이터를 중심으로)

  • Lim, Mi Sun;Kim, Jinhwa;Byeon, Hyeonsu
    • Journal of Digital Convergence
    • /
    • v.14 no.1
    • /
    • pp.151-161
    • /
    • 2016
  • As part of a policy to address climate change and pollution problem, the government introduced a green credit card scheme in order to motivate pro-environmental behaviors in July 2011. It is important to present the specific ways to facilitate pro-environmental behaviors using the consumer behavior pattern data. This study was a result of data from total fifty seven thousands customer purchasing history data of green credit card to be created for the 3 months from January to March 2015. As the analysis process is put in to operation the analysis of the purchasing customer's profile firstly, and the second come into association analysis to consider the buying associations for green products purchasing networks, the third estimate the useful parameters to affect the customer's pro-environmental behavior and customer characteristics. It shows that royal customers are from 30 to 40 years old and their incomes are from 30 million won to 40 million won. Especially, they live in Daegu, Gyeonggi, and Seoul.

Development of Customer Sentiment Pattern Map for Webtoon Content Recommendation (웹툰 콘텐츠 추천을 위한 소비자 감성 패턴 맵 개발)

  • Lee, Junsik;Park, Do-Hyung
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.4
    • /
    • pp.67-88
    • /
    • 2019
  • Webtoon is a Korean-style digital comics platform that distributes comics content produced using the characteristic elements of the Internet in a form that can be consumed online. With the recent rapid growth of the webtoon industry and the exponential increase in the supply of webtoon content, the need for effective webtoon content recommendation measures is growing. Webtoons are digital content products that combine pictorial, literary and digital elements. Therefore, webtoons stimulate consumer sentiment by making readers have fun and engaging and empathizing with the situations in which webtoons are produced. In this context, it can be expected that the sentiment that webtoons evoke to consumers will serve as an important criterion for consumers' choice of webtoons. However, there is a lack of research to improve webtoons' recommendation performance by utilizing consumer sentiment. This study is aimed at developing consumer sentiment pattern maps that can support effective recommendations of webtoon content, focusing on consumer sentiments that have not been fully discussed previously. Metadata and consumer sentiments data were collected for 200 works serviced on the Korean webtoon platform 'Naver Webtoon' to conduct this study. 488 sentiment terms were collected for 127 works, excluding those that did not meet the purpose of the analysis. Next, similar or duplicate terms were combined or abstracted in accordance with the bottom-up approach. As a result, we have built webtoons specialized sentiment-index, which are reduced to a total of 63 emotive adjectives. By performing exploratory factor analysis on the constructed sentiment-index, we have derived three important dimensions for classifying webtoon types. The exploratory factor analysis was performed through the Principal Component Analysis (PCA) using varimax factor rotation. The three dimensions were named 'Immersion', 'Touch' and 'Irritant' respectively. Based on this, K-Means clustering was performed and the entire webtoons were classified into four types. Each type was named 'Snack', 'Drama', 'Irritant', and 'Romance'. For each type of webtoon, we wrote webtoon-sentiment 2-Mode network graphs and looked at the characteristics of the sentiment pattern appearing for each type. In addition, through profiling analysis, we were able to derive meaningful strategic implications for each type of webtoon. First, The 'Snack' cluster is a collection of webtoons that are fast-paced and highly entertaining. Many consumers are interested in these webtoons, but they don't rate them well. Also, consumers mostly use simple expressions of sentiment when talking about these webtoons. Webtoons belonging to 'Snack' are expected to appeal to modern people who want to consume content easily and quickly during short travel time, such as commuting time. Secondly, webtoons belonging to 'Drama' are expected to evoke realistic and everyday sentiments rather than exaggerated and light comic ones. When consumers talk about webtoons belonging to a 'Drama' cluster in online, they are found to express a variety of sentiments. It is appropriate to establish an OSMU(One source multi-use) strategy to extend these webtoons to other content such as movies and TV series. Third, the sentiment pattern map of 'Irritant' shows the sentiments that discourage customer interest by stimulating discomfort. Webtoons that evoke these sentiments are hard to get public attention. Artists should pay attention to these sentiments that cause inconvenience to consumers in creating webtoons. Finally, Webtoons belonging to 'Romance' do not evoke a variety of consumer sentiments, but they are interpreted as touching consumers. They are expected to be consumed as 'healing content' targeted at consumers with high levels of stress or mental fatigue in their lives. The results of this study are meaningful in that it identifies the applicability of consumer sentiment in the areas of recommendation and classification of webtoons, and provides guidelines to help members of webtoons' ecosystem better understand consumers and formulate strategies.

A Study on the Use of General Social Welfare Facilities for the Planning of Integrated Care Center - Focused on four social welfare facilities in Southern Gyeonggi-do (통합돌봄센터 계획을 위한 고령인구의 종합사회복지관 이용실태 연구 - 경기도 남부 4개 사회복지관을 대상으로)

  • Han, Eunbee;Zhang, Jinxiang;Kwon, Soonjung
    • Journal of The Korea Institute of Healthcare Architecture
    • /
    • v.26 no.2
    • /
    • pp.71-79
    • /
    • 2020
  • Purpose: The purpose of this study is to derive basic data for desirable location and functions of the integrated care center. Methods: Survey, Questionaire and statistical analysis are the main research method of this study. In order to collect data related to utilization pattern and favorite functions of the senior people, researchers have visited 4 social welfare facilities located in Southern Gyeonggi Province. 403 questionaires have been gathered from 4 facilities and they have been analyzed by using Excel Program of MS. Results: First, compared to other services, healthcare services have been preferred by many older people in Social welfare Facilities. This means that integrated care centers providing healthcare services for older people rather than services for children or disabilities is desirable. Second, Integrated Care Centers had better be established within the walk distance of elderly people. If it is not easy, the introduction of shuttle bus for older people is desirable. Especially, in case of large Care Center. Implications: This study shows that small facility with community care rather than big facility is desirable for small community in the point of friendliness, convenience, economy, etc.. However it is necessary to combine welfare service and healthcare service even in small centers.

Open Market Sales Trend Analysis System Using Online Shopping Mall Data (온라인 쇼핑몰 데이터를 활용한 판매동향 분석 시스템)

  • Cha, Seung-yeon;Kim, Kang-ryeol;Shrestha, Labina;Kim, Yeong-ju;Choi, Jongmyung
    • Journal of Internet of Things and Convergence
    • /
    • v.5 no.2
    • /
    • pp.7-13
    • /
    • 2019
  • As online shopping is activated by the development of the Internet, consumers' purchase form is changing from the traditional face-to-face purchase method to online purchase method. Many sellers have flowed into shopping malls, and competition among sellers is very intense. Therefore, sellers in shopping malls need to establish rational marketing strategies by analyzing consumer purchase patterns and product sales trends. In this paper, we analyzed the purchase price of consumers by analyzing the product price, rating, and sales quantity of competitors who sell the same product in open shopping malls by time zone. In addition, the collected information was visualized in a chart so that the company's and competitors' sales trends could be easily compared. Using the above system, it is possible to predict the sales volume through the analyzed purchasing pattern and to select the reasonable price of the product by grasping the sales trend.

Evaluation of Geographic Indices Describing Health Care Utilization

  • Kim, Agnus M.;Park, Jong Heon;Kang, Sungchan;Kim, Yoon
    • Journal of Preventive Medicine and Public Health
    • /
    • v.50 no.1
    • /
    • pp.29-37
    • /
    • 2017
  • Objectives: The accurate measurement of geographic patterns of health care utilization is a prerequisite for the study of geographic variations in health care utilization. While several measures have been developed to measure how accurately geographic units reflect the health care utilization patterns of residents, they have been only applied to hospitalization and need further evaluation. This study aimed to evaluate geographic indices describing health care utilization. Methods: We measured the utilization rate and four health care utilization indices (localization index, outflow index, inflow index, and net patient flow) for eight major procedures (coronary artery bypass graft surgery, percutaneous transluminal coronary angioplasty, surgery after hip fracture, knee replacement surgery, caesarean sections, hysterectomy, computed tomography scans, and magnetic resonance imaging scans) according to three levels of geographic units in Korea. Data were obtained from the National Health Insurance database in Korea. We evaluated the associations among the health care utilization indices and the utilization rates. Results: In higher-level geographic units, the localization index tended to be high, while the inflow index and outflow index were lower. The indices showed different patterns depending on the procedure. A strong negative correlation between the localization index and the outflow index was observed for all procedures. Net patient flow showed a moderate positive correlation with the localization index and the inflow index. Conclusions: Health care utilization indices can be used as a proxy to describe the utilization pattern of a procedure in a geographic unit.

Epidemiology of PAH in Korea: An Analysis of the National Health Insurance Data, 2002-2018

  • Albert Youngwoo Jang;Hyeok-Hee Lee;Hokyou Lee;Hyeon Chang Kim;Wook-Jin Chung
    • Korean Circulation Journal
    • /
    • v.53 no.5
    • /
    • pp.313-327
    • /
    • 2023
  • Background and Objectives: Pulmonary arterial hypertension (PAH) is a rare but fatal disease. Recent advances in PAH-specific drugs have improved its outcomes, although the healthcare burden of novel therapeutics may lead to a discrepancy in outcomes between developing and developed countries. We analyzed how the epidemiology and clinical features of PAH has changed through the rapidly advancing healthcare infrastructure in South Korea. Methods: PAH was defined according to a newly devised 3-component algorithm. Using a nationwide health insurance claims database, we delineated annual trends in the prevalence, incidence, medication prescription pattern, and 5-year survival of PAH in Korea. Cumulative survival and potential predictors of mortality were also assessed among 2,151 incident PAH cases. Results: Between 2002 or 2004 and 2018, the prevalence and incidence of PAH increased 75-fold (0.4 to 29.9 per million people) and 12-fold (0.5 to 6.3 per million person-years), respectively. The proportion of patients on combination PAH-specific drug therapy has also steadily increased up to 29.0% in 2018. Among 2,151 incident PAH cases (median [interquartile range] age, 50 [37-62] years; 67.2% female), the 5-year survival rate and median survival duration were 71.8% and 13.1 years, respectively. Independent predictors of mortality were age, sex, etiology of PAH, diabetes, dyslipidemia, and chronic kidney disease. Conclusions: This nationwide study delineated that the prevalence and incidence of PAH have grown rapidly in Korea since the early 2000s. The use of combination therapy has also increased, and the 5-year survival rate of PAH in Korea was similar to those in western countries.

Investigating Dynamic Mutation Process of Issues Using Unstructured Text Analysis (비정형 텍스트 분석을 활용한 이슈의 동적 변이과정 고찰)

  • Lim, Myungsu;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.22 no.1
    • /
    • pp.1-18
    • /
    • 2016
  • Owing to the extensive use of Web media and the development of the IT industry, a large amount of data has been generated, shared, and stored. Nowadays, various types of unstructured data such as image, sound, video, and text are distributed through Web media. Therefore, many attempts have been made in recent years to discover new value through an analysis of these unstructured data. Among these types of unstructured data, text is recognized as the most representative method for users to express and share their opinions on the Web. In this sense, demand for obtaining new insights through text analysis is steadily increasing. Accordingly, text mining is increasingly being used for different purposes in various fields. In particular, issue tracking is being widely studied not only in the academic world but also in industries because it can be used to extract various issues from text such as news, (SocialNetworkServices) to analyze the trends of these issues. Conventionally, issue tracking is used to identify major issues sustained over a long period of time through topic modeling and to analyze the detailed distribution of documents involved in each issue. However, because conventional issue tracking assumes that the content composing each issue does not change throughout the entire tracking period, it cannot represent the dynamic mutation process of detailed issues that can be created, merged, divided, and deleted between these periods. Moreover, because only keywords that appear consistently throughout the entire period can be derived as issue keywords, concrete issue keywords such as "nuclear test" and "separated families" may be concealed by more general issue keywords such as "North Korea" in an analysis over a long period of time. This implies that many meaningful but short-lived issues cannot be discovered by conventional issue tracking. Note that detailed keywords are preferable to general keywords because the former can be clues for providing actionable strategies. To overcome these limitations, we performed an independent analysis on the documents of each detailed period. We generated an issue flow diagram based on the similarity of each issue between two consecutive periods. The issue transition pattern among categories was analyzed by using the category information of each document. In this study, we then applied the proposed methodology to a real case of 53,739 news articles. We derived an issue flow diagram from the articles. We then proposed the following useful application scenarios for the issue flow diagram presented in the experiment section. First, we can identify an issue that actively appears during a certain period and promptly disappears in the next period. Second, the preceding and following issues of a particular issue can be easily discovered from the issue flow diagram. This implies that our methodology can be used to discover the association between inter-period issues. Finally, an interesting pattern of one-way and two-way transitions was discovered by analyzing the transition patterns of issues through category analysis. Thus, we discovered that a pair of mutually similar categories induces two-way transitions. In contrast, one-way transitions can be recognized as an indicator that issues in a certain category tend to be influenced by other issues in another category. For practical application of the proposed methodology, high-quality word and stop word dictionaries need to be constructed. In addition, not only the number of documents but also additional meta-information such as the read counts, written time, and comments of documents should be analyzed. A rigorous performance evaluation or validation of the proposed methodology should be performed in future works.

Spatio-Temporal Patterns of a Public Bike Sharing System in Seoul - Focusing on Yeouido District - (서울시 공공자전거 공유시스템(PBSS)의 시공간적 이용 패턴 분석 - 서울시 여의도동을 중심으로 -)

  • Yun, Seung-yong;Min, Kyung-hun;Ko, Ha-jung
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.48 no.1
    • /
    • pp.1-14
    • /
    • 2020
  • Various policies and studies regarding use of PBSS (Public Bike Sharing System) and Programs (PBSP) have been conducted worldwide as the number systems or programs has increased. Although various phenomena and demands have been generated by the use of PBSS in everyday life, the majority of research and the policies in South Korea have been implemented focused on commuting life. The purpose of this study aimed to understand various PBSS demands using PBSS usage data in 2018 in the Yeouido districts through classifying usage patterns and analyzing features. The rental stations were classified into three types based on weekday/weekend usage rates. The usage of Yeouido's PBSS accounted for 4.3% of the total usage in Seoul Metropolitan City, while the number of PBSS rental stations accounted for 2% of all rental stations in the Seoul urban areas. Rental stations with a higher weekday utilization rates showed high utilization rates in all four seasons and were mainly distributed in work and residential areas. Other stations showed a concentrated usage pattern in spring (April-May) and autumn (September-October) seasons, and their locations were close to the entrance of nearby parks. Besides, renting and returning were often concentrated at certain rental stations for high weekend utilization as compared to the pattern of high weekday usage. Therefore, PBSS management and programs should be operated to reflect various usage demands rather than uniform PBSS operations. The result of this study is meaningful to provide basic data for effective PBSS operation by monitoring the demand for PBSS usage in spatio-temporal terms.

Analysis of Surface Water Temperature Fluctuation and Empirical Orthogonal Function in Cheonsu Bay, Korea

  • Hyo-Sang Choo;Jin-Young Lee;Kyeung-Ho Han;Dong-Sun Kim
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.29 no.3
    • /
    • pp.255-269
    • /
    • 2023
  • Surface water temperature of a bay (from the south to the north) increases in spring and summer, but decreases in autumn and winter. Due to shallow water depth, freshwater outflow, and weak current, the water temperature in the central to northern part of the bay is greatly affected by the land coast and air temperature, with large fluctuations. Water temperature variations are large in the north-east coast of the bay, but small in the south-west coast. The difference between water temperature and air temperature is greater in winter and in the south-central part of the bay than that in the north to the eastern coast of the bay where sea dykes are located. As the bay goes from south to north, the range of water temperature fluctuation and the phase show increases. When fresh water is released from the sea dike, the surrounding water temperature decreases and then rises, or rises and then falls. The first mode of empirical orthogonal function (EOF) represents seasonal variation of water temperature. The second mode represents the variability of water temperature gradient in east-west and north-south directions of the bay. In the first mode, the maximum and the minimum are shown in autumn and summer, respectively, consistent with seasonal distribution of surface water temperature variance. In the second mode, phases of the coast of Seosan~Boryeong and the east coast of Anmyeon Island are opposite to each other, bordering the center of the deep bay. Periodic fluctuation of the first mode time coefficient dominates in the one-day and half-day cycle. Its daily fluctuation pattern is similar to air temperature variation. Sea conditions and topographical characteristics excluding air temperature are factors contributing to the variation of the second mode time coefficient.

Optimizing Urban Construction and Demolition Waste Management System Based on 4D-GIS and Internet Plus

  • Wang, Huiyue;Zhang, Tingning;Duan, Huabo;Zheng, Lina;Wang, Xiaohua;Wang, Jiayuan
    • International conference on construction engineering and project management
    • /
    • 2017.10a
    • /
    • pp.321-327
    • /
    • 2017
  • China is experiencing the urbanization at an unprecedented speed and scale in human history. The continuing growth of China's big cities, both in city land and population, has already led to great challenges in China's urban planning and construction activities, such as the continuous increase of construction and demolition (C&D) waste. Therefore, how to characterize cities' construction activities, particularly dynamically quantify the flows of building materials and construction debris, has become a pressing problem to alleviate the current shortage of resources and realize urban sustainable development. Accordingly, this study is designed to employ 4D-GIS (four dimensions-Geographic Information System) and Internet Plus to offer new approach for accurate but dynamic C&D waste management. The present study established a spatio-temporal pattern and material metabolism evolution model to characterize the geo-distribution of C&D waste by combing material flow analysis (MFA) and 4D-GIS. In addition, this study developed a mobile application (APP) for C&D waste trading and information management, which could be more effective for stakeholders to obtain useful information. Moreover, a cloud database was built in the APP to disclose the flows of C&D waste by the monitoring information from vehicles at regional level. To summarize, these findings could provide basic data and management methods for the supply and reverse supply of building materials. Meanwhile, the methodologies are practical to C&D waste management and beyond.

  • PDF