• Title/Summary/Keyword: data analytics

Search Result 556, Processing Time 0.03 seconds

Development for establishing Big Data-based alley commercial area (빅데이터 기반 골목상권 영역설정 방법론 개발)

  • Hwang, Dong-Hyun;Ko, Kyeong-Seok;Park, Sang-June;Kim, Wan-Su
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.11 no.6
    • /
    • pp.784-792
    • /
    • 2018
  • In this study, we designed the area except the development market and the traditional market, where large scale shops were concentrated by realizing the real estate center of the alley commercial area. In addition, we have developed an area setting method for the alley area where reliability and rationality can be ensured by utilizing the actual data such as the business statistics, the survey data of the business, and the store business DB, which are managed by the local government or the state. The alley commercial areas were classified into five groups according to density. It is thought that users can distinguish the commercial areas from dense commercial areas to the commercial areas in order to utilize various commercial areas.

Recent Research Trend Analysis for the Journal of Society of Korea Industrial and Systems Engineering Using Topic Modeling (토픽모델링을 활용한 한국산업경영시스템학회지의 최근 연구주제 분석)

  • Dong Joon Park;Pyung Hoi Koo;Hyung Sool Oh;Min Yoon
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.46 no.3
    • /
    • pp.170-185
    • /
    • 2023
  • The advent of big data has brought about the need for analytics. Natural language processing (NLP), a field of big data, has received a lot of attention. Topic modeling among NLP is widely applied to identify key topics in various academic journals. The Korean Society of Industrial and Systems Engineering (KSIE) has published academic journals since 1978. To enhance its status, it is imperative to recognize the diversity of research domains. We have already discovered eight major research topics for papers published by KSIE from 1978 to 1999. As a follow-up study, we aim to identify major topics of research papers published in KSIE from 2000 to 2022. We performed topic modeling on 1,742 research papers during this period by using LDA and BERTopic which has recently attracted attention. BERTopic outperformed LDA by providing a set of coherent topic keywords that can effectively distinguish 36 topics found out this study. In terms of visualization techniques, pyLDAvis presented better two-dimensional scatter plots for the intertopic distance map than BERTopic. However, BERTopic provided much more diverse visualization methods to explore the relevance of 36 topics. BERTopic was also able to classify hot and cold topics by presenting 'topic over time' graphs that can identify topic trends over time.

Demand Forecasting Model for Bike Relocation of Sharing Stations (공유자전거 따릉이 재배치를 위한 실시간 수요예측 모델 연구)

  • Yoosin Kim
    • Journal of Internet Computing and Services
    • /
    • v.24 no.5
    • /
    • pp.107-120
    • /
    • 2023
  • The public bicycle of Seoul, Ttareungyi, was launched at October 2015 to reduce traffic and carbon emissions in downtown Seoul and now, 2023 Oct, the cumulative number of user is upto 4 million and the number of bike is about 43,000 with about 2700 stations. However, super growth of Ttareungyi has caused the several problems, especially demand/supply mismatch, and thus the Seoul citizen has been complained about out of stock. In this point, this study conducted a real time demand forecasting model to prevent stock out bike at stations. To develop the model, the research team gathered the rental·return transaction data of 20,000 bikes in whole 1600 stations for 2019 year and then analyzed bike usage, user behavior, bike stations, and so on. The forecasting model using machine learning is developed to predict the amount of rental/return on each bike station every hour through daily learning with the recent 90 days data with the weather information. The model is validated with MAE and RMSE of bike stations, and tested as a prototype service on the Seoul Bike Management System(Mobile App) for the relocation team of Seoul City.

Which is the More Important Factor for Users' Adopting the Serious Games for Health? Effectiveness or Safety (건강 기능성 게임의 확산을 위한 유통 전략 연구: 유효성과 안전성에 대한 사용자 인식을 중심으로)

  • Yong-Young Kim
    • Journal of Industrial Convergence
    • /
    • v.21 no.9
    • /
    • pp.23-32
    • /
    • 2023
  • Interest in Serious Games for Healthcare (SGHs) that can improve health through games is increasing. Digital Therapeutics (DTx) is a treatment that must be approved for effectiveness and safety, so it should follow the traditional drug distribution method, but SGHs are wellness products that are more flexible in terms of adoption and diffusion than DTx. SGHs are effective because it can provide customized services through continuous monitoring and feedback. When SGHs are applied to cognitive impairment treatment or behavioral correction, malfunctions and side effects are minor. This study developed research model based on the Valence Framework, gathered data from 142 undergraduates, and demonstrated that only the perceived benefits have a statistically significant positive (+) effect on SGHs acceptance intentions. Based on these results, this study suggests that SGHs companies should promote benefits in accepting SGHs for general users and they need for a distribution and analytics platform strategy based on a data-driven approach.

The Perception of Gorpcore Look Using Big Data (빅 데이터를 활용한 고프코어 룩에 대한 인식)

  • Ji-Woo Kim;Jeong-Mee Kim
    • Journal of the Korea Fashion and Costume Design Association
    • /
    • v.25 no.4
    • /
    • pp.77-92
    • /
    • 2023
  • The purpose of this study is to investigate the public perception of Gorpcore through Big Aata analytics. The study was conducted based on the collection of Big Data on the word 'Gorpcore' through Textom from July 24, 2017 to March 31, 2023. As a result, 63,386 words were collected from a total of 18,879 posts, and the top 50 words were determined based on frequency of appearance. Based on the collected words, centrality measures and CONCOR algorithm were performed in Ucinet 6. The research results are as follows. 1) The frequency of appearance was high in the order of 'Gorpcore look', 'fashion', 'coordination', 'clothes', 'outdoor', 'Musinsa', 'look', 'trend', 'brand' and 'ahjussi (middle-aged old man in Korean)'. These words had high TF-IDF scores, which leads to the conclusion that these are key words that are recognized as important. 2) Network centrality shows that 'Gorpcore look', 'fashion', 'outdoor', 'coordination', 'clothes', 'trend', 'look' and 'style' have a high correlation with other words. Through this, it was found that the public thinks it is important to create a variety of fashions by styling high-performance outdoor wear and casual wear, and that they are highly interested in clothes and in brands leading the Gorpcore trend. 3) As a result of the CONCOR algorithm, four significant groups were formed. The words that appear in each group are as follows. Group 1 - 'outdoor', 'Gorp', 'Normcore', 'hiking', 'functionality', 'new', 'sports', 'casual wear', 'activity', 'generation', 'collaboration'. Group 2 - 'fashion', 'trend', 'look', 'brand', 'style', 'shoes', 'ugly', 'item', 'trend', 'product', 'Salomon', 'padded jacket', 'stylishness', 'utilization', 'Winter', 'street', 'design', 'retro', 'popular', 'styling'. Group 3 - 'Gorpcore look', 'coordination', 'Musinsa', 'windbreaker', 'recommendation', 'Arcteryx', 'pants', 'man'. Group 4 - 'clothes' 'ahjussi', 'jacket', 'launching', 'spring', 'The North Face', 'collection', 'utility', 'jumper'. As a result, it can be seen that the Gorpcore is also regarded as a part of outdoor, fashion, coordination, and casual wear.

Assoication Rule Analysis between lifestyle risk behaviors and multimorbidity: Findings from KHANES (국민건강영양조사 자료를 활용한 라이프스타일 위험요인과 다중이환간의 연관관계분석)

  • Hyun-Ju Lee;Sungmin Myoung
    • The Journal of Korean Society for School & Community Health Education
    • /
    • v.25 no.1
    • /
    • pp.29-41
    • /
    • 2024
  • Objectives: This study used an efficient data mining algorithm to explore association rules between the lifestyle risk behaviors and multimorbidity (having more than one chronic disease) in Korean adults. Methods: We used data from the 8th Korean National Health and Nutrition Examination Survey(2019-2020) for 7,609 adults aged ≥19 years. This study was undertaken where 6 lifestyle risk behaviors and 11 morbidities were analyzed using R and Rstudio for the ARM. Results: Among 117 association rules, combinations of hypertension, dyslipidemia and diabetes, hypertension were important role in inadequate sleep, physical inactivity and inadequate weight. Conclusion: The findings of this study are significant because they demonstrate the importance of lifestyle risk factors and the role of multiple chronic diseases using big data analytics such as association rule mining. We recommend developing selective and focused health education programs, such as exercise programs to address physical inactivity, dietary interventions to address inadequate weight, and mental health education programs to address inadequate sleep.

Development and Application of Dynamic Visualization Model for Spatial Big Data (공간 빅데이터를 위한 동태적 시각화 모형의 개발과 적용)

  • KIM, Dong-han;KIM, David
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.21 no.1
    • /
    • pp.57-70
    • /
    • 2018
  • The advancement and the spread of information and communication technology (ICT) changes the way we live and act. Computer and ICT devices become smaller and invisible, and they are now virtually everywhere in the world. Many socio-economic activities are now subject to the use of computer and ICT devices although we don't really recognize it. Various socio economic activities supported by digital devices leave digital records, and a myriad of these records becomes what we call'big data'. Big data differ from conventional data we have collected and managed in that it holds more detailed information of socio-economic activities. Thus, they offer not only new insight for our society and but also new opportunity for policy analysis. However, the use of big data requires development of new methods and tools as well as consideration of institutional issues such as privacy. The goals of this research are twofold. Firstly, it aims to understand the opportunities and challenges of using big data for planning support. Big data indeed is a big sum of microscopic and dynamic data, and this challenges conventional analytical methods and planning support tools. Secondly, it seeks to suggest ways of visualizing such spatial big data for planning support. In this regards, this study attempts to develop a dynamic visualization model and conducts an experimental case study with mobile phone big data for the Jeju island. Since the off-the-shelf commercial software for the analysis of spatial big data is not yet commonly available, the roles of open source software and computer programming are important. This research presents a pilot model of dynamic visualization for spatial big data, as well as results from them. Then, the study concludes with future studies and implications to promote the use of spatial big data in urban planning field.

Optimization Model for the Mixing Ratio of Coatings Based on the Design of Experiments Using Big Data Analysis (빅데이터 분석을 활용한 실험계획법 기반의 코팅제 배합비율 최적화 모형)

  • Noh, Seong Yeo;Kim, Young-Jin
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.3 no.10
    • /
    • pp.383-392
    • /
    • 2014
  • The research for coatings is one of the most popular and active research in the polymer industry. For the coatings, electronics industry, medical and optical fields are growing more important. In particular, the trend is the increasing of the technical requirements for the performance and accuracy of the coatings by the development of automotive and electronic parts. In addition, the industry has a need of more intelligent and automated system in the industry is increasing by introduction of the IoT and big data analysis based on the environmental information and the context information. In this paper, we propose an optimization model for the design of experiments based coating formulation data objects using the Internet technologies and big data analytics. In this paper, the coating formulation was calculated based on the best data analysis is based on the experimental design, modify the operator with respect to the error caused based on the coating formulation used in the actual production site data and the corrected result data. Further optimization model to correct the reference value by leveraging big data analysis and Internet of things technology only existing coating formulation is applied as the reference data using a manufacturing environment and context information retrieval in color and quality, the most important factor in maintaining and was derived. Based on data obtained from an experiment and analysis is improving the accuracy of the combination data and making it possible to give a LOT shorter working hours per data. Also the data shortens the production time due to the reduction in the delivery time per treatment and It can contribute to cost reduction or the like defect rate reduced. Further, it is possible to obtain a standard data in the manufacturing process for the various models.

A Study on Big Data Based Method of Patient Care Analysis (빅데이터 기반 환자 간병 방법 분석 연구)

  • Park, Ji-Hun;Hwang, Seung-Yeon;Yun, Bum-Sik;Choe, Su-Gil;Lee, Don-Hee;Kim, Jeong-Joon;Moon, Jin-Yong;Park, Kyung-won
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.20 no.3
    • /
    • pp.163-170
    • /
    • 2020
  • With the development of information and communication technologies, the growing volume of data is increasing exponentially, raising interest in big data. As technologies related to big data have developed, big data is being collected, stored, processed, analyzed, and utilized in many fields. Big data analytics in the health care sector, in particular, is receiving much attention because they can also have a huge social and economic impact. It is predicted that it will be able to use Big Data technology to analyze patients' diagnostic data and reduce the amount of money that is spent on simple hospital care. Therefore, in this thesis, patient data is analyzed to present to patients who are unable to go to the hospital or caregivers who do not have medical expertise with close care guidelines. First, the collected patient data is stored in HDFS and the data is processed and classified using R, a big data processing and analysis tool, in the Hadoop environment. Visualize to a web server using R Shiny, which is used to implement various functions of R on the web.

Clustering of Smart Meter Big Data Based on KNIME Analytic Platform (KNIME 분석 플랫폼 기반 스마트 미터 빅 데이터 클러스터링)

  • Kim, Yong-Gil;Moon, Kyung-Il
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.20 no.2
    • /
    • pp.13-20
    • /
    • 2020
  • One of the major issues surrounding big data is the availability of massive time-based or telemetry data. Now, the appearance of low cost capture and storage devices has become possible to get very detailed time data to be used for further analysis. Thus, we can use these time data to get more knowledge about the underlying system or to predict future events with higher accuracy. In particular, it is very important to define custom tailored contract offers for many households and businesses having smart meter records and predict the future electricity usage to protect the electricity companies from power shortage or power surplus. It is required to identify a few groups with common electricity behavior to make it worth the creation of customized contract offers. This study suggests big data transformation as a side effect and clustering technique to understand the electricity usage pattern by using the open data related to smart meter and KNIME which is an open source platform for data analytics, providing a user-friendly graphical workbench for the entire analysis process. While the big data components are not open source, they are also available for a trial if required. After importing, cleaning and transforming the smart meter big data, it is possible to interpret each meter data in terms of electricity usage behavior through a dynamic time warping method.