• Title/Summary/Keyword: 빅 데이터 패턴 분석

Search Result 195, Processing Time 0.028 seconds

Personal Recommendation Service Design Through Big Data Analysis on Science Technology Information Service Platform (과학기술정보 서비스 플랫폼에서의 빅데이터 분석을 통한 개인화 추천서비스 설계)

  • Kim, Dou-Gyun
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.28 no.4
    • /
    • pp.501-518
    • /
    • 2017
  • Reducing the time it takes for researchers to acquire knowledge and introduce them into research activities can be regarded as an indispensable factor in improving the productivity of research. The purpose of this research is to cluster the information usage patterns of KOSEN users and to suggest optimization method of personalized recommendation service algorithm for grouped users. Based on user research activities and usage information, after identifying appropriate services and contents, we applied a Spark based big data analysis technology to derive a personal recommendation algorithm. Individual recommendation algorithms can save time to search for user information and can help to find appropriate information.

Prediction Model for Unpaid Customers Using Big Data (빅 데이터 기반의 체납 수용가 예측 모델)

  • Jeong, Jaean;Lee, Kyouhwan;Jung, Hoekyung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.24 no.7
    • /
    • pp.827-833
    • /
    • 2020
  • In this paper, to reduce the unpaid rate of local governments, the internal data elements affecting the arrears in Water-INFOS are searched through interviews with meter readers in certain local governments. Candidate data affecting arrears from national statistical data were derived. The influence of the independent variable on the dependent variable was sampled by examining the disorder of the dependent variable in the data set called information gain. We also evaluated the higher prediction rates of decision tree and logistic regression using n-fold cross-validation. The results confirmed that the decision tree can find more accurate customer payment patterns than logistic regression. In the process of developing an analysis algorithm model using machine learning, the optimal values of two environmental variables, the minimum number of data and the maximum purity, which directly affect the complexity and accuracy of the decision tree, are derived to improve the accuracy of the algorithm.

Spatial Impact Assessment of Heat Wave on River Water Quality using Big Data (빅데이터를 이용한 폭염과 하천수질의 공간적 영향 평가)

  • Lee, Jiwan;Lim, Hyeokjin;Shin, Hyungjin;Kim, Seongjoon
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2021.06a
    • /
    • pp.87-87
    • /
    • 2021
  • 이상기후 현상으로 기후변화가 사회와 경제에 미치는 영향이 뚜렷한 추세로 변화되고 있다. 현재 기후변화에 관련된 연구는 사회 시스템에서 위험관리를 위해 기온과 강수량에 따라 다양한 분야에 미치는 영향에 대한 연구를 중점으로 이뤄지고 있다. 본 연구는 여름철 폭염에 의한 기후변화가 하천수질에 미치는 영향을 평가하기 위한 것으로, 우리나라 기상청 91개의 기상관측소에서 일일온도 33℃ 이상의 이벤트를 대상으로 환경부 수질관측망 918개에 대한 14개의 하천수질인자인 DO, BOD, COD, TOC, DOC, TN, DTN, NH4-N, NO2-N, NO3-N, TP, DTP, PO4-P, Chl-a를 분석하였다. 이를 우리나라 117개 중권역별 하천수질과 폭염강도와 지속시간을 나타내는 폭염 지수를 산정하여 분석하였다. 폭염 관련 뉴스 데이터는 2013년부터 2019년까지 Python 기반 뉴스 크롤러를 이용해 폭염 취약지수(Heat Wave Vulnerability Index, HWVI)를 기준으로 분류하여 키워드를 수집하였으며 HWVI 중 '기후노출' 키워드와 관련된 기사는 총 22,514건으로 69.9%로 수집되었다. 공간적 영향 평가를 위해 Getis-Ord Gi*를 이용하여 폭염지수와 하천수질인자간 핫스팟 분석을 실시하고 폭염관련 빅데이터가 하천수질에 미치는 영향을 평가하였다. 폭염지수는 낙동강유역 하류에 대해 Chl-a, TN, TP 항목에서 높은 밀도를 보였다. 분석대상지역 내 폭염이 발생한 확률과 반경 밖에서 발생할 확률의 우도비를 분석하기 위해 SaTScan을 이용한 공간검색통계분석을 실시하였다. 분석결과 폭염지수와 DO의 공간상관성이 높은 것으로 나타났다.

  • PDF

Using Mobile Phone Data, Analyzing Floating Population Near University Areas in Daegu, South Korea, before and after Covid-19 - with a focus on Comparisons with Seoul (통신사 빅데이터를 활용한 코로나 전염병 전후 대구 대학가 유동인구 분석 - 서울과의 비교를 중심으로)

  • Kim, Jae-Hun;Son, Ji-Hoon;Park, Han-Woo
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.3
    • /
    • pp.62-70
    • /
    • 2022
  • This study investigates the temporal structure and movement of floating people near university areas in Daegu metropolitan city, South Korea, before and after Covid-19. In order to determine Daegu's position, the current study compares Daegu and Seoul. The floating population is used as an index to reveal people's various activities in the area known as the local business district, which surrounds the university campus. The information was provided by mobile phone manufacturers. A municipal authority managed a public website where mobile data was made available. Several statistical and visualization techniques were used after the data pre-processing steps. As a result, the floating population fluctuation patterns in both cities in the first half of 2019 and 2020 were comparable. When the Covid-19 diffusion rate in Daegu stabilized in the second half of 2020, the floating population in Daegu increased slightly over the previous year, while the population in Seoul decreased due to the second wave of Covid-19.

Implementation of User Recommendation System based on Video Contents Story Analysis and Viewing Pattern Analysis (영상 스토리 분석과 시청 패턴 분석 기반의 추천 시스템 구현)

  • Lee, Hyoun-Sup;Kim, Minyoung;Lee, Ji-Hoon;Kim, Jin-Deog
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.24 no.12
    • /
    • pp.1567-1573
    • /
    • 2020
  • The development of Internet technology has brought the era of one-man media. An individual produces content on user own and uploads it to related online services, and many users watch the content of online services using devices that allow them to use the Internet. Currently, most users find and watch content they want through search functions provided by existing online services. These features are provided based on information entered by the user who uploaded the content. In an environment where content needs to be retrieved based on these limited word data, user unwanted information is presented to users in the search results. To solve this problem, in this paper, the system actively analyzes the video in the online service, and presents a way to extract and reflect the characteristics held by the video. The research was conducted to extract morphemes based on the story content based on the voice data of a video and analyze them with big data technology.

Determination of coagulant input rate in water purification plant using K-means algorithm and GBR algorithm (K-means 알고리즘과 GBR 알고리즘을 이용한 정수장 응집제 투입률 결정 기법)

  • Kim, Jinyoung;Kang, Bokseon;Jung, Hoekyung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.6
    • /
    • pp.792-798
    • /
    • 2021
  • In this paper, an algorithm for determining the coagulant input rate in the drug-injection tank during the process of the water purification plant was derived through big data analysis and prediction based on artificial intelligence. In addition, analysis of big data technology and AI algorithm application methods and existing academic and technical data were reviewed to analyze and review application cases in similar fields. Through this, the goal was to develop an algorithm for determining the coagulant input rate and to present the optimal input rate through autonomous driving simulator and pilot operation of the coagulant input process. Through this study, the coagulant injection rate, which is an output variable, is determined based on various input variables, and it is developed to simulate the relationship pattern between the input variable and the output variable and apply the learned pattern to the decision-making pattern of water plant operating workers.

Social Safety Systems through Big Data Analysis of Public Data (공공 데이터의 빅데이터 분석을 통한 사회 안전망 시스템)

  • Lee, Sun Yui;Jung, Jun Hee;Cha, Gyeong Hyeon;Son, Ki Jun;Kim, Sang Ji;Kim, Jin Young
    • Journal of Satellite, Information and Communications
    • /
    • v.10 no.4
    • /
    • pp.77-82
    • /
    • 2015
  • This paper proposed an accident prediction model in order to prevent accidents in mountain areas using a big data analysis. Data of accidents in mountain areas are shown as graphs. We have analyzed cases: the number of accidents per year, day of week, time of day to find patterns of the negligent accident in mountain areas. The proposed prediction model consists of weighted variables of the accident in mountain through visualized big data analysis. The model of danger index performance is demonstrated by showing accident-prone areas with weighted variables.

A Meta-Analysis of Influencing Collagen Intake on Skin Utilizing Big Data (빅데이터 분석을 활용한 콜라겐 섭취가 피부에 미치는 영향에 관한 메타분석)

  • Jin, Chan-Yong;Yu, Ok-Kyeong;Nam, Soo-Tai
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.20 no.11
    • /
    • pp.2033-2038
    • /
    • 2016
  • Big data analysis, in the large amount of data stored as the data warehouse which it refers the process of discovering meaningful new correlations, patterns, trends and creating new values. The important issue of a meta-analysis is not the significance test, the effect size of the predictor variable on the criterion variable. We reviewed a total of 236 samples among 6 studies published on the topic related Collagen intake on skin between 2000 and 2016 in Korea. The results of the study are summarized as follows. First, we concluded that the path between before and after of Sebum (SB) had the largest effect size of (r = .416) Therefore, the effect of the Collagen intake intervention showed an explanatory power of 17 (%) about. Next, the path between before and after of Moisture (MS) had the higher the effect size of (r = .318). Thus, we present the theoretical and practical implications of these results.

Estimating the Method of the Number of Visitors of Water-friendly Park Using GPS Location Information (GPS 위치정보를 활용한 친수공원 이용객 수 추정방법 연구)

  • Kim, Seong-Jun;Kim, Tae-Jeong;Kim, Chang-Sung
    • Ecology and Resilient Infrastructure
    • /
    • v.7 no.3
    • /
    • pp.171-180
    • /
    • 2020
  • With the increase in industrialization and urbanization, scarcity of space for leisure life has become an important issue. Opportunities such as natural scenery and ecological experiences provided by waterfront spaces around streams are fundamental factors in the development of the community and creation of a hydrophilic park. In the past, on-site surveys have been conducted using human resources to quantify the number of river visitors, but the accuracy of the results was not sufficient owing to limitations in expenses, manpower, space, and time. In this study, to overcome this problem, we estimated the number of visitors using the location information related to hydrophilic parks. The study areas were Samrak Ecological Park and Daejeo Ecological Park located downstream of the Nakdong River. We compared and analyzed the pattern of the visitors by using the large communication data and the visiting pattern based on GPS location information. The GPS location information is based on Google Popular Times and Kakao visitor data. When the GPS location data were used, the pattern for weekday and weekend visitors was clearer than when the large communication data were used. Therefore, it is expected to be similar to the result of GPS location information if the number of visitors is extracted under the condition of precision of pCELL size and residence time of 30 minutes or more when using future communication big data. In addition, if revisions such as the Personal Information Protection Act are made to extract more accurate data, by estimating the number of visitors based on GPS data, more accurate indicators of the number of visitors can be derived.

Verification of firefighters' heuristics through big data analysis (빅데이터 분석을 통한 소방관의 경험법칙 검증 및 화재예방 활용)

  • Park, Sohyun;Park, Jeong-Hoon;Shin, Eun-Ji;Shin, Dongil
    • Journal of the Korean Institute of Gas
    • /
    • v.24 no.2
    • /
    • pp.50-55
    • /
    • 2020
  • The heuristics accumulated in the field activities of firefighters were reviewed through big data analysis of fire occurrences in Gyeonggi-do and researched to be utilized for proper fire prevention activities according to time, day, and target through quantitative modeling. Empirical rules with high sympathy were collected through direct interviews with firefighters. Among them, the rule of thumb that "Friday is the most fire-prone" is considered to be the most important in terms of fire monitoring and prediction. A big data comparison analysis was conducted, including the number of fires and damages that occurred in Gyeonggi-do in 2018. Furthermore, fire occurrence patterns by region, day of the week, time of day, and building type were derived. Regarding empirical rules that have been validated through research, relatively inexperienced firefighters also can make decisions by relying on refined quantitative predictive modeling and empirical rules including local government and time-based factors that reflect big fire occurrence data.