• 제목/요약/키워드: Public bigdata

Search Result 69, Processing Time 0.021 seconds

Designing the Optimal Urban Distribution Network using GIS : Case of Milk Industry in Ulaanbaatar Mongolia (GIS를 이용한 최적 도심 유통 네트워크 설계 : 몽골 울란바타르 내 우유 산업 사례)

  • Enkhtuya, Daariimaa;Shin, KwangSup
    • The Journal of Bigdata
    • /
    • v.4 no.2
    • /
    • pp.159-173
    • /
    • 2019
  • Last-Mile delivery optimization plays a key role in the urban supply chain operation, which is the most expensive and time-consuming and most complicated part of the whole delivery process. The urban consolidation center (UCC) is regarded as a significant asset for supporting customer demand in the last-mile delivery service. It is the key benefit of UCC to improve the load balance of vehicles and to reduce the total traveling distance by finding the better route with the well-organized multi-leg vehicle journey in the urban area. This paper presents the model using multiple scenario analysis integrated with mathematical optimization techniques using Geographic Information System (GIS). The model aims to find the best solution for the distribution network consisted of DC and UCC, which is applied to the case of Ulaanbaatar Mongolia. The proposed methodology integrates two sub-models, location-allocation model and vehicle routing problem. The multiple scenarios devised by selecting locations of UCC are compared considering the general performance and delivery patterns together. It has been adopted to make better decisions the quantitative metrics such as the economic value of capital cost, operating cost, and balance of using available resources. The result of this research may help the manager or public authorities who should design the distribution network for the last mile delivery service optimization using UCC within the urban area.

  • PDF

Artificial Intelligence Algorithms, Model-Based Social Data Collection and Content Exploration (소셜데이터 분석 및 인공지능 알고리즘 기반 범죄 수사 기법 연구)

  • An, Dong-Uk;Leem, Choon Seong
    • The Journal of Bigdata
    • /
    • v.4 no.2
    • /
    • pp.23-34
    • /
    • 2019
  • Recently, the crime that utilizes the digital platform is continuously increasing. About 140,000 cases occurred in 2015 and about 150,000 cases occurred in 2016. Therefore, it is considered that there is a limit handling those online crimes by old-fashioned investigation techniques. Investigators' manual online search and cognitive investigation methods those are broadly used today are not enough to proactively cope with rapid changing civil crimes. In addition, the characteristics of the content that is posted to unspecified users of social media makes investigations more difficult. This study suggests the site-based collection and the Open API among the content web collection methods considering the characteristics of the online media where the infringement crimes occur. Since illegal content is published and deleted quickly, and new words and alterations are generated quickly and variously, it is difficult to recognize them quickly by dictionary-based morphological analysis registered manually. In order to solve this problem, we propose a tokenizing method in the existing dictionary-based morphological analysis through WPM (Word Piece Model), which is a data preprocessing method for quick recognizing and responding to illegal contents posting online infringement crimes. In the analysis of data, the optimal precision is verified through the Vote-based ensemble method by utilizing a classification learning model based on supervised learning for the investigation of illegal contents. This study utilizes a sorting algorithm model centering on illegal multilevel business cases to proactively recognize crimes invading the public economy, and presents an empirical study to effectively deal with social data collection and content investigation.

  • PDF

A Study on the Prediction for Apartment Sales Price: Focusing on the Basic Property, Economy, Education, Culture and Transportation Properties in S city, Gyeonggi-do (아파트 매매가격 예측에 관한 연구: 경기도 S시 아파트 기본속성과 경제·교육·문화·교통 속성을 중심으로)

  • Kim, Seonghun;Lee, Jung-Mok;Lee, Hyang-Seob;Yu, Su-Han;Shin, WooJin;Yu, Jong-Pil
    • The Journal of Bigdata
    • /
    • v.5 no.1
    • /
    • pp.109-124
    • /
    • 2020
  • In Korea, despite much interest in real estate, it is not easy to predict prices. Because apartments are both residential spaces and investment materials. Key figures affecting the price of apartments vary widely, and there are also regional characteristics. This study was conducted to derive the factors and characteristics that affect the sale price of apartments in S City, Gyeonggi-do. In general, people diagnose that better subway accessibility leads to higher apartment sales price. Nevertheless, in the case of S City, the price was slightly lower as it was closer to Line 1, but the higher the subway accessibility at Shinbundang Line, the higher the price. The five-year average of government bonds and the price were inversely related, and it was found to be proportional to the M2 balance and the price. The floor area ratio and the total number of parking lots had a great influence on the price, and the presence of department stores and discount marts within 1.5 km were the most important factors in the area of cultural aspect.

A Study on the Forecasting Trend of Apartment Prices: Focusing on Government Policy, Economy, Supply and Demand Characteristics (아파트 매매가 추이 예측에 관한 연구: 정부 정책, 경제, 수요·공급 속성을 중심으로)

  • Lee, Jung-Mok;Choi, Su An;Yu, Su-Han;Kim, Seonghun;Kim, Tae-Jun;Yu, Jong-Pil
    • The Journal of Bigdata
    • /
    • v.6 no.1
    • /
    • pp.91-113
    • /
    • 2021
  • Despite the influence of real estate in the Korean asset market, it is not easy to predict market trends, and among them, apartments are not easy to predict because they are both residential spaces and contain investment properties. Factors affecting apartment prices vary and regional characteristics should also be considered. This study was conducted to compare the factors and characteristics that affect apartment prices in Seoul as a whole, 3 Gangnam districts, Nowon, Dobong, Gangbuk, Geumcheon, Gwanak and Guro districts and to understand the possibility of price prediction based on this. The analysis used machine learning algorithms such as neural networks, CHAID, linear regression, and random forests. The most important factor affecting the average selling price of all apartments in Seoul was the government's policy element, and easing policies such as easing transaction regulations and easing financial regulations were highly influential. In the case of the three Gangnam districts, the policy influence was low, and in the case of Gangnam-gu District, housing supply was the most important factor. On the other hand, 6 mid-lower-level districts saw government policies act as important variables and were commonly influenced by financial regulatory policies.

Prediction Model of Real Estate ROI with the LSTM Model based on AI and Bigdata

  • Lee, Jeong-hyun;Kim, Hoo-bin;Shim, Gyo-eon
    • International journal of advanced smart convergence
    • /
    • v.11 no.1
    • /
    • pp.19-27
    • /
    • 2022
  • Across the world, 'housing' comprises a significant portion of wealth and assets. For this reason, fluctuations in real estate prices are highly sensitive issues to individual households. In Korea, housing prices have steadily increased over the years, and thus many Koreans view the real estate market as an effective channel for their investments. However, if one purchases a real estate property for the purpose of investing, then there are several risks involved when prices begin to fluctuate. The purpose of this study is to design a real estate price 'return rate' prediction model to help mitigate the risks involved with real estate investments and promote reasonable real estate purchases. Various approaches are explored to develop a model capable of predicting real estate prices based on an understanding of the immovability of the real estate market. This study employs the LSTM method, which is based on artificial intelligence and deep learning, to predict real estate prices and validate the model. LSTM networks are based on recurrent neural networks (RNN) but add cell states (which act as a type of conveyer belt) to the hidden states. LSTM networks are able to obtain cell states and hidden states in a recursive manner. Data on the actual trading prices of apartments in autonomous districts between January 2006 and December 2019 are collected from the Actual Trading Price Disclosure System of the Ministry of Land, Infrastructure and Transport (MOLIT). Additionally, basic data on apartments and commercial buildings are collected from the Public Data Portal and Seoul Metropolitan Government's data portal. The collected actual trading price data are scaled to monthly average trading amounts, and each data entry is pre-processed according to address to produce 168 data entries. An LSTM model for return rate prediction is prepared based on a time series dataset where the training period is set as April 2015~August 2017 (29 months), the validation period is set as September 2017~September 2018 (13 months), and the test period is set as December 2018~December 2019 (13 months). The results of the return rate prediction study are as follows. First, the model achieved a prediction similarity level of almost 76%. After collecting time series data and preparing the final prediction model, it was confirmed that 76% of models could be achieved. All in all, the results demonstrate the reliability of the LSTM-based model for return rate prediction.

Smart City Mobility and Road Innovation: A Study of Complete Street Adoption and Consideration Factors using the Delphi Method (스마트시티 모빌리티와 도로혁신: 델파이 기법을 활용한 완전도로 도입 및 고려 요인에 관한 연구)

  • Dong-Geon Kim;Se-Yeon Cheon;Ju-Young Kang
    • The Journal of Bigdata
    • /
    • v.8 no.2
    • /
    • pp.233-248
    • /
    • 2023
  • In the process of building the future of smart cities, innovation in mobility and road infrastructure is one of the most important topics. In particular, with the proliferation of autonomous vehicles and various types of mobility on the road, such as electric bicycles, electric kickboards, and electric wheels, roads have a variety of actors to accommodate, including traditional cars and pedestrians, and conflicts between them need to be resolved. Complete streets, a term coined in the United States in 2003, refers to the design and operation of roads that consider the equitable safety and convenience of all road users, including pedestrians, bicyclists, public transportation users, personal mobility (PM) users, and automobile drivers. Currently, many cities overseas are implementing complete streets, and research is being actively conducted to institutionalize them. However, there is a lack of research and discussion on complete streets in Korea. Therefore, this study aims to formalize the main factors to be considered in the design of complete streets by collecting and analyzing the opinions of academic and practitioner experts through the Delphi method. A total of three Delphi surveys were conducted, collecting free responses from experts through the first open-ended survey and organizing them into keywords to create the second and third closed-ended surveys. The second and third rounds of the survey consisted of a total of 52 questions, and 34 items out of 52 were selected as the final factors.

Linguistic Features Discrimination for Social Issue Risk Classification (사회적 이슈 리스크 유형 분류를 위한 어휘 자질 선별)

  • Oh, Hyo-Jung;Yun, Bo-Hyun;Kim, Chan-Young
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.5 no.11
    • /
    • pp.541-548
    • /
    • 2016
  • The use of social media is already essential as a source of information for listening user's various opinions and monitoring. We define social 'risks' that issues effect negative influences for public opinion in social media. This paper aims to discriminate various linguistic features and reveal their effects for building an automatic classification model of social risks. Expecially we adopt a word embedding technique for representation of linguistic clues in risk sentences. As a preliminary experiment to analyze characteristics of individual features, we revise errors in automatic linguistic analysis. At the result, the most important feature is NE (Named Entity) information and the best condition is when combine basic linguistic features. word embedding, and word clusters within core predicates. Experimental results under the real situation in social bigdata - including linguistic analysis errors - show 92.08% and 85.84% in precision respectively for frequent risk categories set and full test set.

Building Linked Big Data for Stroke in Korea: Linkage of Stroke Registry and National Health Insurance Claims Data

  • Kim, Tae Jung;Lee, Ji Sung;Kim, Ji-Woo;Oh, Mi Sun;Mo, Heejung;Lee, Chan-Hyuk;Jeong, Han-Young;Jung, Keun-Hwa;Lim, Jae-Sung;Ko, Sang-Bae;Yu, Kyung-Ho;Lee, Byung-Chul;Yoon, Byung-Woo
    • Journal of Korean Medical Science
    • /
    • v.33 no.53
    • /
    • pp.343.1-343.8
    • /
    • 2018
  • Background: Linkage of public healthcare data is useful in stroke research because patients may visit different sectors of the health system before, during, and after stroke. Therefore, we aimed to establish high-quality big data on stroke in Korea by linking acute stroke registry and national health claim databases. Methods: Acute stroke patients (n = 65,311) with claim data suitable for linkage were included in the Clinical Research Center for Stroke (CRCS) registry during 2006-2014. We linked the CRCS registry with national health claim databases in the Health Insurance Review and Assessment Service (HIRA). Linkage was performed using 6 common variables: birth date, gender, provider identification, receiving year and number, and statement serial number in the benefit claim statement. For matched records, linkage accuracy was evaluated using differences between hospital visiting date in the CRCS registry and the commencement date for health insurance care in HIRA. Results: Of 65,311 CRCS cases, 64,634 were matched to HIRA cases (match rate, 99.0%). The proportion of true matches was 94.4% (n = 61,017) in the matched data. Among true matches (mean age 66.4 years; men 58.4%), the median National Institutes of Health Stroke Scale score was 3 (interquartile range 1-7). When comparing baseline characteristics between true matches and false matches, no substantial difference was observed for any variable. Conclusion: We could establish big data on stroke by linking CRCS registry and HIRA records, using claims data without personal identifiers. We plan to conduct national stroke research and improve stroke care using the linked big database.

Exploratory study on the based on big data for fire prevention of multiple shops (다중이용업소의 화재예방을 위한 빅데이터 기반의 탐색적 연구)

  • Jeon, Byungkwan;Lee, Sung-Won
    • Journal of Industrial Convergence
    • /
    • v.16 no.4
    • /
    • pp.27-32
    • /
    • 2018
  • In this research, in order to prevent the fire of multiple users who are concerned about many human life damage, based on the statistical data of the National Fire Protection Agency's National Fire Information Center and the Fire Defense Administration Fire Defense Administration, Attempts were made on the exploratory approach to measures that can reduce the topic of. In this research, using the preventive firefighting administrative statistics of the Fire and Disaster Management Agency, we analyzed by focusing on the effectiveness of multiple users' firefighting safety education. Through analysis results we could infer that fire safety education does not have effectiveness in actual fire prevention. Through previous research, we found out that the cause is insufficient for the management of fire safety education, thereby deriving an improvement plan to develop a systematic fire safety education system.

COVID-19 Discourse and Social Welfare Intervention through Online News Big Data: Focusing on the Elderly Living Alone (온라인 뉴스 빅데이터를 통한 코로나 19 담론과 사회복지 개입방안: 독거노인을 중심으로)

  • Yeo, Jiyoung
    • 한국노년학
    • /
    • v.41 no.3
    • /
    • pp.353-371
    • /
    • 2021
  • The purpose of this study is to provide clues to social welfare policy making by revealing discourse on social intervention and response based on big data on elderly living alone in the COVID-19 situation. Keyword analysis, network analysis, and topic analysis were utilized to explore the ways in which news media have portrayed challenges facing older individuals and the ways in which the central and local government as well as private organization have responded to them. Results are as follows. First, networks(degree, closeness, betweenness) were formed around region, delivery, society, support, and vulnerability, suggesting an increased demand for economic assistance and social support as well as stronger service delivery systems. Second, key topics derived included "establishing public delivery systems", "establishing local networks", "Managing care gap", "Establishing a private economic support system", and "Establishing service organization system". Based on the research results, discourse on the organic role of government, communities and the private sector has been presented, suggesting policy and practical implications by proposing a discussion on how to intervene for elderly living alone in disaster situations such as COVID-19.