• 제목/요약/키워드: Data prediction model

검색결과 5,375건 처리시간 0.035초

Prediction Model of Real Estate Transaction Price with the LSTM Model based on AI and Bigdata

  • Lee, Jeong-hyun;Kim, Hoo-bin;Shim, Gyo-eon
    • International Journal of Advanced Culture Technology
    • /
    • 제10권1호
    • /
    • pp.274-283
    • /
    • 2022
  • Korea is facing a number difficulties arising from rising housing prices. As 'housing' takes the lion's share in personal assets, many difficulties are expected to arise from fluctuating housing prices. The purpose of this study is creating housing price prediction model to prevent such risks and induce reasonable real estate purchases. This study made many attempts for understanding real estate instability and creating appropriate housing price prediction model. This study predicted and validated housing prices by using the LSTM technique - a type of Artificial Intelligence deep learning technology. LSTM is a network in which cell state and hidden state are recursively calculated in a structure which added cell state, which is conveyor belt role, to the existing RNN's hidden state. The real sale prices of apartments in autonomous districts ranging from January 2006 to December 2019 were collected through the Ministry of Land, Infrastructure, and Transport's real sale price open system and basic apartment and commercial district information were collected through the Public Data Portal and the Seoul Metropolitan City Data. The collected real sale price data were scaled based on monthly average sale price and a total of 168 data were organized by preprocessing respective data based on address. In order to predict prices, the LSTM implementation process was conducted by setting training period as 29 months (April 2015 to August 2017), validation period as 13 months (September 2017 to September 2018), and test period as 13 months (December 2018 to December 2019) according to time series data set. As a result of this study for predicting 'prices', there have been the following results. Firstly, this study obtained 76 percent of prediction similarity. We tried to design a prediction model of real estate transaction price with the LSTM Model based on AI and Bigdata. The final prediction model was created by collecting time series data, which identified the fact that 76 percent model can be made. This validated that predicting rate of return through the LSTM method can gain reliability.

병원의 미래 현금흐름 정보예측 (A Study on the Predictability of Hospital's Future Cash Flow Information)

  • 문영전;양동현
    • 한국병원경영학회지
    • /
    • 제11권3호
    • /
    • pp.19-41
    • /
    • 2006
  • The Objective of this study was to design the model which predict the future cash flow of hospitals and on the basis of designed model to support sound hospital management by the prediction of future cash flow. The five cash flow measurement variables discussed in financial accrual part were used as variables and these variables were defined as NI, NIDPR, CFO, CFAI, CC. To measure the cash flow B/S related variables, P/L related variables and financial ratio related variables were utilized in this study. To measure cash flow models were designed and to estimate the prediction ability of five cash flow models, the martingale model and the market model were utilized. To estimate relative prediction outcome of cash flow prediction model and simple market model, MAE and MER were used to compare and analyze relative prediction ability of the cash flow model and the market model and to prove superiority of the model of the cash flow prediction model, 32 Regional Public Hospital's cross-section data and 4 year time series data were combined and pooled cross-sectional time series regression model was used for GLS-analysis. To analyze this data, Firstly, each cash flow prediction model, martingale model and market model were made and MAE and MER were estimated. Secondly difference-test was conducted to find the difference between MAE and MER of cash flow prediction model. Thirdly after ranking by size the prediction of cash flow model, martingale model and market model, Friedman-test was evaluated to find prediction ability. The results of this study were as follows: when t-test was conducted to find prediction ability among each model, the error of prediction of cash flow model was smaller than that of martingale and market model, and the difference of prediction error cash flow was significant, so cash flow model was analyzed as excellent compare with other models. This research results can be considered conductive in that present the suitable prediction model of future cash flow to the hospital. This research can provide valuable information in policy-making of hospital's policy decision. This research provide effects as follows; (1) the research is useful to estimate the benefit of hospital, solvency and capital supply ability for substitution of fixed equipment. (2) the research is useful to estimate hospital's liqudity, solvency and financial ability. (3) the research is useful to estimate evaluation ability in hospital management. Furthermore, the research should be continued by sampling all hospitals and constructed advanced cash flow model in dimension, established type and continued by studying unified model which is related each cash flow model.

  • PDF

데이터마이닝 기법을 이용한 제조 공정내의 불량항목별 예측방법 (Defect Type Prediction Method in Manufacturing Process Using Data Mining Technique)

  • 변성규;강창욱;심성보
    • 산업경영시스템학회지
    • /
    • 제27권2호
    • /
    • pp.10-16
    • /
    • 2004
  • Data mining technique is the exploration and analysis, by automatic or semiautomatic means, of large quantities of data in order to discover meaningful patterns and rules. This paper uses a data mining technique for the prediction of defect types in manufacturing Process. The Purpose of this Paper is to model the recognition of defect type Patterns and Prediction of each defect type before it occurs in manufacturing process. The proposed model consists of data handling, defect type analysis, and defect type prediction stages. The performance measurement shows that it is higher in prediction accuracy than logistic regression model.

제주 실시간 풍력발전 출력 예측시스템 개발을 위한 개념설계 연구 (A study on the Conceptual Design for the Real-time wind Power Prediction System in Jeju)

  • 이영미;유명숙;최홍석;김용준;서영준
    • 전기학회논문지
    • /
    • 제59권12호
    • /
    • pp.2202-2211
    • /
    • 2010
  • The wind power prediction system is composed of a meteorological forecasting module, calculation module of wind power output and HMI(Human Machine Interface) visualization system. The final information from this system is a short-term (6hr ahead) and mid-term (48hr ahead) wind power prediction value. The meteorological forecasting module for wind speed and direction forecasting is a combination of physical and statistical model. In this system, the WRF(Weather Research and Forecasting) model, which is a three-dimensional numerical weather model, is used as the physical model and the GFS(Global Forecasting System) models is used for initial condition forecasting. The 100m resolution terrain data is used to improve the accuracy of this system. In addition, optimization of the physical model carried out using historic weather data in Jeju. The mid-term prediction value from the physical model is used in the statistical method for a short-term prediction. The final power prediction is calculated using an optimal adjustment between the currently observed data and data predicted from the power curve model. The final wind power prediction value is provided to customs using a HMI visualization system. The aim of this study is to further improve the accuracy of this prediction system and develop a practical system for power system operation and the energy market in the Smart-Grid.

Interpretation of Data Mining Prediction Model Using Decision Tree

  • Kang, Hyuncheol;Han, Sang-Tae;Choi, Jong-Ho
    • Communications for Statistical Applications and Methods
    • /
    • 제7권3호
    • /
    • pp.937-943
    • /
    • 2000
  • Data mining usually deal with undesigned massive data containing many variables for which their characteristics and association rules are unknown, therefore it is actually not easy to interpret the results of analysis. In this paper, it is shown that decision tree can be very useful in interpreting data mining prediction model using two real examples.

  • PDF

Optimization of SWAN Wave Model to Improve the Accuracy of Winter Storm Wave Prediction in the East Sea

  • Son, Bongkyo;Do, Kideok
    • 한국해양공학회지
    • /
    • 제35권4호
    • /
    • pp.273-286
    • /
    • 2021
  • In recent years, as human casualties and property damage caused by hazardous waves have increased in the East Sea, precise wave prediction skills have become necessary. In this study, the Simulating WAves Nearshore (SWAN) third-generation numerical wave model was calibrated and optimized to enhance the accuracy of winter storm wave prediction in the East Sea. We used Source Term 6 (ST6) and physical observations from a large-scale experiment conducted in Australia and compared its results to Komen's formula, a default in SWAN. As input wind data, we used Korean Meteorological Agency's (KMA's) operational meteorological model called Regional Data Assimilation and Prediction System (RDAPS), the European Centre for Medium Range Weather Forecasts' newest 5th generation re-analysis data (ERA5), and Japanese Meteorological Agency's (JMA's) meso-scale forecasting data. We analyzed the accuracy of each model's results by comparing them to observation data. For quantitative analysis and assessment, the observed wave data for 6 locations from KMA and Korea Hydrographic and Oceanographic Agency (KHOA) were used, and statistical analysis was conducted to assess model accuracy. As a result, ST6 models had a smaller root mean square error and higher correlation coefficient than the default model in significant wave height prediction. However, for peak wave period simulation, the results were incoherent among each model and location. In simulations with different wind data, the simulation using ERA5 for input wind datashowed the most accurate results overall but underestimated the wave height in predicting high wave events compared to the simulation using RDAPS and JMA meso-scale model. In addition, it showed that the spatial resolution of wind plays a more significant role in predicting high wave events. Nevertheless, the numerical model optimized in this study highlighted some limitations in predicting high waves that rise rapidly in time caused by meteorological events. This suggests that further research is necessary to enhance the accuracy of wave prediction in various climate conditions, such as extreme weather.

시계열 데이터의 성격과 예측 모델의 예측력에 관한 연구 (Relationships Between the Characteristics of the Business Data Set and Forecasting Accuracy of Prediction models)

  • 이원하;최종욱
    • 지능정보연구
    • /
    • 제4권1호
    • /
    • pp.133-147
    • /
    • 1998
  • Recently, many researchers have been involved in finding deterministic equations which can accurately predict future event, based on chaotic theory, or fractal theory. The theory says that some events which seem very random but internally deterministic can be accurately predicted by fractal equations. In contrast to the conventional methods, such as AR model, MA, model, or ARIMA model, the fractal equation attempts to discover a deterministic order inherent in time series data set. In discovering deterministic order, researchers have found that neural networks are much more effective than the conventional statistical models. Even though prediction accuracy of the network can be different depending on the topological structure and modification of the algorithms, many researchers asserted that the neural network systems outperforms other systems, because of non-linear behaviour of the network models, mechanisms of massive parallel processing, generalization capability based on adaptive learning. However, recent survey shows that prediction accuracy of the forecasting models can be determined by the model structure and data structures. In the experiments based on actual economic data sets, it was found that the prediction accuracy of the neural network model is similar to the performance level of the conventional forecasting model. Especially, for the data set which is deterministically chaotic, the AR model, a conventional statistical model, was not significantly different from the MLP model, a neural network model. This result shows that the forecasting model. This result shows that the forecasting model a, pp.opriate to a prediction task should be selected based on characteristics of the time series data set. Analysis of the characteristics of the data set was performed by fractal analysis, measurement of Hurst index, and measurement of Lyapunov exponents. As a conclusion, a significant difference was not found in forecasting future events for the time series data which is deterministically chaotic, between a conventional forecasting model and a typical neural network model.

  • PDF

생성 모형을 사용한 순항 항공기 향후 속도 예측 및 추론 (En-route Ground Speed Prediction and Posterior Inference Using Generative Model)

  • 백현진;이금진
    • 한국항공운항학회지
    • /
    • 제27권4호
    • /
    • pp.27-36
    • /
    • 2019
  • An accurate trajectory prediction is a key to the safe and efficient operations of aircraft. One way to improve trajectory prediction accuracy is to develop a model for aircraft ground speed prediction. This paper proposes a generative model for posterior aircraft ground speed prediction. The proposed method fits the Gaussian Mixture Model(GMM) to historical data of aircraft speed, and then the model is used to generates probabilistic speed profile of the aircraft. The performances of the proposed method are demonstrated with real traffic data in Incheon Flight Information Region(FIR).

Artificial-Neural-Network-based Night Crime Prediction Model Considering Environmental Factors

  • Lee, Juwon;Jeong, Yongwook;Jung, Sungwon
    • Architectural research
    • /
    • 제24권1호
    • /
    • pp.1-11
    • /
    • 2022
  • As the occurrence of a crime is dependent on different factors, their correlations are beyond the ordinary cognitive range. Owing to this limitation, systems face difficulty in correlating various factors, thereby requiring the assistance of artificial intelligence (AI) to overcome such limitations. Therefore, AI has become indispensable for crime prediction. Crimes can cause severe and irrevocable damage to a society. Recently, big data has been introduced for developing highly accurate models for crime prediction. Prediction of night crimes should be given significant consideration, because crimes primarily occur during nights, when the spatiotemporal characteristics become vulnerable to crimes. Many environmental factors that influence crime rate are applied for crime prediction, and their influence on crime rate may differ based on temporal characteristics and the nature of crime. This study aims to identify the environmental factors that influence sex and theft crimes occurring at night and proposes an artificial neural network (ANN) model to predict sex and theft crimes at night in random areas. The crime data of A district in Seoul for 12 years (2004-2015) was used, and environmental factors that influence sex and theft crimes were derived through multiple regression analysis. Two types of crime prediction models were developed: Type A using all environmental factors as input data; Type B with only the significant factors (obtained from regression analysis) as input data. The Type B model exhibited a greater accuracy than Type A, by 3.26 and 9.47 % higher for theft and sex crimes, respectively.

Defect Severity-based Defect Prediction Model using CL

  • Lee, Na-Young;Kwon, Ki-Tae
    • 한국컴퓨터정보학회논문지
    • /
    • 제23권9호
    • /
    • pp.81-86
    • /
    • 2018
  • Software defect severity is very important in projects with limited historical data or new projects. But general software defect prediction is very difficult to collect the label information of the training set and cross-project defect prediction must have a lot of data. In this paper, an unclassified data set with defect severity is clustered according to the distribution ratio. And defect severity-based prediction model is proposed by way of labeling. Proposed model is applied CLAMI in JM1, PC4 with the least ambiguity of defect severity-based NASA dataset. And it is evaluated the value of ACC compared to original data. In this study experiment result, proposed model is improved JM1 0.15 (15%), PC4 0.12(12%) than existing defect severity-based prediction models.