• Title/Summary/Keyword: data based model

Search Result 20,519, Processing Time 0.049 seconds

Using Structural Changes to support the Neural Networks based on Data Mining Classifiers: Application to the U.S. Treasury bill rates

  • Oh, Kyong-Joo
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 2003.10a
    • /
    • pp.57-72
    • /
    • 2003
  • This article provides integrated neural network models for the interest rate forecasting using change-point detection. The model is composed of three phases. The first phase is to detect successive structural changes in interest rate dataset. The second phase is to forecast change-point group with data mining classifiers. The final phase is to forecast the interest rate with BPN. Based on this structure, we propose three integrated neural network models in terms of data mining classifier: (1) multivariate discriminant analysis (MDA)-supported neural network model, (2) case based reasoning (CBR)-supported neural network model and (3) backpropagation neural networks (BPN)-supported neural network model. Subsequently, we compare these models with a neural network model alone and, in addition, determine which of three classifiers (MDA, CBR and BPN) can perform better. For interest rate forecasting, this study then examines the predictability of integrated neural network models to represent the structural change.

  • PDF

A Study on the Application of Spatial Big Data from Social Networking Service for the Operation of Activity-Based Traffic Model (활동기반 교통모형 분석자료 구축을 위한 소셜네트워크 공간빅데이터 활용방안 연구)

  • Kim, Seung-Hyun;Kim, Joo-Young;Lee, Seung-Jae
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.15 no.4
    • /
    • pp.44-53
    • /
    • 2016
  • The era of Big Data has come and the importance of Big Data has been rapidly growing. The part of transportation, the Four-Step Travel Demand Model(FSTDM), a traditional Trip-Based Model(TBM) reaches its limit. In recent years, a traffic demand forecasting method using the Activity-Based Model(ABM) emerged as a new paradigm. Given that transportation means the spatial movement of people and goods in a certain period of time, transportation could be very closely associated with spatial data. So, I mined Spatial Big Data from SNS. After that, I analyzed the character of these data from SNS and test the reliability of the data through compared with the attributes of TBM. Finally, I built a database from SNS for the operation of ABM and manipulate an ABM simulator, then I consider the result. Through this research, I was successfully able to create a spatial database from SNS and I found possibilities to overcome technical limitations on using Spatial Big Data in the transportation planning process. Moreover, it was an opportunity to seek ways of further research development.

Reliability analysis of piles based on proof vertical static load test

  • Dong, Xiaole;Tan, Xiaohui;Lin, Xin;Zhang, Xuejuan;Hou, Xiaoliang;Wu, Daoxiang
    • Geomechanics and Engineering
    • /
    • v.29 no.5
    • /
    • pp.487-496
    • /
    • 2022
  • Most of the pile's vertical static load tests in construction sites are the proof load tests, which is difficult to accurately estimate the ultimate bearing capacity and analyze the reliability of piles. Therefore, a reliability analysis method based on the proof load-settlement (Q-s) data is proposed in this study. In this proposed method, a simple ultimate limit state function based on the hyperbolic model is established, where the random variables of reliability analysis include the model factor of the ultimate bearing capacity and the fitting parameters of the hyperbolic model. The model factor M = RuR / RuP is calculated based on the available destructive Q-s data, where the real value of the ultimate bearing capacity (RuR) is obtained by the complete destructive Q-s data; the predicted value of the ultimate bearing capacity (RuP) is obtained by the proof Q-s data, a part of the available destructive Q-s data, that before the predetermined load determined by the pile test report. The results demonstrate that the proposed method can easy and effectively perform the reliability analysis based on the proof Q-s data.

Harmonization of IFC 3D Building Model Standards and ISO/STEP AP202 Drawing Standards for 2D Shape Data Representation (IFC 3차원 건축모델표준과 ISO/STEP AP202도면표준의 2차원 형상정보 연계방안)

  • Won, Ji-Sun;Lim, Kyoung-Il;Kim, Seong-Sig
    • Korean Journal of Computational Design and Engineering
    • /
    • v.11 no.6
    • /
    • pp.429-439
    • /
    • 2006
  • The purpose of this study is to support the integration from current 2D drawing-based design to future 3D model-based design. In this paper, an important theme is the combination between the STEP-based 2D drawing standards (i.e., AP202) and the IFC-based 3D building model standards. To achieve the purpose, two methodologies are proposed as follows: the development of IFC extension model for the 2D shape data representation by harmonizing ISO/STEP AP202; and the development of mapping solution between IFC 2D extension model and KOSDIC by constructing the exchange scenario for 2D shape data representation. It is expected that the proposed IFC2X2 2D extension model and mapping solution will offer the basis of development of the integrated standards model in AEC industry.

Recovery the Missing Streamflow Data on River Basin Based on the Deep Neural Network Model

  • Le, Xuan-Hien;Lee, Giha
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2019.05a
    • /
    • pp.156-156
    • /
    • 2019
  • In this study, a gated recurrent unit (GRU) network is constructed based on a deep neural network (DNN) with the aim of restoring the missing daily flow data in river basins. Lai Chau hydrological station is located upstream of the Da river basin (Vietnam) is selected as the target station for this study. Input data of the model are data on observed daily flow for 24 years from 1961 to 1984 (before Hoa Binh dam was built) at 5 hydrological stations, in which 4 gauge stations in the basin downstream and restoring - target station (Lai Chau). The total available data is divided into sections for different purposes. The data set of 23 years (1961-1983) was employed for training and validation purposes, with corresponding rates of 80% for training and 20% for validation respectively. Another data set of one year (1984) was used for the testing purpose to objectively verify the performance and accuracy of the model. Though only a modest amount of input data is required and furthermore the Lai Chau hydrological station is located upstream of the Da River, the calculated results based on the suggested model are in satisfactory agreement with observed data, the Nash - Sutcliffe efficiency (NSE) is higher than 95%. The finding of this study illustrated the outstanding performance of the GRU network model in recovering the missing flow data at Lai Chau station. As a result, DNN models, as well as GRU network models, have great potential for application within the field of hydrology and hydraulics.

  • PDF

Analysis on the Propulsive Performance of Full Scale Ship (실선의 추진성능 해석기법에 관한 연구)

  • Yang, Seung-Il;Kim, Eun-Chan
    • 한국기계연구소 소보
    • /
    • s.9
    • /
    • pp.183-191
    • /
    • 1982
  • This report describes the analysis method of the full-scale propulsive performance by using the data of model test and the full-scale speed trial. The model test data were analyzed by the computer program "PPTT" based on "1978 ITTC Performance Prediction Method for Single Screw Ships." Also the full-scale speed trial data were analyzed by the computer program "SSTT" based on the newly proposed “SRS-KIMM Standard Method of Speed Trial Analysis." An analysis of model and full-scale test data was carried out for a 60.000 DWT Bulk Carrier and the correlation between model and full-scale ship was stuied.

  • PDF

Vacant House Prediction and Important Features Exploration through Artificial Intelligence: In Case of Gunsan (인공지능 기반 빈집 추정 및 주요 특성 분석)

  • Lim, Gyoo Gun;Noh, Jong Hwa;Lee, Hyun Tae;Ahn, Jae Ik
    • Journal of Information Technology Services
    • /
    • v.21 no.3
    • /
    • pp.63-72
    • /
    • 2022
  • The extinction crisis of local cities, caused by a population density increase phenomenon in capital regions, directly causes the increase of vacant houses in local cities. According to population and housing census, Gunsan-si has continuously shown increasing trend of vacant houses during 2015 to 2019. In particular, since Gunsan-si is the city which suffers from doughnut effect and industrial decline, problems regrading to vacant house seems to exacerbate. This study aims to provide a foundation of a system which can predict and deal with the building that has high risk of becoming vacant house through implementing a data driven vacant house prediction machine learning model. Methodologically, this study analyzes three types of machine learning model by differing the data components. First model is trained based on building register, individual declared land value, house price and socioeconomic data and second model is trained with the same data as first model but with additional POI(Point of Interest) data. Finally, third model is trained with same data as the second model but with excluding water usage and electricity usage data. As a result, second model shows the best performance based on F1-score. Random Forest, Gradient Boosting Machine, XGBoost and LightGBM which are tree ensemble series, show the best performance as a whole. Additionally, the complexity of the model can be reduced through eliminating independent variables that have correlation coefficient between the variables and vacant house status lower than the 0.1 based on absolute value. Finally, this study suggests XGBoost and LightGBM based machine learning model, which can handle missing values, as final vacant house prediction model.

Comparative Study of Citizen Science and Expert Based Survey Data Using the Species Distribution Model of Rana uenoi (큰산개구리(Rana uenoi ) 종분포모형을 활용한 시민과학 및 전문가 기반 조사자료의 비교연구)

  • Woncheol Lee;Jeongwoo Yoo;Paikho Rho
    • Journal of Environmental Science International
    • /
    • v.32 no.6
    • /
    • pp.429-440
    • /
    • 2023
  • Quantitative habitat model is established with species occurrence and spatial abundance data, which were usually acquired by professional field ecologists and citizen scientists. The importance of citizen science data is increasing, but the quality of these data needs to be evaluated. This study aims to identify and compare both expert-based data and citizen science data based on the performance power of quantitative models derived from both data sets. A Maximum Entropy (MaxENT) model was developed using eight environmental variables, including climate, topography, landcover and distance to forest edge. The AUC values derived from the MaxENT model were 0.842 and 0.809, respectively, indicating a high level of explanatory power. All environmental variables has similar values for both data sets, except for the distance to forest edge and rice paddy, which was relatively higher for expert-based survey data than that of the citizen science data as the distances increased. This result suggests that habitat model derived from expert-based survey data shows more ecological niche including wider ranges from forest edges and isolated habitat patches of rice paddy. This is presumably because citizen scientists focuses on direct observation methods, whereas professional field surveys investigate a wider variety of methods.

Generating Training Dataset of Machine Learning Model for Context-Awareness in a Health Status Notification Service (사용자 건강 상태알림 서비스의 상황인지를 위한 기계학습 모델의 학습 데이터 생성 방법)

  • Mun, Jong Hyeok;Choi, Jong Sun;Choi, Jae Young
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.9 no.1
    • /
    • pp.25-32
    • /
    • 2020
  • In the context-aware system, rule-based AI technology has been used in the abstraction process for getting context information. However, the rules are complicated by the diversification of user requirements for the service and also data usage is increased. Therefore, there are some technical limitations to maintain rule-based models and to process unstructured data. To overcome these limitations, many studies have applied machine learning techniques to Context-aware systems. In order to utilize this machine learning-based model in the context-aware system, a management process of periodically injecting training data is required. In the previous study on the machine learning based context awareness system, a series of management processes such as the generation and provision of learning data for operating several machine learning models were considered, but the method was limited to the applied system. In this paper, we propose a training data generating method of a machine learning model to extend the machine learning based context-aware system. The proposed method define the training data generating model that can reflect the requirements of the machine learning models and generate the training data for each machine learning model. In the experiment, the training data generating model is defined based on the training data generating schema of the cardiac status analysis model for older in health status notification service, and the training data is generated by applying the model defined in the real environment of the software. In addition, it shows the process of comparing the accuracy by learning the training data generated in the machine learning model, and applied to verify the validity of the generated learning data.

Analysis Model Evaluation based on IoT Data and Machine Learning Algorithm for Prediction of Acer Mono Sap Liquid Water

  • Lee, Han Sung;Jung, Se Hoon
    • Journal of Korea Multimedia Society
    • /
    • v.23 no.10
    • /
    • pp.1286-1295
    • /
    • 2020
  • It has been increasingly difficult to predict the amounts of Acer mono sap to be collected due to droughts and cold waves caused by recent climate changes with few studies conducted on the prediction of its collection volume. This study thus set out to propose a Big Data prediction system based on meteorological information for the collection of Acer mono sap. The proposed system would analyze collected data and provide managers with a statistical chart of prediction values regarding climate factors to affect the amounts of Acer mono sap to be collected, thus enabling efficient work. It was designed based on Hadoop for data collection, treatment and analysis. The study also analyzed and proposed an optimal prediction model for climate conditions to influence the volume of Acer mono sap to be collected by applying a multiple regression analysis model based on Hadoop and Mahout.