• Title/Summary/Keyword: multiple regression techniques

Search Result 256, Processing Time 0.028 seconds

A Study on the Fuel Economy based on the Driving Patterns for Passenger Car in the Metropolitan Area (승용차 도심 주행패턴에 의한 연비 성능 분석)

  • 정남훈;이우택;선우명호
    • Transactions of the Korean Society of Automotive Engineers
    • /
    • v.11 no.1
    • /
    • pp.25-31
    • /
    • 2003
  • There are a lot of factors influencing on the automobile fuel economy such as average speed, average acceleration, acceleration sum per kilometer, and so on. In this study, various driving data were recorded during road tests. The accumulated road test mileage in Seoul metropolitan area is around 1,300 kilometers. The data were analyzed by multivariate statistical techniques including correlation analysis, principal component analysis, and multiple linear regression analysis. The analyzed results show that the average trip time per kilometer is one of the most important factors to fuel consumption and the increase of the average speed is desirable for reducing emissions and fuel consumption.

Quantitative Structure Activity Relationship Prediction of Oral Bioavailabilities Using Support Vector Machine

  • Fatemi, Mohammad Hossein;Fadaei, Fatemeh
    • Journal of the Korean Chemical Society
    • /
    • v.58 no.6
    • /
    • pp.543-552
    • /
    • 2014
  • A quantitative structure activity relationship (QSAR) study is performed for modeling and prediction of oral bioavailabilities of 216 diverse set of drugs. After calculation and screening of molecular descriptors, linear and nonlinear models were developed by using multiple linear regression (MLR), artificial neural network (ANN), support vector machine (SVM) and random forest (RF) techniques. Comparison between statistical parameters of these models indicates the suitability of SVM over other models. The root mean square errors of SVM model were 5.933 and 4.934 for training and test sets, respectively. Robustness and reliability of the developed SVM model was evaluated by performing of leave many out cross validation test, which produces the statistic of $Q^2_{SVM}=0.603$ and SPRESS = 7.902. Moreover, the chemical applicability domains of model were determined via leverage approach. The results of this study revealed the applicability of QSAR approach by using SVM in prediction of oral bioavailability of drugs.

Outward Testing Procedure for the Identification of Multiple Outliers (다수 이상치 인식(認識)을 위한 외향성 검정 절차)

  • Yum, Joon-Keun;Kim, Jong-Woo
    • Journal of Korean Society for Quality Management
    • /
    • v.24 no.3
    • /
    • pp.50-64
    • /
    • 1996
  • This article is concerned with procedures for detecting multiple y outliers in linear regression. The outward-testing procedure, which is controled by the initial subset and the minimum residuals, is suggested by two phases. The performance of this procedure is compared with others by Monte Carlo techniques and found to be superior. The procedure, however, fails in detecting y outliers that are on high-leverage cases in Phase 1. Thus, we proposed ELMS algorithm for a set of suspect observations, in Phase 1. In Phase 2, the proposed testing is conducted using the studentized residuals to see which of the suspect cases are outliers. Several examples are analyzed.

  • PDF

Statistical analysis of hazen-williams C and influencing factors in multi-regional water supply system (광역상수도 유속계수와 영향인자에 관한 통계적 분석)

  • Kim, Bumjun;Kim, Gilho;Kim, Hung soo
    • Journal of Korea Water Resources Association
    • /
    • v.49 no.5
    • /
    • pp.399-410
    • /
    • 2016
  • In case of the application of Hazen-Williams C for design, operation or maintenance of water supply system, field situations always should be reflected on the factors. In this study, the relationships between C factors and influencing factors are analyzed using statistical techniques with 174 measured C factor data collected in periodic inspection for safety diagnosis in multi-regional water supply systems. To analyze their relationships, cross analysis, one-way ANOVA, correlation analysis were conducted. Analysis results showed that C factors had high correlations with both of elapsed year and pipe diameter and were relatively highly affected by coating material among influencing factors with the categorical type. On the other hand, elapsed year, pipe diameter and water type were meaningful influencing factors according to the results of multiple regression analysis. The Cluster analysis revealed that C factors had a tendency of being fundamentally classified on the basis of the elapsed year of about 20 years and the pipe diameter of 1500mm. Although C factors were generally greatly affected by elapsed year, size of pipe diameter relatively had an large influence on values of them in case of large diameter pipes. Lastly, It can be suggested that C factor estimation formulas using multiple regression analysis and clustering analysis in this study, can be applied as decision standards of C factor in multi-regional water supply systems.

Development of the Deterioration Models for the Port Structures by the Multiple Regression Analysis and Markov Chain (다중 회귀분석 및 Markov Chain을 통한 항만시설물의 상태열화모델 개발)

  • Cha, Kyunghwa;Kim, Sung-Wook;Kim, Jung Hoon;Park, Mi-Yun;Kong, Jung Sik
    • Journal of the Computational Structural Engineering Institute of Korea
    • /
    • v.28 no.3
    • /
    • pp.229-239
    • /
    • 2015
  • In light of the significant increase in the quantities of goods transported and the development of the shipping industry, the frequency of usage of port structures has increased; yet, the government's budget for the shipping & port of SOC has been reduced. Port structures require systematically effective maintenance and management trends that address their growing frequency of usage. In order to construct a productive maintenance system, it is essential to develop deterioration models of port structures that consider various characteristics, such as location, type, use, constructed level, and state of maintenance. Processes for developing such deterioration models include examining factors that cause the structures to deteriorate, collecting data on deteriorating structures, and deciding methods of estimation. The techniques used for developing the deterioration models are multiple regression analysis and Markov chain theory. Multiple regression analysis can reflect changes over time and Markov chain theory can apply status changes based on a probabilistic method. Along with these processes, the deterioration models of open-type and gravity-type wharfs were suggested.

Modelling on the Carbonation Rate Prediction of Non-Transport Underground Infrastructures Using Deep Neural Network (심층신경망을 이용한 비운송 지중구조물의 탄산화속도 예측 모델링)

  • Youn, Byong-Don
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.22 no.4
    • /
    • pp.220-227
    • /
    • 2021
  • PCT (Power Cable Tunnel) and UT (Utility Tunnel), which are non-transport underground infrastructures, are mostly RC (Reinforced Concrete) structures, and their durability decreases due to the deterioration caused by carbonation over time. In particular, since the rate of carbonation varies by use and region, a predictive model based on actual carbonation data is required for individual maintenance. In this study, a carbonation prediction model was developed for non-transport underground infrastructures, such as PCT and UT. A carbonation prediction model was developed using multiple regression analysis and deep neural network techniques based on the actual data obtained from a safety inspection. The structures, region, measurement location, construction method, measurement member, and concrete strength were selected as independent variables to determine the dependent variable carbonation rate coefficient in multiple regression analysis. The adjusted coefficient of determination (Ra2) of the multiple regression model was found to be 0.67. The coefficient of determination (R2) of the model for predicting the carbonation of non-transport underground infrastructures using a deep neural network was 0.82, which was superior to the comparative prediction model. These results are expected to help determine the optimal timing for repair on carbonation and preventive maintenance methodology for PCT and UT.

Thermal Error Measurement and Modeling Techniques for the 5 Degree of Freedom(DOF) Spindle Unit Drifts in CNC Machine Tools (CNC 공작기계 스핀들 유닛의 5자유도 열변형 오차측정 및 모델링 기술)

  • Park, Hui-Jae;Lee, Seok-Won;Gwon, Hyeok-Dong
    • Transactions of the Korean Society of Mechanical Engineers A
    • /
    • v.24 no.5 s.176
    • /
    • pp.1343-1351
    • /
    • 2000
  • Thermally induced errors have been significant factors affecting the machine tool accuracy. In this paper, the spindle thermal error has been focused, where the 5 degree of freedom thermal error components are considered. An effective measurement system has been devised for the 5 DOF thermal errors, consisting of gap sensors and thermocouples around the micro-computer interfaced environment. Several thermal error modeling techniques are also implemented for the thermal error prediction: multiple linear regression, neural network and system identification methods, etc. The performance of the thermal error modeling techniques is evaluated and compared, giving the system identification method as the optimum model having the least deviation. The developed system for the thermal error measurement and modeling was practically applied to a CNC machining center, and the spindle thermal errors were effectively compensated around the micro computer-machine tool interfaced networks. The machine tool accuracy was improved about 4-5 times typically.

Determination of Relationship between Thermal and Mechanical Properties of Wood Material

  • Ozcan, Cemal;Korkmaz, Mustafa
    • Journal of the Korean Wood Science and Technology
    • /
    • v.47 no.4
    • /
    • pp.408-417
    • /
    • 2019
  • Non-destructive test techniques are becoming increasingly important for assessment and maintenance. These techniques are very useful for assessment of materials such as wood, whose performance can vary considerably depending on the conditions of use. It is possible to estimate some mechanical properties of a material by determining the movement of energy through the material with the help of these techniques. In this study, it was investigated whether the wood material could be tested nondestructively by the heat energy produced by a source. The correlations between the thermal conductivity and mechanical properties of Scots pine (Pinus sylvestris L.) and sessile oak (Quercus petraea L.) woods were investigated. The thermal conductivity (TC), density, modulus of rupture (MOR), compression strength (CS), and modulus of elasticity (MOE) values of samples were measured according to the related standards and these values were correlated with each other. The linear and multiple regression tests were employed to determine the correlation between thermal conductivity and mechanical properties. The results showed that there is a very strong correlation between thermal conductivity and both density and MOR values. However, the correlations between TC and both MOE and CS were moderate. The results of this study suggest that the thermal conductivity value can be used to estimate the density and some mechanical properties of wood.

Assessment through Statistical Methods of Water Quality Parameters(WQPs) in the Han River in Korea

  • Kim, Jae Hyoun
    • Journal of Environmental Health Sciences
    • /
    • v.41 no.2
    • /
    • pp.90-101
    • /
    • 2015
  • Objective: This study was conducted to develop a chemical oxygen demand (COD) regression model using water quality monitoring data (January, 2014) obtained from the Han River auto-monitoring stations. Methods: Surface water quality data at 198 sampling stations along the six major areas were assembled and analyzed to determine the spatial distribution and clustering of monitoring stations based on 18 WQPs and regression modeling using selected parameters. Statistical techniques, including combined genetic algorithm-multiple linear regression (GA-MLR), cluster analysis (CA) and principal component analysis (PCA) were used to build a COD model using water quality data. Results: A best GA-MLR model facilitated computing the WQPs for a 5-descriptor COD model with satisfactory statistical results ($r^2=92.64$,$Q{^2}_{LOO}=91.45$,$Q{^2}_{Ext}=88.17$). This approach includes variable selection of the WQPs in order to find the most important factors affecting water quality. Additionally, ordination techniques like PCA and CA were used to classify monitoring stations. The biplot based on the first two principal components (PCs) of the PCA model identified three distinct groups of stations, but also differs with respect to the correlation with WQPs, which enables better interpretation of the water quality characteristics at particular stations as of January 2014. Conclusion: This data analysis procedure appears to provide an efficient means of modelling water quality by interpreting and defining its most essential variables, such as TOC and BOD. The water parameters selected in a COD model as most important in contributing to environmental health and water pollution can be utilized for the application of water quality management strategies. At present, the river is under threat of anthropogenic disturbances during festival periods, especially at upstream areas.

A Brief Empirical Verification Using Multiple Regression Analysis on the Measurement Results of Seaport Efficiency of AHP/DEA-AR (다중회귀분석을 이용한 AHP/DEA-AR 항만효율성 측정결과의 실증적 검증소고)

  • Park, Ro-kyung
    • Journal of Korea Port Economic Association
    • /
    • v.32 no.4
    • /
    • pp.73-87
    • /
    • 2016
  • The purpose of this study is to investigate the empirical results of Analytic Hierarchy Process/Data Envelopment Analysis-Assurance Region(AHP/DEA-AR) by using multiple regression analysis during the period of 2009-2012 with 5 inputs (number of gantry cranes, number of berth, berth length, terminal yard, and mean depth) and 2 outputs (container TEU, and number of direct calling shipping companies). Assurance Region(AR) is the most important tool to measure the efficiency of seaports, because individual seaports are characterized in terms of inputs and outputs. Traditional AHP and multiple regression analysis techniques have been used for measuring the AR. However, few previous studies exist in the field of seaport efficiency measurement. The main empirical results of this study are as follows. First, the efficiency ranking comparison between the two models (AHP/DEA-AR and multiple regression) using the Wilcoxon signed-rank test and Mann-Whitney signed-rank sum test were matched with the average level of 84.5 % and 96.3% respectively. When data for four years are used, the ratios of the significant probability are decreased to 61.4% and 92.5%. The policy implication of this study is that the policy planners of Korean port should introduce AHP/DEA-AR and multiple regression analysis when they measure the seaport efficiency and consider the port investment for enhancing the efficiency of inputs and outputs. The next study will deal with the subjects introducing the Fuzzy method, non-radial DEA, and the mixed analysis between AHP/DEA-AR and multiple regression analysis.