• 제목/요약/키워드: multiple regression techniques

검색결과 251건 처리시간 0.032초

다양한 기계학습 기법의 암상예측 적용성 비교 분석 (Comparative Application of Various Machine Learning Techniques for Lithology Predictions)

  • 정진아;박은규
    • 한국지하수토양환경학회지:지하수토양환경
    • /
    • 제21권3호
    • /
    • pp.21-34
    • /
    • 2016
  • In the present study, we applied various machine learning techniques comparatively for prediction of subsurface structures based on multiple secondary information (i.e., well-logging data). The machine learning techniques employed in this study are Naive Bayes classification (NB), artificial neural network (ANN), support vector machine (SVM) and logistic regression classification (LR). As an alternative model, conventional hidden Markov model (HMM) and modified hidden Markov model (mHMM) are used where additional information of transition probability between primary properties is incorporated in the predictions. In the comparisons, 16 boreholes consisted with four different materials are synthesized, which show directional non-stationarity in upward and downward directions. Futhermore, two types of the secondary information that is statistically related to each material are generated. From the comparative analysis with various case studies, the accuracies of the techniques become degenerated with inclusion of additive errors and small amount of the training data. For HMM predictions, the conventional HMM shows the similar accuracies with the models that does not relies on transition probability. However, the mHMM consistently shows the highest prediction accuracy among the test cases, which can be attributed to the consideration of geological nature in the training of the model.

한국한의학연구원 논문집에 사용된 통계기법의 평가 (An Evaluation of the Statistical Techniques Used in the 1995-2007 Editions of the Korea Institute of Oriental Medicine)

  • 강경원;강병갑;고미미;신선화;최선미
    • 한국한의학연구원논문집
    • /
    • 제13권2호통권20호
    • /
    • pp.121-125
    • /
    • 2007
  • Background and Purpose : The purpose of this study was done to investigate what kinds of statistical techniques have been used to analyze data from oriental medicine research Methods : 135 original articles which used statistical techniques in their data analysis were selected from the articles published in The Journal of Korea Institute of Oriental Medicine(JKIOM) between 1995 to 2007. Results : Among 135 articles, 59 articles used descriptive statistics while 76 articles used inferential statistics for data analysis. For that 76 articles, two-sample t-test(33 articles), analysis of variance(29 articles), regression(9 articles), chi-square test(5 articles), nonparametic test(4 articles), Fisher's exact test(3 articles), and other test(9 articles) were chosen to analyze the data. SAS and SPSS statistical softwares(82.50%) were mostly used to analyze the data. Nonparametic tests were used to 4 articles(6.97%) of 67 articles and parametic tests were used to 63 articles(93.03%) of 67 articles. Among 29 articles used analysis of variance, duncan(8 articles), dunnet(4 articles), bonferroni(4 articles), turkey(3 articles), scheff(1 article) were used to do multiple comparison. 9 articles did not carry out the multiple comparison. Conclusions : It was found that the frequencies of statistical package used and statistical analysis used were not much by now. High level statistical analyses were not used most for oriental medicine research.

  • PDF

서울시 도시기온 변화에 관한 모델 연구 (Statistical Models of Air Temperatures in Seoul)

  • 김학열;김운수
    • 한국조경학회지
    • /
    • 제31권3호
    • /
    • pp.74-82
    • /
    • 2003
  • Under the assumption that the temperature of one location is closely related to land use characteristics around that location, this study is carried out to assess the impact of urban land use patterns on air temperature. In order to investigate the relationship, GIS techniques and statistical analyses are utilized, after spatially connecting urban land use data in Seoul Metropolitan Area with atmospheric data observed at Automatic Weather Stations (AWS). The research method is as follows: (1) To find out important land use factors on temperature, simple linear regressions for a specific time period (pilot study) are conducted with urban land use characteristics, (2) To make a final model, multiple regressions are carried out with those factors and, (3) To verify that the final model could be appled to explain temperature variations beyond the period, the model is extensively used for 5 different time periods: 1999 as a whole; summer in 1999; 1998 as a whole; summer in 1998; August in 1998. The results of simple linear regression models in the pilot study show that transportation facilities and open space area are very influential on urban air temperature variations, which explain 66 and 61 percent of the variations, respectively. However, the other land use variables (residential, commercial, and mixed land use) are found to have weak or insignificant relationship to the air temperatures. Multiple linear regression with the two important variables in the pilot study is estimated, which shows that the model explains 75 percent of the variability in air temperatures with correct signs of regression coefficients. Thus, it is empirically shown that an increase in open space and a decrease in transportation facilities area can leads to the decrease in air temperature. After the final model is extensively applied to the 5 different time periods, the estimated models explain 68 ∼ 75 percent of the variations in the temperatures is significant regression coefficients for all explanatory variables. This result provides a possibility that one air temperature model for a specific time period could be a good model for other time periods near to the period. The important implications of this result to lessen high air temperature we: (1) to expand and to conserve open space and (2) to control transportation-related factors such as transportation facilities area, road pavement and traffic congestion.

Modeling of Suspended Solids and Sea Surface Salinity in Hong Kong using Aqua/MODIS Satellite Images

  • Wong, Man-Sing;Lee, Kwon-Ho;Kim, Young-Joon;Nichol, Janet Elizabeth;Li, Zhangqing;Emerson, Nick
    • 대한원격탐사학회지
    • /
    • 제23권3호
    • /
    • pp.161-169
    • /
    • 2007
  • A study was conducted in the Hong Kong with the aim of deriving an algorithm for the retrieval of suspended sediment (SS) and sea surface salinity (SSS) concentrations from Aqua/MODIS level 1B reflectance data with 250m and 500m spatial resolutions. 'In-situ' measurements of SS and SSS were also compared with coincident MODIS spectral reflectance measurements over the ocean surface. This is the first study of SSS modeling in Southeast Asia using earth observation satellite images. Three analysis techniques such as multiple regression, linear regression, and principal component analysis (PCA) were performed on the MODIS data and the 'in-situ' measurement datasets of the SS and SSS. Correlation coefficients by each analysis method shows that the best correlation results are multiple regression from the 500m spatial resolution MODIS images, $R^2$= 0.82 for SS and $R^2$ = 0.81 for SSS. The Root Mean Square Error (RMSE) between satellite and 'in-situ' data are 0.92mg/L for SS and 1.63psu for SSS, respectively. These suggest that 500m spatial resolution MODIS data are suitable for water quality modeling in the study area. Furthermore, the application of these models to MODIS images of the Hong Kong and Pearl River Delta (PRO) Region are able to accurately reproduce the spatial distribution map of the high turbidity with realistic SS concentrations.

의약품 제조설계 및 조작분석의 최적화에 관한 연구 - 정제제조의 최적화 (Mathematical Optimization Techniques in Drug Product Design and Process Analysis. Optimization Techniques in Tablet Design)

  • 김용배
    • 약학회지
    • /
    • 제18권1호
    • /
    • pp.49-58
    • /
    • 1974
  • Tablet product design problem was structured as constrained optimization problem and subsequently solved by multiple regression analysis and Lagrangian method of optimization. Aluminum flufenamate was the drug chosen and microcrystalline cellulose nad starch were the binder and disintegrant, respectivley. The effect of the binder and disintegrant concentration on tablet hardness, friability, volume, in vitro release rate, and urinary excretion rate of drug in human subjects was recorded. Since a reasonably rapid release rate of drug is generally an important objective in the design of solid dosage form, optimization of this parameter was employed in studying the applicability of constrained optimization to a pharmaceutical product design problem. In addition to finding optimal sitivity analysis studies to such problems was also illustratd. It would appear that prediction of the in vivo t$_{50%}$ response from a knowledge of the incitro t$_{50%}$ response can be made fairly accurately for the tablet system used in this study.

  • PDF

부모의 양육가치와 양육신념이 유아의 행동문제에 미치는 영향 (Influence of parents' parenting values and beliefs on preschoolers' problem behaviors)

  • 이은주;민하영
    • 한국생활과학회지
    • /
    • 제15권4호
    • /
    • pp.541-549
    • /
    • 2006
  • The purpose of this study is to clarify that parents' values and beliefs in bringing up their children deeply relate to their children's problem behaviors, The subjects are 267 preschoolers attending kindergarten in Daegue area, Statistical techniques are Two Way ANOVA, Scheffe' test, Pearson's Correlation and Regression, The results of this study are as follows: (1) Problem behaviors of preschoolers are significantly related to parents' values, Preschoolers whose parents have a higher level of values have a lower level of problem behaviors. (2) Problem behaviors of preschoolers are significantly related to parents' beliefs, Preschoolers whose parents have a higher level of beliefs have a higher level of problem behaviors. (3) The Multiple Regression analysis shows that parents' parenting values and beliefs are crucially predictive of preschoolers' problem behaviors. Especially, parents' parenting beliefs is more relevant to preschoolers' problem behaviors than parents' parenting values is.

  • PDF

도시가계의 가계자산투자행태 및 관련변수 (The assets investment of urban households and related factors)

  • 손주영;이연숙
    • 가족자원경영과 정책
    • /
    • 제3권1호
    • /
    • pp.67-82
    • /
    • 1999
  • The purpose of this study were to examine assets investment behavior of urban households and find the factors affecting it. The data were obtained from 442 households living in Seoul. The statistical techniques used for this study included descriptive statistecs, logistic regression, multiple regression. The major findings were as follows: First, The ownership of houehold assets were affected by age, education, total income, total income, total asset, the number of income source, income stability, the financial expectations, past financial experiences and the job of houehold head. Second, amount of household assets was affected by husband’s age, education, unearned income, total asset, income stability, the expectiation of future, the past financial experiences and the job of household head. Third, ratios of household assets were affected by age, education, unearned income, family size, the number of income source and the job of household head. The findings of this study can be used by financial counseling and planning practioners and education.

  • PDF

A Study on the Fault Process and Equipment Analysis of Plastic Ball Grid Array Manufacturing Using Data-Mining Techniques

  • Sim, Hyun Sik
    • Journal of Information Processing Systems
    • /
    • 제16권6호
    • /
    • pp.1271-1280
    • /
    • 2020
  • The yield and quality of a micromanufacturing process are important management factors. In real-world situations, it is difficult to achieve a high yield from a manufacturing process because the products are produced through multiple nanoscale manufacturing processes. Therefore, it is necessary to identify the processes and equipment that lead to low yields. This paper proposes an analytical method to identify the processes and equipment that cause a defect in the plastic ball grid array (PBGA) during the manufacturing process using logistic regression and stepwise variable selection. The proposed method was tested with the lot trace records of a real work site. The records included the sequence of equipment that the lot had passed through and the number of faults of each type in the lot. We demonstrated that the test results reflect the real situation in a PBGA manufacturing process, and the major equipment parameters were then controlled to confirm the improvement in yield; the yield improved by approximately 20%.

결측 택시 Probe 통행속도 예측기법 개발에 관한 연구 (A Study on the Development of a Technique to Predict Missing Travel Speed Collected by Taxi Probe)

  • 윤병조
    • 대한토목학회논문집
    • /
    • 제31권1D호
    • /
    • pp.43-50
    • /
    • 2011
  • 택시 프로브(Probe)를 이용한 구간통행속도 모니터링체계는 지능형교통체계(ITS)의 핵심적인 하부시스템 중 하나이다. 택시 프로브기법을 통해 수집되는 구간통행속도는 도시가로망의 교통상태 모니터링과 통행시간 정보제공에 널리 활용되고 있다. 그러나 택시 Probe기법은 표본수가 적고 교통혼잡으로 인하여 구간통행시간이 자료수집 주기보다 큰 경우, 실시간으로 자료가 수집되지 않는 누락상태가 발생하게 된다. 이러한 누락상태는 단일시간대에서 다중시간대에 걸쳐 발생하게 되며, 기존의 단일시간대 예측기법으로는 다중시간대의 상태를 예측하지 못하는 단점이 있다. 따라서 다중시간대 누락상태에서 실시간 구간통행속도를 예측하기위한 기법이 요구된다. 본 연구에서는 기존의 단일시간대 예측기법의 한계를 극복하면서 단일 및 다중시간대 통행속도를 예측하기위한 기법을 개발하였다. 개발된 모형은 비모수회귀(NPR)을 기반으로 개발되었으며, 다중시간대 예측에도 불구하고 기존의 단일시간대 예측기법보다 우수한 정확도를 보였다.

승용차 도심 주행패턴에 의한 연비 성능 분석 (A Study on the Fuel Economy based on the Driving Patterns for Passenger Car in the Metropolitan Area)

  • 정남훈;이우택;선우명호
    • 한국자동차공학회논문집
    • /
    • 제11권1호
    • /
    • pp.25-31
    • /
    • 2003
  • There are a lot of factors influencing on the automobile fuel economy such as average speed, average acceleration, acceleration sum per kilometer, and so on. In this study, various driving data were recorded during road tests. The accumulated road test mileage in Seoul metropolitan area is around 1,300 kilometers. The data were analyzed by multivariate statistical techniques including correlation analysis, principal component analysis, and multiple linear regression analysis. The analyzed results show that the average trip time per kilometer is one of the most important factors to fuel consumption and the increase of the average speed is desirable for reducing emissions and fuel consumption.