• 제목/요약/키워드: regression analysis method

검색결과 4,587건 처리시간 0.033초

지역빈도해석 및 다중회귀분석을 이용한 산악형 강수해석 (Orographic Precipitation Analysis with Regional Frequency Analysis and Multiple Linear Regression)

  • 윤혜선;엄명진;조원철;허준행
    • 한국수자원학회논문집
    • /
    • 제42권6호
    • /
    • pp.465-480
    • /
    • 2009
  • 본 연구에서는 다중회귀분석을 이용하여 산악효과를 야기하는 지형인자와 강수와의 관계를 파악하였다. 섬 전체가 산악지형인 제주도의 연평균강수량과 지수홍수법으로 산출한 확률강우량을 강수자료로 사용하여 산악효과를 야기하는 지형인자로 선정한 고도, 위 경도와 회귀모형을 구성하였다. 회귀분석 결과 연평균강수량과 고도와의 선형관계가 확률강우량에서도 동일하게 나타났으며, 고도이외에 위도, 경도를 각각 추가인자로 고려할 경우 강우량과 더욱 강한 상관성을 보였다. 또한, 고도와 위도, 경도를 모두 고려한 회귀모형을 이용한 지형공간분석 결과 제주도의 실제 강수특성과 마찬가지로 남동부로 편중된 강수형태를 보여 모형의 적합성을 증명하였다. 그러나 지속시간 및 재현기간과 무관하게 높은 고도에서 회귀식의 유효성이 감소하므로, 높은 고도에서의 추가적인 산악효과인자의 강수량에 대한 영향이 존재될 것으로 판단되므로 추후 연구가 필요하다.

On Bootstrapping; Bartlett Adjusted Empirical Likelihood Ratio Statistic in Regression Analysis

  • Woochul Kim;Duk-Hyun Ko;Keewon Lee
    • Journal of the Korean Statistical Society
    • /
    • 제25권2호
    • /
    • pp.205-216
    • /
    • 1996
  • The bootstrap calibration method for empirical likelihood is considered to make a confidence region for the regression coefficients. Asymptotic properties are studied regarding the coverage probability. Small sample simulation results reveal that the bootstrap calibration works quite well.

  • PDF

Mixed Effects Kernel Binomial Regression

  • Hwang, Chang-Ha
    • Journal of the Korean Data and Information Science Society
    • /
    • 제19권4호
    • /
    • pp.1327-1334
    • /
    • 2008
  • Mixed effect binomial regression models are widely used for analysis of correlated count data in which the response is the result of a series of one of two possible disjoint outcomes. In this paper, we consider kernel extensions with nonparametric fixed effects and parametric random effects. The estimation is through the penalized likelihood method based on kernel trick, and our focus is on the efficient computation and the effective hyperparameter selection. For the selection of hyperparameters, cross-validation techniques are employed. Examples illustrating usage and features of the proposed method are provided.

  • PDF

연관규칙을 이용한 근골격계 질환 예방 - 다변량 로지스틱 회귀분석의 결과를 기반으로 - (Preventing the Musculoskeletal Disorders using Association Rule - Based on Result of Multiple Logistic Regression -)

  • 박승헌;이석환
    • 대한안전경영과학회지
    • /
    • 제9권4호
    • /
    • pp.29-38
    • /
    • 2007
  • We adapted association rules of data mining in order to investigate the relation among the factors of musculoskeletal disorders and proposed the method of preventing the musculoskeletal disorders associated with multiple logistic regression in previous study. This multiple logistic regression was difficult to establish the method of preventing musculoskeletal disorders in case factors can't be managed by worker himself, i.e., age, gender, marital status. In order to solve this problem, we devised association rules of factors of musculoskeletal disorders and proposed the interactive method of preventing the musculoskeletal disorders, by applying association rules with the result of multiple logistic regression in previous study. The result of correlation analysis showed that prevention method of one part also prevents musculoskeletal disorders of other parts of body.

A modified partial least squares regression for the analysis of gene expression data with survival information

  • Lee, So-Yoon;Huh, Myung-Hoe;Park, Mira
    • Journal of the Korean Data and Information Science Society
    • /
    • 제25권5호
    • /
    • pp.1151-1160
    • /
    • 2014
  • In DNA microarray studies, the number of genes far exceeds the number of samples and the gene expression measures are highly correlated. Partial least squares regression (PLSR) is one of the popular methods for dimensional reduction and known to be useful for the classifications of microarray data by several studies. In this study, we suggest a modified version of the partial least squares regression to analyze gene expression data with survival information. The method is designed as a new gene selection method using PLSR with an iterative procedure of imputing censored survival time. Mean square error of prediction criterion is used to determine the dimension of the model. To visualize the data, plot for variables superimposed with samples are used. The method is applied to two microarray data sets, both containing survival time. The results show that the proposed method works well for interpreting gene expression microarray data.

Analyzing Operation Deviation in the Deasphalting Process Using Multivariate Statistics Analysis Method

  • Park, Joo-Hwang;Kim, Jong-Soo;Kim, Tai-Suk
    • 한국멀티미디어학회논문지
    • /
    • 제17권7호
    • /
    • pp.858-865
    • /
    • 2014
  • In the case of system like MES, various sensors collect the data in real time and save it as a big data to monitor the process. However, if there is big data mining in distributed computing system, whole processing process can be improved. In this paper, system to analyze the cause of operation deviation was built using the big data which has been collected from deasphalting process at the two different plants. By applying multivariate statistical analysis to the big data which has been collected through MES(Manufacturing Execution System), main cause of operation deviation was analyzed. We present the example of analyzing the operation deviation of deasphalting process using the big data which collected from MES by using multivariate statistics analysis method. As a result of regression analysis of the forward stepwise method, regression equation has been found which can explain 52% increase of performance compare to existing model. Through this suggested method, the existing petrochemical process can be replaced which is manual analysis method and has the risk of being subjective according to the tester. The new method can provide the objective analysis method based on numbers and statistic.

최근 3년간(2007~2009년) 여성건강간호학회지의 상관분석과 회귀분석 통계활용 논문 분석 (Analysis of the Correlation and Regression Analysis Studies from the Korean Journal of Women Health Nursing over the Past Three Years (2007~2009))

  • 이은주;이은희;김증임;강희선;오현이;전은미;천숙희
    • 여성건강간호학회지
    • /
    • 제17권2호
    • /
    • pp.187-194
    • /
    • 2011
  • Purpose: This study investigated the statistical methods and the results had reported correlation/regression analysis in the studies of Korean Journal of Women Health Nursing (KJWHN). Methods: We reviewed 45 studies using correlation/regression analysis for the suitability of the statistical methods and the research purposes, the criteria for analysis of figures, tables and charts had published in the KJWHN from vol 13 (1) in 2007 to vol 15 (4) in 2009. Results: Forty three studies were fitted to their statistical methodology and their research purposes. Eleven studies considered the minimum sample size. Fourteen regression studies used multiple regression and 12 studies used forward method for variable entry. Only one study among the 17 regression studies accomplished scatter plots and residuals examination. Sixteen studies in correlation studies and six studies in regression studies showed some errors in either the title, variables, category of figures, tables and charts. In the regression study, all reported $R^2$ and ${\beta}$ values except one. Conclusion: It was found that there were still statistical errors or articulation errors in the statistical analysis. All reviewers need to be reviewed more closely for detecting errors not only during reviewing process of the manuscript but also periodic publication for the quality of this academic journal.

A Study on Improving the predict accuracy rate of Hybrid Model Technique Using Error Pattern Modeling : Using Logistic Regression and Discriminant Analysis

  • Cho, Yong-Jun;Hur, Joon
    • Journal of the Korean Data and Information Science Society
    • /
    • 제17권2호
    • /
    • pp.269-278
    • /
    • 2006
  • This paper presents the new hybrid data mining technique using error pattern, modeling of improving classification accuracy. The proposed method improves classification accuracy by combining two different supervised learning methods. The main algorithm generates error pattern modeling between the two supervised learning methods(ex: Neural Networks, Decision Tree, Logistic Regression and so on.) The Proposed modeling method has been applied to the simulation of 10,000 data sets generated by Normal and exponential random distribution. The simulation results show that the performance of proposed method is superior to the existing methods like Logistic regression and Discriminant analysis.

  • PDF

반응표면을 활용한 통계적 모멘트 추정 방법과 신뢰도해석에 적용 (RS-based method for estimating statistical moments and its application to reliability analysis)

  • 허재성;곽병만
    • 대한기계학회:학술대회논문집
    • /
    • 대한기계학회 2004년도 추계학술대회
    • /
    • pp.852-857
    • /
    • 2004
  • A new and efficient method for estimating the statistical moments of a system performance function has been developed. The method consists of two steps: (1) An approximate response surface is generated by a quadratic regression model, and (2) the statistical moments of the regression model are then calculated by experimental design techniques proposed by Seo and $Kwak^{(4)}$. In this approach, the size of experimental region affects the accuracy of the statistical moments. Therefore, the region size should be selected suitably. The D-optimal design and the central composite design are adopted over the selected experimental region for the regression model. Finally, the Pearson system is adopted to decide the distribution type of the system performance function and to analyze structural reliability.

  • PDF

여행비용접근법을 통한 대구 우방타워랜드의 편익가치 측정 (Estimation of Users대 Benefit Value for Woobang Tower Land in Taegu Using Travel Cost Method)

  • 김수봉;심애경;권기찬
    • 한국환경과학회지
    • /
    • 제10권3호
    • /
    • pp.173-178
    • /
    • 2001
  • The aim of this paper is to evaluate users benefit values of theme park using Travel Cost Method with special reference to Woobang Tower Land in Taegu for the estimation of economic values. This research is mainly based on questionnaire survey of 100 users of the theme park. Socio-economic factors such as income, year of education, annual income, age and money(travel cost) are analysed from 5 residential areas of the respondents. Multiple regression analysis was used for the evaluation of annual number of park visitings based on the analysis. The regression model shows NV = $\alpha$+$\beta_1$TC+$\beta_2$INC+$\beta_3$EDU+$\beta_4$AGE (NV : Annual Number of Visitings, TC : Travel Cost, INC : Annual Income, EDU : Years of Education, AGE : Age). Regarding to visitors demand curve based on the equation showed that annual economic values of Woobang Tower Land was estimated as 50billion Korean Won.

  • PDF