• Title/Summary/Keyword: REGRESSION ANALYSIS

Search Result 23,982, Processing Time 0.052 seconds

Learning system for Regression Analysis using Multimedia and Statistical Software (멀티미디어와 통계 소프트웨어를 활용한 회귀분석 학습 시스템)

  • 안기수;허문열
    • The Korean Journal of Applied Statistics
    • /
    • v.11 no.2
    • /
    • pp.389-401
    • /
    • 1998
  • This paper introduces CybeRClass(Cyber Regression Class). CybeRClass uses the technique of animation arid voice to teach regression analysis. The structure of this system make it possible to extend to multivariate analysis methods such as discriminant analysis and cluster analysis. Tools for multimedia is Multimedia ToolBook, and Xlisp-Stat is used for statistical computation and statistical graphics.

  • PDF

An Application of a New Two-Way Regression Model for Rating Curves (수위-유량관계식에 새로운 양방향 회귀모형의 적용)

  • Lee, Chang-Hae
    • Journal of Korea Water Resources Association
    • /
    • v.41 no.1
    • /
    • pp.17-25
    • /
    • 2008
  • Whether rating curves are used in practice or new ones are derived, the characteristics of regression analysis are often neglected. For example, a discharge rating curve, which is established from a regression of observed water levels (H) on observed flowrates(Q), is sometimes used for estimating a design water level corresponding to a simulated design flood runoff. However, if independent and dependent variables are changed with each other, the regression equation is changed in existing regression analysis, which is derived from vertical errors between observed data and regression line. Thus, regression equations should not be applied inversely. To avoid this problem, A new two-way variable least-squares regression analysis is proposed. The new method was applied to the rating curves of five water level stations on main stream of Nakdong River. The three kinds of regression models, which are respectively regression of Q versus H (model 1), H versus Q (model 2) and two-way (model 3), showed that the new method can reduce inadvertent mistakes when applied in practice.

Note on classification and regression tree analysis (분류와 회귀나무분석에 관한 소고)

  • 임용빈;오만숙
    • Journal of Korean Society for Quality Management
    • /
    • v.30 no.1
    • /
    • pp.152-161
    • /
    • 2002
  • The analysis of large data sets with hundreds of thousands observations and thousands of independent variables is a formidable computational task. A less parametric method, capable of identifying important independent variables and their interactions, is a tree structured approach to regression and classification. It gives a graphical and often illuminating way of looking at data in classification and regression problems. In this paper, we have reviewed and summarized tile methodology used to construct a tree, multiple trees and the sequential strategy for identifying active compounds in large chemical databases.

DC Motor Control using Regression Equation and PID Controller (회귀방정식과 PID제어기에 의한 DC모터 제어)

  • 서기영;이수흠;문상필;이내일;최종수
    • Proceedings of the Korea Institute of Convergence Signal Processing
    • /
    • 2000.08a
    • /
    • pp.129-132
    • /
    • 2000
  • We propose a new method to deal with the optimized auto-tuning for the PID controller which is used to the process -control in various fields. First of all, in this method, initial values of DC motor are determined by the Ziegler-Nichols method. Finally, after studying the parameters of PID controller by input vector of multiple regression analysis, when we give new K, L, T values to multiple regression model, the optimized parameters of PID controller is found by multiple regression analysis program.

  • PDF

Statistical notes for clinical researchers: simple linear regression 3 - residual analysis

  • Kim, Hae-Young
    • Restorative Dentistry and Endodontics
    • /
    • v.44 no.1
    • /
    • pp.11.1-11.8
    • /
    • 2019
  • In the previous sections, simple linear regression (SLR) 1 and 2, we developed a SLR model and evaluated its predictability. To obtain the best fitted line the intercept and slope were calculated by using the least square method. Predictability of the model was assessed by the proportion of the explained variability among the total variation of the response variable. In this session, we will discuss four basic assumptions of regression models for justification of the estimated regression model and residual analysis to check them.

A Study on Factors Affecting the Use of Ambulatory Physician Services (의사방문수 결정요인 분석)

  • 박현애;송건용
    • Health Policy and Management
    • /
    • v.4 no.2
    • /
    • pp.58-76
    • /
    • 1994
  • In order to study factors affecting the use of the ambulatory physician services. Andersen's model for health utilization was modified by adding the health behavior component and examined with three different approaches. Three different approaches were the multiople regression model, logistic regression model, and LISREL model. For multiple regression, dependent variable was reported illness-related visits to a physician during past one year and independent variables are variaous variables measuring predisposing factor, enabling factor, need factor and health behavior. For the logistic regression, dependent variable was visit or no-visit to a physician during past one year and independent variables were same as the multiple regression analysis. For the LISREL, five endogenous variables of health utiliztion, predisposing factor, enabling factor, need factor, and health behavior and 20 exogeneous variables which measures five endogenous variables were used. According to the multiple regression analysis, chronic illness, health status, perceived health status of the need factor; residence, sex, age, marital status, education of the predisposing factor ; health insurance, usual source for medical care of enabling factor were the siginificant exploratory variables for the health utilization. Out of the logistic regression analysis, health status, chronic illness, residence, marital status, education, drinking, use of health aid were found to be significant exploratory variables. From LISREL, need factor affect utilization most following by predisposing factor, enabling factor and health behavior. For LISREL model, age, education, and residence for predisposing factor; health status, chronic illess, and perceived health status for need factor; medical insurance for enabling factor; and doing any kind of health behavior for the health behavior were found as the significant observed variables for each theoretical variables.

  • PDF

Analysing the Effects of Regional Factors on the Regional Variation of Obesity Rates Using the Geographically Weighted Regression (공간분석을 이용한 지역별 비만율에 영향을 미치는 요인분석)

  • Kim, Da Yang;Kwak, Jin-Mi;Seo, Eun-Won;Lee, Kwang-Soo
    • Health Policy and Management
    • /
    • v.26 no.4
    • /
    • pp.271-278
    • /
    • 2016
  • Background: This study purposed to analyze the relationship between regional obesity rates and regional variables. Methods: Data was collected from the Korean Statistical Information Service (KOSIS) and Community Health Survey in 2012. The units of analysis were administrative districts such as city, county, and district. The dependent variable was the age-sex adjusted regional obesity rates. The independent variables were selected to represent four aspects of regions: health behaviour factor, psychological factor, socio-economic factor, and physical environment factor. Along with the traditional ordinary least square (OLS) regression analysis model, this study applied geographically weighted regression (GWR) analysis to calculate the regression coefficients for each region. Results: The OLS results showed that there were significant differences in regional obesity rates in high-risk drinking, walking, depression, and financial independence. The GWR results showed that the size of regression coefficients in independent variables was differed by regions. Conclusion: Our results can help in providing useful information for health policy makers. Regional characteristics should be considered when allocating health resources and developing health-related programs.

FUZZY REGRESSION ANALYSIS WITH NON-SYMMETRIC FUZZY COEFFICIENTS BASED ON QUADRATIC PROGRAMMING APPROACH

  • Lee, Haekwan;Hideo Tanaka
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 1998.06a
    • /
    • pp.63-68
    • /
    • 1998
  • This paper proposes fuzzy regression analysis with non-symmetric fuzzy coefficients. By assuming non-symmetric triangular fuzzy coefficients and applying the quadratic programming fomulation, the center of the obtained fuzzy regression model attains more central tendency compared to the one with symmetric triangular fuzzy coefficients. For a data set composed of crisp inputs-fuzzy outputs, two approximation models called an upper approximation model and a lower approximation model are considered as the regression models. Thus, we also propose an integrated quadratic programming problem by which the upper approximation model always includes the lower approximation model at any threshold level under the assumption of the same centers in the two approximation models. Sensitivities of Weight coefficients in the proposed quadratic programming approaches are investigated through real data.

  • PDF

Analysis of Client Propensity in Cyber Counseling Using Bayesian Variable Selection

  • Pi, Su-Young
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.6 no.4
    • /
    • pp.277-281
    • /
    • 2006
  • Cyber counseling, one of the most compatible type of consultation for the information society, enables people to reveal their mental agonies and private problems anonymously, since it does not require face-to-face interview between a counsellor and a client. However, there are few cyber counseling centers which provide high quality and trustworthy service, although the number of cyber counseling center has highly increased. Therefore, this paper is intended to enable an appropriate consultation for each client by analyzing client propensity using Bayesian variable selection. Bayesian variable selection is superior to stepwise regression analysis method in finding out a regression model. Stepwise regression analysis method, which has been generally used to analyze individual propensity in linear regression model, is not efficient since it is hard to select a proper model for its own defects. In this paper, based on the case database of current cyber counseling centers in the web, we will analyze clients' propensities using Bayesian variable selection to enable individually target counseling and to activate cyber counseling programs.

Use of big data for estimation of impacts of meteorological variables on environmental radiation dose on Ulleung Island, Republic of Korea

  • Joo, Han Young;Kim, Jae Wook;Jeong, So Yun;Kim, Young Seo;Moon, Joo Hyun
    • Nuclear Engineering and Technology
    • /
    • v.53 no.12
    • /
    • pp.4189-4200
    • /
    • 2021
  • In this study, the relationship between the environmental radiation dose rate and meteorological variables was investigated with multiple regression analysis and big data of those variables. The environmental radiation dose rate and 36 different meteorological variables were measured on Ulleung Island, Republic of Korea, from 2011 to 2015. Not all meteorological variables were used in the regression analysis because the different meteorological variables significantly affect the environmental radiation dose rate during different periods, and the degree of influence changes with time. By applying the Pearson correlation analysis and stepwise selection methods to the big dataset, the major meteorological variables influencing the environmental radiation dose rate were identified, which were then used as the independent variables for the regression model. Subsequently, multiple regression models for the monthly datasets and dataset of the entire period were developed.