• 제목/요약/키워드: regression analysis.

검색결과 23,697건 처리시간 0.052초

멀티미디어와 통계 소프트웨어를 활용한 회귀분석 학습 시스템 (Learning system for Regression Analysis using Multimedia and Statistical Software)

  • 안기수;허문열
    • 응용통계연구
    • /
    • 제11권2호
    • /
    • pp.389-401
    • /
    • 1998
  • 본 논문에서는 멀티미디어를 활용한 회귀분석 학습시스템 CybeRClass(Cyber Regression Class)를 소개하고자 한다. CybeRClass는 음성정보와 애니메이션 등을 활용하여 회귀분석에 대한 학습을 시켜주는 시스템이다. 이 시스템은 군집분석이나 판별분석 등의 다변량분석 학습이 가능하도록 설계되었다. 멀티미디어 기술을 위한 도구로는 Multimedia ToolBook을 사용하였으며, 통계계산과 통계그라픽을 위해서는 객체지향 통계 언어인 Xlisp-Stat을 사용하였다.

  • PDF

수위-유량관계식에 새로운 양방향 회귀모형의 적용 (An Application of a New Two-Way Regression Model for Rating Curves)

  • 이창해
    • 한국수자원학회논문집
    • /
    • 제41권1호
    • /
    • pp.17-25
    • /
    • 2008
  • 수위-유량관계식의 유도와 실무적용에 있어 통상적으로 회귀분석의 특성을 간과하고 사용하는 경우가 종종 발생한다. 예를 들어 실무에서는 관측수위로부터 관측유량으로 회귀분석되어 만들어진 수위-유량관계식을 홍수모형으로부터 모의된 설계홍수유출량으로부터 설계홍수위를 환산하는데 사용되기도 한다. 그러나 독립과 종속변수가 서로 바뀌면, 관측치와 회귀식간 연직거리의 잔차들로부터 유도된 기존의 회귀분석에 의하여, 회귀식이 서로 달라지기 때문에 역으로 적용하여서는 안 된다. 본 연구에서는 이런 문제점을 해결하기위해 회귀식의 변수들을 상호 교환할 수 있는 최소자승 회귀분석의 새로운 알고리즘을 제안하였다. 새로운 방법을 낙동강유역의 본류 5개 수위표지점의 수위-유량관계식에 대하여 적용하였다. 3가지 회귀식이 유도되었는데, 이들은 각각 수위로부터 유량으로(model 1), 유량으로부터 수위로(model 2) 그리고 양방향(model 3)으로 유도된 수위-유량관계식을 비교하여 실무에서 잘못 적용되는 실수를 줄일 수 있는 새로운 방법을 제시하였다.

분류와 회귀나무분석에 관한 소고 (Note on classification and regression tree analysis)

  • 임용빈;오만숙
    • 품질경영학회지
    • /
    • 제30권1호
    • /
    • pp.152-161
    • /
    • 2002
  • The analysis of large data sets with hundreds of thousands observations and thousands of independent variables is a formidable computational task. A less parametric method, capable of identifying important independent variables and their interactions, is a tree structured approach to regression and classification. It gives a graphical and often illuminating way of looking at data in classification and regression problems. In this paper, we have reviewed and summarized tile methodology used to construct a tree, multiple trees and the sequential strategy for identifying active compounds in large chemical databases.

회귀방정식과 PID제어기에 의한 DC모터 제어 (DC Motor Control using Regression Equation and PID Controller)

  • 서기영;이수흠;문상필;이내일;최종수
    • 융합신호처리학회 학술대회논문집
    • /
    • 한국신호처리시스템학회 2000년도 하계종합학술대회논문집
    • /
    • pp.129-132
    • /
    • 2000
  • We propose a new method to deal with the optimized auto-tuning for the PID controller which is used to the process -control in various fields. First of all, in this method, initial values of DC motor are determined by the Ziegler-Nichols method. Finally, after studying the parameters of PID controller by input vector of multiple regression analysis, when we give new K, L, T values to multiple regression model, the optimized parameters of PID controller is found by multiple regression analysis program.

  • PDF

Statistical notes for clinical researchers: simple linear regression 3 - residual analysis

  • Kim, Hae-Young
    • Restorative Dentistry and Endodontics
    • /
    • 제44권1호
    • /
    • pp.11.1-11.8
    • /
    • 2019
  • In the previous sections, simple linear regression (SLR) 1 and 2, we developed a SLR model and evaluated its predictability. To obtain the best fitted line the intercept and slope were calculated by using the least square method. Predictability of the model was assessed by the proportion of the explained variability among the total variation of the response variable. In this session, we will discuss four basic assumptions of regression models for justification of the estimated regression model and residual analysis to check them.

의사방문수 결정요인 분석 (A Study on Factors Affecting the Use of Ambulatory Physician Services)

  • 박현애;송건용
    • 보건행정학회지
    • /
    • 제4권2호
    • /
    • pp.58-76
    • /
    • 1994
  • In order to study factors affecting the use of the ambulatory physician services. Andersen's model for health utilization was modified by adding the health behavior component and examined with three different approaches. Three different approaches were the multiople regression model, logistic regression model, and LISREL model. For multiple regression, dependent variable was reported illness-related visits to a physician during past one year and independent variables are variaous variables measuring predisposing factor, enabling factor, need factor and health behavior. For the logistic regression, dependent variable was visit or no-visit to a physician during past one year and independent variables were same as the multiple regression analysis. For the LISREL, five endogenous variables of health utiliztion, predisposing factor, enabling factor, need factor, and health behavior and 20 exogeneous variables which measures five endogenous variables were used. According to the multiple regression analysis, chronic illness, health status, perceived health status of the need factor; residence, sex, age, marital status, education of the predisposing factor ; health insurance, usual source for medical care of enabling factor were the siginificant exploratory variables for the health utilization. Out of the logistic regression analysis, health status, chronic illness, residence, marital status, education, drinking, use of health aid were found to be significant exploratory variables. From LISREL, need factor affect utilization most following by predisposing factor, enabling factor and health behavior. For LISREL model, age, education, and residence for predisposing factor; health status, chronic illess, and perceived health status for need factor; medical insurance for enabling factor; and doing any kind of health behavior for the health behavior were found as the significant observed variables for each theoretical variables.

  • PDF

공간분석을 이용한 지역별 비만율에 영향을 미치는 요인분석 (Analysing the Effects of Regional Factors on the Regional Variation of Obesity Rates Using the Geographically Weighted Regression)

  • 김다양;곽진미;서은원;이광수
    • 보건행정학회지
    • /
    • 제26권4호
    • /
    • pp.271-278
    • /
    • 2016
  • Background: This study purposed to analyze the relationship between regional obesity rates and regional variables. Methods: Data was collected from the Korean Statistical Information Service (KOSIS) and Community Health Survey in 2012. The units of analysis were administrative districts such as city, county, and district. The dependent variable was the age-sex adjusted regional obesity rates. The independent variables were selected to represent four aspects of regions: health behaviour factor, psychological factor, socio-economic factor, and physical environment factor. Along with the traditional ordinary least square (OLS) regression analysis model, this study applied geographically weighted regression (GWR) analysis to calculate the regression coefficients for each region. Results: The OLS results showed that there were significant differences in regional obesity rates in high-risk drinking, walking, depression, and financial independence. The GWR results showed that the size of regression coefficients in independent variables was differed by regions. Conclusion: Our results can help in providing useful information for health policy makers. Regional characteristics should be considered when allocating health resources and developing health-related programs.

FUZZY REGRESSION ANALYSIS WITH NON-SYMMETRIC FUZZY COEFFICIENTS BASED ON QUADRATIC PROGRAMMING APPROACH

  • Lee, Haekwan;Hideo Tanaka
    • 한국지능시스템학회:학술대회논문집
    • /
    • 한국퍼지및지능시스템학회 1998년도 The Third Asian Fuzzy Systems Symposium
    • /
    • pp.63-68
    • /
    • 1998
  • This paper proposes fuzzy regression analysis with non-symmetric fuzzy coefficients. By assuming non-symmetric triangular fuzzy coefficients and applying the quadratic programming fomulation, the center of the obtained fuzzy regression model attains more central tendency compared to the one with symmetric triangular fuzzy coefficients. For a data set composed of crisp inputs-fuzzy outputs, two approximation models called an upper approximation model and a lower approximation model are considered as the regression models. Thus, we also propose an integrated quadratic programming problem by which the upper approximation model always includes the lower approximation model at any threshold level under the assumption of the same centers in the two approximation models. Sensitivities of Weight coefficients in the proposed quadratic programming approaches are investigated through real data.

  • PDF

Analysis of Client Propensity in Cyber Counseling Using Bayesian Variable Selection

  • Pi, Su-Young
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • 제6권4호
    • /
    • pp.277-281
    • /
    • 2006
  • Cyber counseling, one of the most compatible type of consultation for the information society, enables people to reveal their mental agonies and private problems anonymously, since it does not require face-to-face interview between a counsellor and a client. However, there are few cyber counseling centers which provide high quality and trustworthy service, although the number of cyber counseling center has highly increased. Therefore, this paper is intended to enable an appropriate consultation for each client by analyzing client propensity using Bayesian variable selection. Bayesian variable selection is superior to stepwise regression analysis method in finding out a regression model. Stepwise regression analysis method, which has been generally used to analyze individual propensity in linear regression model, is not efficient since it is hard to select a proper model for its own defects. In this paper, based on the case database of current cyber counseling centers in the web, we will analyze clients' propensities using Bayesian variable selection to enable individually target counseling and to activate cyber counseling programs.

Use of big data for estimation of impacts of meteorological variables on environmental radiation dose on Ulleung Island, Republic of Korea

  • Joo, Han Young;Kim, Jae Wook;Jeong, So Yun;Kim, Young Seo;Moon, Joo Hyun
    • Nuclear Engineering and Technology
    • /
    • 제53권12호
    • /
    • pp.4189-4200
    • /
    • 2021
  • In this study, the relationship between the environmental radiation dose rate and meteorological variables was investigated with multiple regression analysis and big data of those variables. The environmental radiation dose rate and 36 different meteorological variables were measured on Ulleung Island, Republic of Korea, from 2011 to 2015. Not all meteorological variables were used in the regression analysis because the different meteorological variables significantly affect the environmental radiation dose rate during different periods, and the degree of influence changes with time. By applying the Pearson correlation analysis and stepwise selection methods to the big dataset, the major meteorological variables influencing the environmental radiation dose rate were identified, which were then used as the independent variables for the regression model. Subsequently, multiple regression models for the monthly datasets and dataset of the entire period were developed.