• Title/Summary/Keyword: two variable linear regression analysis

검색결과 68건 처리시간 0.027초

영아의 기질과 교사의 놀이 관련 특성이 2세반 영아의 상상놀이에미치는 영향 (Effects of Toddler Temperament and Teacher's Play-Related Characteristics on Imaginative Play in Two-Year-Old Classrooms)

  • 유애형;신나리
    • 한국보육지원학회지
    • /
    • 제20권2호
    • /
    • pp.83-103
    • /
    • 2024
  • Objective: This study aimed to investigate the effects of children's characteristics and childcare teachers' attributes on the frequency and level of imaginative play in two-year-old classrooms. Methods: The study involved 191 toddlers, their mothers, and 32 teachers from childcare centers. Toddler characteristics encompassed temperament along with demographic variables such as gender and age. Teacher' attributes related to play included playfulness, play-support belief, and interactions with toddlers. Data analysis was conducted using SPSS 22.0 and HLM 8.2 software, employing basic analysis, hierarchical linear analysis, and hierarchical regression analysis. Results: First, as toddlers' age increased, both the frequency and level of their imaginative play increased. Second, individual-level model analysis revealed a positive effect of toddlers' extroversion on the level of imaginative play. Third, the class-level model results indicated that teachers' emotions had a negative effect, whereas their encouragement positively influenced the level of imaginative play. Conclusion/Implications: The significance of this study lies in its utilization of a multilayered model analysis, which offers a more robust examination of variable influences by accounting for hierarchical data structures.

품질경쟁력 우수기업의 특성분석 (A Characteristic Analysis for Quality Competitiveness Excellent Company)

  • 박동준;윤예분;강인선;유은재;김호균;윤민
    • 산업경영시스템학회지
    • /
    • 제42권3호
    • /
    • pp.95-108
    • /
    • 2019
  • Quality management has become an pervasive philosophy in most sectors of business. Specific movements such as statistical quality control, quality circle, total quality management, and quality management system have become embedded in business organizations. Only the companies with competitive edge can survive in the competition in global market. KSA(Korean Standards Association) established in 1962 has launched all kinds of quality education, quality standard certification service for business, and KNQA(Korean National Quality Award) system. This article considers quality competitiveness excellent company award among KNQA. We performed a statistical analysis of audit data for quality competitiveness excellent company for three years, from 2015 to 2017. By using ANOVA and two sample t-tests, the average scores of 13 evaluation fields were significantly different depending on company size and type. We proposed ways to improve the current hall of fame system. We discovered that the average scores of 13 evaluation fields in the audit data according to years and hall of fame status were not significantly different. We also showed linear relationships among 13 evaluation fields by correlation analysis and obtained an estimated linear regression equation : Business Performance, which is a comprehensive index, as a dependent variable was significantly related to Customer Focus and Product Liability as regressor variables among 13 evaluation fields by regression analysis.

재가 뇌졸중환자의 주간재활간호 프로그램 서비스 요구조사 (The Need for Rehabilitation Day Care Program Service of Stroke Survivors)

  • 정성희;서문자
    • 재활간호학회지
    • /
    • 제2권1호
    • /
    • pp.29-44
    • /
    • 1999
  • This study was carried out to obtain basic data required to plan and develop Rehabilitation Day Care Program for the stroke Survivors at home in Korea. The subjects comprised of 118 stroke survivors who discharged from 4 hospitals in Seoul during the past 2 years. The data were collected from August 3, 1998 to September 18, 1998, through interviews with questionnaires about general characteristics, activities of dally living, depression and service need of rehabilitation day care program at the outpatient clinics by trained nursing graduates. Data were analyzed with descriptive analysis, Pearson's correlation analysis, and Stepwise multiple linear regression analysis using SPSS/WIN program. The results obtained are as follows ; 1. The mean score of the general need of rehabilitation day care program of stroke survivors was 2.78(range 1-4). The highest need among the service categories of the rehabilitation day care program was self-care and restorative activities category, and health services referral category, recreation category, psychosocial activities category in order. The needs of each category are as follows ; 1) In the health services referral category, the need for speech therapy was highest, followed by the need for physical therapy and occupational therapy. 2) In the psychosocial activities category, the need for self-help group was highest. 3) In the self-care and restorative activities category, the need for bathing was highest, followed by bowel training, and ambulation training. 4) The need for the recreation category was 2.62. 2. Among the need for the effect related to the utilization of day care program, the need for survivors' physical and psychological well-being was highest and was followed by the need for caregiver's physical and psychological wellbeing. Pearson's correlation analysis revealed following results ; 1. The need for rehabilitation day care program service displayed a correlation with the level of education, ADL, and the level of depression, and a reverse correlation with age. 2. The need for the effect related to the utilization of rehabilitation day program displayed a correlation with the level of education, ADL, and the level of depression. The stepwise multiple linear regression analysis revealed following results : 1. For the need for rehabilitation day care program service, 28.4% of the variance was initially explained by one variable, level of depression. The level of depression plus two variables, survivors' age and ADL, explained 34.2% of the variance in the need for rehabilitation day care program service. 2. For the need for the effect related to the utilization of rehabilitation day care program, 12.4% of the variance was initially explained by one variable, level of depression. The level of depression plus one variable, level of education, explained 20.4% of the variance in the need for the effect related to the utilization of rehabilitation day care program. In conclusion, above characteristics should be considered when we are planning to develop stroke survivors' rehabilitation day care program.

  • PDF

Patent Keyword Analysis using Gamma Regression Model and Visualization

  • Jun, Sunghae
    • 한국컴퓨터정보학회논문지
    • /
    • 제27권8호
    • /
    • pp.143-149
    • /
    • 2022
  • 특허문서는 연구 개발된 기술에 대한 상세한 결과를 포함하고 있기 때문에 효과적인 기술분석을 위한 다양한 특허분석 방법에 대한 연구가 진행되고 있다. 특히 통계학과 머신러닝 알고리즘에 의한 정량적인 특허분석에 대한 연구가 최근 활발하게 이루어지고 있다. 정량적 특허분석에서 가장 많이 사용되는 특허 데이터는 기술 키워드이다. 기술 키워드 데이터를 분석하는 기존의 방법은 대부분 음의 무한대부터 양의 무한대까지 실수 공간 전체를 확률변수의 값으로 갖는 가우시안 확률분포에 기반한 모형이었다. 본 논문에서는 이론적으로 0부터 양의 무한대까지의 값을 갖는 특허 키워드의 빈도 데이터를 분석하기 위하여 감마 확률분포를 활용한 모형을 제안한다. 또한 감마 회귀모형의 회귀방정식을 결정하기 위하여 키워드 간의 기술 연관성을 시각화하는 2-모드 네트워크를 구축한다. 제안 방법과 기존의 가우시안 기반의 분석모형 간의 성능평가를 위하여 실제 특허 데이터를 수집하여 분석한다.

로지스틱 회귀분석을 이용한 인제군 산사태지역의 위험도 평가 (Landslide Risk Assessment in Inje Using Logistic Regression Model)

  • 이환길;김기홍
    • 한국측량학회지
    • /
    • 제30권3호
    • /
    • pp.313-321
    • /
    • 2012
  • 우리나라는 국토의 70%가 산지로 이루어져 있고 연평균 강우량의 대부분이 6월과 9월 사이에 집중되어 산사태로 인한 피해를 지속적으로 입어 왔으며, 최근 급변하는 기후에 따라 그 빈도가 점차 증가하고 있다. 특히, 강원도의 경우 지역적 특성상 대부분 산지로 이루어져 있으며 경사가 가파르고 토심 또한 얕아 산사태에 의해 많은 피해를 입고 있다. 본 논문에서는 2006년 7월 집중호우로 인해 대규모 산사태피해가 발생하였던 강원도 인제군 인제읍 덕산리 지역을 대상으로 로지스틱 회귀분석을 수행하여 산사태 위험도평가모형을 개발하였다. 분석을 위하여 대상 지역의 현장조사 및 피해 직후 촬영된 항공사진을 통해 수집한 정보를 이용하여 GIS DB를 구축하였다. 경사도의 경우 범주형 변수와 연속형 변수로 입력하는 두 가지 방법을 적용하였다. 생성된 예측모형에 대해 정오분류를 실시한 결과 각각 81.4%와 81.9%의 분류정확도를 보였다.

Drought forecasting over South Korea based on the teleconnected global climate variables

  • Taesam Lee;Yejin Kong;Sejeong Lee;Taegyun Kim
    • 한국수자원학회:학술대회논문집
    • /
    • 한국수자원학회 2023년도 학술발표회
    • /
    • pp.47-47
    • /
    • 2023
  • Drought occurs due to lack of water resources over an extended period and its intensity has been magnified globally by climate change. In recent years, drought over South Korea has also been intensed, and the prediction was inevitable for the water resource management and water industry. Therefore, drought forecasting over South Korea was performed in the current study with the following procedure. First, accumulated spring precipitation(ASP) driven by the 93 weather stations in South Korea was taken with their median. Then, correlation analysis was followed between ASP and Df4m, the differences of two pair of the global winter MSLP. The 37 Df4m variables with high correlations over 0.55 was chosen and sorted into three regions. The selected Df4m variables in the same region showed high similarity, leading the multicollinearity problem. To avoid this problem, a model that performs variable selection and model fitting at once, least absolute shrinkage and selection operator(LASSO) was applied. The LASSO model selected 5 variables which showed a good agreement of the predicted with the observed value, R2=0.72. Other models such as multiple linear regression model and ElasticNet were also performed, but did not present a performance as good as LASSO. Therefore, LASSO model can be an appropriate model to forecast spring drought over South Korea and can be used to mange water resources efficiently.

  • PDF

PGA 투어의 골프 스코어 예측 및 분석 (Prediction of golf scores on the PGA tour using statistical models)

  • 임정은;임영인;송종우
    • 응용통계연구
    • /
    • 제30권1호
    • /
    • pp.41-55
    • /
    • 2017
  • 최근 골프는 많은 사람들의 취미 생활로서 자리를 잡아가고 있으며 골프와 관련된 연구도 다양하게 이루어지고 있다. 본 연구에서는 데이터 마이닝 기법을 사용하여 PGA 투어에 참여하는 선수들의 평균스코어를 예측하고 스코어에 유의한 영향을 미치는 변수들을 제시하고자 한다. 그리고 추가적으로 4개의 PGA 투어 플레이오프에 대해 상위 10명, 상위 25명의 선수들을 예측하는 것을 목표로 한다. 우리는 다양한 선형/비선형 회귀분석 방법을 이용하여 평균스코어를 예측하는데, 선형회귀분석 방법으로는 단계적 선택법, 모든 가능한 회귀모형, 라소(LASSO), 능형회귀, 주성분회귀분석을 사용하였으며 비선형회귀분석 방법으로는 트리(CART), 배깅, 그래디언트 부스팅, 신경망 모형, 랜덤 포레스트, 최근접이웃방법(KNN)을 사용하였다. 대부분의 모형에서 공통적으로 선택된 변수들을 살펴보면 페어웨이의 단단함와 그린의 풀의 높이, 평균최대풍속이 높을수록 선수들의 평균스코어는 높아지며 반대로 한 번에 퍼팅을 성공시키는 횟수와 그린적중률 실패 후 버디나 이글로 점수를 만드는 scrambling 변수들, 그리고 공을 멀리 보낼 수 있는 능력을 나타내는 longest drive는 그 값이 높아짐에 따라 선수들의 평균스코어가 낮아지는 경향이 있음을 알 수 있었다. 11가지 모형 모두 테스트 데이터인 2015년 경기 결과를 예측하는데 낮은 오류율을 보였으나 배깅과 랜덤 포레스트의 예측률이 가장 좋았으며 두 모형 모두 상위 10명과 상위 25명의 랭킹을 예측할 때 상당히 높은 적중률을 보였다.

수도(水稻) 적정시비량(適正施肥量) 결정(決定)에 대한 대체모형(代替模型) (An Alternative Model for Determining the Optimal Fertilizer Level)

  • 장석환
    • 한국토양비료학회지
    • /
    • 제13권1호
    • /
    • pp.21-32
    • /
    • 1980
  • Linear models, with and without site variables, have been investigated in order to develop an alternative methodology for determining optimal fertilizer levels. The resultant models are : (1) Model I is an ordinary quadratic response function formed by combining the simple response function estimated at each site in block diagonal form, and has parameters [${\gamma}^{(1)}_{m{\ell}}$], for m=1, 2, ${\cdots}$, n sites and degrees of polynomial, ${\ell}$=0, 1, 2. (2) Mode II is a multiple regression model with a set of site variables (including an intercept) repeated for each fertilizer level and the linear and quadratic terms of the fertilizer variables arranged in block diagonal form as in Model I. The parameters are equal to [${\beta}_h\;{\gamma}^{(2)}_{m{\ell}}$] for h=0, 1, 2, ${\cdots}$, k site variable, m=1, 2, ${\cdots}$ and ${\ell}$=1, 2. (3) Model III is a classical response surface model, I. e., a common quadratic polynomial model for the fertilizer variables augmented with site variables and interactions between site variables and the linear fertilizer terms. The parameters are equal to [${\beta}_h\;{\gamma}_{\ell}\;{\theta}_h$], for h=0, 1, ${\cdots}$, k, ${\ell}$=1, 2, and h'=1, 2, ${\cdots}$, k. (4) Model IV has the same basic structure as Mode I, but estimation procedure involves two stages. In stage 1, yields for each fertilizer level are regressed on the site variables and the resulting predicted yields for each site are then regressed on the fertilizer variables in stage 2. Each model has been evaluated under the assumption that Model III is the postulated true response function. Under this assumption, Models I, II and IV give biased estimators of the linear fertilizer response parameter which depend on the interaction between site variables and applied fertilizer variables. When the interaction is significant, Model III is the most efficient for calculation of optimal fertilizer level. It has been found that Model IV is always more efficient than Models I and II, with efficiency depending on the magnitude of ${\lambda}m$, the mth diagonal element of X (X' X)' X' where X is the site variable matrix. When the site variable by linear fertilizer interaction parameters are zero or when the estimated interactions are not important, it is demonstrated that Model IV can be a reasonable alternative model for calculation of optimal fertilizer level. The efficiencies of the models are compared us ing data from 256 fertilizer trials on rice conducted in Korea. Although Model III is usually preferred, the empirical results from the data analysis support the feasibility of using Model IV in practice when the estimated interaction term between measured soil organic matter and applied nitrogen is not important.

  • PDF

Assessment through Statistical Methods of Water Quality Parameters(WQPs) in the Han River in Korea

  • Kim, Jae Hyoun
    • 한국환경보건학회지
    • /
    • 제41권2호
    • /
    • pp.90-101
    • /
    • 2015
  • Objective: This study was conducted to develop a chemical oxygen demand (COD) regression model using water quality monitoring data (January, 2014) obtained from the Han River auto-monitoring stations. Methods: Surface water quality data at 198 sampling stations along the six major areas were assembled and analyzed to determine the spatial distribution and clustering of monitoring stations based on 18 WQPs and regression modeling using selected parameters. Statistical techniques, including combined genetic algorithm-multiple linear regression (GA-MLR), cluster analysis (CA) and principal component analysis (PCA) were used to build a COD model using water quality data. Results: A best GA-MLR model facilitated computing the WQPs for a 5-descriptor COD model with satisfactory statistical results ($r^2=92.64$,$Q{^2}_{LOO}=91.45$,$Q{^2}_{Ext}=88.17$). This approach includes variable selection of the WQPs in order to find the most important factors affecting water quality. Additionally, ordination techniques like PCA and CA were used to classify monitoring stations. The biplot based on the first two principal components (PCs) of the PCA model identified three distinct groups of stations, but also differs with respect to the correlation with WQPs, which enables better interpretation of the water quality characteristics at particular stations as of January 2014. Conclusion: This data analysis procedure appears to provide an efficient means of modelling water quality by interpreting and defining its most essential variables, such as TOC and BOD. The water parameters selected in a COD model as most important in contributing to environmental health and water pollution can be utilized for the application of water quality management strategies. At present, the river is under threat of anthropogenic disturbances during festival periods, especially at upstream areas.

Jet 폭기 시스템의 순환유량에 따른 산소전달 특성 및 순산소 적용성 검토 (Oxygen Transfer Characteristics & Pure Oxygen Application Study on Circulation Flow Rate of the JLB (Jet Loop Bioreactor))

  • 박노백;송용효;박준규;전항배
    • 한국물환경학회지
    • /
    • 제25권6호
    • /
    • pp.896-901
    • /
    • 2009
  • In this study, in order to apply the air and pure oxygen in the Jet Loop Reactor (JLB) in which the oxygen transfer rate is high, differentiate the operation mode according to each air flowrate and liquid flowrate and investigate the oxygen transfer characteristic, an experiment was carried out. The oxygen concentration with the air flowrate ($Q_g$) and liquid flowrate ($Q_L$) was identical but the oxygen transfer coefficient ($K_L{\cdot}a$) is linear depending on degree of two factors. The width of an increase is small in $0.1min^{-1}$ when the air flowrate is 0.2 L/min with increasing the liquid flowrate. Whereas, the increment was exposed to be very high for $1.5min^{-1}$ when the air flowrate was 5 L/min. In the experiments using the pure oxygen, it was 30 mg/L of oxygen concentration finally and it was 3.5 times than using the air. But the time reached the saturated concentration was similar to using the air, and $K_L{\cdot}a$ was similar to using the air too. Analysis between two independent variable and oxygen transfer of the correlation is the same model like $K_L{\cdot}a={0.0161Q_L}^{1.5371}{Q_g}^{0.5433}$ using with coefficient non linear regression analysis. It was resulted that the liquid flowrate were approximately three times than air flowrate on effect to oxygen transfer rate.