• Title/Summary/Keyword: Multiple regression model

Search Result 2,533, Processing Time 0.025 seconds

Predicting the Number of Movie Audiences Through Variable Selection Based on Information Gain Measure (정보 소득율 기반의 변수 선택을 통한 영화 관객 수 예측)

  • Park, Hyeon-Mock;Choi, Sang Hyun
    • Journal of Information Technology Applications and Management
    • /
    • v.26 no.3
    • /
    • pp.19-27
    • /
    • 2019
  • In this study, we propose a methodology for predicting the movie audience based on movie information that can be easily acquired before opening and effectively distinguishing qualitative variables. In addition, we constructed a model to estimate the number of movie audiences at the time of data acquisition through the configured variables. Another purpose of this study is to provide a criterion for categorizing success of movies with qualitative characteristics. As an evaluation criterion, we used information gain ratio which is the node selection criterion of C4.5 algorithm. Through the procedure we have selected 416 movie data features. As a result of the multiple linear regression model, the performance of the regression model using the variables selection method based on the information gain ratio was excellent.

Use of partial least squares analysis in concrete technology

  • Tutmez, Bulent
    • Computers and Concrete
    • /
    • v.13 no.2
    • /
    • pp.173-185
    • /
    • 2014
  • Multivariate analysis is a statistical technique that investigates relationship between multiple predictor variables and response variable and it is a very commonly used statistical approach in cement and concrete industry. During model building stage, however, many predictor variables are included in the model and possible collinearity problems between these predictors are generally ignored. In this study, use of partial least squares (PLS) analysis for evaluating the relationships among the cement and concrete properties is investigated. This regression method is known to decrease the model complexity by reducing the number of predictor variables as well as to result in accurate and reliable predictions. The experimental studies showed that the method can be used in the multivariate problems of cement and concrete industry effectively.

Multivariate quantile regression tree (다변량 분위수 회귀나무 모형에 대한 연구)

  • Kim, Jaeoh;Cho, HyungJun;Bang, Sungwan
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.3
    • /
    • pp.533-545
    • /
    • 2017
  • Quantile regression models provide a variety of useful statistical information by estimating the conditional quantile function of the response variable. However, the traditional linear quantile regression model can lead to the distorted and incorrect results when analysing real data having a nonlinear relationship between the explanatory variables and the response variables. Furthermore, as the complexity of the data increases, it is required to analyse multiple response variables simultaneously with more sophisticated interpretations. For such reasons, we propose a multivariate quantile regression tree model. In this paper, a new split variable selection algorithm is suggested for a multivariate regression tree model. This algorithm can select the split variable more accurately than the previous method without significant selection bias. We investigate the performance of our proposed method with both simulation and real data studies.

Optimal National Coordinate System Transform Model using National Control Point Network Adjustment Results (국가지준점 망조정 성과를 활용한 최적 국가 좌표계 변환 모델 결정)

  • Song, Dong-Seob;Jang, Eun-Seok;Kim, Tae-Woo;Yun, Hong-Sic
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.25 no.6_2
    • /
    • pp.613-623
    • /
    • 2007
  • The main purpose of this study is to investigate the coordinate transformation based on two different systems between local geodetic datum(tokyo datum) and international geocentric datum(new Korea geodetic datum). For this purpose, three methods were used to determine seven parameters as follows: Bursa-Wolf model, Molodensky-Badekas model, and Veis model. Also, we adopted multiple regression equation method to convert from Tokyo datum to KTRF. We used 935 control points as a common points and applied gross error analysis for detecting the outlier among those control points. The coordinate transformation was carried out using similarity transformation applied the obtained seven parameters and the precision of transformed coordinate was evaluated about 9,917 third or forth order control points. From these results, it was found that Bursa-Wolf model and Molodensky-Badekas model are more suitable than other for the determination of transformation parameters in Korea. And, transforming accuracy using MRE is lower than other similarity transformation model.

Predictive Model of Micro-Environment in a Naturally Ventilated Greenhouse for a Model-Based Control Approach (자연 환기식 온실의 모델 기반 환기 제어를 위한 미기상 환경 예측 모형)

  • Hong, Se-Woon;Lee, In-Bok
    • Journal of Bio-Environment Control
    • /
    • v.23 no.3
    • /
    • pp.181-191
    • /
    • 2014
  • Modern commercial greenhouse requires the use of advanced climate control system to improve crop production and to reduce energy consumption. As an alternative to classical sensor-based control method, this paper introduces a model-based control method that consists of two models: the predictive model and the evaluation model. As a first step, this paper presents straightforward models to predict the effect of natural ventilation in a greenhouse according to meteorological factors, such as outdoor air temperature, soil temperature, solar radiation and mean wind speed, and structural factor, opening rate of roof ventilators. A multiple regression analysis was conducted to develop the predictive models on the basis of data obtained by computational fluid dynamics (CFD) simulations. The output of the models are air temperature drops due to ventilation at 9 sub-volumes in the greenhouse and individual volumetric ventilation rate through 6 roof ventilators, and showed a good agreement with the CFD-computed results. The resulting predictive models have an advantage of ensuring quick and reasonable predictions and thereby can be used as a part of a real-time model-based control system for a naturally ventilated greenhouse to predict the implications of alternative control operation.

Development of Crop Growth Model under Different Soil Moisture Status

  • Goto, Keita;Yabuta, Shin;Sakagami, Jun-Ichi
    • Proceedings of the Korean Society of Crop Science Conference
    • /
    • 2019.09a
    • /
    • pp.19-19
    • /
    • 2019
  • It is necessary to maintain stable crop productions under the unsuitable environments, because the drought and flood may be frequently caused by the global warming. Therefore, it is agent to improve the crop growth model corresponded to soil moisture status. Chili pepper (Capsicum annuum) is one of the useful crop in Asia, and then it is affected by change of precipitation in consequence drought and flood occur however crop model to evaluate water stresses on chili pepper is not enough yet. In this study, development of crop model under different soil moisture status was attempted. The experiment was conducted on the slope fields in the greenhouse. The water level was kept at 20cm above the bottom of the container. Habanero (C. chinense) was used as material for crop model. Sap bleeding rate, SPAD value, chlorophyll content, stomatal conductance, leaf water potential, plant height, leaf area and shoot dry weight were measured at 10 days after treatment (DAT) and 13 DAT. Moreover, temperature and RH in the greenhouse, soil volume water contents (VWC) and soil water potential were measured. As a result, VWC showed 4.0% at the driest plot and 31.4% at the wettest plot at 13 DAT. The growth model was calculated using WVC and the growth analysis parameters. It was considered available, because its coefficient of determination showed 0.84 and there are significant relationship based on plants physiology among the parameters and the changes over time. Furthermore, we analyzed the important factors for higher accuracy prediction using multiple regression analysis.

  • PDF

Fibromyalgia diagnostic model derived from combination of American College of Rheumatology 1990 and 2011 criteria

  • Ghavidel-Parsa, Banafsheh;Bidari, Ali;Hajiabbasi, Asghar;Shenavar, Irandokht;Ghalehbaghi, Babak;Sanaei, Omid
    • The Korean Journal of Pain
    • /
    • v.32 no.2
    • /
    • pp.120-128
    • /
    • 2019
  • Background: We aimed to explore the American College of Rheumatology (ACR) 1990 and 2011 fibromyalgia (FM) classification criteria's items and the components of Fibromyalgia Impact Questionnaire (FIQ) to identify features best discriminating FM features. Finally, we developed a combined FM diagnostic (C-FM) model using the FM's key features. Methods: The means and frequency on tender points (TPs), ACR 2011 components and FIQ items were calculated in the FM and non-FM (osteoarthritis [OA] and non-OA) patients. Then, two-step multiple logistic regression analysis was performed to order these variables according to their maximal statistical contribution in predicting group membership. Partial correlations assessed their unique contribution, and two-group discriminant analysis provided a classification table. Using receiver operator characteristic analyses, we determined the sensitivity and specificity of the final model. Results: A total of 172 patients with FM, 75 with OA and 21 with periarthritis or regional pain syndromes were enrolled. Two steps multiple logistic regression analysis identified 8 key features of FM which accounted for 64.8% of variance associated with FM group membership: lateral epicondyle TP with variance percentages (36.9%), neck pain (14.5%), fatigue (4.7%), insomnia (3%), upper back pain (2.2%), shoulder pain (1.5%), gluteal TP (1.2%), and FIQ fatigue (0.9%). The C-FM model demonstrated a 91.4% correct classification rate, 91.9% for sensitivity and 91.7% for specificity. Conclusions: The C-FM model can accurately detect FM patients among other pain disorders. Re-inclusion of TPs along with saving of FM main symptoms in the C-FM model is a unique feature of this model.

Relationship Between a New Functional Evaluation Model and the Fugle-Meyer Assessment Scale for Evaluating the Upper Extremities of Stroke Patients

  • Kim, Jung-Hyun;Kim, Hyun-Jin;Lee, Seung-Gu;Song, Chang-Ho
    • PNF and Movement
    • /
    • v.18 no.3
    • /
    • pp.305-313
    • /
    • 2020
  • Purpose: The aim of this study was to investigate the relationship between a functional evaluation model and the Fugl-Meyer assessment (FMA) scale in evaluating the upper extremities of stroke patients Methods: Thirty-eight stroke patients were evaluated using the FMA and performed reaching and grasping motions using a three-dimensional motion analysis (Qquas 1 series, Qualisys AB, Sweden). The participants sat on a chair with a backrest. The position of the cup was located at a distance of 80% to the front arm length. The markers were attached to the sternum, acromion, elbow lateral epicondyle, ulnar styloid process, three metacarpal heads, and the distal phalanges of the thumb and index finger. The variables of the correlation between the functional evaluation model and the FMA scale were analyzed. Multiple regression (stepwise) was used to investigate the effect of the kinematic variables. Results: A significant negative correlation was found between the movement time (p < 0.05), movement unit (p < 0.05), and trunk displacement values (p < 0.05) in the FMA total scores, while a positive correlation was found between the peak velocity (p < 0.05) and maximum grip aperture values (p < 0.05). As a result of the multiple regression analysis, the most significant factor was the movement unit, followed by the general movement assessment and trunk displacement. The explained FMA total score value was 62%. Conclusion: This study presents a new functional evaluation model for assessing the reaching and grasping ability of stroke patients. The factors of the proposed functional evaluation model showed significant correlations with the FMA scale scores and confirmed that the new functional evaluation model explained the FMA by 67%. This suggests a new functional evaluation model for reaching and grasping stroke patients.

The Development of Synthetic Unit Hydrograph Suitable to the Hydrologic Characteristics in Korea (국내 수문특성에 적합한 합성단위도의 개발)

  • Jeong, Seong-Won;Mun, Jang-Won
    • Journal of Korea Water Resources Association
    • /
    • v.34 no.6
    • /
    • pp.627-640
    • /
    • 2001
  • Generally, the synthetic unit hydrograph method is presented to estimate the design flood in the ungaged watershed. However, due to the lack of rainfall-runoff data, the models developed in other countries such as U.S.A. and Japan have been widely used in Korea. Therefore, it may be essential to develope the rainfall-runoff model suitable for the hydrological char-acteristics in Korea. In this study, the representative unit hydrographs are derived from rainfall-runoff data at 19 basins in Selma-Cheon and 3-IHP experimental watersheds using ridge-regression method and Nash model. And a new synthetic unit hydrograph for Korea is suggested by integrating the described results and previous studies on unit hydrograph. The newly developed method is represented as two regression forms with three independent variables of watershed area, channel length, and channel slope by multiple regression analysis is carried out for each watershed, the coefficients of determination are not improved in all cases compared out for each watershed, the coefficients of determination are not improved n all cased the synthetic unit hydrograph for each watershed. Therefore, when the new method is applied to some watersheds, the result analyzed for all data has to be used.

  • PDF

Alterations of papilla dimensions after orthodontic closure of the maxillary midline diastema: a retrospective longitudinal study

  • Jeong, Jin-Seok;Lee, Seung-Youp;Chang, Moontaek
    • Journal of Periodontal and Implant Science
    • /
    • v.46 no.3
    • /
    • pp.197-206
    • /
    • 2016
  • Purpose: The aim of this study was to evaluate alterations of papilla dimensions after orthodontic closure of the diastema between maxillary central incisors. Methods: Sixty patients who had a visible diastema between maxillary central incisors that had been closed by orthodontic approximation were selected for this study. Various papilla dimensions were assessed on clinical photographs and study models before the orthodontic treatment and at the follow-up examination after closure of the diastema. Influences of the variables assessed before orthodontic treatment on the alterations of papilla height (PH) and papilla base thickness (PBT) were evaluated by univariate regression analysis. To analyze potential influences of the 3-dimensional papilla dimensions before orthodontic treatment on the alterations of PH and PBT, a multiple regression model was formulated including the 3-dimensional papilla dimensions as predictor variables. Results: On average, PH decreased by 0.80 mm and PBT increased after orthodontic closure of the diastema (P<0.01). Univariate regression analysis revealed that the PH (P=0.002) and PBT (P=0.047) before orthodontic treatment influenced the alteration of PH. With respect to the alteration of PBT, the diastema width (P=0.045) and PBT (P=0.000) were found to be influential factors. PBT before the orthodontic treatment significantly influenced the alteration of PBT in the multiple regression model. Conclusions: PH decreased but PBT increased after orthodontic closure of the diastema. The papilla dimensions before orthodontic treatment influenced the alterations of PH and PBT after closure of the diastema. The PBT increased more when the diastema width before the orthodontic treatment was larger.