• Title/Summary/Keyword: multiple linear analysis

Search Result 1,520, Processing Time 0.03 seconds

Price Determinant Factors of Artworks and Prediction Model Based on Machine Learning (작품 가격 추정을 위한 기계 학습 기법의 응용 및 가격 결정 요인 분석)

  • Jang, Dongryul;Park, Minjae
    • Journal of Korean Society for Quality Management
    • /
    • v.47 no.4
    • /
    • pp.687-700
    • /
    • 2019
  • Purpose: The purpose of this study is to investigate the interaction effects between price determinants of artworks. We expand the methodology in art market by applying machine learning techniques to estimate the price of artworks and compare linear regression and machine learning in terms of prediction accuracy. Methods: Moderated regression analysis was performed to verify the interaction effects of artistic characteristics on price. The moderating effects were studied by confirming the significance level of the interaction terms of the derived regression equation. In order to derive price estimation model, we use multiple linear regression analysis, which is a parametric statistical technique, and k-nearest neighbor (kNN) regression, which is a nonparametric statistical technique in machine learning methods. Results: Mostly, the influences of the price determinants of art are different according to the auction types and the artist 's reputation. However, the auction type did not control the influence of the genre of the work on the price. As a result of the analysis, the kNN regression was superior to the linear regression analysis based on the prediction accuracy. Conclusion: It provides a theoretical basis for the complexity that exists between pricing determinant factors of artworks. In addition, the nonparametric models and machine learning techniques as well as existing parameter models are implemented to estimate the artworks' price.

The Design and Performance Analysis of Physical Layer for VDL Mode-2 (VDL Mode-2 물리 계층 설계 및 성능 분석)

  • Choi, Jun-Su;Lee, Han-Seong;Kim, Tae-Sik;Kim, In-Kyu;Kim, Hyoun-Kyoung
    • Journal of Advanced Navigation Technology
    • /
    • v.11 no.1
    • /
    • pp.17-23
    • /
    • 2007
  • This paper, describes the VDR physical layer design in VDL Mode-2 in order to meet the requirements of International standards. VDR's frequency band is 117.975~137MHz, and CSMA(Carrier Sense Multiple Access), D8PSK(Differential Eight Phase Shift Keyed), 25KHz's channel bandwidth use. The analysis of the isolated channel from near channels, sensitivity of the receiver, dynamic range of the receiver, linear of the transmitter and energy of spurious for linear and non-linear simulation as a requirement condition of performance of VDR and teaches the course of design. The transmitting power level should be lower than 5dB from Po1dB point and the selected IF frequency is 45MHz to suppress the spurious signals. The receiver designed has 4.5dB of Noise figure, 27.52dB of Es/No, Mixer isolation up to 30dB, IIP3 power of LNA up to +10dBm to minimize the intermodulation.

  • PDF

Development of Accident Forecasting Models in Freeway Tunnels using Multiple Linear Regression Analysis (다중선형 회귀분석을 이용한 고속도로 터널구간의 교통사고 예측모형 개발)

  • Park, Ju-Hwan;Kim, Sang-Gu
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.11 no.6
    • /
    • pp.145-154
    • /
    • 2012
  • This paper analyzed the characteristics of traffic accidents in all tunnels on nationwide freeways and selected some various independent variables related to accident occurrence in tunnels. The study aims to develop reliable accident forecasting models using the various dependent variables such as the number of accident (no.), no./km, and no./MVK. Finally, reliable multiple linear regression models were proposed in this paper. This study tested the validity verification of developed models through statistics such as $R^2$, F values, multicollinearity, residual analysis. The paper selected the accident forecasting models considering the characteristics of tunnel accidents and two models were finally proposed according to two groups of tunnel length. In the selected models, natural logarithm of ln(no./MVK) is used for the dependent variable and AADT, vertical slope, and tunnel hight are used for the independent variables. The reliability of two models was proved by the comparison analysis between field data and estimating data using RMSE and MAE. These models may be not only effective in evaluating tunnel safety under design and planning phases of tunnel but also useful to reduce traffic accidents in tunnels and to manage the traffic flow of tunnel.

Interpretation and Comparison of High PM2.5 Characteristics in Seoul and Busan based on the PCA/MLR Statistics from Two Level Meteorological Observations (두 층 관측 기상인자의 주성분-다중회귀분석으로 도출되는 고농도 미세먼지의 부산-서울 지역차이 해석)

  • Choi, Daniel;Chang, Lim-Seok;Kim, Cheol-Hee
    • Atmosphere
    • /
    • v.31 no.1
    • /
    • pp.29-43
    • /
    • 2021
  • In this study, two-step statistical approach including Principal Component Analysis (PCA) and Multiple Linear Regression (MLR) was employed, and main meteorological factors explaining the high-PM2.5 episodes were identified in two regions: Seoul and Busan. We first performed PCA to isolate the Principal Component (PC) that is linear combination of the meteorological variables observed at two levels: surface and 850 hPa level. The employed variables at surface are: temperature (T2m), wind speed, sea level pressure, south-north and west-east wind component and those at 850 hPa upper level variables are: south-north (v850) and west-east (u850) wind component and vertical stability. Secondly we carried out MLR analysis and verified the relationships between PM2.5 daily mean concentration and meteorological PCs. Our two-step statistical approach revealed that in Seoul, dominant factors for influencing the high PM2.5 days are mainly composed of upper wind characteristics in winter including positive u850 and negative v850, indicating that continental (or Siberian) anticyclone had a strong influence. In Busan, however, the dominant factors in explanaining in high PM2.5 concentrations were associated with high T2m and negative u850 in summer. This is suggesting that marine anticyclone had a considerable effect on Busan's high PM2.5 with high temperature which is relevant to the vigorous photochemical secondary generation. Our results of both differences and similarities between two regions derived from only statistical approaches imply the high-PM2.5 episodes in Korea show their own unique characteristics and seasonality which are mostly explainable by two layer (surface and upper) mesoscale meteorological variables.

Outlier Identification in Regression Analysis using Projection Pursuit

  • Kim, Hyojung;Park, Chongsun
    • Communications for Statistical Applications and Methods
    • /
    • v.7 no.3
    • /
    • pp.633-641
    • /
    • 2000
  • In this paper, we propose a method to identify multiple outliers in regression analysis with only assumption of smoothness on the regression function. Our method uses single-linkage clustering algorithm and Projection Pursuit Regression (PPR). It was compared with existing methods using several simulated and real examples and turned out to be very useful in regression problem with the regression function which is far from linear.

  • PDF

A simple nonlinear model for estimating obturator foramen area in young bovines

  • Pares-Casanova, Pere M.
    • Korean Journal of Veterinary Research
    • /
    • v.53 no.2
    • /
    • pp.73-76
    • /
    • 2013
  • The aim of this study was to produce a simple and inexpensive technique for estimating the obturator foramen area (OFA) from young calves based on the hypothesis that OFA can be extrapolated from simple linear measurements. Three linear measurements - dorsoventral height, craneocaudal width and total perimeter of obturator foramen - were obtained from 55 bovine hemicoxae. Different algorithms for determining OFA were then produced with a regression analysis (curve fitting) and statistical analysis software. The most simple equation was OFA ($mm^2$) = [3,150.538 + ($36.111^*CW$)] - [147,856.033/DH] (where CW = craneocaudal width and DH = dorsoventral height, both in mm), representing a good nonlinear model with a standard deviation of error for the estimate of 232.44 and a coefficient of multiple determination of 0.846. This formula may be helpful as a repeatable and easily performed estimation of the obturator foramen area in young bovines. The area of the obturator foramen magnum can thus be estimated using this regression formula.

Community Based Study for Stress and It's Related Factors (일부 지역 주민들의 스트레스 관련요인에 대한 연구)

  • Lee, Jeong-Mi;Kil, Sang-Sun;Kwon, Keun-Sang;Oh, Gyung-Jae
    • Journal of Preventive Medicine and Public Health
    • /
    • v.36 no.2
    • /
    • pp.125-130
    • /
    • 2003
  • Objectives : This study evaluated the stress of community residents using the General Health Questionnaire, GHQ-60, as an instrument of stress measurement. Methods : The study included 2100 residents, aged 20 and over, living in three areas, a large city, a medium sized city and a rural area, between June and September 2001. A questionnaire interviewing method was used to collect data. The data were analyzed using a t-test, ANOVA, Pearson's correlation coefficients and multiple regression analysis. Results : In this study, the degree of stress, as measured by the GHQ-60, was shown to be significantly higher in the following categories: females, people over 60 years old, people engaged in the primary industries and labor work, low incomes, the divorced and the bereaved, people who received no more than an elementary education, people who suffer from chronic diseases and non-exercisers. A factor analysis suggested that there were three factors of social dysfunction factors; psychosomatic symptom, and depression and anxiety, The social dysfunction factors was statistically significant for the groups described above. The factor of psychosomatic symptoms was statistically significant in the rural residents, and in the groups describedabove. The depression and anxiety factor was statistically significant in the large city residents, people aged between 20-29 years, students, unmarried persons, university graduates and those having suffered from chronic diseases. From the multiple linear regression analyses, chronic disease, exercise, gender and income, proved to be significant stress related factors Conclusions : This study suggests that special attention should be given to the management of the chronic invalided, non-exercisers, females and snail income earners, in order to maintain and promote the psychological health of residents in a community.

Genetic parameters for worm resistance in Santa Inês sheep using the Bayesian animal model

  • Rodrigues, Francelino Neiva;Sarmento, Jose Lindenberg Rocha;Leal, Tania Maria;de Araujo, Adriana Mello;Filho, Luiz Antonio Silva Figueiredo
    • Animal Bioscience
    • /
    • v.34 no.2
    • /
    • pp.185-191
    • /
    • 2021
  • Objective: The objective of this study was to estimate the genetic parameters for worm resistance (WR) and associated characteristics, using the linear-threshold animal model via Bayesian inference in single- and multiple-trait analyses. Methods: Data were collected from a herd of Santa Inês breed sheep. All information was collected with animals submitted to natural contamination conditions. All data (number of eggs per gram of feces [FEC], Famacha score [FS], body condition score [BCS], and hematocrit [HCT]) were collected on the same day. The animals were weighed individually on the day after collection (after 12-h fasting). The WR trait was defined by the multivariate cluster analysis, using the FEC, HCT, BCS, and FS of material collected from naturally infected sheep of the Santa Inês breed. The variance components and genetic parameters for the WR, FEC, HCT, BCS, and FS traits were estimated using the Bayesian inference under the linear and threshold animal model. Results: A low magnitude was obtained for repeatability of worm-related traits. The mean values estimated for heritability were of low-to-high (0.05 to 0.88) magnitude. The FEC, HCT, BCS, FS, and body weight traits showed higher heritability (although low magnitude) in the multiple-trait model due to increased information about traits. All WR characters showed a significant genetic correlation, and heritability estimates ranged from low (0.44; single-trait model) to high (0.88; multiple-trait model). Conclusion: Therefore, we suggest that FS be included as a criterion of ovine genetic selection for endoparasite resistance using the trait defined by multivariate cluster analysis, as it will provide greater genetic gains when compared to any single trait. In addition, its measurement is easy and inexpensive, exhibiting greater heritability and repeatability and a high genetic correlation with the trait of resistance to worms.

Design and Performance Analysis of a DS/CDMA Multiuser Detection Algorithm in a Mixed Structure Form (혼합구조 형태의 DS/CDMA 다중사용자 검파 알고리즘 설계 및 성능 분석)

  • Lim, Jong-Min
    • Journal of the Institute of Electronics Engineers of Korea TE
    • /
    • v.39 no.3
    • /
    • pp.51-58
    • /
    • 2002
  • The conventional code division multiple access(CDMA) detector shows severe degradation in communication quality as the number of users increases due to multiple access interferences(MAI). This problem thus restricts the user capacity. Various multiuser detection algorithms have been proposed to overcome the MAI problem. The existing detectors can be generally classified into one of the two categories : linear multiuser detection and subtractive interference cancellation detectors. In the linear multiuser detection, a linear transform is applied to the soft outputs of the conventional detector. In the subtractive interference cancellation detection, estimates of the interference are generated and subtracted out from the received signal. There has been great interest in the family of the subtractive interference cancellation detection because the linear multiuser detection exhibits the disadvantage of taking matrix inversion operations. The successive interference cancellation (SIC) and the parallel interference cancellation (PIC) are the two most popular structures in the subtractive interference cancellation detector. The SIC structure is very simple in hardware complexity, but has the disadvantage of increased processing delay time, while the PIC structure is good in performance, but shows the disadvantage of increased hardware complexity. In this paper we propose a mixed structure form of SIC and PIC in order to achieve good performance as well as simple hardware complexity. A performance analysis of the proposed scheme has been made, and the superior characteristics of the mixed structure are demonstrated by extensive computer simulations. 

Evaluation of the Relationship between the Exposure Level to Mixed Hazardous Heavy Metals and Health Effects Using Factor Analysis (요인분석을 이용한 유해 중금속 복합 노출수준과 건강영향과의 관련성 평가)

  • Kim, Eunseop;Moon, Sun-In;Yim, Dong-Hyuk;Choi, Byung-Sun;Park, Jung-Duck;Eom, Sang-Yong;Kim, Yong-Dae;Kim, Heon
    • Journal of Environmental Health Sciences
    • /
    • v.48 no.4
    • /
    • pp.236-243
    • /
    • 2022
  • Background: In the case of multiple exposures to different types of heavy metals, such as the conditions faced by residents living near a smelter, it would be preferable to group hazardous substances with similar characteristics rather than individually related substances and evaluate the effects of each group on the human body. Objectives: The purpose of this study is to evaluate the utility of factor analysis in the assessment of health effects caused by exposure to two or more hazardous substances with similar characteristics, such as in the case of residents living near a smelter. Methods: Heavy metal concentration data for 572 people living in the vicinity of the Janghang smelter area were grouped based on several subfactors according to their characteristics using factor analysis. Using these factor scores as an independent variable, multiple regression analysis was performed on health effect markers. Results: Through factor analysis, three subfactors were extracted. Factor 1 contained copper and zinc in serum and revealed a common characteristic of the enzyme co-factor in the human body. Factor 2 involved urinary cadmium and arsenic, which are harmful metals related to kidney damage. Factor 3 encompassed blood mercury and lead, which are classified as related to cardiovascular disease. As a result of multiple linear regression analysis, it was found that using the factor index derived through factor analysis as an independent variable is more advantageous in assessing the relevance to health effects than when analyzing the two heavy metals by including them in a single regression model. Conclusions: The results of this study suggest that regression analysis linked with factor analysis is a good alternative in that it can simultaneously identify the effects of heavy metals with similar properties while overcoming multicollinearity that may occur in environmental epidemiologic studies on exposure to various types of heavy metals.