• Title/Summary/Keyword: principal component regression

Search Result 253, Processing Time 0.03 seconds

Alternative hitting ability index for KBO (한국프로야구에서 타자력 지수 제안)

  • Hong, Chong Sun;Kim, Jae Young;Shin, Dong Sik
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.3
    • /
    • pp.677-687
    • /
    • 2016
  • Among lots of sabermetric statistics for baseball batters' ability, the wins above replacement (WAR) is the most popular statistic in MLB. However, there exists a difficulty applying WAR to KBO, since KBO data do not have position adjustment, league adjustment and park factor which are essential in calculating WAR. In this paper, using five statistics for both KBO and MLB qualified batters, we propose hitting ability index (HAI), an alternative sabermetric indices to represent batters' ability. Comparing HAI with WAR of MLB batters, we evaluate the validity of HAI and then applied HAI to 2015 KBO data in which HAI is analyzed statistically with respect to different teams, ages, and positions. Moreover, the linear relationship between KBO batter's HAI and their annual salary is discussed. Grouping 46 KBO batters based on confidence region of the regression model for annual salary, we also statistically investigate batter's annual salary in these groups with respect to several factors.

Determination of Nitrogen in Fresh and Dry Leaf of Apple by Near Infrared Technology (근적외 분석법을 응용한 사과의 생잎과 건조잎의 질소분석)

  • Zhang, Guang-Cai;Seo, Sang-Hyun;Kang, Yeon-Bok;Han, Xiao-Ri;Park, Woo-Churl
    • Korean Journal of Soil Science and Fertilizer
    • /
    • v.37 no.4
    • /
    • pp.259-265
    • /
    • 2004
  • A quicker method was developed for foliar analysis in diagnosis of nitrogen in apple trees based on multivariate calibration procedure using partial least squares regression (PLSR) and principal component regression (PCR) to establish the relationship between reflectance spectra in the near infrared region and nitrogen content of fresh- and dry-leaf. Several spectral pre-processing methods such as smoothing, mean normalization, multiplicative scatter correction (MSC) and derivatives were used to improve the robustness and performance of the calibration models. Norris first derivative with a seven point segment and a gap of six points on MSC gave the best result of partial least squares-1 PLS-1) model for dry-leaf samples with root mean square error of prediction (RMSEP) equal to $0.699g\;kg^{-1}$, and that the Savitzky-Golay first derivate with a seven point convolution and a quadratic polynomial on MSC gave the best results of PLS-1 model for fresh-samples with RMSEP of $1.202g\;kg^{-1}$. The best PCR model was obtained with Savitzky-Golay first derivative using a seven point convolution and a quadratic polynomial on mean normalization for dry leaf samples with RMSEP of $0.553g\;kg^{-1}$, and obtained with the Savitzky-Golay first derivate using a seven point convolution and a quadratic polynomial for fresh samples with RMSEP of $1.047g\;kg^{-1}$. The results indicate that nitrogen can be determined by the near infrared reflectance (NIR) technology for fresh- and dry-leaf of apple.

A Study on the Mediating Role of Mathematics Anxiety in the Influence of Self Efficacy on Mathematics Skills of College Students Majoring in Hospitality Management (호텔.레스토랑 전공 대학생들의 자기효능감과 수학실력의 관계에서 수학불안의 매개역할에 관한 연구)

  • Kim, Min-Jung;Kim, Hyun-Jung;Kim, Dong-Jin
    • Culinary science and hospitality research
    • /
    • v.18 no.4
    • /
    • pp.59-69
    • /
    • 2012
  • This study examines the role of mathematics anxiety as a mediator between self efficacy and mathematics skills using a series of regression analyses suggested by Baron RM & Kenny DA(1986). The participants include college students who enrolled in the Food Service Production and Operation course in a department of hotel and restaurant management at a college in the United States. Descriptive analysis, principal component analysis, reliability test, and a series of regression analyses were used for data analysis using SPSS 19.0. In order to collect data for the study, General Self Efficacy Scale(GSES) and Math Anxiety Rating Scale(MARS) were utilized, and they turned out to be reliable(${\alpha}$=.906 and ${\alpha}$=.890, respectively). A significant negative relationship was found between self efficacy and mathematics anxiety. In addition, it was found that self-efficacious students performed better mathematics skills than those who had lower level of self efficacy. However, the relationship was no longer significant when the concept of mathematics anxiety was added, which satisfies the condition of mediation.

  • PDF

Simultaneous Spectrophotometric Determination of Copper, Nickel, and Zinc Using 1-(2-Thiazolylazo)-2-Naphthol in the Presence of Triton X-100 Using Chemometric Methods (화학계량학적 방법을 사용한 Triton X-100이 함유된 1-(2-Thiazolylazo)-2-Naphthol을 사용한 구리, 니켈과 아연의 동시 분광광도법적 정량)

  • Low, Kah Hin;Zain, Sharifuddin Md.;Abas, Mhd. Radzi;Misran, Misni;Mohd, Mustafa Ali
    • Journal of the Korean Chemical Society
    • /
    • v.53 no.6
    • /
    • pp.717-726
    • /
    • 2009
  • Multivariate models were developed for the simultaneous spectrophotometric determination of copper (II), nickel (II) and zinc (II) in water with 1-(2-thiazolylazo)-2-naphthol as chromogenic reagent in the presence of Triton X-100. To overcome the drawback of spectral interferences, principal component regression (PCR) and partial least square (PLS) multivariate calibration approaches were applied. Performances were validated with several test sets, and their results were then compared. In general, no significant difference in analytical performance between PLS and PCR models. The root mean square error of prediction (RMSEP) using three components for $Cu^{2+}$, $Ni^{2+}$ and $Zn^{2+}$ were 0.018, 0.010, 0.011 ppm, respectively. Figures of merit such as sensitivity, analytical sensitivity, limit of detection (LOD) were also estimated. High reliability was achieved when the proposed procedure was applied to simultaneous determination of $Cu^{2+}$, $Ni^{2+}$ and $Zn^{2+}$ in synthetic mixture and tap water.

A Study of Tasseled Cap Transformation Coefficient for the Geostationary Ocean Color Imager (GOCI) (정지궤도 천리안위성 해양관측센서 GOCI의 Tasseled Cap 변환계수 산출연구)

  • Shin, Ji-Sun;Park, Wook;Won, Joong-Sun
    • Korean Journal of Remote Sensing
    • /
    • v.30 no.2
    • /
    • pp.275-292
    • /
    • 2014
  • The objective of this study is to determine Tasseled Cap Transformation (TCT) coefficients for the Geostationary Ocean Color Imager (GOCI). TCT is traditional method of analyzing the characteristics of the land area from multi spectral sensor data. TCT coefficients for a new sensor must be estimated individually because of different sensor characteristics of each sensor. Although the primary objective of the GOCI is for ocean color study, one half of the scene covers land area with typical land observing channels in Visible-Near InfraRed (VNIR). The GOCI has a unique capability to acquire eight scenes per day. This advantage of high temporal resolution can be utilized for detecting daily variation of land surface. The GOCI TCT offers a great potential for application in near-real time analysis and interpretation of land cover characteristics. TCT generally represents information of "Brightness", "Greenness" and "Wetness". However, in the case of the GOCI is not able to provide "Wetness" due to lack of ShortWave InfraRed (SWIR) band. To maximize the utilization of high temporal resolution, "Wetness" should be provided. In order to obtain "Wetness", the linear regression method was used to align the GOCI Principal Component Analysis (PCA) space with the MODIS TCT space. The GOCI TCT coefficients obtained by this method have different values according to observation time due to the characteristics of geostationary earth orbit. To examine these differences, the correlation between the GOCI TCT and the MODIS TCT were compared. As a result, while the GOCI TCT coefficients of "Brightness" and "Greenness" were selected at 4h, the GOCI TCT coefficient of "Wetness" was selected at 2h. To assess the adequacy of the resulting GOCI TCT coefficients, the GOCI TCT data were compared to the MODIS TCT image and several land parameters. The land cover classification of the GOCI TCT image was expressed more precisely than the MODIS TCT image. The distribution of land cover classification of the GOCI TCT space showed meaningful results. Also, "Brightness", "Greenness", and "Wetness" of the GOCI TCT data showed a relatively high correlation with Albedo ($R^2$ = 0.75), Normalized Difference Vegetation Index (NDVI) ($R^2$ = 0.97), and Normalized Difference Moisture Index (NDMI) ($R^2$ = 0.77), respectively. These results indicate the suitability of the GOCI TCT coefficients.

Statistical Analysis of Water Flow and Water Quality Data in the Imjin River Basin for Total Pollutant Load Management (임진강 유역 오염물질 총량관리를 위한 유량-수질 자료의 통계분석)

  • Cho, Yong-Chul;Choi, Hyeon-Mi;Lee, Young Joon;Ryu, Ingu;Lee, Myung-Gu;Gu, Donghoi;Choi, Kyungwan;Yu, Soonju
    • Journal of Environmental Impact Assessment
    • /
    • v.27 no.4
    • /
    • pp.353-366
    • /
    • 2018
  • The purpose of this study was assessment the quality of water by using the statistical analysis technique of the Water flow and water quality from January 2012 to December 2016 at the unit basin for total pollutant load management system (TPLMS) in the Imjin River. Water flow and water quality were monitored at an average of 8 day intervals, 11 parameters were used for correlation analysis, principal component analysis (PCA), factor analysis (FA), and cluster analysis (CA). The Hierarchical CA was classified into three according to the change of space, such as natural rivers, urban rivers, point with large influence of point pollution source, it was found that the type of contamination source the similarity of water quality affected the classification of cluster. Using one-way analysis of variance (ANOVA) and post-hoc Analysis, there were statistically significant differences between mean values among the clusters. Correlation analysis showed the correlation coefficient between $COD_{Mn}$ and TOC was 0.951 (p<0.01) and the correlation was statistically significantly higher. According to the result PCA and FA, 3 principal components can explaining 72% of the total variations in water quality characteristics and main factor was EC, $BOD_5$, $COD_{Mn}$, TN, TP and TOC indirect indicators of organic matter and nutrients were influenced. This study presented the regression equation obtained by applying the factor scores to the multiple linear regression analysis and concluded that the management Indirect indicators of organic matter and nutrients is important for water quality management in the Imjin River basin.

A METHOD OF CAPABILITY EVALUATION FOR KOREAN PADDY SOILS -Part 2. The rice yield prediction by soil fertility constituents and other characters (한국(韓國) 답토양(畓土壤)의 생산력(生産力) 평가방법에 관한 연구 -2 보(報)·비옥도(肥沃度) 구성인자(構成因子) 및 기타(其他) 특성(特性)에 의(依)한 쌀수확량(收穫量)의 추정(推定))

  • Hong, Ki-Chang;Maeng, Do-Won;Kazutake, Kyuma;Hisao, Furukawa;Suh, Yoon-Soo
    • Korean Journal of Soil Science and Fertilizer
    • /
    • v.12 no.1
    • /
    • pp.15-23
    • /
    • 1979
  • In the first paper of the series the five soil fertility factors were evaluated by means of principal component analysis and varimax method. They are interpreted as representing, 1) skeletal available phosporus status, 2) organnic matter status, 3) salt status 4) base status, and 5) free oxide status. In order to resynthesize such fragmented information for the overall soil fertility evaluation, the method of multiple regression analysis was adopted, using the five factor scores and yield data for Korean paddy soils as independent and dependent variables respectively. As test of linear models with different combinations of independent variables the results of t-test of regression coefficient were revealed that the organic matter status (FII) has no relevance to the yield of paddy and that the free oxides and salt supply has by it self only an insignificant contribution to the yield. The multiple correlation coefficient (R) revealed its multiple regression analysis was as low as 0.43. Introduction of quadratic terms to the linear model bettered the result. Thus multiple correlation coefficient (R) was increased as 0.59. Therefore, a coefficient of determination 0.35 was obtained by a quadratic model with interaction terms among the five fertility constituents. Generally we think that the fertility factor has more contribution to raise the rice yield in paddy and that the failure of yield prediction by fertility factor scores was caused by one of follows; 1) the roughness of the yield inspection, and 2) missextraction of fertility constituents. The second step in this study, assuming that the residuals by multiple regression analysis were due to factors other than soil fertility, we can now proceed to predicting the yield from the field characters with the classified fertility groups by means of Hayashi's theory of quantification No. 1. Such variables as fertility groups (FTYG), water availability (WATER), soil drainage (DRNG), climatic zone (CLIZ), surface soil's stickiness (STCKT), surface soil's dry consistence (DCNST), and surface soil's texture (FTEXT) are taken up as the explanatory variables. The quantification appears reasonable; the well to extremely well in soil drainage, very sticky of surface soil, inefficiency in water availability, coarse texture, and very hard to extremely hard dry consistence in soil are detrimental to the rice yield. The R was as high as 0.90 for the set of variables. But the given explanatory variables in this study were not quite effective in explaining rice yield. The method developed seems to be promising only if properly collected data are available. Conditions that should be satisfied in the yield inspection obtained from common cultivator for the purpose of deriving a prediction equation were put forward.

  • PDF

The Relationship of Psychosocial Factors to Blood Pressure (사회심리학적인 요인과 혈압의 관계)

  • Lee, Choong-Won;Lee, Sung-Kwan
    • Journal of Preventive Medicine and Public Health
    • /
    • v.21 no.1 s.23
    • /
    • pp.99-112
    • /
    • 1988
  • Questionnaires and blood pressure measurements were administered to 279 medical school undergraduates in 1987 to investigate the relationship between psychosocial factors and blood pressure as well as reliability and validity of the Framingham Type A Behavior Scale(FTA). The reliability coefficients of SCL-90-R and nh measured by Spearman-Brown haves split test were $0.57{\sim}0.91$. The factors of FTA extracted by principal component analysis were hard-driving competitiveness factor and impatience factor(2-factor solution) . The total score of nh was positively correlated with relative weight and place raised but the correlations were insignificant, and had significantly positive but weak correlations with depression, anxiety, hostility, paranoid, and psychoticism subscales of SCL-90-R. In the univariate analysis of blood pressures, relative weight and family history were significant in systolic pressure in males and economic status was significant in blood pressures in both sexes. For diastolic pressure, relative weight and frequency of alcohol intake were significant in males and relative weight was in females. After controlling relative weight, the frequency of alcohol intake for diastolic pressure and economic status for systolic pressure were significant in males. The important variables selected by stepwise regression analysis were relative weight and economic status for systolic pressure of males and relative weight and the frequency of alcohol intake for diastolic pressure. At the level of alpha 0.1, depression subscale was added to the model, changing coefficient of determination 0.206 to 0.217. In females, economic status and relative weight were selected for systolic pressure and for diastolic pressure body mass index alone, but the model of blood pressure for females was considered to be unstable due to small sample size(56). FTA was unrelated to the blood pressures in both sexes.

  • PDF

The Effect of Private Guards' Job Embeddedness on Dual Commitment (민간경비원의 직무착근도가 이중몰입에 미치는 영향)

  • Lim, Woon-Sik
    • Korean Security Journal
    • /
    • no.41
    • /
    • pp.123-151
    • /
    • 2014
  • The purpose of this study was focused on the relationship between private guards' job embeddedness and dual commitment. In this study, job embeddedness is selected as an independent variable and dual commitment is selected as a dependent variable one. job embeddedness was divided into three sub-factors such as "fit", "links", and "sacrifice", and dual commitment is again composed with organizational commitment, and career commitment. Moreover sex, age, academic background, service period, and income were selected as a control variable. To test the hypotheses, survey data from private guards in Kyungpook are collected and analyzed. Principal component method is used to see which items cluster together in each factor and to calculate factor scores. Multiple regression analysis identifies several factors which have significant effects on dual commitment. Key finding can be summarized as follow. Fist, the factor of "fit" have significant effects on organizational commitment, and career commitment. Second, the factor of "links" have significant effects on organizational commitment, and career commitment. Third the factor of "sacrifice" have significant effects on organizational commitment, and career commitment. Finally, when all the variables with significant effects are included in the final model, "links" disappear, while "fit" and "sacrifice" remain statistically significant. Based on these finding, this study suggests some policy issues to promote private guards' dual commitment.

  • PDF

Authentication and classification of strawberry varieties by analysis of their leaves using near infrared spectroscopy.

  • Lopez, Mercedes G.
    • Proceedings of the Korean Society of Near Infrared Spectroscopy Conference
    • /
    • 2001.06a
    • /
    • pp.1617-1617
    • /
    • 2001
  • It is well known now that near infrared spectroscopy (NIRS) is a fast, no destructive, and inexpensive analytical technique that could be used to classify, identify, and authenticate a wide range of foods and food items. Therefore, the main aims of this study were to provide a new insight into the authentication of five strawberry (Fragaria x ananassa) varieties and to correlate them with geographical zones and the propagating methods used. Three weeks plants of five different strawberry varieties (F. x ananassa Duch. cv Camarosa, Seascape, Chandler, F. Chiloensis, and F. Virginiana) were cultivated in vitro first and then transferred to pots with special soil, and grown in a greenhouse at CINVESTAV, all varieties were acquired from California (USA). After 18 months, ten leaves from each variety were collected. Transmission spectra from each leave were recorded over a range of 10, 000-4, 000 cm$-^{1}$, 32 scans of each strawberry leave were collected using a resolution of 4 cm$-^{1}$ with a Paragon IdentiCheck FT-NIR System Spectrometer. Triplicates of each strawberry leave were used. All spectra were analyzed using principal component analysis (PCA) and soft independent modeling class analogy (SIMCA). The optimum number of components to be used in the regression was automatically determined by the software. Camarosa was the only variety grown from the same shoot but propagated by a different method (direct or in vitro). Five different classes (varieties) or clusters were observed among samples, however, larger inter class distances were presented by the two wildtype samples (F. Chiloensis and F. Virginiana). Camarosa direct and Camarosa in vitro displayed a small overlapping region between them. On the other hand, Seascape variety presented the smallest rejection percentage among all varieties (more similarities with the rest of the samples). Therefore, it can be concluded that the application of NIRS technique allowed the authentication of all strawberry varieties and geographical origin as well. It was also possible to form subclasses of the same materials. The results presented here demonstrate that NIRS is a very powerful and promising analytical tool since all materials were authenticated and classified based on their variety, origin, and treatment. This is of a tremendous relevance since the variety and origin of a plant material can be established even before it gives its typical fruit or flower.

  • PDF