• Title/Summary/Keyword: multi-linear regression analysis

Search Result 121, Processing Time 0.029 seconds

Analysis of Algal Bloom Occurrence Characteristics Namyang Lake using Sentinel-2 MSI (Sentinel-2 MSI를 활용한 남양 간척담수호의 조류발생 특성 분석)

  • Wonjin Jang;Jinuk Kim;Jiwan Lee;Yongeun Park;Seongjoon Kim
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2023.05a
    • /
    • pp.56-56
    • /
    • 2023
  • 남양호는 농업용수 공급을 위해 건설된 하구 담수호로 과도한 영양물질 축적으로 인해 매년 여름 녹조류가 번성한다. 따라서 본 연구에서는 조류발생 특성을 분석하고자 식물성 플랑크톤 및 관련 분해 산물에 의해 고유 광학특성을 가지고 있는 Chlorophyll-a(Chl-a)의 추정을 통한 녹조 발생을 파악하고자 Sentinel-2 Multi Spectral Image(MSI)의 원격 반사율 광학 스펙트럼을 사용하였다. Chl-a 추정알고리즘 개발을 위하여 Sentinel-2 A, B의 교차 방문주기인 5일 간격에 맞추어 현장수질자료(2022년: 27회 2023년: 27회)를 측정하였다. Chl-a 농도는 EXO-YSI를이용하여 측정하였으며 해당기간동안 9.4 ~ 127.1 mg/L의 범위를 보였으며, Sentine-2 자료는 A, B자료에서 B1(443 nm) ~ B8A(865 nm)파장의 값을 기상조건(구름, 안개, 강수)을 고려하여 현장수질측정 위치에서 반사도를 추출하였다. 입력자료는 대기 및 방사영향을 고려해 반사도 간의 비율자료와 선행연구에서 활용된 반사도를 활용하였으며 알고리즘은 다중선형회귀분석(Multi Linear Regression Model)과 Random Forest를 활용하였다. MLR의 경우 결정계수(R2)가 학습 및 검증에서 각각 0.68, 0.59의 성능을 보였으며, RF의 경우 각각 0.94, 0.85의 성능을 보였다. 해당알고리즘으로 생성된 Chl-a 시공간농도 자료는 담수호내 조류발생 특성을 분석하고 효율적 조류관리 및 대처에 활용될 것으로 판단된다.

  • PDF

Water Quality Assessment and Turbidity Prediction Using Multivariate Statistical Techniques: A Case Study of the Cheurfa Dam in Northwestern Algeria

  • ADDOUCHE, Amina;RIGHI, Ali;HAMRI, Mehdi Mohamed;BENGHAREZ, Zohra;ZIZI, Zahia
    • Applied Chemistry for Engineering
    • /
    • v.33 no.6
    • /
    • pp.563-573
    • /
    • 2022
  • This work aimed to develop a new equation for turbidity (Turb) simulation and prediction using statistical methods based on principal component analysis (PCA) and multiple linear regression (MLR). For this purpose, water samples were collected monthly over a five year period from Cheurfa dam, an important reservoir in Northwestern Algeria, and analyzed for 12 parameters, including temperature (T°), pH, electrical conductivity (EC), turbidity (Turb), dissolved oxygen (DO), ammonium (NH4+), nitrate (NO3-), nitrite (NO2-), phosphate (PO43-), total suspended solids (TSS), biochemical oxygen demand (BOD5) and chemical oxygen demand (COD). The results revealed a strong mineralization of the water and low dissolved oxygen (DO) content during the summer period. High levels of TSS and Turb were recorded during rainy periods. In addition, water was charged with phosphate (PO43-) in the whole period of study. The PCA results revealed ten factors, three of which were significant (eigenvalues >1) and explained 75.5% of the total variance. The F1 and F2 factors explained 36.5% and 26.7% of the total variance, respectively and indicated anthropogenic pollution of domestic agricultural and industrial origin. The MLR turbidity simulation model exhibited a high coefficient of determination (R2 = 92.20%), indicating that 92.20% of the data variability can be explained by the model. TSS, DO, EC, NO3-, NO2-, and COD were the most significant contributing parameters (p values << 0.05) in turbidity prediction. The present study can help with decision-making on the management and monitoring of the water quality of the dam, which is the primary source of drinking water in this region.

Reliability and Data Integration of Duplicated Test Results Using Two Bioelectrical Impedence Analysis Machines in the Korean Genome and Epidemiology Study

  • Park, Bo-Young;Yang, Jae-Jeong;Yang, Ji-Hyun;Kim, Ji-Min;Cho, Lisa-Y.;Kang, Dae-Hee;Shin, Chol;Hong, Young-Seoub;Choi, Bo-Youl;Kim, Sung-Soo;Park, Man-Suck;Park, Sue-K.
    • Journal of Preventive Medicine and Public Health
    • /
    • v.43 no.6
    • /
    • pp.479-485
    • /
    • 2010
  • Objectives: The Korean Genome and Epidemiology Study (KoGES), a multicenter-based multi-cohort study, has collected information on body composition using two different bioelectrical impedence analysis (BIA) machines. The aim of the study was to evaluate the possibility of whether the test values measured from different BIA machines can be integrated through statistical adjustment algorithm under excellent inter-rater reliability. Methods: We selected two centers to measure inter-rater reliability of the two BIA machines. We set up the two machines side by side and measured subjects' body compositions between October and December 2007. Duplicated test values of 848 subjects were collected. Pearson and intra-class correlation coefficients for inter-rater reliability were estimated using results from the two machines. To detect the feasibility for data integration, we constructed statistical compensation models using linear regression models with residual analysis and R-square values. Results: All correlation coefficients indicated excellent reliability except mineral mass. However, models using only duplicated body composition values for data integration were not feasible due to relatively low $R^2$ values of 0.8 for mineral mass and target weight. To integrate body composition data, models adjusted for four empirical variables that were age, sex, weight and height were most ideal (all $R^2$ > 0.9). Conclusions: The test values measured with the two BIA machines in the KoGES have excellent reliability for the nine body composition values. Based on reliability, values can be integrated through algorithmic statistical adjustment using regression equations that includes age, sex, weight, and height.

Sports Media Value in New Media Platform Era: The Role of Media Engagement and Empathy (뉴미디어 플랫폼 시대의 스포츠미디어 가치: 미디어 인게이지먼트와 공감의 역할)

  • Choi, Eui-Yul;Jeon, Yong-Bae;Kim, Hyun-Duck
    • Journal of the Korean Applied Science and Technology
    • /
    • v.39 no.3
    • /
    • pp.433-441
    • /
    • 2022
  • The purpose of this study is to investigate the relationship between media engagement, media empathy, and media value of MCN sports broadcasting. To achieve this purpose, a survey was conducted on 324 MCN sports broadcast viewers. Exploratory factor analysis was performed to confirm validity, and Cronbach's α test was performed to investigate reliability. In addition, correlation analysis was performed to verify discriminant validity, and linear regression analysis was performed to verify the research hypothesis, and the following conclusions were drawn. Media engagement had a positive effect on media value. Media engagement had a positive effect on media empathy. Media empathy has a positive effect on media value.

Coastal Shallow-Water Bathymetry Survey through a Drone and Optical Remote Sensors (드론과 광학원격탐사 기법을 이용한 천해 수심측량)

  • Oh, Chan Young;Ahn, Kyungmo;Park, Jaeseong;Park, Sung Woo
    • Journal of Korean Society of Coastal and Ocean Engineers
    • /
    • v.29 no.3
    • /
    • pp.162-168
    • /
    • 2017
  • Shallow-water bathymetry survey has been conducted using high definition color images obtained at the altitude of 100 m above sea level using a drone. Shallow-water bathymetry data are one of the most important input data for the research of beach erosion problems. Especially, accurate bathymetry data within closure depth are critically important, because most of the interesting phenomena occur in the surf zone. However, it is extremely difficult to obtain accurate bathymetry data due to wave-induced currents and breaking waves in this region. Therefore, optical remote sensing technique using a small drone is considered to be attractive alternative. This paper presents the potential utilization of image processing algorithms using multi-variable linear regression applied to red, green, blue and grey band images for estimating shallow water depth using a drone with HD camera. Optical remote sensing analysis conducted at Wolpo beach showed promising results. Estimated water depths within 5 m showed correlation coefficient of 0.99 and maximum error of 0.2 m compared with water depth surveyed through manual as well as ship-board echo-sounder measurements.

The Impact of Medical Utilization on Subjective Health and Happiness Index and Quality of Life according to the Economic Level of the Elderly (노인의 경제적 수준에 따른 의료이용이 주관적 건강수준과 행복감 지수 및 삶의 질에 미치는 영향)

  • So, Kwon-Seob;Hwang, Hye-Jeong;Kim, Eun-Mi
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.20 no.3
    • /
    • pp.544-552
    • /
    • 2019
  • The purpose of this study was to find concrete measures to improve the subjective health level, happiness and quality of life of the elderly according to economic level and to propose social and policy alternatives accordingly. As a research method, 63,929 elderly people aged 65 or older were surveyed using the Community Health Survey (Indicator Bank) _v09, and the frequency of health use by economic level, subjective health level, euphoria and quality of life Analysis and Chi square analysis and independent t-test. Multi variate logistic regression analysis was performed with subjective health level as a dependent variable and multiple linear regression analysis was performed to determine the factors affecting euphoria and quality of life. The results of the study are as follows. In the case of recipients, medical use was lower than that of non-recipients, lower education level, female age of 75 years or older, and less stress, In case of present or past recipients, the result of non - receipt increased as the subjective health level was worse, and the non - recipient had higher euphoria and quality of life. As a result, there is a need for alternatives to increase opportunities for medical use among the recipients, with particular attention being paid to women and elderly people over 75 years old. It is expected to be used as a basic data to effectively improve the health promotion, happiness and quality of life of the elderly people of low income group.

A Study on Variation of Earth Pressure (토압의 변동에 관한 연구)

  • Bae, Sang Kun
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.14 no.1
    • /
    • pp.179-193
    • /
    • 1994
  • In the development of engineering designs, decisions are required irrespective of the state of completeness and quality of information, and are formulated under conditions of uncertainty. Furthermore, under conditions of uncertainty the design invokes risks. Thus, in the design of the structures, the currently used deterministic design method does not provide a realistic assessment of the actual safety or the reliability of the structures. It is desirable that decisions required in The process of the design invariably must be made based on the reliability analysis. Properties of soil material are subject to more uncertainty than those of other structural material. In the field of soil mechanics and foundation engineering, it needed to develop reliability-based design methods. In order to simplify the reliability analysis or the reliability-based design process of the structures associated with the active earth pressure, it is necessary to find the variation and the distribution type of the active earth pressure calculated from the basic properties of soils. Monte Carlo simulation is performed to obtain the relationship between the variation of the active earth pressure for cohessionless soils calculated by using Rankine formula and the basic soil properties and the distribution type of the earth pressure. A series of regression equations obtained by utilizing the multi-linear regression analysis is suggested in this paper and the sensitivity of the basic soil properties to the variation of The earth pressure is investigated. The type of distribution of the active earth pressure was found to be the beta distribution in most cases or to be very similar to the beta distribution, if the basic soil variables are normally distributed.

  • PDF

The Impact of Human Resource Development on Job Satisfaction and Organizational Commitment : Mediating Effects of Learning Culture (인적자원개발제도, 조직몰입, 직무만족 간의 관계 : 조직수준의 학습문화의 매개효과 검증)

  • Kim, Sung Hwan
    • Asia-Pacific Journal of Business Venturing and Entrepreneurship
    • /
    • v.9 no.3
    • /
    • pp.119-128
    • /
    • 2014
  • One of the theoretically and empirically grounded black boxes in HRD and firm performance link is employee' attitudes such as organizational commitment and job satisfaction. However, most studies were conducted with the regression analysis at the organizational level. This study used HLM(hierarchical linear modeling) analysis, which made it possible to estimate more accurate relationship between variables that were measured from two different levels. In addition, this study attempted to open an the black box(learning culture) in the relationship between HRD and employee attitudes. The result showed that the HRD have a positive effect on the organizational commitment and the job satisfaction. Also the HRD showed full mediation effect of organization commitment and the job satisfaction on the Learning culture. And the result showed that the HRD in 2007 have a positive effect on employee' attitudes in 2009. These findings concluded that systematic HRD like employee's education and training must be built and also the positive culture for employee's learning like support of management's learning organization must be improved in order to promote the organizational performance(organizational commitment, job satisfaction) in company.

  • PDF

A Study of Anomaly Detection for ICT Infrastructure using Conditional Multimodal Autoencoder (ICT 인프라 이상탐지를 위한 조건부 멀티모달 오토인코더에 관한 연구)

  • Shin, Byungjin;Lee, Jonghoon;Han, Sangjin;Park, Choong-Shik
    • Journal of Intelligence and Information Systems
    • /
    • v.27 no.3
    • /
    • pp.57-73
    • /
    • 2021
  • Maintenance and prevention of failure through anomaly detection of ICT infrastructure is becoming important. System monitoring data is multidimensional time series data. When we deal with multidimensional time series data, we have difficulty in considering both characteristics of multidimensional data and characteristics of time series data. When dealing with multidimensional data, correlation between variables should be considered. Existing methods such as probability and linear base, distance base, etc. are degraded due to limitations called the curse of dimensions. In addition, time series data is preprocessed by applying sliding window technique and time series decomposition for self-correlation analysis. These techniques are the cause of increasing the dimension of data, so it is necessary to supplement them. The anomaly detection field is an old research field, and statistical methods and regression analysis were used in the early days. Currently, there are active studies to apply machine learning and artificial neural network technology to this field. Statistically based methods are difficult to apply when data is non-homogeneous, and do not detect local outliers well. The regression analysis method compares the predictive value and the actual value after learning the regression formula based on the parametric statistics and it detects abnormality. Anomaly detection using regression analysis has the disadvantage that the performance is lowered when the model is not solid and the noise or outliers of the data are included. There is a restriction that learning data with noise or outliers should be used. The autoencoder using artificial neural networks is learned to output as similar as possible to input data. It has many advantages compared to existing probability and linear model, cluster analysis, and map learning. It can be applied to data that does not satisfy probability distribution or linear assumption. In addition, it is possible to learn non-mapping without label data for teaching. However, there is a limitation of local outlier identification of multidimensional data in anomaly detection, and there is a problem that the dimension of data is greatly increased due to the characteristics of time series data. In this study, we propose a CMAE (Conditional Multimodal Autoencoder) that enhances the performance of anomaly detection by considering local outliers and time series characteristics. First, we applied Multimodal Autoencoder (MAE) to improve the limitations of local outlier identification of multidimensional data. Multimodals are commonly used to learn different types of inputs, such as voice and image. The different modal shares the bottleneck effect of Autoencoder and it learns correlation. In addition, CAE (Conditional Autoencoder) was used to learn the characteristics of time series data effectively without increasing the dimension of data. In general, conditional input mainly uses category variables, but in this study, time was used as a condition to learn periodicity. The CMAE model proposed in this paper was verified by comparing with the Unimodal Autoencoder (UAE) and Multi-modal Autoencoder (MAE). The restoration performance of Autoencoder for 41 variables was confirmed in the proposed model and the comparison model. The restoration performance is different by variables, and the restoration is normally well operated because the loss value is small for Memory, Disk, and Network modals in all three Autoencoder models. The process modal did not show a significant difference in all three models, and the CPU modal showed excellent performance in CMAE. ROC curve was prepared for the evaluation of anomaly detection performance in the proposed model and the comparison model, and AUC, accuracy, precision, recall, and F1-score were compared. In all indicators, the performance was shown in the order of CMAE, MAE, and AE. Especially, the reproduction rate was 0.9828 for CMAE, which can be confirmed to detect almost most of the abnormalities. The accuracy of the model was also improved and 87.12%, and the F1-score was 0.8883, which is considered to be suitable for anomaly detection. In practical aspect, the proposed model has an additional advantage in addition to performance improvement. The use of techniques such as time series decomposition and sliding windows has the disadvantage of managing unnecessary procedures; and their dimensional increase can cause a decrease in the computational speed in inference.The proposed model has characteristics that are easy to apply to practical tasks such as inference speed and model management.

Diagnosis of Nitrogen Content in the Leaves of Apple Tree Using Spectral Imagery (분광 영상을 이용한 사과나무 잎의 질소 영양 상태 진단)

  • Jang, Si Hyeong;Cho, Jung Gun;Han, Jeom Hwa;Jeong, Jae Hoon;Lee, Seul Ki;Lee, Dong Yong;Lee, Kwang Sik
    • Journal of Bio-Environment Control
    • /
    • v.31 no.4
    • /
    • pp.384-392
    • /
    • 2022
  • The objective of this study was to estimated nitrogen content and chlorophyll using RGB, Hyperspectral sensors to diagnose of nitrogen nutrition in apple tree leaves. Spectral data were acquired through image processing after shooting with high resolution RGB and hyperspectral sensor for two-year-old 'Hongro/M.9' apple. Growth data measured chlorophyll and leaf nitrogen content (LNC) immediately after shooting. The growth model was developed by using regression analysis (simple, multi, partial least squared) with growth data (chlorophyll, LNC) and spectral data (SPAD meter, color vegetation index, wavelength). As a result, chlorophyll and LNC showed a statistically significant difference according to nitrogen fertilizer level regardless of date. Leaf color became pale as the nutrients in the leaf were transferred to the fruit as over time. RGB sensor showed a statistically significant difference at the red wavelength regardless of the date. Also hyperspectral sensor showed a spectral difference depend on nitrogen fertilizer level for non-visible wavelength than visible wavelength at June 10th and July 14th. The estimation model performance of chlorophyll, LNC showed Partial least squared regression using hyperspectral data better than Simple and multiple linear regression using RGB data (Chlorophyll R2: 81%, LNC: 81%). The reason is that hyperspectral sensor has a narrow Full Half at Width Maximum (FWHM) and broad wavelength range (400-1,000 nm), so it is thought that the spectral analysis of crop was possible due to stress cause by nitrogen deficiency. In future study, it is thought that it will contribute to development of high quality and stable fruit production technology by diagnosis model of physiology and pest for all growth stage of tree using hyperspectral imagery.