• Title/Summary/Keyword: multivariate data analysis

Search Result 1,405, Processing Time 0.032 seconds

Analyzing Market Integration of Wild Caught Fish Species (자연산 어류의 시장 통합성 분석)

  • Kim, Do-Hoon
    • The Journal of Fisheries Business Administration
    • /
    • v.44 no.1
    • /
    • pp.71-79
    • /
    • 2013
  • This study is aimed to estimate market integration of wild caught fish species on the Korean market, using both multivariate and bivariate cointegration analysis. For the analysis of market integration between wild caught fish species, major four fish species those are most popular fish in the market and caught by the large purse seine fishery-chub mackerel, jack mackerel, hairtail and spanish mackerel-were selected as analytical target fish species. And their real monthly price data from January 2000 to December 2011 were used in the analysis. The results of the multivariate cointegration test for four wild caught fish species showed that there would be long-term equilibrium relationships among prices of four wild caught fish species, and consequently, the markets for wild caught fish species were estimated to be integrated. The results of exclusion test and bivariate cointegration test also supported that there would be a clear evidence to suggest that all target wild caught fish species were cointegrated each other.

High-resolution 1H NMR Spectroscopy of Green and Black Teas

  • Jeong, Ji-Ho;Jang, Hyun-Jun;Kim, Yongae
    • Journal of the Korean Chemical Society
    • /
    • v.63 no.2
    • /
    • pp.78-84
    • /
    • 2019
  • High-resolution $^1H$ NMR spectroscopic technique has been widely used as one of the most powerful analytical tools in food chemistry as well as to define molecular structure. The $^1H$ NMR spectra-based metabolomics has focused on classification and chemometric analysis of complex mixtures. The principal component analysis (PCA), an unsupervised clustering method and used to reduce the dimensionality of multivariate data, facilitates direct peak quantitation and pattern recognition. Using a combination of these techniques, the various green teas and black teas brewed were investigated via metabolite profiling. These teas were characterized based on the leaf size and country of cultivation, respectively.

Effects of Two Chemotherapy Regimens, Anthracycline-based and CMF, on Breast Cancer Disease Free Survival in the Eastern Mediterranean Region and Asia: A Meta-Analysis Approach for Survival Curves

  • Zare, Najaf;Ghanbari, Saeed;Salehi, Alireza
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.14 no.3
    • /
    • pp.2013-2017
    • /
    • 2013
  • Background: To compare the effects of two adjuvant chemotherapy regimens, anthracycline-based and cyclophosphamide, methotrexate, fluorourical (CMF) on disease free survival for breast cancer patients in the Eastern Mediterranean region and Asia. Methods: In a systematic review with a multivariate mixed model meta-analysis, the reported survival proportion at multiple time points in different studies were combined. Our data sources were studies linking the two chemotherapy regimens on an adjuvant basis with disease free survival published in English and Persian in the Eastern Mediterranean region and Asia. All survival curves were generated with Graphdigitizer software. Results: 14 retrospective cohort studies were located from electronic databases. We analyzed data for 1,086 patients who received anthracycline-based treatment and 1,109 given CMF treatment. For determination of survival proportions and time we usesb the transformation Ln (-Ln(S)) and Ln (time) to make precise estimations and then fit the model. All analyses were carried out with STATA software. Conclusions: Our findings showed a significant efficacy of anthracycline-based adjuvant therapy regarding disease free survival of breast cancer. As a limitation in this meta-analysis we used studies with different types of anthracycline-based regimens.

Analysis of Hyperspectral Dentin Data Using Independent Component Analysis

  • Jung, Sung-Hwan
    • Journal of Korea Multimedia Society
    • /
    • v.12 no.12
    • /
    • pp.1755-1760
    • /
    • 2009
  • In this research, for the first time, we tried to analyse Raman hyperspectral dentin data using Independent Component Analysis (ICA) to see its possibility of adoption for the dental analysis software. We captured hyperspectral dentin data on 569 spots on a molar with dental lesion by HR800 Micro Raman Spectrometer at UMKC-CRISP (University of Missouri at Kansas City-Center for Research on Interfacial Structure and Properties). Each spot has 1,005 hyperspectral data. We applied ICA to the captured hyperspectral data of dentin for evaluating ICA approach, and compared it with the well known multivariate analysis method, PCA. As a result of the experiment, ICA approach shows better local characteristic of dentin than the result of PCA. We confirmed that ICA also could be a good method along with PCA in the dental analysis software.

  • PDF

Functional ARCH analysis for a choice of time interval in intraday return via multivariate volatility (함수형 ARCH 분석 및 다변량 변동성을 통한 일중 로그 수익률 시간 간격 선택)

  • Kim, D.H.;Yoon, J.E.;Hwang, S.Y.
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.3
    • /
    • pp.297-308
    • /
    • 2020
  • We focus on the functional autoregressive conditional heteroscedasticity (fARCH) modelling to analyze intraday volatilities based on high frequency financial time series. Multivariate volatility models are investigated to approximate fARCH(1). A formula of multi-step ahead volatilities for fARCH(1) model is derived. As an application, in implementing fARCH(1), a choice of appropriate time interval for the intraday return is discussed. High frequency KOSPI data analysis is conducted to illustrate the main contributions of the article.

Statistical analysis of metagenomics data

  • Calle, M. Luz
    • Genomics & Informatics
    • /
    • v.17 no.1
    • /
    • pp.6.1-6.9
    • /
    • 2019
  • Understanding the role of the microbiome in human health and how it can be modulated is becoming increasingly relevant for preventive medicine and for the medical management of chronic diseases. The development of high-throughput sequencing technologies has boosted microbiome research through the study of microbial genomes and allowing a more precise quantification of microbiome abundances and function. Microbiome data analysis is challenging because it involves high-dimensional structured multivariate sparse data and because of its compositional nature. In this review we outline some of the procedures that are most commonly used for microbiome analysis and that are implemented in R packages. We place particular emphasis on the compositional structure of microbiome data. We describe the principles of compositional data analysis and distinguish between standard methods and those that fit into compositional data analysis.

Estimation of Genetic Variance Components of Body Size Measurements in Hanwoo (Korean Cattle) Using a Multivariate Linear Model

  • Lee, Jung-Jae;Kim, Nae-Soo
    • Journal of Animal Science and Technology
    • /
    • v.52 no.3
    • /
    • pp.167-174
    • /
    • 2010
  • The objectives of this study were to quantify the combination values of the principal components and factors calculated using body measurements of Hanwoo (Korean Cattle) and estimate their heritabilities. The technique of multivariate analysis was used to reduce a large number of variables to a smaller number of new variables and characterize cattle according to body shape. The analyses were performed using 1,979 cattle at 12 months of age and 936 cattle at 24 months of age. The data for the analyses was obtained from progeny tests performed on Korean Cattle for 6 years from 2003 to 2008. The phenotypic correlations among these traits were estimated to range from 0.32 to 0.90 at 12 months of age and from 0.21 to 0.82 at 24 months of age. The first principal components (PC1s) indicated a weighed average of overall body measurements, accounting for 99.91% of the total variation for both periods of test. The two first PCs had positive coefficients for all body measurements. The major sources of PC, such as chest girth (CG), body length (BL), rump height (RH), and wither height (WH) were similar for both test periods. The heritabilities for PC1, the first factor score (FS1), and the second factor score (FS2) were estimated by multivariate REML method. The estimated heritabilities for PC1, FS1, and FS2 were 0.33, 0.38, and 0.40, respectively, at 12 months of age and 0.26, 0.76, and 0.58 at 24 months of age. Further studies are needed to determine whether the heritabilities of FS1 and FS2 at 24 months of age were overestimated.

Estimation of Brain Connectivity during Motor Imagery Tasks using Noise-Assisted Multivariate Empirical Mode Decomposition

  • Lee, Ki-Baek;Kim, Ko Keun;Song, Jaeseung;Ryu, Jiwoo;Kim, Youngjoo;Park, Cheolsoo
    • Journal of Electrical Engineering and Technology
    • /
    • v.11 no.6
    • /
    • pp.1812-1824
    • /
    • 2016
  • The neural dynamics underlying the causal network during motor planning or imagery in the human brain are not well understood. The lack of signal processing tools suitable for the analysis of nonlinear and nonstationary electroencephalographic (EEG) hinders such analyses. In this study, noise-assisted multivariate empirical mode decomposition (NA-MEMD) is used to estimate the causal inference in the frequency domain, i.e., partial directed coherence (PDC). Natural and intrinsic oscillations corresponding to the motor imagery tasks can be extracted due to the data-driven approach of NA-MEMD, which does not employ predefined basis functions. Simulations based on synthetic data with a time delay between two signals demonstrated that NA-MEMD was the optimal method for estimating the delay between two signals. Furthermore, classification analysis of the motor imagery responses of 29 subjects revealed that NA-MEMD is a prerequisite process for estimating the causal network across multichannel EEG data during mental tasks.

Short-term Construction Investment Forecasting Model in Korea (건설투자(建設投資)의 단기예측모형(短期豫測模型) 비교(比較))

  • Kim, Kwan-young;Lee, Chang-soo
    • KDI Journal of Economic Policy
    • /
    • v.14 no.1
    • /
    • pp.121-145
    • /
    • 1992
  • This paper examines characteristics of time series data related to the construction investment(stationarity and time series components such as secular trend, cyclical fluctuation, seasonal variation, and random change) and surveys predictibility, fitness, and explicability of independent variables of various models to build a short-term construction investment forecasting model suitable for current economic circumstances. Unit root test, autocorrelation coefficient and spectral density function analysis show that related time series data do not have unit roots, fluctuate cyclically, and are largely explicated by lagged variables. Moreover it is very important for the short-term construction investment forecasting to grasp time lag relation between construction investment series and leading indicators such as building construction permits and value of construction orders received. In chapter 3, we explicate 7 forecasting models; Univariate time series model (ARIMA and multiplicative linear trend model), multivariate time series model using leading indicators (1st order autoregressive model, vector autoregressive model and error correction model) and multivariate time series model using National Accounts data (simple reduced form model disconnected from simultaneous macroeconomic model and VAR model). These models are examined by 4 statistical tools that are average absolute error, root mean square error, adjusted coefficient of determination, and Durbin-Watson statistic. This analysis proves two facts. First, multivariate models are more suitable than univariate models in the point that forecasting error of multivariate models tend to decrease in contrast to the case of latter. Second, VAR model is superior than any other multivariate models; average absolute prediction error and root mean square error of VAR model are quitely low and adjusted coefficient of determination is higher. This conclusion is reasonable when we consider current construction investment has sustained overheating growth more than secular trend.

  • PDF

Multivariate Classification of Choson Coins (다변수 분석법에 의한 조선시대 동전의 분류연구)

  • Lee, Chang-Keun;Kang, Hyung-Tai;Goh, Sung-Hee
    • 보존과학연구
    • /
    • s.8
    • /
    • pp.1-12
    • /
    • 1987
  • Fifty ancient Korean coins originated in Choson dynasty have been determined for 9 elements such as Sn, Fe, As, Ag, Co, Sb, Ir, Ru and Ni by instrumental neutron activation analysis and for 3 elements such as Cu, Pb, and Zn by atomicalsorption spectrometry. Bronze coins originated in early days of the dynasty contain as major constituents Cu, Pb and Sn approximately in the ratio 90 : 4 : 3, where as, those in latter days contain in the ratio 7 : 2 : 0. Brass coins which had begun in 17century contain as major constituents Cu, Zn and Pb approximately in the ratio 7 : 1: 1. The multivariate date have been analyzed for the relation among elemental contents through the variance-covariance matrix. The data have been fur theranalyzed by a principal component mapping method. As the results training set of 8class have been chosen, based on the spread of sample points in an eigenvector plotand archaeolgical data such as age and the office of minting.

  • PDF