• Title/Summary/Keyword: R-package

Search Result 539, Processing Time 0.031 seconds

Genome re-sequencing to identify single nucleotide polymorphism markers for muscle color traits in broiler chickens

  • Kong, H.R.;Anthony, N.B.;Rowland, K.C.;Khatri, B.;Kong, B.C.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.31 no.1
    • /
    • pp.13-18
    • /
    • 2018
  • Objective: Meat quality including muscle color in chickens is an important trait and continuous selective pressures for fast growth and high yield have negatively impacted this trait. This study was conducted to investigate genetic variations responsible for regulating muscle color. Methods: Whole genome re-sequencing analysis using Illumina HiSeq paired end read method was performed with pooled DNA samples isolated from two broiler chicken lines divergently selected for muscle color (high muscle color [HMC] and low muscle color [LMC]) along with their random bred control line (RAN). Sequencing read data was aligned to the chicken reference genome sequence for Red Jungle Fowl (Galgal4) using reference based genome alignment with NGen program of the Lasergene software package. The potential causal single nucleotide polymorphisms (SNPs) showing non-synonymous changes in coding DNA sequence regions were chosen in each line. Bioinformatic analyses to interpret functions of genes retaining SNPs were performed using the ingenuity pathways analysis (IPA). Results: Millions of SNPs were identified and totally 2,884 SNPs (1,307 for HMC and 1,577 for LMC) showing >75% SNP rates could induce non-synonymous mutations in amino acid sequences. Of those, SNPs showing over 10 read depths yielded 15 more reliable SNPs including 1 for HMC and 14 for LMC. The IPA analyses suggested that meat color in chickens appeared to be associated with chromosomal DNA stability, the functions of ubiquitylation (UBC) and quality and quantity of various subtypes of collagens. Conclusion: In this study, various potential genetic markers showing amino acid changes were identified in differential meat color lines, that can be used for further animal selection strategy.

Identification and Functional Analysis of Differentially Expressed Genes Related to Metastatic Osteosarcoma

  • Niu, Feng;Zhao, Song;Xu, Chang-Yan;Chen, Lin;Ye, Long;Bi, Gui-Bin;Tian, Gang;Gong, Ping;Nie, Tian-Hong
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.15 no.24
    • /
    • pp.10797-10801
    • /
    • 2015
  • Background: To explore the molecular mechanisms of metastatic osteosarcoma (OS) by using the microarray expression profiles of metastatic and non-metastatic OS samples. Materials and Methods: The gene expression profile GSE37552 was downloaded from Gene Expression Omnibus database, including 2 human metastatic OS cell line models and 2 two non-metastatic OS cell line models. The differentially expressed genes (DEGs) were identified by Multtest package in R language. In addition, functional enrichment analysis of the DEGs was performed by WebGestalt, and the protein-protein interaction (PPI) networks were constructed by Hitpredict, then the signal pathways of the genes involved in the networks were performed by Kyoto Encyclopaedia of Genes and Genomes (KEGG) automatic annotation server (KAAS). Results: A total of 237 genes were classified as DEGs in metastatic OS. The most significant up- and down-regulated genes were A2M (alpha-2-macroglobulin) and BCAN (brevican). The DEGs were significantly related to the response to hormone stimulus, and the PPI network of A2M contained IL1B (interleukin), LRP1 (low-density lipoprotein receptor-related protein 1) and PDGF (platelet-derived growth factor). Furthermore, the MAPK signaling pathway and focal adhesion were significantly enriched. Conclusions: A2M and its interactive proteins, such as IL1B, LRP1 and PDGF may be candidate target molecules to monitor, diagnose and treat metastatic OS. The response to hormone stimulus, MAPK signaling pathway and focal adhesion may play important roles in metastatic OS.

Identifying Differentially Expressed Genes and Small Molecule Drugs for Prostate Cancer by a Bioinformatics Strategy

  • Li, Jian;Xu, Ya-Hong;Lu, Yi;Ma, Xiao-Ping;Chen, Ping;Luo, Shun-Wen;Jia, Zhi-Gang;Liu, Yang;Guo, Yu
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.14 no.9
    • /
    • pp.5281-5286
    • /
    • 2013
  • Purpose: Prostate cancer caused by the abnormal disorderly growth of prostatic acinar cells is the most prevalent cancer of men in western countries. We aimed to screen out differentially expressed genes (DEGs) and explore small molecule drugs for prostate cancer. Materials and Methods: The GSE3824 gene expression profile of prostate cancer was downloaded from Gene Expression Omnibus database which including 21 normal samples and 18 prostate cancer cells. The DEGs were identified by Limma package in R language and gene ontology and pathway enrichment analyses were performed. In addition, potential regulatory microRNAs and the target sites of the transcription factors were screened out based on the molecular signature database. In addition, the DEGs were mapped to the connectivity map database to identify potential small molecule drugs. Results: A total of 6,588 genes were filtered as DEGs between normal and prostate cancer samples. Examples such as ITGB6, ITGB3, ITGAV and ITGA2 may induce prostate cancer through actions on the focal adhesion pathway. Furthermore, the transcription factor, SP1, and its target genes ARHGAP26 and USF1 were identified. The most significant microRNA, MIR-506, was screened and found to regulate genes including ITGB1 and ITGB3. Additionally, small molecules MS-275, 8-azaguanine and pyrvinium were discovered to have the potential to repair the disordered metabolic pathways, abd furthermore to remedy prostate cancer. Conclusions: The results of our analysis bear on the mechanism of prostate cancer and allow screening for small molecular drugs for this cancer. The findings have the potential for future use in the clinic for treatment of prostate cancer.

Bayesian estimation for frequency using resampling methods (재표본 방법론을 활용한 베이지안 주파수 추정)

  • Pak, Ro Jin
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.6
    • /
    • pp.877-888
    • /
    • 2017
  • Spectral analysis is used to determine the frequency of time series data. We first determine the frequency of the series through the power spectrum or the periodogram and then calculate the period of a cycle that may exist in a time series. Estimating the frequency using a Bayesian technique has been developed and proven to be useful; however, the Bayesian estimator for the frequency cannot be analytically solved through mathematical equations and may be handled numerically or computationally. In this paper, we make an inference on the Bayesian frequency through both resampling a parameter by Markov chain Monte Carlo (MCMC) methods and resampling data by bootstrap methods for a time series. We take the Korean real estate price index as an example for Bayesian frequency estimation. We have found a difference in the periods between the sale price index and the long term rental price index, but the difference is not statistically significant.

근적외 분광분석법을 이용한 한국산과 미국산 잎담배의 판별분석

  • 장기철;김용옥;이경구
    • Journal of the Korean Society of Tobacco Science
    • /
    • v.20 no.2
    • /
    • pp.191-197
    • /
    • 1998
  • Discriminant analysis using near infrared spectra derived from Korean Flue-cured(KF) and American Flue-cured(AF), and also Korean Burley(KB) and American Burley(AB) tobacco was done to classify flue-cured and burley tobacco as either grown in Korea or grown in the USA. Samples were scanned in the wavelength of 400 ~ 2500 nm by near infrared analyzer(NIRSystem Co., model 6500). The discrimination equations for flue-cured and burley tobacco were developed using partial least square 2 method in Infrasoft International NIRS 3 software package. KF samples used for the development of the discrimination equations were higher contents of total sugar, crude ash and chlorine, and higher value of leaf density and brightness, but lower contents of nicotine, total nitrogen and ether extracts, and higher value of redness than those of AF samples. KB samples were higher contents of nicotine, crude ash and chlorine, but lower contents of ether extracts and value of brightness than those of AB samples. On 3 dimensional graph drawn with 3 principal component scores calculated with 3 principal component from KF and KB sample spectra, KF sample spectra were significantly different from AF, and also KB sample spectra were significantly different from AB. The discrimination equations of flue-cured and burley were developed with 3 principal component, respectively. The discrimination equations for flue-cured and burley had a standard error of 0.03 and 0.04, and a R2 of 0.88 and 0.84, respectively. The tobacco samples used for the development of discrimination equation were perfectly classified as KF and AF by flue-cured discrimination equation, and also perfectly classified KB and AB by burley discrimination equation, respectively. The correct classification rates of KF and AF samples not used for the development of discrimination equations were 9S % (828 out of 869 samples) and 98 % (98 out of 100 samples) by flue-cured discrimination equations, and KB and AB samples were 94%(345 out of 368 samples) and 100%(42 out of 42 samples) by burley discrimination equations, respectively.

  • PDF

Study on the Characteristics of a Dash System Based on Test and Simulation for Vehicle Noise Reduction (승용차량의 소음저감을 위한 시험과 시뮬레이션을 이용한 대시 시스템의 특성 연구)

  • Yoo, Ji Woo;Chae, Ki-Sang;Cho, Jin Ho
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.22 no.11
    • /
    • pp.1071-1077
    • /
    • 2012
  • Low frequency noises(up to about 200 Hz) such as booming are mainly caused by particular modes, and in general the solutions may be found based on mode controls where conventional methods such as FEM can be used. However, at higher frequencies between 0.3~1 kHz, as the number of modes rapidly increases, radiation characteristics from structures, performances of damping sheets and sound packages may be more crucial rather than particular modes, and consequently the conventional FEM may be less practical in dealing with this kinds of structure-borne problems. In this context, so-called 'mid-frequency simulation model' based on FE-SEA hybrid method is studied and validated to reduce noise in this frequency region. Energy transmission loss(i.e. air borne noise) is also studied. A dash panel component is chosen for this study, which is an important path that transmits both structure-borne and air borne energies into the cavity. Design modifications including structural modifications, attachment of damping sheets and application of different sound packages are taken into account and the corresponding noise characteristics are experimentally identified. It is found that the dash member behaves as a noise path. The damping sheet and sound packages have similar influences on both sound radiation and transmission loss. The comparison between experiments and simulations shows that this model could be used to predict the tendency of noise improvement.

A Study of the Knowledge, Attitude and Needs of AIDS Education of Senior High School Students (고등학생들의 에이즈(후천성 면역 결핍증)에 대한 지식과 태도에 관한 연구 - K시 일부 고등학생 대상 -)

  • Park, In-Hyae;Yoon, Hyun-Sook;Han, You-Jeong
    • Journal of the Korean Society of School Health
    • /
    • v.9 no.2
    • /
    • pp.239-248
    • /
    • 1996
  • This descriptive study was done to investigate the degree of knowledge and attitudes, the need of health education, and the relationship between knowledge and attitudes concerning AIDS in senior high school students. The data was collected from 386 senior high school students(200 boys, 180 girls) in K city. The data was gathered by a self reporting questionnaire, from Nov. 22nd to the 30th, 1995. Data was analyzed using the descriptive statistics methods: t-test, F-test, Pearson-Correlation Coefficient, GLM with the statistical computer package, SAS. The Result of this study were as follows : 1. The mean score of knowledge about AIDS was 42.70 from a total score of 58, showing significant differences between boys and girls (p<0.0001). 2. The mean score of attitudes about AIDS was 29.95 from a total score of 40, showing no significant differences between boys and girls. 3. The selected contents that the students want to learn were prevention, etiology, transmission, symptoms, and treatment of AIDS in that order. 4. The relationship between knowledge and attitude about AIDS show a positive correlation and is statistically significant(r=0.27, p<0.0001). The information obtained from this study will provi a useful data to develop a practical health education program about AIDS for senior high school students.

  • PDF

Combining Support Vector Machine Recursive Feature Elimination and Intensity-dependent Normalization for Gene Selection in RNAseq (RNAseq 빅데이터에서 유전자 선택을 위한 밀집도-의존 정규화 기반의 서포트-벡터 머신 병합법)

  • Kim, Chayoung
    • Journal of Internet Computing and Services
    • /
    • v.18 no.5
    • /
    • pp.47-53
    • /
    • 2017
  • In past few years, high-throughput sequencing, big-data generation, cloud computing, and computational biology are revolutionary. RNA sequencing is emerging as an attractive alternative to DNA microarrays. And the methods for constructing Gene Regulatory Network (GRN) from RNA-Seq are extremely lacking and urgently required. Because GRN has obtained substantial observation from genomics and bioinformatics, an elementary requirement of the GRN has been to maximize distinguishable genes. Despite of RNA sequencing techniques to generate a big amount of data, there are few computational methods to exploit the huge amount of the big data. Therefore, we have suggested a novel gene selection algorithm combining Support Vector Machines and Intensity-dependent normalization, which uses log differential expression ratio in RNAseq. It is an extended variation of support vector machine recursive feature elimination (SVM-RFE) algorithm. This algorithm accomplishes minimum relevancy with subsets of Big-Data, such as NCBI-GEO. The proposed algorithm was compared to the existing one which uses gene expression profiling DNA microarrays. It finds that the proposed algorithm have provided as convenient and quick method than previous because it uses all functions in R package and have more improvement with regard to the classification accuracy based on gene ontology and time consuming in terms of Big-Data. The comparison was performed based on the number of genes selected in RNAseq Big-Data.

A Study on the Prediction Model Considering the Multicollinearity of Independent Variables in the Seawater Reverse Osmosis (역삼투압 해수담수화(SWRO) 플랜트에서 독립변수의 다중공선성을 고려한 예측모델에 관한 연구)

  • Han, In sup;Yoon, Yeon-Ah;Chang, Tai-Woo;Kim, Yong Soo
    • Journal of Korean Society for Quality Management
    • /
    • v.48 no.1
    • /
    • pp.171-186
    • /
    • 2020
  • Purpose: The purpose of this study is conducting of predictive models that considered multicollinearity of independent variables in order to carry out more efficient and reliable predictions about differential pressure in seawater reverse osmosis. Methods: The main variables of each RO system are extracted through factor analysis. Common variables are derived through comparison of RO system # 1 and RO system # 2. In order to carry out the prediction modeling about the differential pressure, which is the target variable, we constructed the prediction model reflecting the regression analysis, the artificial neural network, and the support vector machine in R package, and figured out the superiority of the model by comparing RMSE. Results: The number of factors extracted from factor analysis of RO system #1 and RO system #2 is same. And the value of variability(% Var) increased as step proceeds according to the analysis procedure. As a result of deriving the average RMSE of the models, the overall prediction of the SVM was superior to the other models. Conclusion: This study is meaningful in that it has been conducting a demonstration study of considering the multicollinearity of independent variables. Before establishing a predictive model for a target variable, it would be more accurate predictive model if the relevant variables are derived and reflected.

Changes in Food Companies' Product and Promotion Activities after Restriction of TV Advertising of Energy-Dense and Nutrient-Poor Foods in Korea (고열량.저영양 식품의 TV 광고 제한에 따른 식품회사의 제품 및 촉진 활동 변화)

  • Kim, Hyungjun;Yoon, Jihyun;Lee, Youngmi
    • Journal of the East Asian Society of Dietary Life
    • /
    • v.24 no.3
    • /
    • pp.291-300
    • /
    • 2014
  • This study determined the effects of television (TV) advertising restriction of Energy-Dense and Nutrient-Poor Foods on product and promotion activities by food companies producing or selling children's favorite foods. A survey using a selfadministered questionnaire was conducted via on-line or fax by marketers or R&D managers from 108 food companies. The data from 55 respondents (50.9%) were analyzed. Restriction exerted positive effects on food products with respect to compliance with labeling requirements (4.0 out of 5 points) and reinforcement of nutritional contents examination of new products (3.7 out of 5 points). Reformulations of products such as reduction of nutrients like sodium, sugar and fat were also reported. In addition, food companies underwent diverse changes in promotional activities, including modification of package designs and displays in stores, offering free gifts, discounts, etc. In conclusion, restriction of TV food advertising may contribute to improvement of children's food environment by encouraging food companies to make favorable product changes. On the other hand, the results also revealed that food companies adopt diverse marketing channels that are not yet under regulation. Hence, to make policies more effective, regulation needs to be extended from TV to other marketing channels to which children are easily exposed.