A Review of Three Different Studies on Hidden Markov Models for Epigenetic Problems: A Computational Perspective

Lee, Kyung-Eun;Park, Hyun-Seok;

doi:10.5808/GI.2014.12.4.145

Genomics & Informatics

제12권4호
/
Pages.145-150
/
2014
/
1598-866X(pISSN)
/
2234-0742(eISSN)

한국유전체학회 (Korea Genome Organization)

DOI QR Code

A Review of Three Different Studies on Hidden Markov Models for Epigenetic Problems: A Computational Perspective

Lee, Kyung-Eun (Ewha Information and Telecommunication Institute, Ewha Womans University) ;
Park, Hyun-Seok (Ewha Information and Telecommunication Institute, Ewha Womans University)

투고 : 2014.10.17
심사 : 2014.11.23
발행 : 2014.12.31

https://doi.org/10.5808/GI.2014.12.4.145 인용 PDF KSCI

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

Recent technical advances, such as chromatin immunoprecipitation combined with DNA microarrays (ChIp-chip) and chromatin immunoprecipitation-sequencing (ChIP-seq), have generated large quantities of high-throughput data. Considering that epigenomic datasets are arranged over chromosomes, their analysis must account for spatial or temporal characteristics. In that sense, simple clustering or classification methodologies are inadequate for the analysis of multi-track ChIP-chip or ChIP-seq data. Approaches that are based on hidden Markov models (HMMs) can integrate dependencies between directly adjacent measurements in the genome. Here, we review three HMM-based studies that have contributed to epigenetic research, from a computational perspective. We also give a brief tutorial on HMM modelling-targeted at bioinformaticians who are new to the field.

키워드

참고문헌

Park HS, Galbadrakh B, Kim YM. Recent progresses in the linguistic modeling of biological sequences based on formal language theory. Genomics Inform 2011;9:5-11. https://doi.org/10.5808/GI.2011.9.1.005
Searls DB. The language of genes. Nature 2002;420:211-217. https://doi.org/10.1038/nature01255
Munch K, Krogh A. Automatic generation of gene finders for eukaryotic species. BMC Bioinformatics 2006;7:263. https://doi.org/10.1186/1471-2105-7-263
Durbin R, Eddy SR, Krogh A, Mitchison G. Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids. Cambridge: Cambridge University Press, 1998.
Pachter L, Alexandersson M, Cawley S. Applications of generalized pair hidden Markov models to alignment and gene finding problems. J Comput Biol 2002;9:389-399. https://doi.org/10.1089/10665270252935520
Liang KC, Wang X, Anastassiou D. Bayesian basecalling for DNA sequence analysis using hidden Markov models. IEEE/ACM Trans Comput Biol Bioinform 2007;4:430-440.
Lottaz C, Iseli C, Jongeneel CV, Bucher P. Modeling sequencing errors by combining Hidden Markov models. Bioinformatics 2003;19 Suppl 2:ii103-ii112.
Won KJ, Hamelryck T, Prugel-Bennett A, Krogh A. An evolutionary method for learning HMM structure: prediction of protein secondary structure. BMC Bioinformatics 2007;8:357. https://doi.org/10.1186/1471-2105-8-357
Zhang S, Borovok I, Aharonowitz Y, Sharan R, Bafna V. A sequence- based filtering method for ncRNA identification and its application to searching for riboswitch elements. Bioinformatics 2006;22:e557-e565. https://doi.org/10.1093/bioinformatics/btl232
Yoon BJ, Vaidyanathan PP. Structural alignment of RNAs using profile-csHMMs and its application to RNA homology search: overview and new results. IEEE Trans Automat Contr 2008;53:10-25. https://doi.org/10.1109/TAC.2007.911322
Harmanci AO, Sharma G, Mathews DH. Efficient pairwise RNA structure prediction using probabilistic alignment constraints in Dynalign. BMC Bioinformatics 2007;8:130. https://doi.org/10.1186/1471-2105-8-130
Weinberg Z, Ruzzo WL. Sequence-based heuristics for faster annotation of non-coding RNA families. Bioinformatics 2006; 22:35-39. https://doi.org/10.1093/bioinformatics/bti743
Shen L, Waterland RA. Methods of DNA methylation analysis. Curr Opin Clin Nutr Metab Care 2007;10:576-581. https://doi.org/10.1097/MCO.0b013e3282bf6f43
Bailey T, Krajewski P, Ladunga I, Lefebvre C, Li Q, Liu T, et al. Practical guidelines for the comprehensive analysis of ChIP-seq data. PLoS Comput Biol 2013;9:e1003326. https://doi.org/10.1371/journal.pcbi.1003326
ENCODE Project Consortium. The ENCODE (ENCyclopedia Of DNA Elements) Project. Science 2004;306:636-640. https://doi.org/10.1126/science.1105136
ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature 2012;489:57-74. https://doi.org/10.1038/nature11247
Li W, Meyer CA, Liu XS. A hidden Markov model for analyzing ChIP-chip experiments on genome tiling arrays and its application to p53 binding sequences. Bioinformatics 2005;21 Suppl 1:i274-i282. https://doi.org/10.1093/bioinformatics/bti1046
Xu H, Wei CL, Lin F, Sung WK. An HMM approach to genome- wide identification of differential histone modification sites from ChIP-seq data. Bioinformatics 2008;24:2344-2349. https://doi.org/10.1093/bioinformatics/btn402
Ernst J, Kellis M. Discovery and characterization of chromatin states for systematic annotation of the human genome. Nat Biotechnol 2010;28:817-825. https://doi.org/10.1038/nbt.1662
Lieberfarb ME, Lin M, Lechpammer M, Li C, Tanenbaum DM, Febbo PG, et al. Genome-wide loss of heterozygosity analysis from laser capture microdissected prostate cancer using single nucleotide polymorphic allele (SNP) arrays and a novel bioinformatics platform dChipSNP. Cancer Res 2003;63:4781-4785.
Baum LE, Petrie T, Soules G, Weiss N. A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains. Ann Math Stat 1970;41:164-171. https://doi.org/10.1214/aoms/1177697196
Ji H, Wong WH. TileMap: create chromosomal map of tiling array hybridizations. Bioinformatics 2005;21:3629-3636. https://doi.org/10.1093/bioinformatics/bti593
Martin-Magniette ML, Mary-Huard T, Berard C, Robin S. ChIPmix: mixture model of regressions for two-color ChIPchip analysis. Bioinformatics 2008;24:i181-i186. https://doi.org/10.1093/bioinformatics/btn280
Johannes F, Wardenaar R, Colome-Tatche M, Mousson F, de Graaf P, Mokry M, et al. Comparing genome-wide chromatin profiles using ChIP-chip or ChIP-seq. Bioinformatics 2010;26: 1000-1006. https://doi.org/10.1093/bioinformatics/btq087
Moghaddam AM, Roudier F, Seifert M, Bérard C, Magniette ML, Ashtiyani RK, et al. Additive inheritance of histone modifications in Arabidopsis thaliana intra-specific hybrids. Plant J 2011;67:691-700. https://doi.org/10.1111/j.1365-313X.2011.04628.x
Seifert M, Cortijo S, Colome-Tatche M, Johannes F, Roudier F, Colot V. MeDIP-HMM: genome-wide identification of distinct DNA methylation states from high-density tiling arrays. Bioinformatics 2012;28:2930-2939. https://doi.org/10.1093/bioinformatics/bts562
Arand J, Spieler D, Karius T, Branco MR, Meilinger D, Meissner A, et al. In vivo control of CpG and non-CpG DNA methylation by DNA methyltransferases. PLoS Genet 2012;8:e1002750. https://doi.org/10.1371/journal.pgen.1002750
Jaschek R, Tanay A. Spatial clustering of multivariate genomic and epigenomic information. Res Comput Mol Biol 2009;5541:170-183. https://doi.org/10.1007/978-3-642-02008-7_12
Ernst J, Kheradpour P, Mikkelsen TS, Shoresh N, Ward LD, Epstein CB, et al. Mapping and analysis of chromatin state dynamics in nine human cell types. Nature 2011;473:43-49. https://doi.org/10.1038/nature09906

피인용 문헌

A critical assessment of hidden markov model sub-optimal sampling strategies applied to the generation of peptide 3D models vol.37, pp.21, 2016, https://doi.org/10.1002/jcc.24422

Genomics & Informatics

A Review of Three Different Studies on Hidden Markov Models for Epigenetic Problems: A Computational Perspective

초록

키워드

참고문헌

피인용 문헌

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)