Acknowledgement
Supported by : 한국연구재단
References
- Stephens, Zachary D., et al., "Big data: astronomical or genomical?," PLoS Biol, 13.7 (2015): e1002195. https://doi.org/10.1371/journal.pbio.1002195
- http://www.ncbi.nlm.nih.gov/refseq
- Altschul, Stephen F., et al., "Basic local alignment search tool," Journal of molecular biology, 215.3 (1990): 403-410. https://doi.org/10.1016/S0022-2836(05)80360-2
- Edgar, Robert C., "Search and clustering orders of magnitude faster than BLAST," Bioinformatics, 26.19 (2010): 2460-2461. https://doi.org/10.1093/bioinformatics/btq461
- Cole, James R., et al., "The Ribosomal Database Project: improved alignments and new tools for rRNA analysis," Nucleic acids research 37.suppl 1 (2009): D141-D145. https://doi.org/10.1093/nar/gkn879
- Sikic, Kresimir, and Oliviero Carugo, "Protein sequence redundancy reduction: comparison of various method," Bioinformation 5.6 (2010): 234-239. https://doi.org/10.6026/97320630005234
- Loh, Po-Ru, Michael Baym, and Bonnie Berger, "Compressive genomics," Nature biotechnology 30.7 (2012): 627-630. https://doi.org/10.1038/nbt.2241
- Smith, Temple F., and Michael S. Waterman, "Identification of common molecular subsequences," Journal of molecular biology, 147.1 (1981): 195-197. https://doi.org/10.1016/0022-2836(81)90087-5
- Needleman, Saul B., and Christian D. Wunsch, "A general method applicable to the search for similarities in the amino acid sequence of two proteins," Journal of molecular biology 48.3 (1970): 443-453. https://doi.org/10.1016/0022-2836(70)90057-4
- Li, Weizhong, and Adam Godzik, "Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences," Bioinformatics 22.13 (2006): 1658-1659. https://doi.org/10.1093/bioinformatics/btl158
- DeSantis, Todd Z., et al., "Greengenes, a chimerachecked 16S rRNA gene database and workbench compatible with ARB," Applied and environmental microbiology 72.7 (2006): 5069-5072. https://doi.org/10.1128/AEM.03006-05
- Maaten, Laurens van der, and Geoffrey Hinton, "Visualizing data using t-SNE," Journal of Machine Learning Research 9. Nov (2008): 2579-2605.