DOI QR코드

DOI QR Code

A note on the distance distribution paradigm for Mosaab-metric to process segmented genomes of influenza virus

  • Received : 2020.01.09
  • Accepted : 2020.02.27
  • Published : 2020.03.31

Abstract

In this paper, we present few technical notes about the distance distribution paradigm for Mosaab-metric using 1, 2, and 3 grams feature extraction techniques to analyze composite data points in high dimensional feature spaces. This technical analysis will help the specialist in bioinformatics and biotechnology to deeply explore the biodiversity of influenza virus genome as a composite data point. Various technical examples are presented in this paper, in addition, the integrated statistical learning pipeline to process segmented genomes of influenza virus is illustrated as sequential-parallel computational pipeline.

Keywords

References

  1. Daoud M. The extension of the largest generalized-eigenvalue based distance metric D_ij (gamma_1 ) in arbitrary feature spaces to classify composite data points. Genomics Inform 2019;17:e39. https://doi.org/10.5808/GI.2019.17.4.e39
  2. Daoud M. Insights of window-based mechanism approach to visualize composite BioData point in feature spaces. Genomics Inform 2019;17:e4. https://doi.org/10.5808/GI.2019.17.1.e4
  3. Lakdawala SS, Brooke CB. What's new with flu? An overview. Viruses 2019;11:E433.
  4. White MC, Lowen AC. Implications of segment mismatch for influenza A virus evolution. J Gen Virol 2018;99:3-16. https://doi.org/10.1099/jgv.0.000989
  5. Halling-Brown M, Shepherd AJ. Constructing computational pipelines. Methods Mol Biol 2008;453:451-470. https://doi.org/10.1007/978-1-60327-429-6_24
  6. James G, Witten D, Hastie T, Tibshirani R. An Introduction to Statistical Learning: with Applications in R. New York: Springer, 2014.
  7. Daoud M, Kremer SC. A new distance distribution paradigm to detect the variability of the influenza-A virus in high dimensional spaces. In: 2009 IEEE International Conference on Bioinformatics and Biomedicine Workshop, 2009 Nov 13, Washington, DC, USA. Orlando: Institute of Electrical and Electronics Engineers, 2009. pp. 32-37.
  8. Daoud M. Quantum sequence analysis: a new alignment-free technique for analyzing sequences in feature space. In: Proceedings of the International Conference on Bioinformatics, Computational Biology and Biomedical Informatics (Gheng Q, Jeun J, Li Y, Prieto- Centurion V, Krishnan JA, Schatz BR, eds.), 2013 Sep 22-25, Washington, DC, USA. New York: ACM Press, 2013. pp. 702.
  9. NCBI. Influenza Virus Resource. Bethesda: National Center for Biotechnology Information, 2008. Accessed 2019 Sep 2. Available from: http://www.ncbi.nlm.nih.gov/genomes/FLU/.
  10. Centers for Disease Control and Prevention. Influenza (Flu). Atlanta: Centers for Disease Control and Prevention, 2019. Accessed 2019 Sep 2. Available from: https://www.cdc.gov/flu/resource-center/freeresources/graphics/images.htm.