DOI QR코드

DOI QR Code

Converting Panax ginseng DNA and chemical fingerprints into two-dimensional barcode

  • Cai, Yong (State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences, University of Macau) ;
  • Li, Peng (State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences, University of Macau) ;
  • Li, Xi-Wen (State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences, University of Macau) ;
  • Zhao, Jing (State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences, University of Macau) ;
  • Chen, Hai (Information Technology College of Beijing Normal University Zhuhai Campus) ;
  • Yang, Qing (State Key Laboratory of Hydraulics and Mountain River Engineering, Sichuan University) ;
  • Hu, Hao (State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences, University of Macau)
  • 투고 : 2016.04.07
  • 심사 : 2016.06.29
  • 발행 : 2017.07.15

초록

Background: In this study, we investigated how to convert the Panax ginseng DNA sequence code and chemical fingerprints into a two-dimensional code. In order to improve the compression efficiency, GATC2Bytes and digital merger compression algorithms are proposed. Methods: HPLC chemical fingerprint data of 10 groups of P. ginseng from Northeast China and the internal transcribed spacer 2 (ITS2) sequence code as the DNA sequence code were ready for conversion. In order to convert such data into a two-dimensional code, the following six steps were performed: First, the chemical fingerprint characteristic data sets were obtained through the inflection filtering algorithm. Second, precompression processing of such data sets is undertaken. Third, precompression processing was undertaken with the P. ginseng DNA (ITS2) sequence codes. Fourth, the precompressed chemical fingerprint data and the DNA (ITS2) sequence code were combined in accordance with the set data format. Such combined data can be compressed by Zlib, an open source data compression algorithm. Finally, the compressed data generated a two-dimensional code called a quick response code (QR code). Results: Through the abovementioned converting process, it can be found that the number of bytes needed for storing P. ginseng chemical fingerprints and its DNA (ITS2) sequence code can be greatly reduced. After GTCA2Bytes algorithm processing, the ITS2 compression rate reaches 75% and the chemical fingerprint compression rate exceeds 99.65% via filtration and digital merger compression algorithm processing. Therefore, the overall compression ratio even exceeds 99.36%. The capacity of the formed QR code is around 0.5k, which can easily and successfully be read and identified by any smartphone. Conclusion: P. ginseng chemical fingerprints and its DNA (ITS2) sequence code can form a QR code after data processing, and therefore the QR code can be a perfect carrier of the authenticity and quality of P. ginseng information. This study provides a theoretical basis for the development of a quality traceability system of traditional Chinese medicine based on a two-dimensional code.

키워드

참고문헌

  1. Luo K, Chen S, Chen K, Song J, Yao H, Ma X, Zhu Y, Pang X, Yu H, Li X, et al. Assessment of candidate plant DNA barcodes using the Rutaceae family. Sci China Life Sci 2010;53:701-8. https://doi.org/10.1007/s11427-010-4009-1
  2. Selvaraj D, Sarma RK, Shanmughanandhan D, Srinivasan R, Ramalingam S. Evaluation of DNA barcode candidates for the discrimination of the large plant family Apocynaceae. Plant Syst Evol 2015;301:1263-73. https://doi.org/10.1007/s00606-014-1149-y
  3. Yan SK, Xin WF, Luo GA, Wang YM, Cheng YY. An approach to develop twodimensional fingerprint for the quality control of Qingkailing injection by high-performance liquid chromatography with diode array detection. J Chromatogr A 2005;1090:90-7. https://doi.org/10.1016/j.chroma.2005.07.066
  4. Xie YY, Luo D, Cheng YJ, Ma JF, Wang YM, Liang QL, Luo GA. Steaming-induced chemical transformations and holistic quality assessment of red ginseng derived from Panax ginseng by means of HPLC-ESI-MS/MS n-based multicomponent quantification fingerprint. J Agric Food Chem 2012;60:8213-24. https://doi.org/10.1021/jf301116x
  5. Sun TT, Liang XL, Zhu HY, Peng XL, Guo XJ, Zhao LS. Rapid separation and identification of 31 major saponins in Shizhu ginseng by ultra-high performance liquid chromatographyeelectron spray ionizationeMS/MS. J Ginseng Res 2016;40:220-8. https://doi.org/10.1016/j.jgr.2015.07.008
  6. Lu GH, Chan K, Liang YZ, Leung K, Chan CL, Jiang ZH, Zhao ZZ. Development of high-performance liquid chromatographic fingerprints for distinguishing Chinese Angelica from related umbelliferae herbs. J Chromatogr A 2005;1073:383-92. https://doi.org/10.1016/j.chroma.2004.11.080
  7. Liu L, Wang Y, Song Q, Bao YP. Fingerprint identification system based on twodimensional barcode and DSP. Adv Mater Res 2012;479:2082-5.
  8. Chen SL, Yao H, Han JP, Liu C, Song JY, Shi LC, Zhu YJ, Ma XY, Gao T, Pang XH, et al. Validation of the ITS2 region as a novel DNA barcode for identifying medicinal plant species. PLoS One 2010;5:e8613. https://doi.org/10.1371/journal.pone.0008613
  9. Liu C, Shi L, Xu X, Li H, Xing H, Liang D, Jiang K, Pang X, Song J, Chen S. DNA barcode goes two-dimensions: DNA QR code web server. PloS One 2012;7:e35146. https://doi.org/10.1371/journal.pone.0035146
  10. Kumar NP, Rajavel A, Jambulingam P. Application of PDF417 symbology for ‘DNA barcoding'. Comput Meth Prog Biomed 2008;90:187-9. https://doi.org/10.1016/j.cmpb.2007.12.011
  11. Cai Y, Li XW, Li M, Chen XJ, Hu H, Ni JY, Wang YT. Traceability and quality control in traditional Chinese medicine: from chemical fingerprint to twodimensional barcode. Evid Based Complement Altern Med 2015, 251304. http://dx.doi.org/10.1155/2015/251304. 6 pages.
  12. Grumbach S, Tahi F. A new challenge for compression algorithms: genetic sequences. Inform Process Manag 1994;30:875-86. https://doi.org/10.1016/0306-4573(94)90014-0
  13. Chen X, Kwong S, Li M. A compression algorithm for DNA sequences. IEEE Eng Med Biol 2010;20:61-6.
  14. Matsumoto T, Sadakane K, Imai H. Biological sequence compression algorithms. Genome Inform 2000;11:43-52.
  15. Chen X, Li M, Ma B, Tromp J. DNA Compress: fast and effective DNA sequence compression. Bioinformatics 2002;18:1696-8. https://doi.org/10.1093/bioinformatics/18.12.1696
  16. Behzadi B, Le Fessant F. DNA compression challenge revisited: a dynamic programming approach. Comb Pattern Match 2005;3537:190-200.
  17. Srinivasa KG, Jagadish M, Venugopal KR, Patnaik LM. Efficient compression of non-repetitive DNA sequences using dynamic programming. In: Advanced Computing and Communications, ADCOM 2006, International Conference on IEEE; 2006. p. 569-74.
  18. Korodi G, Tabus I. Normalized maximum likelihood model of order-1 for the compression of DNA sequences. In: Data Compression Conference, 2007. DCC'07. IEEE; 2007. p. 33-42.
  19. Chen SL. Standard DNA barcodes of Chinese materia medica in Chinese pharmacopoeia, Volume 3. Beijing: Science Press; 2015. p. 473-5.
  20. Kreft S, Navarro G. Self-indexing based on LZ77. CPM 2011;11:41-54.
  21. Deutsch P, Gailly JL. Zlib compressed data format specification version 3.3 (No. RFC 1950). RFC 1950, May.
  22. Galperin MY, Cochrane GR. Petabyte-scale innovations at the European nucleotide archive. Nucl Acids Res 2009;37:D1-4. https://doi.org/10.1093/nar/gkn942
  23. Xie PS, Leung AY. Understanding the traditional aspect of Chinese medicine in order to achieve meaningful quality control of Chinese materia medica. J Chromatogr A 2009;1216:1933-40. https://doi.org/10.1016/j.chroma.2008.08.045
  24. Liang Y, Xie P, Chan K. Perspective of chemical fingerprinting of Chinese herbs. Planta Med 2010;76:1997-2003. https://doi.org/10.1055/s-0030-1250541