References
- Y. Yang et al., "Query by Document," ACM Int. Conf. Web Search Data Mining, Barcelona, Spain, Feb. 9-12, 2009, pp. 34-43.
- M.A. Sanchez-Perez, G. Sidorov, and A. Gelbukh, "A Winning Approach to Text Alignment for Text Reuse Detection at PAN 2014," Notebook PAN CLEF, Sheffield, UK, Sept. 15-18, 2014.
- C. Trapnell and S.L. Salzberg, "How to Map Billions of Short Reads onto Genomes," Nature Biotechnology, vol. 27, 2009, pp. 455-457. https://doi.org/10.1038/nbt0509-455
- P. Ferragina and G. Manzini, "Opportunistic Data Structures with Applications," Ann. Symp. Foundations Computer Sci., Redondo Beach, CA, USA, Nov. 12-14, 2000, pp. 390-398.
- M. Burrows and D.J. Wheeler, "A Block-Sorting Lossless Data Compression Algorithm," Technical Report 124, Digital Equipment Corporation, 1994.
- U. Manber and G. Myers, "Suffix Arrays: A New Method for Online String Searches," SIAM J. Comput., vol. 22, no. 5, Oct. 1993, pp. 935-948. https://doi.org/10.1137/0222058
- H. Li and R. Durbin, "Fast and Accurate Short Read Alignment with Burrows-Wheeler Transform," Bioinformatics, vol. 25, no. 14, 2009, pp. 1754-1760. https://doi.org/10.1093/bioinformatics/btp324
- M. Potthast et al., "Overview of the 6th International Competition on Plagiarism Detection," Notebook PAN CLEF, Sheffield, UK, Sept. 15-18, 2014.
- K. Williams, H. Chen, and C. Giles, "Supervised Ranking for Plagiarism Source Retrieval," Notebook PAN CLEF, Sheffield, UK, Sept. 15-18, 2014.
- M.A. Sanchez-Perez, G. Sidorov, and A. Gelbukh, "A Winning Approach to Text Alignment for Text Reuse Detection at PAN 2014," Notebook PAN CLEF, Sheffield, UK, Sept. 15-18, 2014.
- S.F. Altschul et al., "Basic Local Alignment Search Tool," J. Molecular Biology, vol. 215, no. 3, Oct. 1990, pp. 403-410. https://doi.org/10.1016/S0022-2836(05)80360-2
- R. Li et al., "SOAP2: An Improved Ultrafast Tool for Short Read Alignment," Bioinformatics, vol. 25, no. 15, Aug. 2009, pp. 1966-1967. https://doi.org/10.1093/bioinformatics/btp336
- PAN 2013, Accessed June 19, 2015. http://pan.webis.de
- P. Ferragina and G. Navarro, Pizza & Chili Corpus, Accessed June 29, 2015. http://pizzachili.dcc.uchile.cl
- Y. Sun, J. Qin, and W. Wang, "Near Duplicate Text Detection Using Frequency-Biased Signatures," Web Inf. Syst. Eng., Int. Conf., Nanjing, China, Oct. 13-15, 2013, pp. 277-291.
- C.S. Ock et al., "A Fast Searchong for Similar Text Using Genomc Read Mapping Method," IEEE Int. Conf. Comput. Sci. Eng., Sydney, Australia, Dec. 3-5, 2013, pp. 219-226.
- S.-H. Kim and H.-G. Cho, "A New Approach for Approximate Text Search Using Genomic Short-Read Mapping Model," ACM Int. Conf. Ubiquitous Inf. Manag. Commun., Bali, Indonesia, Jan. 8-10, 2015.
- R. Raman, V. Raman, and S.S. Rao, "Succinct Indexable Dictionaries with Applications to Encoding k-ary Trees and Multisets," ACM-SIAM Symp. Discrete Algorithms, San Francisco, CA, USA, Jan. 6-8, 2002, pp.233-242.
- S. Gog, Succinct Data Structure Library 2.0, Accessed Dec. 1, 2015. https://github.com/simongog/sdsl-lite