DOI QR코드

DOI QR Code

A Dataset of Online Handwritten Assamese Characters

  • Baruah, Udayan (Dept. of Information Technology, Sikkim Manipal Institute of Technology) ;
  • Hazarika, Shyamanta M. (Dept. of Computer Science and Engineering, Tezpur University)
  • Received : 2013.07.08
  • Accepted : 2014.04.16
  • Published : 2015.09.30

Abstract

This paper describes the Tezpur University dataset of online handwritten Assamese characters. The online data acquisition process involves the capturing of data as the text is written on a digitizer with an electronic pen. A sensor picks up the pen-tip movements, as well as pen-up/pen-down switching. The dataset contains 8,235 isolated online handwritten Assamese characters. Preliminary results on the classification of online handwritten Assamese characters using the above dataset are presented in this paper. The use of the support vector machine classifier and the classification accuracy for three different feature vectors are explored in our research.

Keywords

References

  1. R. Plamondon and S. Srihari, "Online and offline handwriting recognition: a comprehensive survey," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 22, no. 1, pp. 63-84, 2000. https://doi.org/10.1109/34.824821
  2. E. Alpaydin and Fevzi. Alimoglu, "Pen-based recognition of handwritten digits dataset," 1998; http://archive.ics.uci.edu/ml/datasets/Pen-Based+Recognition+of+Handwritten+Digits.
  3. C. Vivard-Gaurdin, P. M. Lallican, S. Knerr, and P. Binter, "IRESTE On/Off (IRONOFF) dual handwriting database," in Proceeding of the 5th International Conference on Document Analysis and Recognition, Bangalore, India, 1999, pp. 455-458.
  4. D. Llorens, F. Prat, A. Marzal, J. M. Vilar, M. J. Castro, J. C. Amengual, S. Barrachina, A. Castellanos, S. Espana, J. A. Gomez, J. Gorbe, A. Gordo, V. Palazón, G. Peris, R. Ramos-Garijo, and F. Zamora, "The UJIpenchars Database: a pen-based database of isolated handwritten characters," in Proceeding of the 6th International Conference on Language Resources and Evaluation (LREC), Marrakech, Morocco, 2008, pp. 2647-2651.
  5. I. Guyon, L. Schomaker, R. Plamondon, M. Liberman, and S. Janet, "UNIPEN project of on-line data exchange and recognizer benchmarks," in Proceeding of the 12th IAPR International Conference on Pattern Recognition, Jerusalem, Israel, 1994, pp. 29-33.
  6. A. Bharath and S. Madhvanath, "Hidden Markov model for online handwritten Tamil word recognition," in Proceedings of the 9th International Conference on Document Analysis and Recognition, Curtiba, Brazil, 2007, pp. 506-510.
  7. L. Prasanth, J. Babu, R. Sharma, and P. Rao, "Elastic matching of online handwritten Tamil and Telegu scripts using local features," in Proceedings of the 9th International Conference on Document Analysis and Recognition, Curtiba, Brazil, 2007, pp. 1028-1032.
  8. U. Bhattacharya, B. K. Gupta, and S. K. Parui, "Direction code based features for recognition of online handwritten characters of Bangla," in Proceedings of the 9th International Conference on Document Analysis and Recognition, Curtiba, Brazil, 2007, pp. 58-62.
  9. N. Joshi, G. Sita, A. G. Ramakrishnan, V. Deepu, and S. Madhvanath, "Machine Recognition of Online Handwritten Devanagari Characters," in Proceedings of the 8th International Conference on Document Analysis and Recognition, Seoul, Korea, 2005, pp. 1156-1160.
  10. A. Sharma, R. Kumar, and R. K. Sharma, "Online handwritten Gurmukhi character recognition using elastic matching," in Proceedings of the Congress on Image and Signal Processing, Hainan, China, 2008, pp. 391-396.
  11. A. S. Bhaskarabhatla and S. Madhvanath, "Experiences in collection of handwriting data for online handwriting recognition in Indic scripts," in Proceedings of the 4th International Conference on Language Resources and Evaluation, Lisbon, Portugal, 2004.
  12. B. B. Chaudhuri, "A complete handwritten numeral database of Bangla: a major Indic script," in Proceedings of the 10th International Workshop on Frontiers in Handwriting Recognition, France, 2006.
  13. K. C. Santosh, C. Nattee, and Bart Lamiroy, "Relative positioning of stroke based clustering: a new approach to on-line handwritten Devanagari character recognition," International Journal of Image & Graphics (IJIG), vol. 12, no. 2, 2012.
  14. N. Saharia and K. M. Konwar, "LuitPad: a fully unicode compatible Assamese writing software," in, Proceedings of the 2nd Workshop on Advances in Text Input Methods (WTIM 2), Mumbai, India, 2012, pp. 79-88.
  15. N. S. Bhabendra, Amar Akhar (Second Part). Guwahati: Assam Book Hive, 2008.
  16. C. W. Hsu, C. C. Chang, and C. J. Lin, "A practical guide to support vector classification," Department of Computer Science and Information Engineering, National Taiwan University, Taipei, Taiwan, 2003.
  17. A. R. Ahmed, C. V. Gaudin, M. Khalid, and R. Yusof, "Online handwriting recognition using support vector machine," in Proceedings of the 2nd International Conference on Artificial Intelligence in Engineering and Technology, 2004, Sabah, Malaysia, pp. 250-256.
  18. U. Baruah and S. M. Hazarika, "Online handwritten Assamese characters dataset," 2011; http://mlr.cs.umass.edu/ml/datasets/Online+Handwritten+Assamese+Characters+Dataset.
  19. G. S. Reddy, P Sharma, S. R. M. Prasanna, C. Mahanta and L. N. Sharma, "Combined online and offline Assamese handwritten numeral recognizer," in Proceedings of the National Conference on Communications (NCC2012), Kharagpur, 2012, pp. 1-5.