Browse > Article

딥러닝을 이용한 일반 영상에서의 문자 인식  

Jeong, Gyu-Hwan (VUNO Inc.)
Kim, Hyeon-Jun (VUNO Inc.)
Lee, Ye-Ha (VUNO Inc.)
Publication Information
Korea Information Processing Society Review / v.22, no.1, 2015 , pp. 42-54 More about this Journal
Keywords
Citations & Related Records
연도 인용수 순위
  • Reference
1 K. Jung, K. I. Kim and A. K. Jain, "Text information extraction in images and video : a survey", Pattern Recognition, vol. 37, no. 5, pp. 977-997, 2004   DOI
2 S. Singh, "Optical character recognition techniques : a survey", Journal of Emerging Trends in Computing and Information Sciences, vol. 4, no. 6, pp. 545-550, 2013
3 C. Patel, A. Patel and D. Patel, "Optical chracter recognition by open source OCR tool Tesseract : a case study", International Journal of Computer Applications, vol. 55, no. 10, pp. 50-56, 2012   DOI
4 C. Yao, X. Bai and W. Liu, "A unified framework for Multioriented text detection and recognition", IEEE Transactions on Image Processing, vol. 23, no. 11, pp. 4737-4749, 2014   DOI
5 Y. Bengio and Y. LeCun, "Scaling learning algorithms towards AI", Large-scale Kernel Machines 34, pp. 1-41, 2007
6 A. Krizhevsky, I. Sutskever and G. E. Hinton, "ImageNet classification with deep convolutional neural networks", Advances in Neural Information Processing Systems 25, 2012, pp. 1097-1105
7 I. J. Goodfellow, Y. Bulatov, J. Ibraz, S. Arnoud and V. Shet, "Multi-digit number recognition from street view imagery using deep convolutional neural networks", arXiv:1312.6082, 2014
8 R. Girshick, J. Donahue, T. Darrell and J. Malik, "Rich feature hierarchies for accurate object detection and semantic segmentation", arXiv:1311.2524, 2013.
9 A. Bissacco, M. Cummins, Y. Netzer and H. Neven, "PhotoOCR : reading text in uncontrolled conditions", in Proceedings of the IEEE Conference on Computer Vision, 2013, pp. 785-792
10 K. Koray, P. Sermanet, Y. Boureau, K. Gregor, M. Mathieu and Y. LeCun, "Learning convolutional feature hierarchies for visual recognition", Advances in Neural Information Processing Systems 23, 2010, pp. 1090-1098
11 K. Wang, B. Babenko and S. Belongie, "End-to-end scene text recognition", in Proceedings of the IEEE International Conference on Computer Vision, 2011, pp. 1457-1464
12 T. Wang, D. J. Wu, A. Coates and A. Y. Ng, "End-to-end text recognition with convolutional neural networks", in Proceedings of the International Conference on Pattern Recognition, 2012, pp. 3304-3308
13 O. Alsharif and J. Pineau, "End-to-end text recognition with hybrid HMM maxout models", arXiv:1310.1811, 2013
14 D. E. Rumelhart and J. L. McClelland, "Parallel distributed processing: explorations in the microstructure of cognition", Cambridge: MIT Press, 1986
15 15. G. E. Hinton and R. R. Salakhutdinov, "Reducing the dimensionality of data with neural networks", Science, vol. 313, no. 5786, pp. 504-507, 2006   DOI
16 P. Baldi and P. J. Sadowski "Understanding dropout", Advances in Neural Information Processing Systems 26, 2013, pp. 2814-2822
17 H. Lee, A. Battle, R. Raina and A. Y. Ng, "Efficient sparse coding algorithms", Advances in Neural Information Processing Systems 19, 2007, pp. 584-592
18 V. Nair and G. E. Hinton, "Rectified linear units improve restricted Boltzmann machines", in Proceedings of International Conference on Machine Learning, 2010, pp. 807-814
19 C. Szegedy, W. Liu., Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke and A. Rabinovich, "Going deeper with convolutions", arXiv:1409.4842, 2014
20 X. Glorot, A. Bordes and Y. Bengio, "Deep sparse rectifier neural networks", in Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, 2011, pp. 315-323
21 G. E. Hinton, N. Srivastava. A. Krizhevsky, I. Sutskever and R. R. Salakhutdinov, "Improving neural networks by preventing co-adaptation of feature detectors", arXiv:1207.0580, 2012
22 A. Coates, B. Carpenter, C. Case, S. Satheesh, B. Suresh, T. Wang, D. J. Wu and A. Y. Ng, "Text detection and character recognition in scene images with unsupervised feature learning", in Proceedings of the International Conference on Document Analysis and Recognition, 2011, pp. 440-445
23 A. Coates, H. Lee and A. Y. Ng, "An analysis of single-layer networks in unsupervised feature learning", AISTATS, 2011
24 A. Neubeck, L. V. Gool, "Efficient non-maximal suppression", in Proceedings of the International Conference on Pattern Recognition, 2006, pp. 850-855