Comparison of Spatial and Frequency Images for Character Recognition

Abdurakhmon, Abduraimjonov;Choi, Hyeon-yeong;Ko, Jaepil;

한국정보통신학회:학술대회논문집 (Proceedings of the Korean Institute of Information and Commucation Sciences Conference)

한국정보통신학회 (The Korea Institute of Information and Commucation Engineering)

문자인식을 위한 공간 및 주파수 도메인 영상의 비교

Comparison of Spatial and Frequency Images for Character Recognition

;
최현영 (금오공과대학교) ;
고재필 (금오공과대학교)

Abdurakhmon, Abduraimjonov (Kumoh National Institute of Technology) ;
Choi, Hyeon-yeong (Kumoh National Institute of Technology) ;
Ko, Jaepil (Kumoh National Institute of Technology)

발행 : 2019.05.23

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

딥러닝은 객체인식 분야에서에서 강력하고, 강건한 학습 알고리즘이다. 딥러닝에서 자주 활용되고, 객체인식 분야에서 최고의 성능을 보여주는 네트워크는 Convolutional Neural Network(CNN) 이다. 숫자 필기 인식을 위한 MNIST 데이터셋를 CNN으로 학습하면 성능이 매우 뛰어나다. 이는 MNIST 데이터 셋의 숫자들이 중앙에 잘 정렬되어 있기 때문이다. 하지만, 실제 데이터들은 중앙에 정렬이 잘 되어있지 않다. 이러한 경우에 CNN은 이전과 같이 우수한 성능을 보여주지 못한다. 이를 해결하기 위해, 우리는 FFT를 활용하여 이미지를 주파수 공간으로 변환하여 입력으로 주는 방법을 제안한다.

Deep learning has become a powerful and robust algorithm in Artificial Intelligence. One of the most impressive forms of Deep learning tools is that of the Convolutional Neural Networks (CNN). CNN is a state-of-the-art solution for object recognition. For instance when we utilize CNN with MNIST handwritten digital dataset, mostly the result is well. Because, in MNIST dataset, all digits are centralized. Unfortunately, the real world is different from our imagination. If digits are shifted from the center, it becomes a big issue for CNN to recognize and provide result like before. To solve that issue, we have created frequency images from spatial images by a Fast Fourier Transform (FFT).

한국정보통신학회:학술대회논문집 (Proceedings of the Korean Institute of Information and Commucation Sciences Conference)

문자인식을 위한 공간 및 주파수 도메인 영상의 비교

Comparison of Spatial and Frequency Images for Character Recognition

초록

키워드

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)