통합 검색 | Korea Science

Development of a Low-cost Industrial OCR System with an End-to-end Deep Learning Technology

Subedi, Bharat;Yunusov, Jahongir;Gaybulayev, Abdulaziz;Kim, Tae-Hyong
- 대한임베디드공학회논문지
- /
- 제15권2호
- /
- pp.51-60
- /
- 2020
Optical character recognition (OCR) has been studied for decades because it is very useful in a variety of places. Nowadays, OCR's performance has improved significantly due to outstanding deep learning technology. Thus, there is an increasing demand for commercial-grade but affordable OCR systems. We have developed a low-cost, high-performance OCR system for the industry with the cheapest embedded developer kit that supports GPU acceleration. To achieve high accuracy for industrial use on limited computing resources, we chose a state-of-the-art text recognition algorithm that uses an end-to-end deep learning network as a baseline model. The model was then improved by replacing the feature extraction network with the best one suited to our conditions. Among the various candidate networks, EfficientNet-B3 has shown the best performance: excellent recognition accuracy with relatively low memory consumption. Besides, we have optimized the model written in TensorFlow's Python API using TensorFlow-TensorRT integration and TensorFlow's C++ API, respectively.
https://doi.org/10.14372/IEMEK.2020.15.2.51 인용 PDF KSCI

후처리를 이용한 음성 다이얼링 시스템의 성능향상 (Performance Improvement of Voice Dialing System using Post-Processing)

김원구
- 한국음향학회지
- /
- 제19권5호
- /
- pp.9-12
- /
- 2000
음성 다이얼링 시스템은 화자의 음성을 인식하여 원하는 전화번호로 자동으로 전화를 걸어주는 시스템으로 주로 이동 전화나 휴대형 통신 장비에 유용하게 사용된다. 개인 음성 다이얼링 시스템의 경우, 다이얼링에 사용되는 모든 구문은 사용자가 선택하고 사용자의 음성을 사용하여 학습되어 음성 인식을 위한 HMM을 생성한다. 이러한 시스템은 화자독립 시스템 보다 매우 적은 메모리 공간과 계산량으로 구현이 가능하다. 그러나 이러한 시스템은 학습시 각 단어당 2-3개의 음성만을 사용하므로 음성인식 시스템의 성능을 개선하기 위한 각 상태에서의 상태지속분포을 추정하기는 매우 어렵다. 따라서 본 논문에서는 성능개선을 위한 후처리기를 제안하였다. 전화선을 통하여 구성된 데이터베이스를 이용한 실험에서 제안된 후처리기가 인식 시스템의 성능을 향상시킴을 확인하였다.
PDF

Information Processing in Primate Retinal Ganglion

Je, Sung-Kwan;Cho, Jae-Hyun;Kim, Gwang-Baek
- Journal of information and communication convergence engineering
- /
- 제2권2호
- /
- pp.132-137
- /
- 2004
Most of the current computer vision theories are based on hypotheses that are difficult to apply to the real world, and they simply imitate a coarse form of the human visual system. As a result, they have not been showing satisfying results. In the human visual system, there is a mechanism that processes information due to memory degradation with time and limited storage space. Starting from research on the human visual system, this study analyzes a mechanism that processes input information when information is transferred from the retina to ganglion cells. In this study, a model for the characteristics of ganglion cells in the retina is proposed after considering the structure of the retina and the efficiency of storage space. The MNIST database of handwritten letters is used as data for this research, and ART2 and SOM as recognizers. The results of this study show that the proposed recognition model is not much different from the general recognition model in terms of recognition rate, but the efficiency of storage space can be improved by constructing a mechanism that processes input information.
PDF KSCI

Video Representation via Fusion of Static and Motion Features Applied to Human Activity Recognition

Arif, Sheeraz;Wang, Jing;Fei, Zesong;Hussain, Fida
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제13권7호
- /
- pp.3599-3619
- /
- 2019
In human activity recognition system both static and motion information play crucial role for efficient and competitive results. Most of the existing methods are insufficient to extract video features and unable to investigate the level of contribution of both (Static and Motion) components. Our work highlights this problem and proposes Static-Motion fused features descriptor (SMFD), which intelligently leverages both static and motion features in the form of descriptor. First, static features are learned by two-stream 3D convolutional neural network. Second, trajectories are extracted by tracking key points and only those trajectories have been selected which are located in central region of the original video frame in order to to reduce irrelevant background trajectories as well computational complexity. Then, shape and motion descriptors are obtained along with key points by using SIFT flow. Next, cholesky transformation is introduced to fuse static and motion feature vectors to guarantee the equal contribution of all descriptors. Finally, Long Short-Term Memory (LSTM) network is utilized to discover long-term temporal dependencies and final prediction. To confirm the effectiveness of the proposed approach, extensive experiments have been conducted on three well-known datasets i.e. UCF101, HMDB51 and YouTube. Findings shows that the resulting recognition system is on par with state-of-the-art methods.
https://doi.org/10.3837/tiis.2019.07.015 인용 PDF KSCI HTML

기억부하가 얼굴과 공간 작업기억의 유지에 미치는 효과: 사건유관 fMRI 연구 (The Effect of Memory Load on Maintenance in Face and Spatial Working Memory: An Event-Related fMRI Study)

김정희;정광우;강형근;이무석;박태진
- 인지과학
- /
- 제21권2호
- /
- pp.359-386
- /
- 2010
본 연구는 공간작업기억과 비공간작업기억의 유지 처리에 대한 영역특수모형과 처리특수모형을 검증하고자 수행한 사건유관 fMRI 연구이다. 지연반응과제에서 얼굴자극을 표적자극으로 사용하고 표적자극의 수를 달리하여 기억부하를 조작하였으며 일정한 지연기간 후 위치 및 얼굴 재인판단을 요구하였는데, 지연 동안 정보유형(위치와 얼굴)이 기억부하효과의 신경상관물에 미치는 영향을 뇌영상 처리로 분석하였다. 독립변인은 정보유형(자극영역)과 기억부하(표적자극의 수)로서, 정보유형변인은 얼굴작업기억(비공간작업기억) 및 공간작업기억의 두 수준, 기억부하변인은 표적자극이 1개인 조건(저부하)과 3개인 조건(고부하)이었다. 행동수행 분석 결과, 재인정확도에 있어 얼굴작업기억과 공간작업기억 간에는 차이가 없었으며 두 작업기억 모두에서 동등한 정도의 기억부하효과가 확인되었다. 뇌 영상분석 결과, 유지단계에서 전전두피질의 하측 및 중/상측영역이 모두 얼굴작업기억과 공간작업기억 양자와 관련되고 특히 VLPFC가 두 작업기억에 공통적으로 관련된다는 것이 밝혀졌는데, 이러한 결과는 처리성분에 따른 전전두피질의 기능적 전문화를 주장하는 처리특수모형에 더 잘 부합된다.
PDF

가이드라인을 이용한 동적 손동작 인식 (Dynamic Hand Gesture Recognition using Guide Lines)

김건우;이원주;전창호
- 전자공학회논문지CI
- /
- 제47권5호
- /
- pp.1-9
- /
- 2010
일반적으로 동적 손동작 인식을 위해서는 전처리, 손 추적, 손 모양 검출의 단계가 필요하다. 본 논문에서는 전처리와 손 모양 검출 방법을 개선함으로써 성능을 향상시킨 동적 손동작 인식 방법을 제안한다. 전처리 단계에서는 동적테이블을 이용하여 노이즈제거 성능을 높이고, YCbCr 컬러공간을 이용한 기존의 피부색 검출 방식에서 피부색의 범위를 조절할 수 있도록 하여 피부색 검출 성능을 높인다. 특히 손 모양 검출 단계에서는 가이드라인을 이용하여 동적 손동작 인식의 요소인 시작이미지(Start Image)와 정지 이미지(Stop Image)를 검출하여 동적 손동작을 인식하기 때문에 학습예제를 사용한 손동작 인식 방법에 비해 인식 속도가 빠르다는 이점이 있다. 가이드라인이란 웹캠을 통해 입력되는 손의 모양과 비교하여 검출하기 위해 화면에 출력하는 손 모양의 라인이다. 가이드라인을 이용한 동적 손동작 인식 방법의 성능을 평가하기 위해 웹캠을 사용하여 복잡한 배경과 단순한 배경으로 구분된 9가지 동영상을 대상으로 실험하였다. 그 결과 CPU 점유율이 낮고, 메모리 사용량도 적기 때문에 시스템 부하가 높은 환경에 효과적임을 알 수 있었다.
PDF KSCI

웨이블릿 변환의 특성을 이용한 얼굴 인식 성능 개선 (Performance Improvement of the Face Recognition Using the Properties of Wavelet Transform)

박경준;서석용;고형화
- 한국항행학회논문지
- /
- 제17권6호
- /
- pp.726-735
- /
- 2013
본 논문에서는 웨이블릿 변환의 특성을 이용한 얼굴인식 방법을 제안하여 인식성능 향상에 관한 연구를 진행하였다. 사용한 이산 웨이블릿 변환은 모웨이블릿의 특징과 비슷한 Daubechies D4 필터이다. 웨이블릿 변환영역 중 LL 대역의 데이터만을 이용할 경우 원본 데이터에 비하여 크기가 줄어들게 되어 인식과정의 속도와 메모리 사용량을 줄일 수 있게 된다. 또한 2차원 데이터의 변형없이 손실을 줄여 인식률을 향상시키기 위하여 2차원 LDA 방법을 적용하였다. 그리고 여기서 얻은 특징벡터를 이용하여 SVM을 수행하도록 하였다. 실험은 Matlab 프로그램을 통하여 ORL 얼굴 데이터베이스와 Yale 얼굴 데이터베이스를 이용하여 실험을 하였고 기존의 방법들과 인식률과 수행시간을 비교를 함으로써 제안한 방법의 우수성을 입증하였다.
https://doi.org/10.12673/jkoni.2013.17.6.726 인용 PDF KSCI

자율주행 제어를 위한 향상된 주변환경 인식 알고리즘 (Improved Environment Recognition Algorithms for Autonomous Vehicle Control)

배인환;김영후;김태경;오민호;주현수;김슬기;신관준;윤선재;이채진;임용섭;최경호
- 자동차안전학회지
- /
- 제11권2호
- /
- pp.35-43
- /
- 2019
This paper describes the improved environment recognition algorithms using some type of sensors like LiDAR and cameras. Additionally, integrated control algorithm for an autonomous vehicle is included. The integrated algorithm was based on C++ environment and supported the stability of the whole driving control algorithms. As to the improved vision algorithms, lane tracing and traffic sign recognition were mainly operated with three cameras. There are two algorithms developed for lane tracing, Improved Lane Tracing (ILT) and Histogram Extension (HIX). Two independent algorithms were combined into one algorithm - Enhanced Lane Tracing with Histogram Extension (ELIX). As for the enhanced traffic sign recognition algorithm, integrated Mutual Validation Procedure (MVP) by using three algorithms - Cascade, Reinforced DSIFT SVM and YOLO was developed. Comparing to the results for those, it is convincing that the precision of traffic sign recognition is substantially increased. With the LiDAR sensor, static and dynamic obstacle detection and obstacle avoidance algorithms were focused. Therefore, improved environment recognition algorithms, which are higher accuracy and faster processing speed than ones of the previous algorithms, were proposed. Moreover, by optimizing with integrated control algorithm, the memory issue of irregular system shutdown was prevented. Therefore, the maneuvering stability of the autonomous vehicle in severe environment were enhanced.
https://doi.org/10.22680/kasa2019.11.2.035 인용 PDF KSCI HTML

파지기간에 따른 모니터 화면상 광각이미지의 경계축소현상 (Boundary Contraction for Wide-Angle Images on Monitor Screen: An Effect of Retention Interval)

장필식
- 한국컴퓨터정보학회논문지
- /
- 제12권4호
- /
- pp.61-68
- /
- 2007
모니터 화면상에서 광각으로 멀리보이는 이미지에 대한 기억왜곡현상을 고찰하기 위해 피실험자 170명을 대상으로 두 개의 실험이 실시되었다. 기억 파지기간을 세 가지(즉시, 1시간, 24시간)로 나누고, 시각기억에 대한 재생실험과 재인실험을 실시하였다. 재생실험 결과, 파지기간에 상관없이 피실험자들은 원본이미지보다 전경과 배경을 더 크게 확대 하여 표현하였다. 재인실험결과 파지기간이 1시간과 48시간인 경우에 피실험자들은 원본과 동일한 이미지에 대해 더 광각으로 멀리 촬영된 것으로 인식하였다. 이 결과들은 광각이미지의 시각적 기억, 회상에 경계축소현상이 나타난다는 것을 확인시켜주며, 기존 연구들의 주장과는 달리 경계확장이 일관성 있으며 단방향적으로만 일어나는 현상이 아니라는 것을 보여준다. 또한 연구결과는 경계축소현상을 기억스키마 가설로 설명하는 것이 타당함을 보여준다.
PDF

홍화씨와 흰민들레 복합물의 Scopolamine 유도 기억력 손상에 대한 보호 효과 (Protective Effects of Combination of Carthamus tinctorius L. Seed and Taraxacum coreanum on Scopolamine-induced Memory Impairment in Mice)

김지현;;김민조;박찬흠;이재양;신유수;조은주
- 한국약용작물학회지
- /
- 제28권2호
- /
- pp.85-94
- /
- 2020
Background: Alzheimer's disease (AD) is caused by various factors, such as cholinergic dysfunction, regulation of neurotrophic factor expression, and accumulation of amyloid-beta. We investigated whether or not a combination of Carthamus tinctorius L. seed and Taraxacum coreanum (CT) has a protective effect on scopolamine-induced memory impairment in a mouse model. Methods and Results: Mice were orally pretreated with CT (50, 100 and 200 mg/kg/day) for 14 days, and scopolamine (1 mg/kg/day) was injected intraperitoneally before subjecting them to behavior tests. CT-administered mice showed better novel object recognition and working memory ability than scopolamine-treated control mice. In T-maze and Morris water maze tests, CT (100 and 200 mg/kg/day) significantly increased space perceptive ability and occupancy to the target quadrant, respectively. In addition, 100 and 200 mg/kg/day of CT attenuated cholinergic dysfunction through inhibition of butyryl cholinesterase in brain tissue. Furthermore, CT-administered mice showed higher cyclic adenosine monophosphate-response element-binding protein (CREB) levels and lower amyloid precursor protein (APP) levels compared to scopolamine-treated control mice. Conclusions: CT improved scopolamine-induced memory impairment through inhibition of cholinergic dysfunction, up-regulation of CREB, and down-regulation of APP. Therefore, CT could be a useful therapeutic agent for AD with protective effects on cognitive impairment.
https://doi.org/10.7783/KJMCS.2020.28.2.85 인용 PDF KSCI

검색결과 473건 처리시간 0.025초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)