통합 검색 | Korea Science

사람 행동 인식에서 반복 감소를 위한 저수준 사람 행동 변화 감지 방법 (Detection of Low-Level Human Action Change for Reducing Repetitive Tasks in Human Action Recognition)

노요환;김민정;이도훈
- 한국멀티미디어학회논문지
- /
- 제22권4호
- /
- pp.432-442
- /
- 2019
Most current human action recognition methods based on deep learning methods. It is required, however, a very high computational cost. In this paper, we propose an action change detection method to reduce repetitive human action recognition tasks. In reality, simple actions are often repeated and it is time consuming process to apply high cost action recognition methods on repeated actions. The proposed method decides whether action has changed. The action recognition is executed only when it has detected action change. The action change detection process is as follows. First, extract the number of non-zero pixel from motion history image and generate one-dimensional time-series data. Second, detecting action change by comparison of difference between current time trend and local extremum of time-series data and threshold. Experiments on the proposed method achieved 89% balanced accuracy on action change data and 61% reduced action recognition repetition.
https://doi.org/10.9717/kmms.2019.22.4.432 인용 PDF KSCI HTML

Human Motion Recognition Based on Spatio-temporal Convolutional Neural Network

Hu, Zeyuan;Park, Sange-yun;Lee, Eung-Joo
- 한국멀티미디어학회논문지
- /
- 제23권8호
- /
- pp.977-985
- /
- 2020
Aiming at the problem of complex feature extraction and low accuracy in human action recognition, this paper proposed a network structure combining batch normalization algorithm with GoogLeNet network model. Applying Batch Normalization idea in the field of image classification to action recognition field, it improved the algorithm by normalizing the network input training sample by mini-batch. For convolutional network, RGB image was the spatial input, and stacked optical flows was the temporal input. Then, it fused the spatio-temporal networks to get the final action recognition result. It trained and evaluated the architecture on the standard video actions benchmarks of UCF101 and HMDB51, which achieved the accuracy of 93.42% and 67.82%. The results show that the improved convolutional neural network has a significant improvement in improving the recognition rate and has obvious advantages in action recognition.
https://doi.org/10.9717/kmms.2020.23.8.977 인용 PDF KSCI HTML

강원도 중소기업 품질경영 운영 방안 사례 (A study on Quality Management in Small and Medium Enterprises)

박노국
- 대한안전경영과학회지
- /
- 제8권1호
- /
- pp.131-144
- /
- 2006
Quality system management adapted by small and medium enterprises in Kangwon province to enhance the competitiveness was studied. Variance analysis on several questionnaire answers was performed. Motives for acquiring the accreditation, such as product export, adjustment to international trend, enhancement of brand/product recognition, CEO's mind change, and management innovation, have been changed significantly among business types. Mind changes after the accreditations were setting company's first priority on quality, enhanced recognition on compliance of in-house standards and regulations, employee's performance with the recognition of quality. Amongst service problems to maintain the ace reditations were difficulties in maintaining the recognition of the company's finality management, labor increase to maintain the ISO 9000 enforcement team, and financial burden to keep the accreditation. Quality recognition after the accreditations was significantly improved in setting company's first priority on quality, enhanced recognition on compliance of in-house standards and regulations, employee's performance with the recognition of quality.
PDF KSCI

Multi-Style License Plate Recognition System using K-Nearest Neighbors

Park, Soungsill;Yoon, Hyoseok;Park, Seho
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제13권5호
- /
- pp.2509-2528
- /
- 2019
There are various styles of license plates for different countries and use cases that require style-specific methods. In this paper, we propose and illustrate a multi-style license plate recognition system. The proposed system performs a series of processes for license plate candidates detection, structure classification, character segmentation and character recognition, respectively. Specifically, we introduce a license plate structure classification process to identify its style that precedes character segmentation and recognition processes. We use a K-Nearest Neighbors algorithm with pre-training steps to recognize numbers and characters on multi-style license plates. To show feasibility of our multi-style license plate recognition system, we evaluate our system for multi-style license plates covering single line, double line, different backgrounds and character colors on Korean and the U.S. license plates. For the evaluation of Korean license plate recognition, we used a 50 minutes long input video that contains 138 vehicles of 6 different license plate styles, where each frame of the video is processed through a series of license plate recognition processes. From two experiments results, we show that various LP styles can be recognized under 50 ms processing time and with over 99% accuracy, and can be extended through additional learning and training steps.
https://doi.org/10.3837/tiis.2019.05.015 인용 PDF KSCI HTML

A Study on Design and Implementation of Speech Recognition System Using ART2 Algorithm

Kim, Joeng Hoon;Kim, Dong Han;Jang, Won Il;Lee, Sang Bae
- International Journal of Fuzzy Logic and Intelligent Systems
- /
- 제4권2호
- /
- pp.149-154
- /
- 2004
In this research, we selected the speech recognition to implement the electric wheelchair system as a method to control it by only using the speech and used DTW (Dynamic Time Warping), which is speaker-dependent and has a relatively high recognition rate among the speech recognitions. However, it has to have small memory and fast process speed performance under consideration of real-time. Thus, we introduced VQ (Vector Quantization) which is widely used as a compression algorithm of speaker-independent recognition, to secure fast recognition and small memory. However, we found that the recognition rate decreased after using VQ. To improve the recognition rate, we applied ART2 (Adaptive Reason Theory 2) algorithm as a post-process algorithm to obtain about 5% recognition rate improvement. To utilize ART2, we have to apply an error range. In case that the subtraction of the first distance from the second distance for each distance obtained to apply DTW is 20 or more, the error range is applied. Likewise, ART2 was applied and we could obtain fast process and high recognition rate. Moreover, since this system is a moving object, the system should be implemented as an embedded one. Thus, we selected TMS320C32 chip, which can process significantly many calculations relatively fast, to implement the embedded system. Considering that the memory is speech, we used 128kbyte-RAM and 64kbyte ROM to save large amount of data. In case of speech input, we used 16-bit stereo audio codec, securing relatively accurate data through high resolution capacity.
https://doi.org/10.5391/IJFIS.2004.4.2.149 인용 PDF KSCI

Human Face Recognition Based on improved CNN Model with Multi-layers

Zhang, Ruyang;Lee, Eung-Joo
- 한국멀티미디어학회논문지
- /
- 제24권5호
- /
- pp.701-708
- /
- 2021
As one of the most widely used technology in the world right now, Face recognition has already received widespread attention by all the researcher and institutes. It has been used in many fields such as safety protection, surveillance system, crime control and even in our ordinary life such as home security and so on. This technology with today's technology has advantages such as high connectivity and real time transformation. But we still need to improve its recognition rate, reaction time and also reduce impact of different environmental status to the whole system. So in this paper we proposed a face recognition system model with improved CNN which combining the characteristics of flat network and residual network, integrated learning, simplify network structure and enhance portability and also improve the recognition accuracy. We also used AR and ORL database to do the experiment and result shows higher recognition rate, efficiency and robustness for different image conditions.
https://doi.org/10.9717/kmms.2021.24.5.701 인용 PDF KSCI HTML

Intelligent Activity Recognition based on Improved Convolutional Neural Network

Park, Jin-Ho;Lee, Eung-Joo
- 한국멀티미디어학회논문지
- /
- 제25권6호
- /
- pp.807-818
- /
- 2022
In order to further improve the accuracy and time efficiency of behavior recognition in intelligent monitoring scenarios, a human behavior recognition algorithm based on YOLO combined with LSTM and CNN is proposed. Using the real-time nature of YOLO target detection, firstly, the specific behavior in the surveillance video is detected in real time, and the depth feature extraction is performed after obtaining the target size, location and other information; Then, remove noise data from irrelevant areas in the image; Finally, combined with LSTM modeling and processing time series, the final behavior discrimination is made for the behavior action sequence in the surveillance video. Experiments in the MSR and KTH datasets show that the average recognition rate of each behavior reaches 98.42% and 96.6%, and the average recognition speed reaches 210ms and 220ms. The method in this paper has a good effect on the intelligence behavior recognition.
https://doi.org/10.9717/kmms.2022.25.6.807 인용 PDF KSCI HTML

Convolutional Neural Network와 Stereo Image를 이용한 얼굴 인식 (Face Recognition Using Convolutional Neural Network and Stereo Images)

기철민;조태훈
- 한국정보통신학회:학술대회논문집
- /
- 한국정보통신학회 2016년도 춘계학술대회
- /
- pp.359-362
- /
- 2016
얼굴은 홍채, 지문 등과 같은 사람마다 가진 특수한 정보이다. 얼굴 인식에 대한 연구들은 과거부터 현재까지 지속적으로 진행되고 있으며, 이러한 연구들을 통해 여러 가지의 얼굴 인식 방법들이 나타났다. 이 중에는 스테레오로 구성된 얼굴 데이터를 이용하여 얼굴 인식을 진행하는 알고리즘들이 있다. 본 논문에서는 기계학습의 방법인 Convolutional Neural Network를 이용하여 스테레오로 구성된 얼굴 이미지를 하나의 신경망으로 학습을 진행하였다. 또한 스테레오로 구성된 얼굴 이미지는 카메라 2대를 이용하여 취득하였다. 이 방법은 얼굴 인식에서 보편적으로 많이 사용되는 알고리즘인 PCA를 이용한 스테레오 얼굴 인식의 결과보다 더욱 좋은 성능을 보였다.
PDF

시간지연 회귀 신경회로망을 이용한 피치 악센트 인식 (Automatic Recognition of Pitch Accents Using Time-Delay Recurrent Neural Network)

Kim, Sung-Suk;Kim, Chul;Lee, Wan-Joo
- The Journal of the Acoustical Society of Korea
- /
- 제23권4E호
- /
- pp.112-119
- /
- 2004
This paper presents a method for the automatic recognition of pitch accents with no prior knowledge about the phonetic content of the signal (no knowledge of word or phoneme boundaries or of phoneme labels). The recognition algorithm used in this paper is a time-delay recurrent neural network (TDRNN). A TDRNN is a neural network classier with two different representations of dynamic context: delayed input nodes allow the representation of an explicit trajectory F0(t), while recurrent nodes provide long-term context information that can be used to normalize the input F0 trajectory. Performance of the TDRNN is compared to the performance of a MLP (multi-layer perceptron) and an HMM (Hidden Markov Model) on the same task. The TDRNN shows the correct recognition of $91.9{\%}\;of\;pitch\;events\;and\;91.0{\%}$ of pitch non-events, for an average accuracy of $91.5{\%}$ over both pitch events and non-events. The MLP with contextual input exhibits $85.8{\%},\;85.5{\%},\;and\;85.6{\%}$ recognition accuracy respectively, while the HMM shows the correct recognition of $36.8{\%}\;of\;pitch\;events\;and\;87.3{\%}$ of pitch non-events, for an average accuracy of $62.2{\%}$ over both pitch events and non-events. These results suggest that the TDRNN architecture is useful for the automatic recognition of pitch accents.
PDF KSCI

인식 단위로서의 한국어 음절에 대한 연구 (A Study on the Korean Syllable As Recognition Unit)

김유진;김회린;정재호
- 한국음향학회지
- /
- 제16권3호
- /
- pp.64-72
- /
- 1997
본 논문에서는 한국어 대용량 어휘 인식 시스템에 적합한 인식 단위에 대하여 연구 및 실험하였다. 특히 현재 인식 시스템의 인식 단위로 주로 사용되는 음소와 한국어의 특징을 잘 나타내는 음절을 선택하고, 인식 실험을 통해 음절이 한국어 인식 시스템의 인식 단위로서 적합한가를 음소와 비교하였다. 객관적인 비교 인식 실험 결과를 제시하기 위하여 동일한 남성 화자의 음성 데이터를 수집하고, 수작업 음소 경계 및 레이블링 과정을 거친 음성 데이터 베이스를 구축하였다. 또한 각 인식 단위에 동일한 HMM 기반의 훈련 및 인식 알고리즘을 적용하기 위해 Entropic사의 HTK (HMM Tool Kit) 2.0을 사용하였다. 각 인식 단위의 훈련을 위해 5상태 3출력, 8상태 6출력 HMM 모델의 연속 HMM (Continuous HMM)을 적용하였고, PBW 3회분, POW 1회분을 훈련에 사용하고 PBW 1회분을 각 인식 단위로서 인식하는 화자 종속 단어 인식 실험을 구성하였다. 실험 결과 8상태 6출력 모델을 사용한 경우 음소 단위는 95.65%, 음절 단위는 94.41%의 인식률을 나타내었다. 한편 인식 속도에서는 음절이 음소보다 약 25% 빠른 것으로 나타났다.
PDF

검색결과 9,780건 처리시간 0.039초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)