통합 검색 | Korea Science

Image Feature Detection and Contrast Enhancement Algorithms Based on Statistical Tests

Kim, Yeong-Hwa;Nam, Ji-Ho
- Journal of the Korean Data and Information Science Society
- /
- 제18권2호
- /
- pp.385-399
- /
- 2007
In many image processing applications, a random noise makes some trouble since most video enhancement functions produce visual artifacts if a priori of the noise is incorrect. The basic difficulty is that the noise and the signal are difficult to be distinguished. Typical unsharp masking (UM) enhances the visual appearances of images, but it also amplifies the noise components of the image. Hence, the applications of a UM are limited when noises are presented. This paper proposed statistical algorithms based on parametric and nonparametric tests to adaptively enhance the image feature and the noise combining while applying UM. With the proposed algorithm, it is made possible to enhance the local contrast of an image without amplifying the noise.
PDF

물체 추적을 위한 강화된 부분공간 표현 (Enhanced Representation for Object Tracking)

윤석민;유한주;최진영
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2009년도 정보 및 제어 심포지움 논문집
- /
- pp.408-410
- /
- 2009
We present an efficient and robust measurement model for visual tracking. This approach builds on and extends work on subspace representations of measurement model. Subspace-based tracking algorithms have been introduced to visual tracking literature for a decade and show considerable tracking performance due to its robustness in matching. However the measures used in their measurement models are often restricted to few approaches. We propose a novel measure of object matching using Angle In Feature Space, which aims to improve the discriminability of matching in subspace. Therefore, our tracking algorithm can distinguish target from similar background clutters which often cause erroneous drift by conventional Distance From Feature Space measure. Experiments demonstrate the effectiveness of the proposed tracking algorithm under severe cluttered background.
PDF

YCbCr 농도 대비를 이용한 입술특징 추출 (Lip Feature Extraction using Contrast of YCbCr)

김우성;민경원;고한석
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2006년도 하계종합학술대회
- /
- pp.259-260
- /
- 2006
Since audio speech recognition is affected by noise in real environment, visual speech recognition is used to support speech recognition. For the visual speech recognition, this paper suggests the extraction of lip-feature using two types of image segmentation and reduced ASM. Input images are transformed to YCbCr based images and lips are segmented using the contrast of Y/Cb/Cr between lip and face. Subsequently, lip-shape model trained by PCA is placed on segmented lip region and then lip features are extracted using ASM.
PDF

An Intelligent Visual Servoing Method using Vanishing Point Features

Lee, Joon-Soo;Suh, Il-Hong
- Journal of Electrical Engineering and information Science
- /
- 제2권6호
- /
- pp.177-182
- /
- 1997
A visual servoing method is proposed for a robot with a camera in hand. Specifically, vanishing point features are suggested by employing a viewing model of perspective projection to calculate the relative rolling, pitching and yawing angles between the object and the camera. To compensate dynamic characteristics of the robot, desired feature trajectories for the learning of visually guided line-of-sight robot motion are obtained by measuring features by the camera in hand not in the entire workspace, but on a single linear path along which the robot moves under the control of a commercially provided function of linear motion. And then, control actions of the camera are approximately found by fuzzy-neural networks to follow such desired feature trajectories. To show the validity of proposed algorithm, some experimental results are illustrated, where a four axis SCARA robot with a B/W CCD camera is used.
PDF

Multimodal audiovisual speech recognition architecture using a three-feature multi-fusion method for noise-robust systems

Sanghun Jeon;Jieun Lee;Dohyeon Yeo;Yong-Ju Lee;SeungJun Kim
- ETRI Journal
- /
- 제46권1호
- /
- pp.22-34
- /
- 2024
Exposure to varied noisy environments impairs the recognition performance of artificial intelligence-based speech recognition technologies. Degraded-performance services can be utilized as limited systems that assure good performance in certain environments, but impair the general quality of speech recognition services. This study introduces an audiovisual speech recognition (AVSR) model robust to various noise settings, mimicking human dialogue recognition elements. The model converts word embeddings and log-Mel spectrograms into feature vectors for audio recognition. A dense spatial-temporal convolutional neural network model extracts features from log-Mel spectrograms, transformed for visual-based recognition. This approach exhibits improved aural and visual recognition capabilities. We assess the signal-to-noise ratio in nine synthesized noise environments, with the proposed model exhibiting lower average error rates. The error rate for the AVSR model using a three-feature multi-fusion method is 1.711%, compared to the general 3.939% rate. This model is applicable in noise-affected environments owing to its enhanced stability and recognition rate.
https://doi.org/10.4218/etrij.2023-0266 인용 PDF

3D-2D 모션 추정을 위한 LiDAR 정보 보간 알고리즘 (LiDAR Data Interpolation Algorithm for 3D-2D Motion Estimation)

전현호;고윤호
- 한국멀티미디어학회논문지
- /
- 제20권12호
- /
- pp.1865-1873
- /
- 2017
The feature-based visual SLAM requires 3D positions for the extracted feature points to perform 3D-2D motion estimation. LiDAR can provide reliable and accurate 3D position information with low computational burden, while stereo camera has the problem of the impossibility of stereo matching in simple texture image region, the inaccuracy in depth value due to error contained in intrinsic and extrinsic camera parameter, and the limited number of depth value restricted by permissible stereo disparity. However, the sparsity of LiDAR data may increase the inaccuracy of motion estimation and can even lead to the result of motion estimation failure. Therefore, in this paper, we propose three interpolation methods which can be applied to interpolate sparse LiDAR data. Simulation results obtained by applying these three methods to a visual odometry algorithm demonstrates that the selective bilinear interpolation shows better performance in the view point of computation speed and accuracy.
https://doi.org/10.9717/kmms.2017.20.12.1865 인용 PDF KSCI

바이모달 음성인식기의 시각 특징 추출을 위한 색상 분석자 SVM을 이용한 입술 위치 검출 (Lip Detection using Color Distribution and Support Vector Machine for Visual Feature Extraction of Bimodal Speech Recognition System)

정지년;양현승
- 한국정보과학회논문지:소프트웨어및응용
- /
- 제31권4호
- /
- pp.403-410
- /
- 2004
바이모달 음성인식기는 잡음 환경하 음성인식 성능을 향상하기 위해 고안되었다. 바이모달 음 성인식기에 있어 영상을 통한 시각 특징 추출은 매우 중요한 역할을 하며 이를 위한 입술 위치 검출은 시각 특징 추출을 위한 중요한 선결 과제이다 본 논문은 색상분포와 SVM을 이용하여 시각 특징 추출을 위한 입술 위치 검출 방법을 제안하였다. 제안된 방법은 얼굴색/입술 색상 분포를 학습하여 이로부터 입술의 초기 위치를 빠르게 찾아내고 SVM을 이용하여 입술의 정확한 위치를 찾음으로써 정확하고 빠르게 입술의 위치를 찾도록 하였으며 실험을 통해 바이모달 인식기에 적용하기에 적합함을 알 수 있었다.
PDF KSCI

A Tree Regularized Classifier-Exploiting Hierarchical Structure Information in Feature Vector for Human Action Recognition

Luo, Huiwu;Zhao, Fei;Chen, Shangfeng;Lu, Huanzhang
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제11권3호
- /
- pp.1614-1632
- /
- 2017
Bag of visual words is a popular model in human action recognition, but usually suffers from loss of spatial and temporal configuration information of local features, and large quantization error in its feature coding procedure. In this paper, to overcome the two deficiencies, we combine sparse coding with spatio-temporal pyramid for human action recognition, and regard this method as the baseline. More importantly, which is also the focus of this paper, we find that there is a hierarchical structure in feature vector constructed by the baseline method. To exploit the hierarchical structure information for better recognition accuracy, we propose a tree regularized classifier to convey the hierarchical structure information. The main contributions of this paper can be summarized as: first, we introduce a tree regularized classifier to encode the hierarchical structure information in feature vector for human action recognition. Second, we present an optimization algorithm to learn the parameters of the proposed classifier. Third, the performance of the proposed classifier is evaluated on YouTube, Hollywood2, and UCF50 datasets, the experimental results show that the proposed tree regularized classifier obtains better performance than SVM and other popular classifiers, and achieves promising results on the three datasets.
https://doi.org/10.3837/tiis.2017.03.020 인용 PDF KSCI

영상 검색을 위한 점진적 블록 크기 기반의 효율적인 손실 좌표 압축 기술 (Gradual Block-based Efficient Lossy Location Coding for Image Retrieval)

최경민;정현일;김해광
- 방송공학회논문지
- /
- 제18권2호
- /
- pp.319-322
- /
- 2013
MPEG-7 CDVS (Compact Descriptor for Visual Search)분야에서 표준화하고 있는 현대의 모바일 디바이스 및 서버에서 사용되는 영상검색과 매칭 알고리즘들은 SIFT(scale invariant feature transform)와 SURF(speeded up robust features) 같은 강인한 디스크립터를 기반으로 하는 특징 점에 의한 알고리즘으로 이루어진다. 이러한 특징 점들은 크게 좌표와 디스크립터로 나누어져 있다. 빠르고 정확한 검색을 위해서 특징 점들은 디바이스에서 서버, 또는 서버에서 디바이스로 자유롭게 전송이 되어야 하므로 과거에 여러 압축 알고리즘들이 제안 되었다. 이 논문에서는 특징 점들의 분포 및 연관성 등을 관찰하고 연구하여 좌표의 정보를 효율적으로 압축하면서 정확도를 보존할 수 있는 점진적 블록 크기 기반의 손실 좌표 압축 알고리즘을 제안한다. 실험 결과로부터 현재 가장 효율이 좋은 알고리즘 보다 특징 점당 비트가 평균적으로 0.3~0.4bit(5%~6%) 감소하고 정확도(TP,FP,TN)가 데이터 종류에 따라 유지되거나 미약하게 상승하는 결과를 얻었다.
https://doi.org/10.5909/JBE.2013.18.2.319 인용 PDF KSCI

Unity3D를 이용한 스트랩 다운 영상 추적기의 동역학 및 유도 법칙 알고리즘의 상호-시뮬레이션 방법에 관한 연구 (Study on Co-Simulation Method of Dynamics and Guidance Algorithms for Strap-Down Image Tracker Using Unity3D)

마린미카엘;김태호;방효충;조한진;조영기;최용훈
- 한국항공우주학회지
- /
- 제46권11호
- /
- pp.911-920
- /
- 2018
본 연구에서는 스트랩 다운 영상 탐색기를 활용한 유도무기와 목표물 사이의 관측각을 효과적으로 추적할 수 있는 연구를 수행하였고 이를 시각적으로 시뮬레이션 가능한 테스트 베드를 구축하였다. 영상 정보를 이용하여 목표물 추적을 위한 Lucas Kanade의 Optical flow 알고리즘과 같은 희박 특징점 추적 알고리즘 구현 시 고성능의 특징점 분포를 유지시키는 법을 기술하였으며, 특징점 추적 문제를 특징점 관리의 개념으로 확장하여 연구하였다. 이를 구현하기 위해 Unity3D 엔진을 이용하여 시각 환경을 구성하고 OpenCV를 이용하여 영상 처리 시뮬레이션을 개발하였다. 상호-시뮬레이션을 위해 매틀랩(Matlab) 시뮬링크(Simulink)로 동적 시스템 모델링을 하였고, Unity3D를 이용한 시각 환경을 구성, OpenCV를 이용한 컴퓨터 비전 작업을 수행하였다.
https://doi.org/10.5139/JKSAS.2018.46.11.911 인용 PDF KSCI

검색결과 742건 처리시간 0.028초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)