Search | Korea Science

Recognition Algorithm using MFCC Feature Parameter (MFCC 특징 파라미터를 이용한 인식 알고리즘)

Choi, Jae-seung
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2016.10a
- /
- pp.773-774
- /
- 2016
배경잡음은 음성신호의 특징을 왜곡하기 때문에 음성인식 시스템의 인식율 향상의 방해요소가 된다. 따라서 본 논문에서는 배경잡음이 존재하는 환경에서의 음성인식을 실시하기 위해서, 신경회로망과 Mel 주파수 켑스트럼 계수를 사용하여 연속음성 식별 알고리즘을 제안한다. 본 논문의 실험에서는 본 알고리즘을 사용하여 배경잡음이 섞인 음성신호에 대하여 음성인식의 식별율 개선을 실현할 수 있도록 연구를 진행하며, 본 알고리즘이 유효하다는 것을 실험을 통하여 명백히 한다.
PDF

Planar-Object Position Estimation by using Scale & Affine Invariant Features (불변하는 스케일-아핀 특징 점을 이용한 평면객체의 위치 추정)

Lee, Seok-Jun;Jung, Soon-Ki
- 한국HCI학회:학술대회논문집
- /
- 2008.02a
- /
- pp.795-800
- /
- 2008
카메라로 입력되는 영상에서 객체를 인식하기 위한 노력은, 기존의 컴퓨터 비전분야에서 좋은 이슈로 연구되고 있다. 영상 내부에 등장하는 객체를 인식하고 해당 객체를 포함하고 있는 전체 이미지에서 현재 영상의 위치를 인식하기 위해서는, 영상 내에 등장할 객체에 대한 트레이닝이 필요하다. 본 논문에서는 영상에 등장할 객체에 대해서, 특징 점을 검출(feature detection)하고 각 점들이 가지는 픽셀 그라디언트 방향의 벡터 값들을 그 이웃하는 벡터 값들과 함께 DoG(difference-of-Gaussian)함수를 이용하여 정형화 한다. 이는 추후에 입력되는 영상에서 검출되는 특징 점들과 그 이웃들 간의 거리나 스케일의 비율 등의 파리미터를 이용하여 비교함으로써, 현재 특징 점들의 위치를 추정하는 정보로 사용된다. 본 논문에서는 광역의 시설 단지를 촬영한 인공위성 영상을 활용하여 시설물 내부에 존재는 건물들에 대한 초기 특징 점들을 검출하고 데이터베이스로 저장한다. 트레이닝이 마친 후에는, 프린트된 인공위성 영상내부의 특정 건물을 카메라를 이용하여 촬영하고, 이 때 입력된 영상의 특징 점을 해석하여 기존에 구축된 데이터베이스 내의 특징 점과 비교하는 과정을 거친다. 매칭되는 특징 점들은 DoG로 정형화된 벡터 값들을 이용하여 해당 건물에 대한 위치를 추정하고, 3차원으로 기 모델링 된 건물을 증강현실 기법을 이용하여 영상에 정합한 후 가시화 한다.
PDF

3D object Modeling based on Superquadrics and Constructive Solid Geometry (Superquadric 과 CSG에 기반한 3차원 모델링)

김대현;이선호;김태은;최종수
- Proceedings of the Korea Multimedia Society Conference
- /
- 2000.04a
- /
- pp.149-152
- /
- 2000
3차원 물체 형상 모델링은 인식에 있어서 중요한 역할을 차지하고 있다. 기존의 픽셀(pixel)기반 영상표현은 물체 고유의 유기적 구조를 반영할 수 없고, 에지(edge)나 기반 물체 표현법은 물체의 자세한 표현이 가능하지만 물체인식을 위해서는 많은 양의 속성들을 만들어내게된다. 따라서 물체인식을 위해서는 물체의 형상특징을 직선적으로 기술할 수 있는 체적소 기반 물체 표현 방법이 필요하다. 본 논문에서는 몇 개의 파리미터를 이용하여 3차원 정보를 효과적으로 얻을 수 있는 superquadric과 이를 기본 단위로 한 CSG(Constructive Solid Geometry) tree를 이용하여 3 차원 물체 형상모델링에 대해서 기술한다.
PDF

Background Noise Classification in Noisy Speech of Short Time Duration Using Improved Speech Parameter (개량된 음성매개변수를 사용한 지속시간이 짧은 잡음음성 중의 배경잡음 분류)

Choi, Jae-Seung
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.20 no.9
- /
- pp.1673-1678
- /
- 2016
In the area of the speech recognition processing, background noises are caused the incorrect response to the speech input, therefore the speech recognition rates are decreased by the background noises. Accordingly, a more high level noise processing techniques are required since these kinds of noise countermeasures are not simple. Therefore, this paper proposes an algorithm to distinguish between the stationary background noises or non-stationary background noises and the speech signal having short time duration in the noisy environments. The proposed algorithm uses the characteristic parameter of the improved speech signal as an important measure in order to distinguish different types of the background noises and the speech signals. Next, this algorithm estimates various kinds of the background noises using a multi-layer perceptron neural network. In this experiment, it was experimentally clear the estimation of the background noises and the speech signals.
https://doi.org/10.6109/jkiice.2016.20.9.1673 인용 PDF KSCI

A Performance Evaluation of Factors Influencing the ROI Coding Quality in JPEG2000 (JPEG2000에서 ROI 코딩 품질에 영향을 미치는 요소의 성능 평가)

Ki Jun-Kang;Kim Hyun-Joo;Lee Jum-Sook
- Journal of the Korea Society of Computer and Information
- /
- v.11 no.4 s.42
- /
- pp.197-206
- /
- 2006
One of the most significant characteristics of JPEG2000. the emerging still image standards. is the ROI (Region of Interest) coding. JPEG2000 provides a number of ROI coding mechanisms and ROI parameters. To apply them to an application, it must select the applicable values. In this paper, we evaluate how the ROI coding mechanisms and the ROI parameters influencing JPEG2000 qualify affect the ROI quality and the whole image quality. The ROI coding mechanisms are Maxshift and Implicit. and the parameters are tile size and ROI size, codeblock size, number of DWT decomposition levels and ROI importance. The bigger the tile size, the better the quality. The bigger the ROI size, the ROI importance and the number of DWT decomposition levels, the worse the qualify. In code block $32{\times}32$ of Maxshift and Implicit, it has the best qualify.
PDF

Vision-based Real-time Lane Detection and Tracking for Mobile Robots in a Constrained Track Environment

Kim, Young-Ju
- Journal of the Korea Society of Computer and Information
- /
- v.24 no.11
- /
- pp.29-39
- /
- 2019
As mobile robot applications increase in real life, the need of low cost autonomous driving are gradually increasing. We propose a novel vision-based real-time lane detection and tracking system that supports autonomous driving of mobile robots in constrained tracks which are designed considering indoor driving conditions of mobile robots. Considering the processing of lanes with various shapes and the pre-adjustment of operation parameters, the system structure with multi-operation modes are designed. In parameter tuning mode, thresholds of the color filter is dynamically adjusted based on the geometric property of the lane thickness. And in the unstable input mode of curved tracks and the stable input mode of straight tracks, lane feature pixels are adaptively extracted based on the geometric and temporal characteristics of the lanes and the lane model is fitted using the least-squared method. The track centerline is calculated using lane models and the motion model is simplified and tracked by a linear Kalman filter. In the driving experiments, it was confirmed that even in low-performance robot configurations, real-time processing produces the accurate autonomous driving in the constrained track.
https://doi.org/10.9708/jksci.2019.24.11.029 인용 PDF KSCI

Search Result 6, Processing Time 0.019 seconds

Recognition Algorithm using MFCC Feature Parameter (MFCC 특징 파라미터를 이용한 인식 알고리즘)

Planar-Object Position Estimation by using Scale & Affine Invariant Features (불변하는 스케일-아핀 특징 점을 이용한 평면객체의 위치 추정)

3D object Modeling based on Superquadrics and Constructive Solid Geometry (Superquadric 과 CSG에 기반한 3차원 모델링)

Background Noise Classification in Noisy Speech of Short Time Duration Using Improved Speech Parameter (개량된 음성매개변수를 사용한 지속시간이 짧은 잡음음성 중의 배경잡음 분류)

A Performance Evaluation of Factors Influencing the ROI Coding Quality in JPEG2000 (JPEG2000에서 ROI 코딩 품질에 영향을 미치는 요소의 성능 평가)

Vision-based Real-time Lane Detection and Tracking for Mobile Robots in a Constrained Track Environment

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)