• 제목/요약/키워드: Scene Recognition

검색결과 193건 처리시간 0.03초

Accurate Location Identification by Landmark Recognition

  • Jian, Hou;Tat-Seng, Chua
    • 한국방송∙미디어공학회:학술대회논문집
    • /
    • 한국방송공학회 2009년도 IWAIT
    • /
    • pp.164-169
    • /
    • 2009
  • As one of the most interesting scenes, landmarks constitute a large percentage of the vast amount of scene images available on the web. On the other hand, a specific "landmark" usually has some characteristics that distinguish it from surrounding scenes and other landmarks. These two observations make the task of accurately estimating geographic information from a landmark image necessary and feasible. In this paper, we propose a method to identify landmark location by means of landmark recognition in view of significant viewpoint, illumination and temporal variations. We use GPS-based clustering to form groups for different landmarks in the image dataset. The images in each group rather fully express the possible views of the corresponding landmark. We then use a combination of edge and color histogram to match query to database images. Initial experiments with Zubud database and our collected landmark images show that is feasible.

  • PDF

3차원 거리 측정 장치를 이용한 물체 인식 (Object Recognition using 3D Depth Measurement System.)

  • 김성찬;고수홍;김형석
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2006년도 하계종합학술대회
    • /
    • pp.941-942
    • /
    • 2006
  • A depth measurement system to recognize 3D shape of objects using single camera, line laser and a rotating mirror has been investigated. The camera and the light source are fixed, facing the rotating mirror. The laser light is reflected by the mirror and projected to the scene objects whose locations are to be determined. The camera detects the laser light location on object surfaces through the same mirror. The scan over the area to be measured is done by mirror rotation. The Segmentation process of object recognition is performed using the depth data of restored 3D data. The Object recognition domain can be reduced by separating area of interest objects from complex background.

  • PDF

이동물체의 광학적 인식을 위한 합성 HMT (Synthetic hit-miss transform for optical recognition of a moving target)

  • 김종찬;김정우;이하운;도양회;김수중
    • 전자공학회논문지D
    • /
    • 제35D권3호
    • /
    • pp.82-90
    • /
    • 1998
  • A hit-miss transform(HMT) using synthetic structuring elements(SE's) for optical recognition of a moving target is proposed. A moving target which was obtained from a fixed view point has objects. In proposed HMT, SE's are synthesized by using SDF(synthetic discriminant function) algorithm for efficient recognitionof various shapes of true class objects in noisy and cluttered scene. The synthetic hit SE and the synthetic miss SE are composed of SDF of hit SE's and miss SE's for each true class object. Simulation results show the proposed method can be used for the recognition of various shapes of the true class with one one HMT operation.

  • PDF

면 법선 영상 기반형 3차원 물체인식에서의 새로운 매칭 기법 (A New Matching Strategy for SNI-based 3-D Object Recognition)

  • 박종훈;최종수
    • 전자공학회논문지B
    • /
    • 제30B권7호
    • /
    • pp.59-69
    • /
    • 1993
  • In this paper, a new matching strategy for 3-D object recognition, based on the Surface Normal Images (SNIs), is proposed. The matching strategy using the similarity decision function [9,10] lost the efficiency and the reliability of matching, because all features of models within model base must be compared with the scene object features, and the weights of the attributes of features is given by heuristic manner. However, the proposed matching strategy can solve these problems by using a new approach. In the approach, by searching the model base, a model object whose features are fully matched with the features of sceme object is selected. In this paper, the model base is constructed for the total 26 objects, and systhetic and real range images are used in the test of the system operation. Experimental result is performed to show the possibility that this strategy can be effectively used for the SNI based recognition.

  • PDF

잡음환경에서의 음성인식 성능 향상을 위한 이중채널 음성의 CASA 기반 전처리 방법 (CASA-based Front-end Using Two-channel Speech for the Performance Improvement of Speech Recognition in Noisy Environments)

  • 박지훈;윤재삼;김홍국
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2007년도 하계종합학술대회 논문집
    • /
    • pp.289-290
    • /
    • 2007
  • In order to improve the performance of a speech recognition system in the presence of noise, we propose a noise robust front-end using two-channel speech signals by separating speech from noise based on the computational auditory scene analysis (CASA). The main cues for the separation are interaural time difference (ITD) and interaural level difference (ILD) between two-channel signal. As a result, we can extract 39 cepstral coefficients are extracted from separated speech components. It is shown from speech recognition experiments that proposed front-end has outperforms the ETSI front-end with single-channel speech.

  • PDF

3D Res-Inception Network Transfer Learning for Multiple Label Crowd Behavior Recognition

  • Nan, Hao;Li, Min;Fan, Lvyuan;Tong, Minglei
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제13권3호
    • /
    • pp.1450-1463
    • /
    • 2019
  • The problem towards crowd behavior recognition in a serious clustered scene is extremely challenged on account of variable scales with non-uniformity. This paper aims to propose a crowed behavior classification framework based on a transferring hybrid network blending 3D res-net with inception-v3. First, the 3D res-inception network is presented so as to learn the augmented visual feature of UCF 101. Then the target dataset is applied to fine-tune the network parameters in an attempt to classify the behavior of densely crowded scenes. Finally, a transferred entropy function is used to calculate the probability of multiple labels in accordance with these features. Experimental results show that the proposed method could greatly improve the accuracy of crowd behavior recognition and enhance the accuracy of multiple label classification.

화상에서의 각도 변화를 이용한 3차원 물체 인식 (View Variations and Recognition of 2-D Objects)

  • 황보택근
    • 한국정보처리학회논문지
    • /
    • 제4권11호
    • /
    • pp.2840-2848
    • /
    • 1997
  • 컴퓨터 비전을 이용한 3차원 물체 인식은 카메라의 위치에 따라 화상에 투영되는 물체의 형상이 변하기 때문에 매우 복잡하고 어렵다. 따라서 컴퓨터 비전을 이용한 효과적인 인식 시스템을 구축하기 위해서는 각 3차원 물체에 있어서 유일하고 중요한 특성이 보는 각도에 따라 어떻게 변화하는가를 분석하고 이해하는 것이 매우 중요한 요소이다. 본 연구에서는 특징점들(landmarks)간에 이루어지는 각도 또는 3차원 다각형의 모서리(edge) 사이의 각도를 중요한 특성으로 선택하였고, orthographic 투영과 isotropic view orientation 아래에서 그러한 각도들의 보는 방향에 따른 화상에서의 변화를 2차원 결합 밀도 함수로 유도하였다. 본 논문에서 구한 수리적인 결합 밀도 함수는 통계적인 판단 규칙을 적용하여 효과적으로 물체 인식에 적용될 수 있다. 제안된 방법의 타당성 검토를 위하여 간단한 실험을 수행하였으며, 실험결과 본 방법 이 매우 효과적인 것으로 나타났다.

  • PDF

텍스트 인식률 개선을 위한 한글 텍스트 이미지 초해상화 (Korean Text Image Super-Resolution for Improving Text Recognition Accuracy)

  • 권준형;조남익
    • 방송공학회논문지
    • /
    • 제28권2호
    • /
    • pp.178-184
    • /
    • 2023
  • 카메라로 촬영한 야외 일반 영상에서 텍스트 이미지를 찾아내고 그 내용을 인식하는 기술은 로봇 비전, 시각 보조 등의 기반으로 활용될 수 있는 매우 중요한 기술이다. 하지만 텍스트 이미지가 저해상도인 경우에는 텍스트 이미지에 포함된 노이즈나 블러 등의 열화가 더 두드러지기 때문에 텍스트 내용 인식 성능의 하락이 발생하게 된다. 본 논문에서는 일반 영상에서의 저해상도 한글 텍스트에 대한 이미지 초해상화를 통해서 텍스트 인식 정확도를 개선하였다. 트랜스포머에 기반한 모델로 한글 텍스트 이미지 초해상화를 수행 하였으며, 직접 구축한 고해상도-저해상도 한글 텍스트 이미지 데이터셋에 대하여 제안한 초해상화 방법을 적용했을 때 텍스트 인식 성능이 개선되는 것을 확인하였다.

CASA 기반 음성분리 성능 향상을 위한 형태 분석 기술의 응용 (Application of Shape Analysis Techniques for Improved CASA-Based Speech Separation)

  • 이윤경;권오욱
    • 대한음성학회지:말소리
    • /
    • 제65호
    • /
    • pp.153-168
    • /
    • 2008
  • We propose a new method to apply shape analysis techniques to a computational auditory scene analysis (CASA)-based speech separation system. The conventional CASA-based speech separation system extracts speech signals from a mixture of speech and noise signals. In the proposed method, we complement the missing speech signals by applying the shape analysis techniques such as labelling and distance function. In the speech separation experiment, the proposed method improves signal-to-noise ratio by 6.6 dB. When the proposed method is used as a front-end of speech recognizers, it improves recognition accuracy by 22% for the speech-shaped stationary noise condition and 7.2% for the two-talker noise condition at the target-to-masker ratio than or equal to -3 dB.

  • PDF

사전정보를 이용한 차량번호판 영역의 분리 (Isolating vehicle license plate area using the known information)

  • 문기주;신영석;최효돈
    • 경영과학
    • /
    • 제13권2호
    • /
    • pp.1-11
    • /
    • 1996
  • Two different methods to extract the license plate area of a vehicle have been used for automatic recognition purposes. One method is with a color vision system and the other is with an edge detecting operator. The system with color vision has some problems if the colors of license plate and vehicle's body are similar. The various plate colors in Korea also drops the system performance. The edge detecting operator also has a problem for a real time processing since it performs on all pixels of the scene. In this paper a possible method using gray level vision system and available pre-known information of license plates is suggested. The suggested procedure searches the lower boundary of the plate by counting high contrast points between one and near pixel from the bottom line of the scene. It finds the upper boundary from the bottom line by adding number plate height after finding the lower boundary. The left and right boundaries are found by similar processes.

  • PDF