Search | Korea Science

Query-based Visual Attention Algorithm for Object Recognition of A Mobile Robot (이동로봇의 물체인식을 위한 질의 기반 시각 집중 알고리즘)

Ryu, Gwang-Geun;Lee, Sang-Hoon;Suh, Il-Hong
- Journal of the Institute of Electronics Engineers of Korea SC
- /
- v.44 no.1
- /
- pp.50-58
- /
- 2007
In this paper, we propose a query-based visual attention algorithm for effective object finding of a vision-based mobile robot. This algorithm is developed by extending conventional bottom-up visual attention algorithms. In our proposed algorithm various conspicuity maps are merged to make a saliency map, where weighting values are determined by query-dependent object properties. The saliency map is then used to find possible attentive location of queried object. To show the validities of our proposed algorithm, several objects are employed to compare performances of our proposed algorithm with those of conventional bottom-up approaches. Here, as one of exemplar query-dependent object property, color property is used.
PDF KSCI

A Study on the Robust Bimodal Speech-recognition System in Noisy Environments (잡음 환경에 강인한 이중모드 음성인식 시스템에 관한 연구)

이철우;고인선;계영철
- The Journal of the Acoustical Society of Korea
- /
- v.22 no.1
- /
- pp.28-34
- /
- 2003
Recent researches have been focusing on jointly using lip motions (i.e. visual speech) and speech for reliable speech recognitions in noisy environments. This paper also deals with the method of combining the result of the visual speech recognizer and that of the conventional speech recognizer through putting weights on each result: the paper proposes the method of determining proper weights for each result and, in particular, the weights are autonomously determined, depending on the amounts of noise in the speech and the image quality. Simulation results show that combining the audio and visual recognition by the proposed method provides the recognition performance of 84% even in severely noisy environments. It is also shown that in the presence of blur in images, the newly proposed weighting method, which takes the blur into account as well, yields better performance than the other methods.
PDF KSCI

Landscape Information Visualization of Landscape Potential Index in Hilly Openspace Conservation of Urban Fringe Area (도시주변 녹지경관의 보전.관리에 있어 경관잠재력 지표의 경관정보화와 가시화 연구)

Cho, Tong-Buhm
- Journal of Korean Society of Rural Planning
- /
- v.7 no.1 s.13
- /
- pp.37-48
- /
- 2001
The purpose of this study is to suggest the landscape potential index for visualizing landscape information in the conservation of hilly landscape in urban fringe. For the visual and quantitative approach to topological landscape assessment, numerical entity data of DEM(digital elevation model) were processed with CAD-based utilities that we developed and were mainly focused on analysis of visibility and visual sensitivity. Some results, with reference in assessing greenbelt area of Eodeung Mt. in Gwangju, proved to be considerable in the landscape assessment of suburban hilly landscapes. 1) Since the viewpoints and viewpoint fields were critical to landscape structure, randomized 194 points(spatially 500m interval) were applied to assessing the generalized visual sensitivity, we called. Because there were similar patterns of distribution comparing to those by 56 points and 18 Points given appropriately, it could be more efficient by a few viewpoints which located widely. 2) Regressional function was derived to represent the relationships between probabilities of visibility frequency and the topological factors(topological dominance, landform complexity and relational aspect) of target field. 3) Visibility scores of each viewpoint were be calculated by summing the visual sensitivity indices within a scene. The scores to the upper part including ridge line have been more representative to overall distributions of visual sensitivities. Also, with sum of deviations of sensitivity indices from each single point's specific index to the weighting values of view points could be estimated rotationally. 4) The deviational distributions of visual sensitivity classes in the topological unit of target field were proved to represent the visual vulnerability of the landform. 5) Landscape potential indices combined with the visual sensitivity and the DGN(degree of green naturality) were proposed as visualized landscape information distributed by topological unit.
PDF

Development of Integrity Evaluation Techniques for Concrete Structures (콘크리트 구조물의 건전성 평가 기법 개발)

정연주;김도겸;이장화;조명석;송영철
- Proceedings of the Korea Concrete Institute Conference
- /
- 1999.10a
- /
- pp.623-626
- /
- 1999
Structural integrity in concrete structures are affected by materialistic and environmental factors. Therefore, to develop a objective integrity evaluation method is extremely difficult. In this study, preliminary integrity evaluation method for concrete structures was proposed by conducting by visual and detailed inspection for in-situ conditions based on the weighting factors for structural significance and integrity degrading factors of each element constituting concrete structures.
PDF

Spatial Histograms for Region-Based Tracking

Birchfield, Stanley T.;Rangarajan, Sriram
- ETRI Journal
- /
- v.29 no.5
- /
- pp.697-699
- /
- 2007
Spatiograms are histograms augmented with spatial means and covariances to capture a richer description of the target. We present a particle filtering framework for region-based tracking using spatiograms. Unlike mean shift, the framework allows for non-differentiable similarity measures to compare two spatiograms; we present one such similarity measure, a combination of a recent weighting scheme and histogram intersection. Experimental results show improved performance with the new measure as well as the importance of global spatial information for tracking. The performance of spatiograms is compared with color histograms and several texture histogram methods.
PDF

Estimation of speech feature vectors and enhancement of speech recognition performance using lip information (입술정보를 이용한 음성 특징 파라미터 추정 및 음성인식 성능향상)

Min So-Hee;Kim Jin-Young;Choi Seung-Ho
- MALSORI
- /
- no.44
- /
- pp.83-92
- /
- 2002
Speech recognition performance is severly degraded under noisy envrionments. One approach to cope with this problem is audio-visual speech recognition. In this paper, we discuss the experiment results of bimodal speech recongition based on enhanced speech feature vectors using lip information. We try various kinds of speech features as like linear predicion coefficient, cepstrum, log area ratio and etc for transforming lip information into speech parameters. The experimental results show that the cepstrum parameter is the best feature in the point of reconition rate. Also, we present the desirable weighting values of audio and visual informations depending on signal-to-noiso ratio.
PDF

Wavelet Packet-Based Progressive Image Transmission (Wavelet Packet 기반 점진적 영상 전송)

Song, Joon-Ho;Lee, Gi-Hun;Park, Rae-Hong
- Journal of the Korean Institute of Telematics and Electronics S
- /
- v.35S no.8
- /
- pp.77-85
- /
- 1998
This paper proposes progressive image transmission(PIT) methods based on the wavelet packet transform, in which quantizers are optimized at each stage for the given bit rate. Scalar and vector quantizers are used and the performance of each quantizer is compared. After quantization, selected subbands are ordered by their priority for transmission. Subjective quality of the reconsetructed image is improved by human visual system (HVS) weighting.
PDF

MPEG-4 Rate Control Using GOV Structure (GOV구조를 이용한 MPEG-4 비트율 제어기법)

박지호;김종호;정제창
- Proceedings of the IEEK Conference
- /
- 2003.07e
- /
- pp.2056-2059
- /
- 2003
The rate control is very important to solve the difficulties arising from bit-rate on transmission through channel and to improve video quality. It is very important to point out that the amount of output bit obtained the encoding process using rate controller brings many problems on the transmission of channels and furthermore output bitstream decoded affects directly on the visual quality of displayed subject. In this paper, the effective rate control algorithm by rate-distortion modeling using MPEG-4 encoder is proposed. The proposed rate control has applied different weighting by VOP prediction type and even in the same VOP prediction type, the predicted reference allocates more bit. Through these bit allocation the minimization of distortion can be achieved preventing propagation of quantization error The amount of saved bitstream obtained by the proposed algorithm in this thesis is allocated to I-VOP using region of interest(ROI) selective enhancement on the next GOV encoding process and this process brought the improvement of visual quality.
PDF

Representative Rating of Bridges using Condition Assessment Data (상태평가 결과를 이용한 교량의 대표등급 산정방법)

Oh, Byung-Hwan;Kim, Kwang-Soo;Shin, Kyung-Joon;Lee, Sang-Cheol
- Journal of the Korea institute for structural maintenance and inspection
- /
- v.6 no.1
- /
- pp.111-118
- /
- 2002
Currently, the inspection of bridges is conduced for the parts or elements of a bridges and the results of inspection are depicted for those local elements. Therefore, the representative rating of a bridge as a whole bridge system is not presented. The purpose of the present study is to purpose a reasonable method which can yield realistic representative rating for an actual bridge. The purpose method consists of two steps, i.e, visual inspection step and safety assessment step. The importance of members is considered by introducing the weighting factors and the number of spans is also considered to obtain the representative rating of a whole bridge system. The purpose method may be efficiently used to calculate the realistic representative rating bridge structures.
PDF KSCI

A Video Watermarking Method using Global Masking (전역 마스킹을 이용한 비디오 워터마킹 방법)

문지영;호요성
- Journal of Broadcast Engineering
- /
- v.8 no.3
- /
- pp.268-277
- /
- 2003
In this paper, we propose a new video watermarking method exploiting the human visual system (HVS) to find effective locations. in the video frames which make the watermark robust and imperceptible simultaneously. In particular, we propose a new HVS-optimized weighting map for hiding the watermark by considering HVS in three different aspects : frequency, spatial, and motion masking effects. The global masking map is modeled by combining the frequency masking, the spatial masking, and the motion masking. In this paper, we use a watermark which is generated by the bitwise exclusive-OR operation between a logo image and a random sequence. The amount of watermarks is weighted by a control parameter. Furthermore, we embed the watermark in the uncompressed video sequence for the general watermarking method available to various coding schemes. Simulation results show that the watermark is imperceptible and the proposed method is good for watermark capacity. It is also demonstrated that the proposed method is robust against various attacks, such as MPEG coding, MPEG re-encoding, and frame attacks.
PDF KSCI

Search Result 65, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)