통합 검색 | Korea Science

Multi-level Cross-attention Siamese Network For Visual Object Tracking

Zhang, Jianwei;Wang, Jingchao;Zhang, Huanlong;Miao, Mengen;Cai, Zengyu;Chen, Fuguo
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제16권12호
- /
- pp.3976-3990
- /
- 2022
Currently, cross-attention is widely used in Siamese trackers to replace traditional correlation operations for feature fusion between template and search region. The former can establish a similar relationship between the target and the search region better than the latter for robust visual object tracking. But existing trackers using cross-attention only focus on rich semantic information of high-level features, while ignoring the appearance information contained in low-level features, which makes trackers vulnerable to interference from similar objects. In this paper, we propose a Multi-level Cross-attention Siamese network(MCSiam) to aggregate the semantic information and appearance information at the same time. Specifically, a multi-level cross-attention module is designed to fuse the multi-layer features extracted from the backbone, which integrate different levels of the template and search region features, so that the rich appearance information and semantic information can be used to carry out the tracking task simultaneously. In addition, before cross-attention, a target-aware module is introduced to enhance the target feature and alleviate interference, which makes the multi-level cross-attention module more efficient to fuse the information of the target and the search region. We test the MCSiam on four tracking benchmarks and the result show that the proposed tracker achieves comparable performance to the state-of-the-art trackers.
https://doi.org/10.3837/tiis.2022.12.011 인용 PDF KSCI HTML

Webcam-Based 2D Eye Gaze Estimation System By Means of Binary Deformable Eyeball Templates

Kim, Jin-Woo
- Journal of information and communication convergence engineering
- /
- 제8권5호
- /
- pp.575-580
- /
- 2010
Eye gaze as a form of input was primarily developed for users who are unable to use usual interaction devices such as keyboard and the mouse; however, with the increasing accuracy in eye gaze detection with decreasing cost of development, it tends to be a practical interaction method for able-bodied users in soon future as well. This paper explores a low-cost, robust, rotation and illumination independent eye gaze system for gaze enhanced user interfaces. We introduce two brand-new algorithms for fast and sub-pixel precise pupil center detection and 2D Eye Gaze estimation by means of deformable template matching methodology. In this paper, we propose a new algorithm based on the deformable angular integral search algorithm based on minimum intensity value to localize eyeball (iris outer boundary) in gray scale eye region images. Basically, it finds the center of the pupil in order to use it in our second proposed algorithm which is about 2D eye gaze tracking. First, we detect the eye regions by means of Intel OpenCV AdaBoost Haar cascade classifiers and assign the approximate size of eyeball depending on the eye region size. Secondly, using DAISMI (Deformable Angular Integral Search by Minimum Intensity) algorithm, pupil center is detected. Then, by using the percentage of black pixels over eyeball circle area, we convert the image into binary (Black and white color) for being used in the next part: DTBGE (Deformable Template based 2D Gaze Estimation) algorithm. Finally, using DTBGE algorithm, initial pupil center coordinates are assigned and DTBGE creates new pupil center coordinates and estimates the final gaze directions and eyeball size. We have performed extensive experiments and achieved very encouraging results. Finally, we discuss the effectiveness of the proposed method through several experimental results.
https://doi.org/10.6109/jicce.2010.8.5.575 인용 PDF KSCI

WebGen: 템플릿 기반 웹 스크립트 생성기 (WebGen: a Template-based Web Script Generator)

음두헌
- 정보처리학회논문지D
- /
- 제14D권5호
- /
- pp.509-516
- /
- 2007
데이터베이스와 연동하는 웹 응용에 대한 수요가 비즈니스론 포함하는 모든 분야에서 급속히 증가하고 있다. 그러나 급증하는 수요에 비해 웹 응용의 작성 및 유지 보수에 많은 시간과 노력이 소요되고 있다. 본 논문에서 소개하는 웹 스크립트 자동 생성기인 WebGen은 웹 응용에 필요한 폼들과 이 폼들을 통해 이루어지는 질의에 대해 데이터베이스와 연동하여 처리하는 웹 스크립트들을 자동 생성하는 소프트웨어 도구다. WebGen은 웹 응용 개발자가 작성하는 구성파일(configuration file)에 정의된 선언적인 내용을, 생성될 스크립트의 기본 원형인 내장된 템플릿(template)에 반영하여 5개의 웹 스크립트들(Search, Select, Edit, Information, Action)을 생성한다. Action 스크립트를 제외한 나머지 스크립트들은 사용자 인터페이스로 각각 해당되는 웹 폼을 생성한다. 따라서 WebGen은 웹 응용 작성을 위한 시간과 노력을 크게 줄여 웹 응용의 생산성을 향상시킨다. 상용 웹 스크립트 생성기들과 달리, WebGen은 상호 독립적인 템플릿들을 기반으로 하기 때문에 버전 관리가 용이하고 한 폼에 표현 가능한 정보도 관심의 대상인 엔티티 외에 이 엔티티와 직 간접적으로 연관된 모든 엔티티들을 포함한다.
https://doi.org/10.3745/KIPSTD.2007.14-D.5.509 인용 PDF KSCI

연속 CT 영상에서 템플릿 매칭을 이용한 폐결절 정합 (Pulmonary Nodule Registration using Template Matching in Serial CT Scans)

조현희;홍헬렌
- 한국정보과학회논문지:소프트웨어및응용
- /
- 제36권8호
- /
- pp.623-632
- /
- 2009
본 논문에서는 연속시점에서 촬영한 CT 영상에서 대응되는 폐결절을 추적 관찰하기 위한 폐결절 정합 방법을 제안한다. 제안 방법은 다음과 같은 다섯 단계로 구생된다. 첫째, 분할된 폐를 포함하는 최적경계볼륨의 중심으로 위치 차이를 보정한다. 둘째, 초기 CT 영상과 추적 CT 영상에서 가장 높은 밝기값을 가지고 있는 갈비뼈 구조를 포함하는 관상최대강도투사 영상을 생성한다. 셋째, 두 관상최대강도투사 영상 간의 정규화된 평균 밝기값 차이를 통해 강체 변환을 최적화한다. 넷째, 강체 정합 후에 폐결절 중심 간의 유클라디안 거리 측정을 통해 대응되는 폐결절 대응 후보를 정의한다. 마지막으로, 폐결절을 매칭하기 위하여 초기 CT 영상 내에 폐결절 템플릿과 추적 CT 영상 내에 탐색 볼륨 간의 템플릿 매칭을 수행 한다. 본 제안 방법의 결과를 평가하기 위하여 육안 평가, 정확성 및 수행시간 측정을 수행하였다. 실험결과 관상최대강도투사를 기반으로 하는 강체정합과 지역적 템플릿 매칭을 이용하여 폐결절이 정확하고 빠르게 정합됨을 알 수 있었다.
PDF KSCI

Haar-like 특징과 템플릿을 이용한 귀 검출 (Ear Detection using Haar-like Feature and Template)

한상일;차형태
- 방송공학회논문지
- /
- 제13권6호
- /
- pp.875-882
- /
- 2008
영상으로부터 사람의 귀를 검출하는 것은 생체 인식 분야에 있어서 매우 중요한 분야이다. 따라서 본 논문에서는 측면 얼굴 영상으로부터 귀를 검출하는 알고리즘을 제안한다. 제안하는 알고리즘은 먼저 피부색을 이용하여 얼굴 영역을 검출하고 검출된 얼굴 영역으로부터 Haar-like 특징을 이용하여 귀를 검출한다. 그리고 검출된 귀를 검증하기 위해 표준 템플릿을 이용하여 검출된 귀를 검증한다. 실험 결과 본 논문에서 제안된 방법은 기존의 연구에 비해 60%의 처리 속도 향상과 92%의 검출 성공률을 보였다.
https://doi.org/10.5909/JBE.2008.13.6.875 인용 PDF KSCI

템플릿 매칭을 이용한 넙치용 백신자동접종시스템 개발 (Development of a vaccine automation injection system for flatfish using a template matching)

이동길;양용수;박성욱;차봉진;허국성;김종락
- 수산해양기술연구
- /
- 제48권2호
- /
- pp.165-173
- /
- 2012
Nationally, flatfish vaccination has been performed manually, and is a laborious and time-consuming procedure with low accuracy. The handling requirement also makes it prone to contamination. With a view to eliminating these drawbacks, we designed an automatic vaccine system in which the injection is delivered by a Cartesian coordinate robot guided by a vision system. The automatic vaccine injection system is driven by an injection site location algorithm that uses a template-matching technique. The proposed algorithm was designed to derive the time and possible angles of injection by comparing a search area with a template. The algorithm is able to vaccinate various sizes of flatfish, even when they are loaded at different angles. We validated the performance of the proposed algorithm by analyzing the injection error under randomly generated loading angles. The proposed algorithm allowed an injection rate of 2000 per hour on average. Vaccination of flatfish with a body length of up to 500mm was possible, even when the orientation of the fish was random. The injection errors in various sizes of flatfish were very small, ranging from 0 to 0.6mm.
https://doi.org/10.3796/KSFT.2012.48.2.165 인용 PDF KSCI

PCA와 LDA를 이용한 실시간 얼굴 검출 및 검증 기법 (Real-time Face Detection and Verification Method using PCA and LDA)

홍은혜;고병철;변혜란
- 한국정보과학회논문지:소프트웨어및응용
- /
- 제31권2호
- /
- pp.213-223
- /
- 2004
본 논문에서는 실시간 응용을 위해 형판 정합 방법을 기반으로 하면서 동시에 외형 기반 (appearance_based) 방법에서 제시하는 학습 모델을 이용한 새로운 얼굴 검출 방법을 제안한다. 우선, 빛이나 조명의 영향에 의한 오류를 방지하기 위한 효과적인 전처리 과정으로 최소-최대 정규화(Min-max Normalization) 방법과 히스토그램 정규화 방법을 적용시킨다. 그런 뒤에 입력 영상과 형판을 PCA 변환하여 각각의 주성분(PC : Principal Component)을 생성하고 이를 LDA 변환한다. PCA 및 LDA 변환된 형판을 이용하여 입력 영상과의 거리 값을 구한 후 거리 값이 가장 작은 영역을 얼굴 영역으로 선택하고, 선택된 영역은 SVM을 이용하여 얼굴인지 아닌지를 검증하는 과정을 거친다. 또한, 본 논문에서는 실시간 얼굴 검출 방법을 위해 전체 영역이 아닌 $\pm$12 화소 크기의 탐색 윈도우를 이용하여 시스템의 속도 및 정확도를 고려하도록 하였다. 실제 환경과 같은 6개 부류의 동영상을 중심으로 실험한 결과, 본 논문에서 제안하는 방법이 기존의 PCA 변환만을 이용한 방법보다 좋은 성능을 보여줌을 알 수 있었고, 또한 SVM을 이용한 얼굴 검증 과정을 추가한 방법이 PCA 변환과 LDA 변환을 사용한 방법보다 좋은 성능을 보여줌을 알 수 있었다.
PDF KSCI

신경회로망에 기초한 자동얼굴인식 (Automatic Face Recognition Using Neural Network)

김재철;이민중;김현식;최영규
- 제어로봇시스템학회:학술대회논문집
- /
- 제어로봇시스템학회 2000년도 제15차 학술회의논문집
- /
- pp.417-417
- /
- 2000
This paper proposes a face detection and recognition method that combines the template matching method and the eigenface method with the neural network. In the face extraction step, the skin color information is used. Therefore, the search region is reduced. The global property of the face is achieved by the eigenface method. Face recognition is performed by a neural network that can learn the face property.
PDF

The 3D-QSAR study of non-peptide bradykinin antagonists by CoMFA

Park, Hea-Young;Choi, Su-Young;Lee, Su-Jin;Kam, Yu-Rim
- 대한약학회:학술대회논문집
- /
- 대한약학회 2003년도 Proceedings of the Convention of the Pharmaceutical Society of Korea Vol.2-2
- /
- pp.186.1-186.1
- /
- 2003
Bradykinin is an autocoid related to acute and chronic pain and inflammation. The non-peptide bradykinin antagonists are of interest as novel anti-inflammatory therapeutics. Some active compounds such as FR 173657, LF 160687, and bradyzide were reported very recently. In our search for the new bradykinin antagonists, we designed and synthesized the iminodiacetic acid derivatives having two or three amide bonds and lipophilic ring system in each molecule. Liquid phase combinatorial synthesis using the iminodiacetic acid template gave diverse individual compounds rapidly and efficiently on a 10-50 mg scale. (omitted)
PDF

PCB 검사를 위한 개선된 통계적 그레이레벨 모델 (Improved Statistical Grey-Level Models for PCB Inspection)

복진섭;조태훈
- 반도체디스플레이기술학회지
- /
- 제12권1호
- /
- pp.1-7
- /
- 2013
Grey-level statistical models have been widely used in many applications for object location and identification. However, conventional models yield some problems in model refinement when training images are not properly aligned, and have difficulties for real-time recognition of arbitrarily rotated models. This paper presents improved grey-level statistical models that align training images using image or feature matching to overcome problems in model refinement of conventional models, and that enable real-time recognition of arbitrarily rotated objects using efficient hierarchical search methods. Edges or features extracted from a mean training image are used for accurate alignment of models in the search image. On the aligned position and orientation, fitness measure based on grey-level statistical models is computed for object recognition. It is demonstrated in various experiments in PCB inspection that proposed methods are superior to conventional methods in recognition accuracy and speed.
PDF KSCI

검색결과 64건 처리시간 0.03초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)