Search | Korea Science

Real-Time Visual Grounding for Natural Language Instructions with Deep Neural Network (심층 신경망을 이용한 자연어 지시의 실시간 시각적 접지)

Hwang, Jisu;Kim, Incheol
- Proceedings of the Korea Information Processing Society Conference
- /
- 한국정보처리학회 2019년도 춘계학술발표대회
- /
- pp.487-490
- /
- 2019
시각과 언어 기반의 이동(VLN)은 3차원 실내 환경에서 실시간 입력 영상과 자연어 지시들을 이해함으로써, 에이전트 스스로 목적지까지 이동해야 하는 인공지능 문제이다. 이 문제는 에이전트의 영상 및 자연어 이해 능력뿐만 아니라, 상황 추론과 행동 계획 능력도 함께 요구하는 복합 지능 문제이다. 본 논문에서는 시각과 언어 기반의 이동(VLN) 작업을 위한 새로운 심층 신경망 모델을 제안한다. 제안모델에서는 입력 영상에서 합성곱 신경망을 통해 추출하는 시각적 특징과 자연어 지시에서 순환 신경망을 통해 추출하는 언어적 특징 외에, 자연어 지시에서 언급하는 장소와 랜드마크 물체들을 영상에서 별도로 탐지해내고 이들을 추가적으로 행동 선택을 위한 특징들로 이용한다. 다양한 3차원 실내 환경들을 제공하는 Matterport3D 시뮬레이터와 Room-to-Room(R2R) 벤치마크 데이터 집합을 이용한 실험들을 통해, 본 논문에서 제안하는 모델의 높은 성능과 효과를 확인할 수 있었다.
https://doi.org/10.3745/PKIPS.y2019m05a.487 인용 PDF

Real-Time Monocular Camera Pose Estimation which is Robust to Dynamic Environment (동적 환경에 강인한 단안 카메라의 실시간 자세 추정 기법)

Bak, Junhyeong;Park, In Kyu
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 한국방송∙미디어공학회 2021년도 하계학술대회
- /
- pp.322-323
- /
- 2021
증강현실이나 자율 주행, 드론 등의 기술에서 현재 위치와 시점을 파악하기 위해서는 실시간 카메라 자세 추정이 필요하다. 이를 위해 가장 일반적인 방식인 연속적인 단안 영상으로부터 카메라 자세를 추정하는 방식은 두 영상의 정적 객체 간에 견고한 특징점 매칭이 이루어져야한다. 하지만 일반적인 영상들은 다양한 이동 객체가 존재하는 동적 환경이므로 정적 객체만의 매칭을 보장하기 어렵다는 문제가 있다. 본 논문은 이 같은 동적 환경 문제를 해결하기 위해, 신경망 기반의 객체 분할 기법으로 영상 속 객체를 추출하고, 객체별 특징점 매칭 및 자세 추정 결과로 정적 객체를 특정해 매칭하는 방법을 제안한다. 또한, 제안하는 정적 객체 특정 방식에 적합한 신경망 기반 특징점 추출 방법을 사용하면 동적 환경에 보다 강인한 카메라 자세 추정이 가능함을 실험을 통해 확인한다.
PDF

Feature Extraction System for High-Speed Fingerprint Recognition using the Multi-Access Memory System (다중 접근 메모리 시스템을 이용한 고속 지문인식 특징추출 시스템)

Park, Jong Seon;Kim, Jea Hee;Ko, Kyung-Sik;Park, Jong Won
- Journal of Korea Multimedia Society
- /
- 제16권8호
- /
- pp.914-926
- /
- 2013
Among the recent security systems, security system with fingerprint recognition gets many people's interests through the strengths such as exclusiveness, convenience, etc, in comparison with other security systems. The most important matters for fingerprint recognition system are reliability of matching between the fingerprint in database and user's fingerprint and rapid process of image processing algorithms used for fingerprint recognition. The existing fingerprint recognition system reduces the processing time by removing some processes in the feature extraction algorithms but has weakness of a reliability. This paper realizes the fingerprint recognition algorithm using MAMS(Multi-Access Memory System) for both the rapid processing time and the reliability in feature extraction and matching accuracy. Reliability of this process is verified by the correlation between serial processor's results and MAMS-PP64's results. The performance of the method using MAMS-PP64 is 1.56 times faster than compared serial processor.
https://doi.org/10.9717/kmms.2013.16.8.914 인용 PDF KSCI

Extracting Ganglion in Ultrasound Image using DBSCAN and FCM based 2-layer Clustering (DBSCAN과 FCM 기반 2-Layer 클러스터링을 이용한 초음파 영상에서의 결절종 추출)

Park, Tae-eun;Song, Jae-uk;Kim, Kwang-baek
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 한국정보통신학회 2021년도 추계학술대회
- /
- pp.186-188
- /
- 2021
본 논문에서는 초음파 영상에서 DBSCAN(Density-based spatial clustering of applications with noise)과 FCM 클러스터링 기반 양자화 기법을 적용하여 결절종을 추출하는 방법을 제안한다. 본 논문에서는 초음파 영상 촬영 시 좌우 상단의 지방층 영역과 하단 영역의 명암도가 어두운 영역을 잡음 영역으로 설정한다. 그리고 초음파 영상에 퍼지스트레칭 기법을 적용하여 잡음 영역을 최대한 제거 한 후에 ROI 영역을 추출한다. 추출된 ROI 영역에서 밀도 분포를 분석하기 위하여 히스토그램을 분석한 후에 DBSCAN을 적용하여 초음파 영상에서 결절종 후보에 해당되는 명암도를 추출한다. 추출한 후보 명암도를 대상으로 FCM 클러스터링 기법을 적용한다. FCM을 적용하는 단계에서 결절종의 저에코 혹은 무에코의 특징을 이용하여 클러스터 중심 값이 가장 낮은 클러스터를 양자화 한 후에 라벨링 기법을 적용시켜 결절종의 후보 객체를 추출한다. 제안된 결절종 추출 방법의 성능을 분석하기 위해 전문의가 결절종 영역을 표기한 초음파 영상과 표기되지 않은 초음파 영상 120쌍을 대상으로 DBSCAN, FCM, 그리고 제안된 방법 간의 성능을 비교 분석하였다. 제안된 방법에서는 120개의 초음파 영상에서 106개 결절종 영역이 추출되었고 FCM 기법에서는 80개가 추출되었고 DBSCAN에서는 36개가 추출되었다. 따라서 제안된 방법이 결절종 추출에 효율적인 것을 확인하였다.
PDF

Co-registration of PET-CT Brain Images using a Gaussian Weighted Distance Map (가우시안 가중치 거리지도를 이용한 PET-CT 뇌 영상정합)

Lee, Ho;Hong, Helen;Shin, Yeong-Gil
- Journal of KIISE:Software and Applications
- /
- 제32권7호
- /
- pp.612-624
- /
- 2005
In this paper, we propose a surface-based registration using a gaussian weighted distance map for PET-CT brain image fusion. Our method is composed of three main steps: the extraction of feature points, the generation of gaussian weighted distance map, and the measure of similarities based on weight. First, we segment head using the inverse region growing and remove noise segmented with head using region growing-based labeling in PET and CT images, respectively. And then, we extract the feature points of the head using sharpening filter. Second, a gaussian weighted distance map is generated from the feature points in CT images. Thus it leads feature points to robustly converge on the optimal location in a large geometrical displacement. Third, weight-based cross-correlation searches for the optimal location using a gaussian weighted distance map of CT images corresponding to the feature points extracted from PET images. In our experiment, we generate software phantom dataset for evaluating accuracy and robustness of our method, and use clinical dataset for computation time and visual inspection. The accuracy test is performed by evaluating root-mean-square-error using arbitrary transformed software phantom dataset. The robustness test is evaluated whether weight-based cross-correlation achieves maximum at optimal location in software phantom dataset with a large geometrical displacement and noise. Experimental results showed that our method gives more accuracy and robust convergence than the conventional surface-based registration.
PDF KSCI

Geometry and Camera Recovery for Indoor Images using Homographies and Image Segmentation (Homography와 영상 분할을 미용한 실내 영상으로부터의 기하정보와 카메라 정보의 추출)

박태준;권대현;오광만
- Proceedings of the Korea Multimedia Society Conference
- /
- 한국멀티미디어학회 2000년도 추계학술발표논문집
- /
- pp.143-146
- /
- 2000
본 논문에서는 다수의 실내 영상으로부터 영상을 촬영한 카메라의 속성정보와 실내 환경에 대한 기하정보를 추출하는 방법을 제안한다. BSP-Tree를 이용하여 주어진 실영상을 각각의 부분 영역이 실제로도 평면 영역에 해당되도록 분할하였으며, 특징점 대응을 통해 각 분할된 영역의 영상간 대응을 찾고 이로부터 각 분할 영역의 homography를 계산하였다 또한 간단한 가정을 통해 계산된 homography로부터 각 분할영역에 대응된 평면의 방정식과 각 영상을 촬영한 카메라의 속성을 찾아낼 수 있믐을 보였다. 본 논문에서 제안한 방법은 현재 본 연구팀이 구현 중인 영상기반 모델링 시스템에서 핵심적인 기능을 수행하리라 기대된다.
PDF

Video Segmentation Using Image signal and Human characteristic (영상신호 특성 및 Human 특징을 이용한 실시간 영상 분류)

Kim, Min-Joon;Kim, Won-Ha
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 한국방송∙미디어공학회 2016년도 하계학술대회
- /
- pp.284-287
- /
- 2016
영상에서 배경으로부터 객체를 분류하는 영상 분류 알고리즘은 물체 인식 및 추적 등 다양한 응용분야에서 중요하다. 본 논문에서는 고정된 카메라에서 다수의 초기 프레임을 참조하여 실시간 영상 분류 방법을 제안한다. 먼저 전경과 배경을 구분하는 확률모델을 제안하였으며 초기 프레임 동안에 카메라의 특성을 추출하여 카메라에 적응적으로 영상을 분류한다. 또한 분류된 영상에서 human의 특징을 이용하여 분류된 결과를 보정하는 방법을 제안한다. 마지막으로 제안한 알고리즘의 실시간 분류 처리를 위하여 복잡도를 최소화 하였다.
PDF

A Selective Attention Based Target Detection System in Noisy Images (잡영 영상에서의 선택적 주의 기반 목표물 탐지 시스템)

최경주;이일병
- Proceedings of the Korean Information Science Society Conference
- /
- 한국정보과학회 2002년도 봄 학술발표논문집 Vol.29 No.1 (B)
- /
- pp.622-624
- /
- 2002
본 논문에서는 선택적 주의에 기반한 잡영 영상에서의 목표물 탐지 방법에 대해 기술한다. 특히 제안하는 방법은 목표물에 대한 아무런 지식을 사용하지 않고, 단지 입력되는 영상의 상향식 단서만을 사용하여 목표물을 탐지해냄으로써 여러 다양한 분야에 일반적으로 사용될 수 있다. 제안하는 시스템에서는 몇 가지 기본 특징들이 입력된 영상에서 바로 추출되며, 이러한 특징들이 서로 통합되어 가는 과정에서 목표물 탐지에 유용하지 않은 정보는 자연스럽게 걸러지며, 유용한 정보는 추가되고 부각되어진다. 간단한 영상부터 복잡한 자연영상에 이르는 다양한 잡영 영상을 대상으로 실험하여 제안하는 시스템의 성능을 평가하였다.
PDF

A Study on Moving Vehicles Segmentation and Tracking using Logic Operations (논리 연산을 이용한 주행차량 분할 및 추적에 관한 연구)

조경민;최기호
- Proceedings of the Korea Multimedia Society Conference
- /
- 한국멀티미디어학회 2004년도 춘계학술발표대회논문집
- /
- pp.211-214
- /
- 2004
본 논문은 논리 연산을 이용한 실시간 주행 차량 분할 및 추적에 관한 알고리즘을 제안하였다. 연속된 프레임 간에 논리연산을 이용하여 영상을 분할하고, 배경과 잡음을 제거하였으며 영상에서 주행차량의 이동 영역을 추출하였다. 주행차량들을 논리 연산을 이용하여 영상분할 함으로써 기존 방법에 비해 평활화 및 에지추출 단계에서 나타날 수 있는 문제점들을 제거하였고, 전처리 단계를 줄였으며, 알고리즘을 단순화 하였다. 또한 추적되는 영상으로부터 위치와 컬러등의 주행 차량의 특징을 직접 추출 가능하도륵 하였다.
PDF

Automatic Detection of Dissimilar Regions through Multiple Feature Analysis (다중의 특징 분석을 통한 비 유사 영역의 자동적인 검출)

Jang, Seok-Woo;Jung, Myunghee
- Journal of the Korea Academia-Industrial cooperation Society
- /
- 제21권2호
- /
- pp.160-166
- /
- 2020
As mobile-based hardware technology develops, many kinds of applications are also being developed. In addition, there is an increasing demand to automatically check that the interface of these applications works correctly. In this paper, we describe a method for accurately detecting faulty images from applications by comparing major characteristics from input color images. For this purpose, our method first extracts major characteristics of the input image, then calculates the differences in the extracted major features, and decides if the test image is a normal image or a faulty image dissimilar to the reference image. Experiment results show that the suggested approach robustly determines similar and dissimilar images by comparing major characteristics from input color images. The suggested method is expected to be useful in many real application areas related to computer vision, like video indexing, object detection and tracking, image surveillance, and so on.
https://doi.org/10.5762/KAIS.2020.21.2.160 인용 PDF KSCI

검색결과 2,333건 처리시간 0.033초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)