• 제목/요약/키워드: Visual Feature Extraction

검색결과 141건 처리시간 0.024초

A new approach for content-based video retrieval

  • Kim, Nac-Woo;Lee, Byung-Tak;Koh, Jai-Sang;Song, Ho-Young
    • International Journal of Contents
    • /
    • 제4권2호
    • /
    • pp.24-28
    • /
    • 2008
  • In this paper, we propose a new approach for content-based video retrieval using non-parametric based motion classification in the shot-based video indexing structure. Our system proposed in this paper has supported the real-time video retrieval using spatio-temporal feature comparison by measuring the similarity between visual features and between motion features, respectively, after extracting representative frame and non-parametric motion information from shot-based video clips segmented by scene change detection method. The extraction of non-parametric based motion features, after the normalized motion vectors are created from an MPEG-compressed stream, is effectively fulfilled by discretizing each normalized motion vector into various angle bins, and by considering the mean, variance, and direction of motion vectors in these bins. To obtain visual feature in representative frame, we use the edge-based spatial descriptor. Experimental results show that our approach is superior to conventional methods with regard to the performance for video indexing and retrieval.

Robust Control of Robot Manipulators using Vision Systems

  • 이영찬;지민석;이강웅
    • 한국항행학회논문지
    • /
    • 제7권2호
    • /
    • pp.162-170
    • /
    • 2003
  • In this paper, we propose a robust controller for trajectory control of n-link robot manipulators using feature based on visual feedback. In order to reduce tracking error of the robot manipulator due to parametric uncertainties, integral action is included in the dynamic control part of the inner control loop. The desired trajectory for tracking is generated from feature extraction by the camera mounted on the end effector. The stability of the robust state feedback control system is shown by the Lyapunov method. Simulation and experimental results on a 5-link robot manipulator with two degree of freedom show that the proposed method has good tracking performance.

  • PDF

An approach to visual pattern recognition by neural network system

  • Hatakeyama, Yasuhiro;Kakazu, Yukinori
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 1992년도 한국자동제어학술회의논문집(국제학술편); KOEX, Seoul; 19-21 Oct. 1992
    • /
    • pp.61-64
    • /
    • 1992
  • In this paper, a visual pattern recognition system is proposed, which can recognize both a pattern and its location. This system, referred to as the expanded neocognitron, has the following capabilities: (1) A higher performance in extraction of features, and (2) A new capability for recognizing the locations of patterns. This system adopts the learning and recognizing mechanism of the neocognitron. First, the ability to classify pattern is enhanced by improving the mechanisms of feature extraction and learning algorithm. Second, the function of detecting the location of each pattern is realized by developing an architecture which does not reduce structure, i.e., the unit density is constant all the way from the input stage to the output stage.

  • PDF

Metadata Processing Technique for Similar Image Search of Mobile Platform

  • Seo, Jung-Hee
    • Journal of information and communication convergence engineering
    • /
    • 제19권1호
    • /
    • pp.36-41
    • /
    • 2021
  • Text-based image retrieval is not only cumbersome as it requires the manual input of keywords by the user, but is also limited in the semantic approach of keywords. However, content-based image retrieval enables visual processing by a computer to solve the problems of text retrieval more fundamentally. Vision applications such as extraction and mapping of image characteristics, require the processing of a large amount of data in a mobile environment, rendering efficient power consumption difficult. Hence, an effective image retrieval method on mobile platforms is proposed herein. To provide the visual meaning of keywords to be inserted into images, the efficiency of image retrieval is improved by extracting keywords of exchangeable image file format metadata from images retrieved through a content-based similar image retrieval method and then adding automatic keywords to images captured on mobile devices. Additionally, users can manually add or modify keywords to the image metadata.

시각 음성인식을 위한 영상 기반 접근방법에 기반한 강인한 시각 특징 파라미터의 추출 방법 (Robust Feature Extraction Based on Image-based Approach for Visual Speech Recognition)

  • 송민규;;민소희;김진영;나승유;황성택
    • 한국지능시스템학회논문지
    • /
    • 제20권3호
    • /
    • pp.348-355
    • /
    • 2010
  • 음성 인식 기술의 발전에도 불구하고 잡음 환경하의 음성 인식은 여전히 어려운 분야이다. 이를 해결하기 위한 방안으로 음성 정보 이외에 시각 정보를 이용한 시각 음성인식에 대한 연구가 진행되고 있다. 하지만 시각 정보 또한 음성과 마찬가지로 주위 조명 환경이나 기타, 다른 요인에 따른 영상잡음이 존재하며, 이런 영상잡음은 시각 음성 인식의 성능 저하를 야기한다. 따라서 인식 성능 향상을 위해 시각 특징 파라미터를 어떻게 추출하느냐는 하나의 관심분야이다. 본 논문에서는 HMM기반 시각 음성인식의 인식 성능 향상을 위한 영상 기반 접근방법에 따른 시각 특징 파라미터의 추출 방법에 대하여 논하고 그에 따른 인식성능을 비교하였다. 실험을 위해 105명에 화자에 대한 62단어의 데이터베이스를 구축하고, 이를 이용하여 히스토그램 매칭, 입술 접기, 프레임 간 필터링 기법, 선형마스크, DCT, PCA 등을 적용하여 시각 특징 파라미터를 추출하였다. 실험결과, 제안된 방법에 의해 추출된 특징 파라미터를 인식기에 적용하였을 때의 인식 성능은 기본 파라미터에 비해 약21%의 성능 향상이 됨을 알 수 있다.

기하학적 불변벡터기반 랜드마크 인식방법 (Landmark Recognition Method based on Geometric Invariant Vectors)

  • 차정희
    • 한국컴퓨터정보학회논문지
    • /
    • 제10권3호
    • /
    • pp.173-182
    • /
    • 2005
  • 본 논문에서는 항해 시 위치인식에 사용하기 위하여 카메라의 뷰포인트에 무관한 랜드마크를 인식하는 방법을 제안한다. 기존연구에서 사용된 특징들은 카메라의 뷰포인트에 따라 변하고 이에따른 정보 양의 증가로 위치확인을 위한 시각적인 랜드마크의 추출이 어렵다. 본 논문에서 제안된 방법은 특징 추출단계, 학습과 인식단계, 정합단계의 삼단계로 구성된다. 특징 추출단계에서는 영상의 관심영역을 설정, 이 영역 안에서 코너점을 추출하는데, 추출 시 작은 고유값의 통계적 분석을 통해 보다 정확하고 잡음에 강한 특징을 추출하는 방법을 제안한다. 학습 및 인식단계에서는 5개의 특징점으로 구성된 특징모델이 뷰포인트에 무관한 특징점인지를 검사하여 강건 특징모델을 구성한다. 정합단계에서는 시간 복잡도를 줄이고 정확한 대응점을 산출하기 위하여 유사도 평가함수와 Graham 탐색방법을 이용한 정합 방법을 제안한다. 실험에서는 다양한 실내영상을 가지고 제안한 방법과 기존방법을 비교 분석함으로써 제안한 방법의 우수함을 보였다.

  • PDF

GPU 가속화를 통한 이미지 특징점 기반 RGB-D 3차원 SLAM (Image Feature-Based Real-Time RGB-D 3D SLAM with GPU Acceleration)

  • 이동화;김형진;명현
    • 제어로봇시스템학회논문지
    • /
    • 제19권5호
    • /
    • pp.457-461
    • /
    • 2013
  • This paper proposes an image feature-based real-time RGB-D (Red-Green-Blue Depth) 3D SLAM (Simultaneous Localization and Mapping) system. RGB-D data from Kinect style sensors contain a 2D image and per-pixel depth information. 6-DOF (Degree-of-Freedom) visual odometry is obtained through the 3D-RANSAC (RANdom SAmple Consensus) algorithm with 2D image features and depth data. For speed up extraction of features, parallel computation is performed with GPU acceleration. After a feature manager detects a loop closure, a graph-based SLAM algorithm optimizes trajectory of the sensor and builds a 3D point cloud based map.

A Novel Technique for Detection of Repacked Android Application Using Constant Key Point Selection Based Hashing and Limited Binary Pattern Texture Feature Extraction

  • MA Rahim Khan;Manoj Kumar Jain
    • International Journal of Computer Science & Network Security
    • /
    • 제23권9호
    • /
    • pp.141-149
    • /
    • 2023
  • Repacked mobile apps constitute about 78% of all malware of Android, and it greatly affects the technical ecosystem of Android. Although many methods exist for repacked app detection, most of them suffer from performance issues. In this manuscript, a novel method using the Constant Key Point Selection and Limited Binary Pattern (CKPS: LBP) Feature extraction-based Hashing is proposed for the identification of repacked android applications through the visual similarity, which is a notable feature of repacked applications. The results from the experiment prove that the proposed method can effectively detect the apps that are similar visually even that are even under the double fold content manipulations. From the experimental analysis, it proved that the proposed CKPS: LBP method has a better efficiency of detecting 1354 similar applications from a repository of 95124 applications and also the computational time was 0.91 seconds within which a user could get the decision of whether the app repacked. The overall efficiency of the proposed algorithm is 41% greater than the average of other methods, and the time complexity is found to have been reduced by 31%. The collision probability of the Hashes was 41% better than the average value of the other state of the art methods.

Visual Semantic Based 3D Video Retrieval System Using HDFS

  • Ranjith Kumar, C.;Suguna, S.
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제10권8호
    • /
    • pp.3806-3825
    • /
    • 2016
  • This paper brings out a neoteric frame of reference for visual semantic based 3d video search and retrieval applications. Newfangled 3D retrieval application spotlight on shape analysis like object matching, classification and retrieval not only sticking up entirely with video retrieval. In this ambit, we delve into 3D-CBVR (Content Based Video Retrieval) concept for the first time. For this purpose we intent to hitch on BOVW and Mapreduce in 3D framework. Here, we tried to coalesce shape, color and texture for feature extraction. For this purpose, we have used combination of geometric & topological features for shape and 3D co-occurrence matrix for color and texture. After thriving extraction of local descriptors, TB-PCT (Threshold Based- Predictive Clustering Tree) algorithm is used to generate visual codebook. Further, matching is performed using soft weighting scheme with L2 distance function. As a final step, retrieved results are ranked according to the Index value and produce results .In order to handle prodigious amount of data and Efficacious retrieval, we have incorporated HDFS in our Intellection. Using 3D video dataset, we fiture the performance of our proposed system which can pan out that the proposed work gives meticulous result and also reduce the time intricacy.

Automatic Visual Feature Extraction And Measurement of Mushroom (Lentinus Edodes L.)

  • Heon-Hwang;Lee, C.H.;Lee, Y.K.
    • 한국농업기계학회:학술대회논문집
    • /
    • 한국농업기계학회 1993년도 Proceedings of International Conference for Agricultural Machinery and Process Engineering
    • /
    • pp.1230-1242
    • /
    • 1993
  • In a case of mushroom (Lentinus Edodes L.) , visual features are crucial for grading and the quantitative evaluation of the growth state. The extracted quantitative visual features can be used as a performance index for the drying process control or used for the automatic sorting and grading task. First, primary external features of the front and back sides of mushroom were analyzed. And computer vision based algorithm were developed for the extraction and measurement of those features. An automatic thresholding algorithm , which is the combined type of the window extension and maximum depth finding was developed. Freeman's chain coding was modified by gradually expanding the mask size from 3X3 to 9X9 to preserve the boundary connectivity. According to the side of mushroom determined from the automatic recognition algorithm size thickness, overall shape, and skin texture such as pattern, color (lightness) ,membrane state, and crack were quantified and measured. A portion of t e stalk was also identified and automatically removed , while reconstructing a new boundary using the Overhauser curve formulation . Algorithms applied and developed were coded using MS_C language Ver, 6.0, PC VISION Plus library functions, and VGA graphic function as a menu driven way.

  • PDF