• 제목/요약/키워드: Segmentation and feature extraction

검색결과 190건 처리시간 0.027초

Creepage Distance Measurement Using Binocular Stereo Vision on Hot-line for High Voltage Insulator

  • He, Wenjun;Wang, Jiake;Fu, Yuegang
    • Current Optics and Photonics
    • /
    • 제2권4호
    • /
    • pp.348-355
    • /
    • 2018
  • How to measure the creepage distance of an insulator quickly and accurately is a problem for the power industry at present, and the noticeable concern is that the high voltage insulation equipment cannot be measured online in the charged state. In view of this situation, we develop an on-line measurement system of creepage distance for high voltage insulators based on binocular stereo vision. We have proposed a method of generating linear structured light using a conical off-axis mirror. The feasibility and effect of two ways to solve the interference problem of strong sunlight have been discussed, one way is to use bandpass filters to enhance the contrast ratio of linear structured light in the images, and the other way is to process the images with adaptive threshold segmentation and feature point extraction. After the system is calibrated, we tested the measurement error of the on-line measurement system with a composite insulator sample. Experimental results show that the maximum relative error is 1.45% and the average relative error is 0.69%, which satisfies the task requirement of not more than 5% of the maximum relative error.

Automatic proficiency assessment of Korean speech read aloud by non-natives using bidirectional LSTM-based speech recognition

  • Oh, Yoo Rhee;Park, Kiyoung;Jeon, Hyung-Bae;Park, Jeon Gue
    • ETRI Journal
    • /
    • 제42권5호
    • /
    • pp.761-772
    • /
    • 2020
  • This paper presents an automatic proficiency assessment method for a non-native Korean read utterance using bidirectional long short-term memory (BLSTM)-based acoustic models (AMs) and speech data augmentation techniques. Specifically, the proposed method considers two scenarios, with and without prompted text. The proposed method with the prompted text performs (a) a speech feature extraction step, (b) a forced-alignment step using a native AM and non-native AM, and (c) a linear regression-based proficiency scoring step for the five proficiency scores. Meanwhile, the proposed method without the prompted text additionally performs Korean speech recognition and a subword un-segmentation for the missing text. The experimental results indicate that the proposed method with prompted text improves the performance for all scores when compared to a method employing conventional AMs. In addition, the proposed method without the prompted text has a fluency score performance comparable to that of the method with prompted text.

도심지역의 고해상도 위성영상 정합에 대한 그림자 영향 분석 (Analysis of Shadow Effect on High Resolution Satellite Image Matching in Urban Area)

  • 염준호;한유경;김용일
    • 대한공간정보학회지
    • /
    • 제21권2호
    • /
    • pp.93-98
    • /
    • 2013
  • 다중 시기에 수집된 고해상도 위성영상은 효과적인 도심지 분석과 모니터링을 위한 필수적인 자료이다. 그러나 같은 지역에 대해 다른 센서에서 수집된 영상은 물론, 동일 센서 영상이라 하더라도 두 영상간의 기하학적 위치정보가 서로 일치하지 않는 문제가 존재한다. 따라서 다중 영상의 효과적인 활용을 위해서는 영상 정합을 위해 매칭 포인트를 추출하는 일이 필수적이다. 그러나 도심지의 경우 건물, 교량, 나무, 기타 인공 구조물 등의 영향으로 넓은 영역에 그림자가 분포하며 그림자의 방향과 강도는 영상 수집 시기에 따라 달라지기 때문에 정확한 매칭 포인트를 추출하는데 어려움이 있다. 본 연구에서는 대표적인 매칭점 추출 기법인 SIFT(Scale-Invariant Feature Transform) 기법과 자동 그림자 추출 기법을 적용하여 도심지역의 그림자가 영상 정합에 미치는 영향을 분석하였다. 영상 분할을 통해 생성된 세그먼트의 분광 및 공간인자를 이용하여 그림자 객체를 추출하였으며 이 때 건물 버퍼 영역을 그림자의 인접정보로서 활용하였다. SIFT 기법을 통해 추출된 매칭점이 그림자에 위치하는 경우 이를 제거하고 영상 정합을 수행하였다. 최종적으로 고해상도 위성영상의 정합에 대한 그림자의 영향을 분석하기 위해 추출된 매칭점과 정합 결과의 정확도를 정량적, 시각적으로 평가하였다.

Classification of Leukemia Disease in Peripheral Blood Cell Images Using Convolutional Neural Network

  • Tran, Thanh;Park, Jin-Hyuk;Kwon, Oh-Heum;Moon, Kwang-Seok;Lee, Suk-Hwan;Kwon, Ki-Ryong
    • 한국멀티미디어학회논문지
    • /
    • 제21권10호
    • /
    • pp.1150-1161
    • /
    • 2018
  • Classification is widely used in medical images to categorize patients and non-patients. However, conventional classification requires a complex procedure, including some rigid steps such as pre-processing, segmentation, feature extraction, detection, and classification. In this paper, we propose a novel convolutional neural network (CNN), called LeukemiaNet, to specifically classify two different types of leukemia, including acute lymphoblastic leukemia (ALL) and acute myeloid leukemia (AML), and non-cancerous patients. To extend the limited dataset, a PCA color augmentation process is utilized before images are input into the LeukemiaNet. This augmentation method enhances the accuracy of our proposed CNN architecture from 96.9% to 97.2% for distinguishing ALL, AML, and normal cell images.

3D Rendering of Magnetic Resonance Images using Visualization Toolkit and Microsoft.NET Framework

  • Madusanka, Nuwan;Zaben, Naim Al;Shidaifat, Alaaddin Al;Choi, Heung-Kook
    • Journal of Multimedia Information System
    • /
    • 제2권2호
    • /
    • pp.207-214
    • /
    • 2015
  • In this paper, we proposed new software for 3D rendering of MR images in the medical domain using C# wrapper of Visualization Toolkit (VTK) and Microsoft .NET framework. Our objective in developing this software was to provide medical image segmentation, 3D rendering and visualization of hippocampus for diagnosis of Alzheimer disease patients using DICOM Images. Such three dimensional visualization can play an important role in the diagnosis of Alzheimer disease. Segmented images can be used to reconstruct the 3D volume of the hippocampus, and it can be used for the feature extraction, measure the surface area and volume of hippocampus to assist the diagnosis process. This software has been designed with interactive user interfaces and graphic kernels based on Microsoft.NET framework to get benefited from C# programming techniques, in particular to design pattern and rapid application development nature, a preliminary interactive window is functioning by invoking C#, and the kernel of VTK is simultaneously embedded in to the window, where the graphics resources are then allocated. Representation of visualization is through an interactive window so that the data could be rendered according to user's preference.

동적 윤곽 모델을 이용한 이동 물체 추적 (Moving Object Tracking Using Active Contour Model)

  • 한규범;백윤수
    • 대한기계학회논문집A
    • /
    • 제27권5호
    • /
    • pp.697-704
    • /
    • 2003
  • In this paper, the visual tracking system for arbitrary shaped moving object is proposed. The established tracking system can be divided into model based method that needs previous model for target object and image based method that uses image feature. In the model based method, the reliable tracking is possible, but simplification of the shape is necessary and the application is restricted to definite target mod el. On the other hand, in the image based method, the process speed can be increased, but the shape information is lost and the tracking system is sensitive to image noise. The proposed tracking system is composed of the extraction process that recognizes the existence of moving object and tracking process that extracts dynamic characteristics and shape information of the target objects. Specially, active contour model is used to effectively track the object that is undergoing shape change. In initializatio n process of the contour model, the semi-automatic operation can be avoided and the convergence speed of the contour can be increased by the proposed effective initialization method. Also, for the efficient solution of the correspondence problem in multiple objects tracking, the variation function that uses the variation of position structure in image frame and snake energy level is proposed. In order to verify the validity and effectiveness of the proposed tracking system, real time tracking experiment for multiple moving objects is implemented.

단일 2차원 라이다 기반의 다중 특징 비교를 이용한 장애물 분류 기법 (Obstacle Classification Method using Multi Feature Comparison Based on Single 2D LiDAR)

  • 이무현;허수정;박용완
    • 제어로봇시스템학회논문지
    • /
    • 제22권4호
    • /
    • pp.253-265
    • /
    • 2016
  • We propose an obstacle classification method using multi-decision factors and decision sections based on Single 2D LiDAR. The existing obstacle classification method based on single 2D LiDAR has two specific advantages: accuracy and decreased calculation time. However, it was difficult to classify obstacle type, and therefore accurate path planning was not possible. To overcome this problem, a method of classifying obstacle type based on width data was proposed. However, width data was not sufficient to enable accurate obstacle classification. The proposed algorithm of this paper involves the comparison between decision factor and decision section to classify obstacle type. Decision factor and decision section was determined using width, standard deviation of distance, average normalized intensity, and standard deviation of normalized intensity data. Experiments using a real autonomous vehicle in a real environment showed that calculation time decreased in comparison with 2D LiDAR-based method, thus demonstrating the possibility of obstacle type classification using single 2D LiDAR.

Adaptive Cross-Device Gait Recognition Using a Mobile Accelerometer

  • Hoang, Thang;Nguyen, Thuc;Luong, Chuyen;Do, Son;Choi, Deokjai
    • Journal of Information Processing Systems
    • /
    • 제9권2호
    • /
    • pp.333-348
    • /
    • 2013
  • Mobile authentication/identification has grown into a priority issue nowadays because of its existing outdated mechanisms, such as PINs or passwords. In this paper, we introduce gait recognition by using a mobile accelerometer as not only effective but also as an implicit identification model. Unlike previous works, the gait recognition only performs well with a particular mobile specification (e.g., a fixed sampling rate). Our work focuses on constructing a unique adaptive mechanism that could be independently deployed with the specification of mobile devices. To do this, the impact of the sampling rate on the preprocessing steps, such as noise elimination, data segmentation, and feature extraction, is examined in depth. Moreover, the degrees of agreement between the gait features that were extracted from two different mobiles, including both the Average Error Rate (AER) and Intra-class Correlation Coefficients (ICC), are assessed to evaluate the possibility of constructing a device-independent mechanism. We achieved the classification accuracy approximately $91.33{\pm}0.67%$ for both devices, which showed that it is feasible and reliable to construct adaptive cross-device gait recognition on a mobile phone.

Extraction of User Preference for Video Stimuli Using EEG-Based User Responses

  • Moon, Jinyoung;Kim, Youngrae;Lee, Hyungjik;Bae, Changseok;Yoon, Wan Chul
    • ETRI Journal
    • /
    • 제35권6호
    • /
    • pp.1105-1114
    • /
    • 2013
  • Owing to the large number of video programs available, a method for accessing preferred videos efficiently through personalized video summaries and clips is needed. The automatic recognition of user states when viewing a video is essential for extracting meaningful video segments. Although there have been many studies on emotion recognition using various user responses, electroencephalogram (EEG)-based research on preference recognition of videos is at its very early stages. This paper proposes classification models based on linear and nonlinear classifiers using EEG features of band power (BP) values and asymmetry scores for four preference classes. As a result, the quadratic-discriminant-analysis-based model using BP features achieves a classification accuracy of 97.39% (${\pm}0.73%$), and the models based on the other nonlinear classifiers using the BP features achieve an accuracy of over 96%, which is superior to that of previous work only for binary preference classification. The result proves that the proposed approach is sufficient for employment in personalized video segmentation with high accuracy and classification power.

특징 분석과 프랙탈 차원을 이용한 객체 기반 영상검색 (A Object-Based Image Retrieval Using Feature Analysis and Fractal Dimension)

  • 이정봉;박장춘
    • 한국멀티미디어학회논문지
    • /
    • 제7권2호
    • /
    • pp.173-186
    • /
    • 2004
  • 영상 검색의 수행 방법으로 사람의 시각 시스템의 특성을 기반으로 주요 의미를 갖는 객체의 효과적인 특징 추출을 통한 내용기반 영상 검색 시스템을 제안한다. 관심 객체 영역이 우선적으로 검출되도록 하기 위해 영상 내에서 비교적 면적이 크고 배경색상과의 차이가 크면서 영상의 가운데 위치하는 영역을 의미를 갖는 주요 객체로 판단하였다. 영상 고유의 특징을 얻기 위해서는 영상의 객체 윤곽선의 전체 길이를 정규화 된 일정한 세그먼트로 분할한 후에 객체 윤곽선의 세그먼트가 갖는 편각차분 벡터들의 누적 합과 양분된 객체의 시그너처를 추출하여 물체의 회전과 크기 변화에 적응적인 형태 특징으로 사용한다. 이와 같은 형태 특징을 필두로 해서 질감 샘플과 칼라, 그리고 이심률 정보를 결합하여 유사도를 측정함으로써 이동, 회전 크기 변화에 강건한 검색이 가능했으며 영역의 부분적인 변화나 손상으로 인한 객체 특성의 왜곡 현상에 덜 민감하게 반응하였다. 또한 Box-Counting Dimension에 의한 프랙탈 차원을 이용하여 측정한 객체간 복잡도 관계를 기반으로 하여 영상 특징에 서로 다른 유사도 가중치를 부여하는 방법이 잘못된 검색을 최소로 하여 더욱 효율적인 검색율을 보였다.

  • PDF