• 제목/요약/키워드: vision-based techniques

검색결과 293건 처리시간 0.031초

Photon-counting linear discriminant analysis for face recognition at a distance

  • Yeom, Seok-Won
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • 제12권3호
    • /
    • pp.250-255
    • /
    • 2012
  • Face recognition has wide applications in security and surveillance systems as well as in robot vision and machine interfaces. Conventional challenges in face recognition include pose, illumination, and expression, and face recognition at a distance involves additional challenges because long-distance images are often degraded due to poor focusing and motion blurring. This study investigates the effectiveness of applying photon-counting linear discriminant analysis (Pc-LDA) to face recognition in harsh environments. A related technique, Fisher linear discriminant analysis, has been found to be optimal, but it often suffers from the singularity problem because the number of available training images is generally much smaller than the number of pixels. Pc-LDA, on the other hand, realizes the Fisher criterion in high-dimensional space without any dimensionality reduction. Therefore, it provides more invariant solutions to image recognition under distortion and degradation. Two decision rules are employed: one is based on Euclidean distance; the other, on normalized correlation. In the experiments, the asymptotic equivalence of the photon-counting method to the Fisher method is verified with simulated data. Degraded facial images are employed to demonstrate the robustness of the photon-counting classifier in harsh environments. Four types of blurring point spread functions are applied to the test images in order to simulate long-distance acquisition. The results are compared with those of conventional Eigen face and Fisher face methods. The results indicate that Pc-LDA is better than conventional facial recognition techniques.

딥 러닝 기반의 팬옵틱 분할 기법 분석 (Survey on Deep Learning-based Panoptic Segmentation Methods)

  • 권정은;조성인
    • 대한임베디드공학회논문지
    • /
    • 제16권5호
    • /
    • pp.209-214
    • /
    • 2021
  • Panoptic segmentation, which is now widely used in computer vision such as medical image analysis, and autonomous driving, helps understanding an image with holistic view. It identifies each pixel by assigning a unique class ID, and an instance ID. Specifically, it can classify 'thing' from 'stuff', and provide pixel-wise results of semantic prediction and object detection. As a result, it can solve both semantic segmentation and instance segmentation tasks through a unified single model, producing two different contexts for two segmentation tasks. Semantic segmentation task focuses on how to obtain multi-scale features from large receptive field, without losing low-level features. On the other hand, instance segmentation task focuses on how to separate 'thing' from 'stuff' and how to produce the representation of detected objects. With the advances of both segmentation techniques, several panoptic segmentation models have been proposed. Many researchers try to solve discrepancy problems between results of two segmentation branches that can be caused on the boundary of the object. In this survey paper, we will introduce the concept of panoptic segmentation, categorize the existing method into two representative methods and explain how it is operated on two methods: top-down method and bottom-up method. Then, we will analyze the performance of various methods with experimental results.

VR 환경에서의 객체의 이동 및 충돌 감지를 위한 iBeacon 신호의 활용 기법 (iBeacon Sinals Utilizing Techniques for the Moving Object and Collision Detection in VR Environment)

  • 윤창표;황치곤
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국정보통신학회 2016년도 추계학술대회
    • /
    • pp.333-334
    • /
    • 2016
  • 최근 가상현실 서비스를 위한 기술 및 디바이스의 발전과 더불어 다양한 응용 서비스 분야에 가상현실 기술이 이용되고 있다. 그러나 증강현실이 아닌 가상현실이 경우 시야를 확보하기 어렵기 때문에 이동형이 아닌 고정 상태에서의 서비스만이 가능하다는 문제를 갖는다. 본 논문에서는 가상현실 시스템에 iBeacon 기술을 이용한 실내 위치 기반 서비스의 기술을 응용하여 가상의 공간에서의 이동성 지원을 위한 다른 이동 객체와의 충돌을 감지할 수 있는 iBeacon 신호의 응용 기법을 제안한다.

  • PDF

Influence of Spiritual Leadership, Organizational Commitment and its Effect on the Performance of Lembaga Perkreditan Desa

  • RIANA, I Gede
    • The Journal of Asian Finance, Economics and Business
    • /
    • 제8권2호
    • /
    • pp.1111-1124
    • /
    • 2021
  • This research aims to investigate The Role of Calling and Membership as Mediation on the Influence of Spiritual Leadership against Organizational Commitment and its Effect on the Performance of Lembaga Perkreditan Desa (LPD). This study has outlined organizational commitment as a construct thought to be theoretically strategic to empower organizations through calling and membership. The research was conducted upon LPDs in Bali. Inferential analysis techniques have been used to test the empirical model and the hypotheses proposed in this study. The analysis technique used is the structural equation model (SEM) based on variance or component, called Partial Least Square (PLS). The significant findings of this research are, first, spiritual leadership has a positive and significant effect on calling; second, calling has a positive and significant impact on organizational commitment; and third, calling has a positive and significant effect on organizational performance. Several studies have been conducted to estimate the mediating role of calling in explaining the relationship of spiritual leadership with organizational commitment. The results of the study by Bodla and Ali (2012) and Bodla et al. (2013) state that spiritual leadership, which consists of vision and altruistic love, has a positive and significant influence on organizational commitment. Likewise, spiritual leadership has a positive and significant effect on calling.

Transfer Learning Based Real-Time Crack Detection Using Unmanned Aerial System

  • Yuvaraj, N.;Kim, Bubryur;Preethaa, K. R. Sri
    • 국제초고층학회논문집
    • /
    • 제9권4호
    • /
    • pp.351-360
    • /
    • 2020
  • Monitoring civil structures periodically is necessary for ensuring the fitness of the structures. Cracks on inner and outer surfaces of the building plays a vital role in indicating the health of the building. Conventionally, human visual inspection techniques were carried up to human reachable altitudes. Monitoring of high rise infrastructures cannot be done using this primitive method. Also, there is a necessity for more accurate prediction of cracks on building surfaces for ensuring the health and safety of the building. The proposed research focused on developing an efficient crack classification model using Transfer Learning enabled EfficientNet (TL-EN) architecture. Though many other pre-trained models were available for crack classification, they rely on more number of training parameters for better accuracy. The TL-EN model attained an accuracy of 0.99 with less number of parameters on large dataset. A bench marked METU dataset with 40000 images were used to test and validate the proposed model. The surfaces of high rise buildings were investigated using vision enabled Unmanned Arial Vehicles (UAV). These UAV is fabricated with TL-EN model schema for capturing and analyzing the real time streaming video of building surfaces.

SDCN: Synchronized Depthwise Separable Convolutional Neural Network for Single Image Super-Resolution

  • Muhammad, Wazir;Hussain, Ayaz;Shah, Syed Ali Raza;Shah, Jalal;Bhutto, Zuhaibuddin;Thaheem, Imdadullah;Ali, Shamshad;Masrour, Salman
    • International Journal of Computer Science & Network Security
    • /
    • 제21권11호
    • /
    • pp.17-22
    • /
    • 2021
  • Recently, image super-resolution techniques used in convolutional neural networks (CNN) have led to remarkable performance in the research area of digital image processing applications and computer vision tasks. Convolutional layers stacked on top of each other can design a more complex network architecture, but they also use more memory in terms of the number of parameters and introduce the vanishing gradient problem during training. Furthermore, earlier approaches of single image super-resolution used interpolation technique as a pre-processing stage to upscale the low-resolution image into HR image. The design of these approaches is simple, but not effective and insert the newer unwanted pixels (noises) in the reconstructed HR image. In this paper, authors are propose a novel single image super-resolution architecture based on synchronized depthwise separable convolution with Dense Skip Connection Block (DSCB). In addition, unlike existing SR methods that only rely on single path, but our proposed method used the synchronizes path for generating the SISR image. Extensive quantitative and qualitative experiments show that our method (SDCN) achieves promising improvements than other state-of-the-art methods.

차내 경험의 디지털 트랜스포메이션과 오디오 기반 인터페이스의 동향 및 시사점 (Trends and Implications of Digital Transformation in Vehicle Experience and Audio User Interface)

  • 김기현;권성근
    • 한국멀티미디어학회논문지
    • /
    • 제25권2호
    • /
    • pp.166-175
    • /
    • 2022
  • Digital transformation is driving so many changes in daily life and industry. The automobile industry is in a similar situation. In some cases, element techniques in areas called metabuses are also being adopted, such as 3D animated digital cockpit, around view, and voice AI, etc. Through the growth of the mobile market, the norm of human-computer interaction (HCI) has been evolving from keyboard-mouse interaction to touch screen. The core area was the graphical user interface (GUI), and recently, the audio user interface (AUI) has partially replaced the GUI. Since it is easy to access and intuitive to the user, it is quickly becoming a common area of the in-vehicle experience (IVE), especially. The benefits of a AUI are freeing the driver's eyes and hands, using fewer screens, lower interaction costs, more emotional and personal, effective for people with low vision. Nevertheless, when and where to apply a GUI or AUI are actually different approaches because some information is easier to process as we see it. In other cases, there is potential that AUI is more suitable. This is a study on a proposal to actively apply a AUI in the near future based on the context of various scenes occurring to improve IVE.

A Novel Whale Optimized TGV-FCMS Segmentation with Modified LSTM Classification for Endometrium Cancer Prediction

  • T. Satya Kiranmai;P.V.Lakshmi
    • International Journal of Computer Science & Network Security
    • /
    • 제23권5호
    • /
    • pp.53-64
    • /
    • 2023
  • Early detection of endometrial carcinoma in uterus is essential for effective treatment. Endometrial carcinoma is the worst kind of endometrium cancer among the others since it is considerably more likely to affect the additional parts of the body if not detected and treated early. Non-invasive medical computer vision, also known as medical image processing, is becoming increasingly essential in the clinical diagnosis of various diseases. Such techniques provide a tool for automatic image processing, allowing for an accurate and timely assessment of the lesion. One of the most difficult aspects of developing an effective automatic categorization system is the absence of huge datasets. Using image processing and deep learning, this article presented an artificial endometrium cancer diagnosis system. The processes in this study include gathering a dermoscopy images from the database, preprocessing, segmentation using hybrid Fuzzy C-Means (FCM) and optimizing the weights using the Whale Optimization Algorithm (WOA). The characteristics of the damaged endometrium cells are retrieved using the feature extraction approach after the Magnetic Resonance pictures have been segmented. The collected characteristics are classified using a deep learning-based methodology called Long Short-Term Memory (LSTM) and Bi-directional LSTM classifiers. After using the publicly accessible data set, suggested classifiers obtain an accuracy of 97% and segmentation accuracy of 93%.

의상 특징 기반의 동일인 식별 (Person Identification based on Clothing Feature)

  • 최유주;박선미;조위덕;김구진
    • 한국컴퓨터그래픽스학회논문지
    • /
    • 제16권1호
    • /
    • pp.1-7
    • /
    • 2010
  • 비전 기반의 감시 시스템에서 동일인의 식별은 매우 중요하다. 감시 시스템에서 주로 사용되는 CCTV 카메라의 영상은 상대적으로 낮은 해상도를 가지므로 얼굴 인식 기법을 이용하여 동일인을 식별하기는 어렵다. 본 논문에서는 CCTV 카메라 영상에서 의상 특징을 이용하여 동일인을 식별하는 알고리즘을 제안한다. 건물의 주출입구에서 출입자가 인증을 받을 때, 의상 특징이 데이터베이스에 저장된다. 그 후, 건물 내에서 촬영한 영상에 대해 배경 차감 및 피부색 발견 기법을 이용하여 의상 영역을 발견한다. 의상의 특징 벡터는 텍스처와 색상 특징을 이용하여 구성한다. 텍스처 특징은 지역적 에지 히스토그램을 이용하여 추출된다. 색상 특징은 색상 지도의 옥트리 기반 양자화(octree-based quantization)를 이용하여 추출된다. 건물 내의 촬영 영상이 주어질 때, 데이터베이스에서 의상 특징이 가장 유사한 사람을 발견함으로써 동일인을 식별하며, 의상 특징 벡터 간의 유사도 측정을 위해서는 유클리디안 거리(Euclidean distance)를 사용한다. 실험 결과, 얼굴인식 기법이 최대 43%의 성공률을 보인 데 비해, 의상 특징을 이용하여 80%의 성공률로 동일인을 식별하였다.

미시적 교통정보자료의 취득을 위한 영상기반 차량추적기술 개발 (Development of Vision-Based Vehicle Tracking for Extracting Microscopic Traffic Information)

  • 이기영;장명순
    • 대한교통학회지
    • /
    • 제23권7호
    • /
    • pp.137-148
    • /
    • 2005
  • 일정구간의 도로를 주행하는 차량에 대한 단위시간대별 위치정보를 취득하게 되면, 도로의 교통상황에 대한 정보와 개별차량의 미시적인 주행행태를 파악할 수 있게 된다. 기존 사용되는 영상기술은 짧은 지점에 대한 교통량, 속도 등의 제한적인 자료만의 취득이 가능하여 도로구간의 교통상황을 대표하는데 한계가 있다. 또한 기존 영상기술은 주행차량의 미시적행태분석을 위해서 비디오로 촬영된 영상을 한 프레임씩 수동으로 작동하여 데이터를 수집함으로써 많은 인력과 시간이 소요되었다. 본 연구에서는 차량의 단위시간대별 위치자료를 자동으로 얻어낼 수 있는 규칙기반 차량추적기술을 개발하였다. 또한 기술의 검증을 위해 130m의 도로구간에서 차량의 주행위치를 0.05초 단위로 추적한 기초 자료를 추출하였으며, 이 데이터의 가공을 통해 산출된 속도와 실측된 속도와의 비교를 통해 차량추적의 정확도를 검증하였다. 향후 이러한 차량추적기술은 도로의 교통상황에 대한 주요 정보의 제공 등의 실용적 측면과 차량의 주행행태 분석 등의 학문적 분야에 널리 활용될 수 있을 것이다.