• Title/Summary/Keyword: 영상 전처리

Search Result 1,103, Processing Time 0.035 seconds

Design of a deep learning model to determine fire occurrence in distribution switchboard using thermal imaging data (열화상 영상 데이터 기반 배전반 화재 발생 판별을 위한 딥러닝 모델 설계)

  • Dongjoon Park;Minyoung Kim
    • The Journal of the Convergence on Culture Technology
    • /
    • v.9 no.5
    • /
    • pp.737-745
    • /
    • 2023
  • This paper discusses a study on developing an artificial intelligence model to detect incidents of fires in distribution switchboard using thermal images. The objective of the research is to preprocess collected thermal images into suitable data for object detection models and design a model capable of determining the occurrence of fires within distribution panels. The study utilizes thermal image data from AI-HUB's industrial complex for training. Two CNN-based deep learning object detection algorithms, namely Faster R-CNN and RetinaNet, are employed to construct models. The paper compares and analyzes these two models, ultimately proposing the optimal model for the task.

Design of a Contactless Access Security System using Palm Creases and Palm Vein Pattern Matching (손금과 정맥혈관 패턴매칭을 이용한 비접촉 출입 보안시스템 설계)

  • Ki-Jung Kim
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.19 no.1
    • /
    • pp.327-334
    • /
    • 2024
  • In this paper, we developed a system with a near-infrared LED light source with a wavelength of 950nm to acquire palm vein images and a white LED light source to acquire palm creases based on Raspberry Pi. In addition, we implemented a unique pattern-extractable image processing technology that can prevent counterfeiting and enhance security of mixed creases and palmprints through image pre-processing (Gray scaling, Histogram Equalization, Blurring, Thresholding, Thinning) for the acquired vein and palm images, and secured a source technology that can be used in a security-enhanced system.

(Very Low Bitrate Image Compression Coding Based on Fractal) (프랙탈 기반 저전송율 영상 압축 부호화)

  • 곽성근
    • Journal of the Korea Computer Industry Society
    • /
    • v.3 no.8
    • /
    • pp.1085-1092
    • /
    • 2002
  • Studies on image information processing have been performed since long time ago because in daily life most of information are acquired by the since of sight. Since there should be a lot of data to describe image as a digital form, data compression is required in order to store or transmit digital image. Lately among most of image compression methods adopted on image compression standards, transform coding methods have been primarily used which transforms the correlations between pixels of image on frequency domain before image compression. It is blown that the standard methods using especially DCT features blocking effect which is the major cause of degrading the quality of image at high compression rate. Fractal encoding using quadtree partition is applied after reducing original image, and we are to find a optimal encoding for the number of scaling bit and offset bit.

  • PDF

Free-viewpoint Stereoscopic TIP Generation Using Virtual Camera and Depth Map (가상 카메라와 깊이 맵을 활용하는 자유시점 입체 TIP 생성)

  • Lee, Kwang-Hoon;Jo, Cheol-Yong;Choi, Chang-Yeol;Kim, Man-Bae
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2009.11a
    • /
    • pp.219-222
    • /
    • 2009
  • 자유시점 비디오는 단순히 수동적으로 비디오를 보는 것이 아니라 원하는 시점을 자유로이 선택하여 보는 능동형 비디오이다. 일반적으로 다양한 위치 및 다양한 각도에 위치하는 다수의 카메라로부터 촬영된 영상을 이용하여 제작하는데, 이 기술은 박물관 투어, 엔터테인먼트 등의 다양한 분야에서 활용된다. 본 논문에서는 자유시점 비디오의 새로운 분야로 한 장의 영상을 가상 카메라와 깊이맵을 이용하여 영상 내부를 네비게이션하는 자유시점 입체 Tour-Into-Picture (TIP)을 제안한다. 오래전부터 TIP가 연구되어 왔는데, 이 분야는 한 장의 사진 내부를 탐험하면서 애니메이션으로 볼 수 있게 하는 기술이다. 제안 방법은 전처리과정으로 전경 마스크, 배경영상, 및 깊이맵을 자동 및 수동 방법으로 구한다. 다음에는 영상 내부를 항해하면서 투영 영상들을 획득한다. 배경영상과 전객객체의 3D 모델링 데이터를 기반으로 가상 카메라의 3차원 공간 이동, yaw, pitch, rolling의 회전, look-around effect, 줌인 등의 다양한 카메라 기능을 활용하여 자유시점 비디오를 구현한다. 또한 깊이정보의 특성 및 구조에 따라 놀라운 시청효과를 전달하는 카메라 기능의 설정 방법을 소개한다. 소프트웨어는 OpenGL 및 MFC Visual C++ 기반으로 구축되었으며, 실험영상으로 조선시대의 작품인 신윤복의 단오풍정을 사용하였고, 입체 애니메이션으로 제작되어 보다 실감있는 콘텐츠를 제공한다.

  • PDF

Design and Implementation of Deep Learning based System for Object Identification of Multimedia Data (멀티미디어 데이터에서 객체 식별을 위한 딥러닝 기반의 시스템 설계 및 구현)

  • Ko, Sang-Gyun;Kim, Bongjae;Kim, Jeong-Dong
    • Annual Conference of KIPS
    • /
    • 2018.10a
    • /
    • pp.606-608
    • /
    • 2018
  • 최근 CCTV나 블랙박스 등 멀티미디어 데이터를 생성해내는 장치의 사용이 늘어나고 있다. 이러한 대용량 멀티미디어 데이터가 증가함에 따라 사용자가 동영상과 같은 멀티미디어 데이터 내의 객체를 식별하기 위해서는 많은 시간을 할애하여 매뉴얼하게 일일이 찾아야 하는 한계점이 있다. 본 논문에서는 사용자가 동영상 및 이미지에서와 같은 멀티미디어 데이터에서 객체를 자동으로 식별할 수 있 수 있는 딥러닝 기반의 객체 식별 및 검색 모델을 제안한다. 제안하는 객체 식별 검색은 이미지 검색과 동영상 검색을 지원한다. 이미지 검색에서는 이미지에 존재하는 동일한 객체를 검색 대상 이미지들에서 객체를 식별하고, 이미지에 존재하는 객체를 검색하여 결과로 반환한다. 또한 동영상 검색에서는 동영상에서 검색하고자 하는 객체를 식별하고 객체가 출현하는 시간을 전처리과정을 통해 기록하며, 검색하고자 하는 동영상 내에 존재하는 객체의 검색이 가능하다. 따라서 사용자가 동영상에서 객체의 검색 시 키워드 검색이 가능하여 동영상을 모두 재생하서 객체를 식별해야 하는 번거로움을 해결할 수 있다.

A Study on Clutter Rejection using PCA and Stochastic features of Edge Image (주성분 분석법 및 외곽선 영상의 통계적 특성을 이용한 클러터 제거기법 연구)

  • Kang, Suk-Jong;Kim, Do-Jong;Bae, Hyeon-Deok
    • Journal of the Institute of Electronics Engineers of Korea SC
    • /
    • v.47 no.6
    • /
    • pp.12-18
    • /
    • 2010
  • Automatic Target Detection (ATD) systems that use forward-looking infrared (FLIR) consists of three stages. preprocessing, detection, and clutter rejection. All potential targets are extracted in preprocessing and detection stages. But, this results in a high false alarm rates. To reduce false alarm rates of ATD system, true targets are extracted in the clutter rejection stage. This paper focuses on clutter rejection stage. This paper presents a new clutter rejection technique using PCA features and stochastic features of clutters and targets. PCA features are obtained from Euclidian distances using which potential targets are projected to reduced eigenspace selected from target eigenvectors. CV is used for calculating stochastic features of edges in targets and clutters images. To distinguish between target and clutter, LDA (Linear Discriminant Analysis) is applied. The experimental results show that the proposed algorithm accurately classify clutters with a low false rate compared to PCA method or CV method

Hand Biometric Information Recognition System of Mobile Phone Image for Mobile Security (모바일 보안을 위한 모바일 폰 영상의 손 생체 정보 인식 시스템)

  • Hong, Kyungho;Jung, Eunhwa
    • Journal of Digital Convergence
    • /
    • v.12 no.4
    • /
    • pp.319-326
    • /
    • 2014
  • According to the increasing mobile security users who have experienced authentication failure by forgetting passwords, user names, or a response to a knowledge-based question have preference for biological information such as hand geometry, fingerprints, voice in personal identification and authentication. Therefore biometric verification of personal identification and authentication for mobile security provides assurance to both the customer and the seller in the internet. Our study focuses on human hand biometric information recognition system for personal identification and personal Authentication, including its shape, palm features and the lengths and widths of the fingers taken from mobile phone photographs such as iPhone4 and galaxy s2. Our hand biometric information recognition system consists of six steps processing: image acquisition, preprocessing, removing noises, extracting standard hand feature extraction, individual feature pattern extraction, hand biometric information recognition for personal identification and authentication from input images. The validity of the proposed system from mobile phone image is demonstrated through 93.5% of the sucessful recognition rate for 250 experimental data of hand shape images and palm information images from 50 subjects.

An image enhancement algorithm for detecting the license plate region using the image of the car personal recorder (차량 번호판 검출을 위한 자동차 개인 저장 장치 이미지 향상 알고리즘)

  • Yun, Jong-Ho;Choi, Myung-Ryul;Lee, Sang-Sun
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.17 no.3
    • /
    • pp.1-8
    • /
    • 2016
  • We propose an adaptive histogram stretching algorithm for application to a car's personal recorder. The algorithm was used for pre-processing to detect the license plate region in an image from a personal recorder. The algorithm employs a Probability Density Function (PDF) and Cumulative Distribution Function (CDF) to analyze the distribution diagram of the images. These two functions are calculated using an image obtained by sampling at a certain pixel interval. The images were subjected to different levels of stretching, and experiments were done on the images to extract their characteristics. The results show that the proposed algorithm provides less deterioration than conventional algorithms. Moreover, contrast is enhanced according to the characteristics of the image. The algorithm could provide better performance than existing algorithms in applications for detecting search regions for license plates.

Advanced Seam Finding Algorithm for Stitching of 360 VR Images (개선된 Seam Finder를 이용한 360 VR 이미지 스티칭 기술)

  • Son, Hui-Jeong;Han, Jong-Ki
    • Journal of Broadcast Engineering
    • /
    • v.23 no.5
    • /
    • pp.656-668
    • /
    • 2018
  • VR (Virtual Reality) is one of the important research topics in the field of multimedia application system. The quality of the visual data composed from multiple pictures depends on the performance of stitching technique. The stitching module consists of feature extraction, mapping of those, warping, seam finding, and blending. In this paper, we proposed a preprocessing scheme to provide the efficient mask for seam finder. Incorporating of the proposed mask removes the distortion, such as ghost and blurring, in the stitched image. The simulation results show that the proposed algorithm outperforms other conventional techniques in the respect of the subjective quality and the computational complexity.

A Parallel Implementation of JPEG2000 4K Ultra High Definition Image using OpenCL (OpenCL을 이용한 JPEG2000 4K 초고화질 영상처리의 병렬고속화 구현)

  • Park, Daeseung;Kim, Cheong Ghil
    • Journal of Satellite, Information and Communications
    • /
    • v.10 no.1
    • /
    • pp.1-5
    • /
    • 2015
  • With the help of fast growing multimedia technology and high preference for users of large screens, the newest video coding standard, HEVC (High Efficiency Video Coding) high-quality video compression), has been introduced. Therefore, the high definition image services which are four times more clear than conventional HD video, are getting popular. JPEG 2000 also has stated to support 4K and 8K UHD. As a result, it requires fast processing technology to read and write UHD images. This paper introduces a study on fast parallel processing technology for UHD images. For this purpose, first, JPEG 2000 is reviewed and a GPU based parallel implementation is proposed for a preprocessing of color conversion stage. The parallelled algorithm is implemented with OpenCL (Open Computing Language). The simulation results show that the proposed method shows 5 times performance improvements on processing speed for 4K UHD over the method using threads.