• 제목/요약/키워드: image resizing

검색결과 54건 처리시간 0.027초

Content-Aware Convolutional Neural Network for Object Recognition Task

  • Poernomo, Alvin;Kang, Dae-Ki
    • International journal of advanced smart convergence
    • /
    • 제5권3호
    • /
    • pp.1-7
    • /
    • 2016
  • In existing Convolutional Neural Network (CNNs) for object recognition task, there are only few efforts known to reduce the noises from the images. Both convolution and pooling layers perform the features extraction without considering the noises of the input image, treating all pixels equally important. In computer vision field, there has been a study to weight a pixel importance. Seam carving resizes an image by sacrificing the least important pixels, leaving only the most important ones. We propose a new way to combine seam carving approach with current existing CNN model for object recognition task. We attempt to remove the noises or the "unimportant" pixels in the image before doing convolution and pooling, in order to get better feature representatives. Our model shows promising result with CIFAR-10 dataset.

Image-based Extraction of Histogram Index for Concrete Crack Analysis

  • Kim, Bubryur;Lee, Dong-Eun
    • 국제학술발표논문집
    • /
    • The 9th International Conference on Construction Engineering and Project Management
    • /
    • pp.912-919
    • /
    • 2022
  • The study is an image-based assessment that uses image processing techniques to determine the condition of concrete with surface cracks. The preparations of the dataset include resizing and image filtering to ensure statistical homogeneity and noise reduction. The image dataset is then segmented, making it more suited for extracting important features and easier to evaluate. The image is transformed into grayscale which removes the hue and saturation but retains the luminance. To create a clean edge map, the edge detection process is utilized to extract the major edge features of the image. The Otsu method is used to minimize intraclass variation between black and white pixels. Additionally, the median filter was employed to reduce noise while keeping the borders of the image. Image processing techniques are used to enhance the significant features of the concrete image, especially the defects. In this study, the tonal zones of the histogram and its properties are used to analyze the condition of the concrete. By examining the histogram, the viewer will be able to determine the information on the image through the number of pixels associated and each tonal characteristic on a graph. The features of the five tonal zones of the histogram which implies the qualities of the concrete image may be evaluated based on the quality of the contrast, brightness, highlights, shadow spikes, or the condition of the shadow region that corresponds to the foreground.

  • PDF

OBLIQUE PROJECTION OPERATION FOR NEAR OPTIMAL IMAGE RESIZING

  • Lee, Chulhee
    • 한국방송∙미디어공학회:학술대회논문집
    • /
    • 한국방송공학회 1996년도 학술대회
    • /
    • pp.209-212
    • /
    • 1996
  • In this paper, we propose to re-size images using an oblique projection operator instead of the orthogonal one in order to obtain faster, simpler, and more general algorithms. The main advantage is that it becomes perfectly feasible to use higher order models(e.g., splines of degree n 3). We develop the theoretical background and present a simple and practical implementation procedure that uses B-splines. Experiments show that the proposed algorithm consistently outperforms the standard interpolation method and that it essentially provides the same performance as the optimal procedure (least squares solution) with considerably less computations.

  • PDF

웨이블릿 영역에서 근사 계수의 증감 정보를 이용한 블라인드 워터마크 (A Blind Watermarking Technique Using Difference of Approximation Coefficients in Wavelet Domain)

  • 윤혜진;성영경;최태선
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2002년도 하계종합학술대회 논문집(4)
    • /
    • pp.219-222
    • /
    • 2002
  • In this paper, we propose a new blind image watermarking method in wavelet domain. It is necessary to find out watermark insertion location in blind watermark. We use horizontal and vertical difference of LL components to select watermark insertion location, because increment or decrement of successive components is rarely changed in LL band. A pseudo-random sequence is used as a watermark. Experimental results show that the proposed method is robust to various kinds of attacks such as JPEG lossy compression, averaging, median filtering, resizing, histogram equalization, and additive Gaussian noise.

  • PDF

마스크 생산 라인에서 영상 기반 마스크 필터 검사를 위한 계층적 상관관계 기반 이상 현상 탐지 (Hierarchical Correlation-based Anomaly Detection for Vision-based Mask Filter Inspection in Mask Production Lines)

  • 오건희;이효진;이헌철
    • 대한임베디드공학회논문지
    • /
    • 제16권6호
    • /
    • pp.277-283
    • /
    • 2021
  • This paper addresses the problem of vision-based mask filter inspection for mask production systems. Machine learning-based approaches can be considered to solve the problem, but they may not be applicable to mask filter inspection if normal and anomaly mask filter data are not sufficient. In such cases, handcrafted image processing methods have to be considered to solve the problem. In this paper, we propose a hierarchical correlation-based approach that combines handcrafted image processing methods to detect anomaly mask filters. The proposed approach combines image rotation, cropping and resizing, edge detection of mask filter parts, average blurring, and correlation-based decision. The proposed approach was tested and analyzed with real mask filters. The results showed that the proposed approach was able to successfully detect anomalies in mask filters.

히스토그램과 블록분할을 이용한 매칭 알고리즘 (Matching Algorithm using Histogram and Block Segmentation)

  • 박성곤;최연호;조내수;임성운;권우현
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2009년도 정보 및 제어 심포지움 논문집
    • /
    • pp.231-233
    • /
    • 2009
  • The object recognition is one of the major computer vision fields. The object recognition using features(SIFT) is finding common features in input images and query images. But the object recognition using feature methods has suffered of difficulties due to heavy calculations when resizing input images and query images. In this paper, we focused on speed up finding features in the images. we proposed method using block segmentation and histogram. Block segmentation used diving input image and than histogram decided correlation between each 1]lock and query image. This paper has confirmed that tile matching time reduced for object recognition since reducing block.

  • PDF

Attack Detection on Images Based on DCT-Based Features

  • Nirin Thanirat;Sudsanguan Ngamsuriyaroj
    • Asia pacific journal of information systems
    • /
    • 제31권3호
    • /
    • pp.335-357
    • /
    • 2021
  • As reproduction of images can be done with ease, copy detection has increasingly become important. In the duplication process, image modifications are likely to occur and some alterations are deliberate and can be viewed as attacks. A wide range of copy detection techniques has been proposed. In our study, content-based copy detection, which basically applies DCT-based features for images, namely, pixel values, edges, texture information and frequency-domain component distribution, is employed. Experiments are carried out to evaluate robustness and sensitivity of DCT-based features from attacks. As different types of DCT-based features hold different pieces of information, how features and attacks are related can be shown in their robustness and sensitivity. Rather than searching for proper features, use of robustness and sensitivity is proposed here to realize how the attacked features have changed when an image attack occurs. The experiments show that, out of ten attacks, the neural networks are able to detect seven attacks namely, Gaussian noise, S&P noise, Gamma correction (high), blurring, resizing (big), compression and rotation with mostly related to their sensitive features.

DCT 변환 계수를 이용한 축소/확대 (Upsampling and Downsampling using DCT Coefficients)

  • 박일철;권구락
    • 한국정보통신학회논문지
    • /
    • 제15권8호
    • /
    • pp.1714-1719
    • /
    • 2011
  • 각종 시각 매체들이 발달함에 따라 대부분의 영상들은 고화질의 영상을 사용하고 있다. 그 만큼 전송할 때 많은 용량을 전송해야 하기 때문에 압축된 형태를 지향하고 있으며 이뿐만 아니라 소형기기의 디스플레이 장치에 알맞은 영상을 제공해야 하는 필요성이 제기되고 있다. 본 논문에서는 DCT 영역에서 영상을 축소/확대하여 계산 량을 줄이면서 디스플레이 장치에 알맞은 영상 크기 조절 방법을 제시한다. 제안하는 방법은 DCT 영역에서 영상의 해상도를 조절할 수 있기 때문에 기존의 방법들에 비해 높은 PSNR 값을 보인다.

재보간의 특성을 이용한 디지털 이미지의 합성 영역 및 필터링 영역 검출 (Detection of Forged Regions and Filtering Regions of Digital Images Using the Characteristics of Re-interpolation)

  • 황민구;하동환
    • 한국멀티미디어학회논문지
    • /
    • 제15권2호
    • /
    • pp.179-194
    • /
    • 2012
  • 디지털 합성 이미지는 이미지가 담고 있는 진정성을 왜곡하기 때문에 사회적인 문제가 되고 있다. 이러한 디지털 합성 이미지들은 인터넷, 잡지 또는 정치적 광고를 위한 이미지들에서 흔히 볼 수 있다. 이러한 왜곡된 매체들은 이미지가 담고 있는 정보에 대한 신뢰도를 떨어트릴 수 있다. 본 논문에서는 이와 같이 대중에게 전달되는 정보의 혼란을 예방하기 위한 연구로써 디지털 합성 이미지를 판독하는데 목적이 있다. 대부분의 합성 이미지들은 이미지 크기 조절 및 회전을 이용하는 방법을 사용하기 때문에 합성 영역에 보간 (Interpolation)이 적용되게 된다. 본 논문은 보간의 흔적을 검출하는 연구로써 이미 보간이 적용된 영역과 그렇지 않은 영역에 재보간을 적용하여 두 영영간의 주파수 패턴을 검출하는 실험을 하였다. 이를 통해 합성에 사용된 보간 흔적을 검출하였으며 이미지 리터칭에 사용된 필터링 영역도 검출할 수 있었다.

이미지 데이터를 모니터링하는 관리도에서 이미지와 ROI 크기 조정의 영향 (Resizing effect of image and ROI in using control charts to monitor image data)

  • 이주형;윤형욱;이성민;이재헌
    • 응용통계연구
    • /
    • 제30권3호
    • /
    • pp.487-501
    • /
    • 2017
  • 최근 산업의 생산공정에서는 머신비전시스템을 통하여 제품의 품질특성치에 대한 정보를 이미지 데이터로 제공하는 경우가 많다. 따라서 산업과 의학 현장에서 이미지 데이터의 모니터링을 위해 관리도 절차의 필요성이 많이 대두되고 있다. 이미지 데이터를 모니터링하는 관리도 절차는 전통적으로 사용하는 관리도 절차와 유사한 점도 있지만, 데이터의 구조를 비롯하여 각 이미지에서 ROI를 설정하여 관리도 절차를 적용하는 등 서로 다른 점도 많이 있다. 이 논문에서는 생산공정에서 제공되는 이미지 데이터에 대해 관리도를 사용하는 절차를 소개하고, 이미지 또는 ROI 크기의 확대와 축소가 제품의 이상원인을 탐지하는데 어떠한 영향이 주는지를 모의실험을 통하여 알아보았고 각 관리도의 성능 또한 비교하였다.