• Title/Summary/Keyword: invariant

Search Result 2,157, Processing Time 0.031 seconds

CBIRS/TB Using Color Feature Information for A tablet Recognition (알약 인식을 위해 색 특징정보를 이용한 CBIRS/TB)

  • Koo, Gun-Seo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.19 no.2
    • /
    • pp.49-56
    • /
    • 2014
  • This thesis proposes CBIRS/TB method that uses a tablet's color distribution information and form distinctive in content-based search. CBIRS/TB can avoid misuses and improper tablet uses by conducting content-based search in commonly prescribed tablets. The existing FE-CBIRS system is limited to recognizing only the image of color and shape of the tablet, that leads to applying insufficient form-specific information. While CBIRS/TB utilizes average, standard deviation, hue and saturation of each tablets in color, brightness, and contrast, FE-CBIRS has partial-sphere application problem; only applying the typical color of the tablet. Also, in case of the shape-specific-information, Invariant Moment is mainly used for the extracted partial-spheres. This causes delayed processing time and accuracy problems. Therefore, to improve this setback, this thesis indexed color-specific-information of the extracted images into categorized classification for improved search speed and accuracy.

Parameterized Modeling of Spatially Varying PSF for Lens Aberration and Defocus

  • Wang, Chao;Chen, Juan;Jia, Hongguang;Shi, Baosong;Zhu, Ruifei;Wei, Qun;Yu, Linyao;Ge, Mingda
    • Journal of the Optical Society of Korea
    • /
    • v.19 no.2
    • /
    • pp.136-143
    • /
    • 2015
  • Image deblurring by a deconvolution method requires accurate knowledge of the blur kernel. Existing point spread function (PSF) models in the literature corresponding to lens aberrations and defocus are either parameterized and spatially invariant or spatially varying but discretely defined. In this paper, a parameterized model is developed and presented for a PSF which is spatially varying due to lens aberrations and defocus in an imaging system. The model is established from the Seidel third-order aberration coefficient and the Hu moment. A skew normal Gauss model is selected for parameterized PSF geometry structure. The accuracy of the model is demonstrated with simulations and measurements for a defocused infrared camera and a single spherical lens digital camera. Compared with optical software Code V, the visual results of two optical systems validate our analysis and proposed method in size, shape and direction. Quantitative evaluation results reveal the excellent accuracy of the blur kernel model.

Face Recognition under Varying Pose using Local Area obtained by Side-view Pose Normalization (측면 포즈정규화를 통한 부분 영역을 이용한 포즈 변화에 강인한 얼굴 인식)

  • Ahn, Byeong-Doo;Ko, Han-Seok
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.42 no.4 s.304
    • /
    • pp.59-68
    • /
    • 2005
  • This paper proposes a face recognition under varying poses using local area obtained by side-view pose normalization. General normalization methods for face recognition under varying pose have a problem with the information about invisible area of face. Generally this problem is solved by compensation, but there are many cases where the image is distorted or features lost due to compensation .To solve this problem we normalize the face pose in side-view to reduce distortion that happens mainly in areas that have large depth variation. We only use undistorted area, removing the area that has been distorted by normalization. We consider two cases of yaw pose variation and pitch pose variation, and by experiments, we confirm the improvement of recognition performance.

Recognition of Printed Hangul Text Using Circular Pattern Vectors (원형 패턴 벡터를 이용한 인쇄체 한글 인식)

  • Jeong, Ji-Ho;Choe, Tae-Yeong
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.38 no.3
    • /
    • pp.269-281
    • /
    • 2001
  • This thesis deals with a novel font-dependent Hangul recognition algorithm invariant to position translation, scaling, and rotation using circular pattern vectors. The proposed algorithm removes noise from input letters using binary morphology and generates the circular pattern vectors. The generated circular pattern vectors represent spatial distributions on several concentric circles from the center of gravity in a given letter. Then the algorithm selects the letter minimizing the distance between the reference vectors and the generated circular pattern vectors. In order to estimate performances of the proposed algorithm, the completed Batang Hangul 2,350 letters were used as test images with scaling and rotational transformations. Experimental results show that the proposed algorithm are better than conventional algorithm using the ring projection in the recognition rates of Hangul letters with scaling and rotational transformation.

  • PDF

Ringing Artifact Removal in Image Restoration Using Wavelet Transform (웨이블릿 변환을 이용한 영상복원의 물결현상 제거 방법)

  • Youn, Jin-Young;Yoo, Yoon-Jong;Jun, Sin-Young;Shin, Jeong-Ho;Paik, Joon-Ki
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.45 no.6
    • /
    • pp.78-87
    • /
    • 2008
  • Digital image find own level core media in multimedia as image restoration technology fields, which remove degradation factor for image enhancement, have been growing. Linear space-invariant image restoration algorithm often introduce ringing artifacts near sharp intensity transition areas. This paper presents a new adaptive post-filtering algorithm for reducing ringing artifact. The proposed method extracts an edge map of the image using wavelet transform Based on the edge information, ringing artifacts are detected, and removed by an adaptive bilateral filter. Experimental results show that the proposed algorithm can efficiently remove ringing artifacts with edge preservation.

GAN-based Image-to-image Translation using Multi-scale Images (다중 스케일 영상을 이용한 GAN 기반 영상 간 변환 기법)

  • Chung, Soyoung;Chung, Min Gyo
    • The Journal of the Convergence on Culture Technology
    • /
    • v.6 no.4
    • /
    • pp.767-776
    • /
    • 2020
  • GcGAN is a deep learning model to translate styles between images under geometric consistency constraint. However, GcGAN has a disadvantage that it does not properly maintain detailed content of an image, since it preserves the content of the image through limited geometric transformation such as rotation or flip. Therefore, in this study, we propose a new image-to-image translation method, MSGcGAN(Multi-Scale GcGAN), which improves this disadvantage. MSGcGAN, an extended model of GcGAN, performs style translation between images in a direction to reduce semantic distortion of images and maintain detailed content by learning multi-scale images simultaneously and extracting scale-invariant features. The experimental results showed that MSGcGAN was better than GcGAN in both quantitative and qualitative aspects, and it translated the style more naturally while maintaining the overall content of the image.

A Novel Multi-focus Image Fusion Technique Using Directional Multiresolution Transform (방향성 다해상도 변환을 사용한 새로운 다중초점 이미지 융합 기법)

  • Park, Dae-Chul;Atole, Ronnel R.
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.9 no.4
    • /
    • pp.59-68
    • /
    • 2009
  • This paper addresses a hybrid multi-focus image fusion scheme using the recent curvelet transform constructions. Hybridization is obtained by combining the MS fusion rule with a novel "copy" method. The proposed scheme use MS rule to fuse the m most significant terms in spectrum of an image at each decomposition level. The scheme is dubbed in this work as m-term fusion in adherence to its use of the MSC (most significant coefficients) in the transform set at any given scale, orientation, and translation. We applied the edge-sensitive objective quality measure proposed by Xydeas and Petrovic to evaluate the method. Experimental results show that the proposed scheme is a potential alternative to the redundant, shift-invariant Dual-Tree Complex Wavelet transforms. In particular, it was confirmed that a 50% m-term fusion produces outputs with no visible quality degradation.

  • PDF

Lift-Off Invariance Transformations for Electromagnetic Eddy Current Nondestructive Evaluation Signals (다양한 센서 측정 거리로부터 획득한 자기적 와전류 신호의 불변 변환 처리 기법)

  • Kim, Dae-Won
    • Journal of the Korean Magnetics Society
    • /
    • v.14 no.6
    • /
    • pp.207-212
    • /
    • 2004
  • Eddy current (EC) testing methods are widely used in a variety of applications including the inspection of steam generator tubes in nuclear power plants, aircraft parts and airframes. A key factor that affects the EC signal is lift-off which means the physical distance between a sensor and a specimen in the testing. In practice, it is difficult to keep track of the actual value of the lift -off during a specific experiment, simulation or testing in the field, which is essential for accurate interpretation of the signal to be used in the following steps. Hence it is necessary to have a scheme to render the EC signal invariant to the effects of lift-off in spite of the changes in the real world. This paper describes a new method for compensating EC signals for variations in lift-off by acquiring an invariance feature using a homomorphic operator and neural network techniques. The signals from various lift-offs are transformed to obtain a zero lift-off equivalent signal that can be subsequently used for defect characterization in the next step.

New Template Based Face Recognition Using Log-polar Mapping and Affine Transformation (로그폴라 사상과 어파인 변환을 이용한 새로운 템플릿 기반 얼굴 인식)

  • Kim, Mun-Gab;Choi, Il;Chien, Sung-Il
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.39 no.2
    • /
    • pp.1-10
    • /
    • 2002
  • This paper presents the new template based human face recognition methods to improve the recognition performance against scale and in-plane rotation variations of face images. To enhance the recognition performance, the templates are generated by linear or nonlinear operation on multiple images including different scales and rotations of faces. As the invariant features to allow for scale and rotation variations of face images, we adopt the affine transformation, the log-polar mapping, and the log-polar image based FFT. The proposed recognition methods are evaluated in terms of the recognition rate and the processing time. Experimental results show that the proposed template based methods lead to higher recognition rate than the single image based one. The affine transformation based face recognition method shows marginally higher recognition rate than those of the log-polar mapping based method and the log-polar image based FFT, while, in the aspect of processing time, the log-polar mapping based method is the fastest one.

Visual Voice Activity Detection and Adaptive Threshold Estimation for Speech Recognition (음성인식기 성능 향상을 위한 영상기반 음성구간 검출 및 적응적 문턱값 추정)

  • Song, Taeyup;Lee, Kyungsun;Kim, Sung Soo;Lee, Jae-Won;Ko, Hanseok
    • The Journal of the Acoustical Society of Korea
    • /
    • v.34 no.4
    • /
    • pp.321-327
    • /
    • 2015
  • In this paper, we propose an algorithm for achieving robust Visual Voice Activity Detection (VVAD) for enhanced speech recognition. In conventional VVAD algorithms, the motion of lip region is found by applying an optical flow or Chaos inspired measures for detecting visual speech frames. The optical flow-based VVAD is difficult to be adopted to driving scenarios due to its computational complexity. While invariant to illumination changes, Chaos theory based VVAD method is sensitive to motion translations caused by driver's head movements. The proposed Local Variance Histogram (LVH) is robust to the pixel intensity changes from both illumination change and translation change. Hence, for improved performance in environmental changes, we adopt the novel threshold estimation using total variance change. In the experimental results, the proposed VVAD algorithm achieves robustness in various driving situations.