• Title/Summary/Keyword: Image-to-Image Translation

Search Result 303, Processing Time 0.035 seconds

Wavelet Transform Technology for Translation-invariant Iris Recognition (위치 이동에 무관한 홍채 인식을 위한 웨이블렛 변환 기술)

  • Lim, Cheol-Su
    • The KIPS Transactions:PartB
    • /
    • v.10B no.4
    • /
    • pp.459-464
    • /
    • 2003
  • This paper proposes the use of a wavelet based image transform algorithm in human iris recognition method and the effectiveness of this technique will be determined in preprocessing of extracting Iris image from the user´s eye obtained by imaging device such as CCD Camera or due to torsional rotation of the eye, and it also resolves the problem caused by invariant under translations and dilations due to tilt of the head. This technique values through the proposed translation-invariant wavelet transform algorithm rather than the conventional wavelet transform method. Therefore we extracted the best-matching iris feature values and compared the stored feature codes with the incoming data to identify the user. As result of our experimentation, this technique demonstrate the significant advantage over verification when it compares with other general types of wavelet algorithm in the measure of FAR & FRR.

ACL-GAN: Image-to-Image translation GAN with enhanced learning and hyper-parameter searching speed using new loss function (ACL-GAN: 새로운 loss 를 사용하여 하이퍼 파라메터 탐색속도와 학습속도를 향상시킨 영상변환 GAN)

  • Cho, JeongIk;Yoon, Kyoungro
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2019.11a
    • /
    • pp.41-43
    • /
    • 2019
  • Image-to-image 변환에서 인상적인 성능을 보이는 StarGAN 은 모델의 성능에 중요한 영향을 끼치는 adversarial weight, classification weight, reconstruction weight 라는 세가지 하이퍼파라미터의 결정을 전제로 하고 있다. 본 연구에서는 이 중 conditional GAN loss 인 adversarial loss 와 classification loss 를 대치할 수 있는 attribute loss를 제안함으로써, adversarial weight와 classification weight 를 최적화하는 데 걸리는 시간을 attribute weight 의 최적화에 걸리는 시간으로 대체하여 하이퍼파라미터 탐색에 걸리는 시간을 획기적으로 줄일 수 있게 하였다. 제안하는 attribute loss 는 각 특징당 GAN 을 만들 때 각 GAN 의 loss 의 합으로, 이 GAN 들은 hidden layer 를 공유하기 때문에 연산량의 증가를 거의 가져오지 않는다. 또한 reconstruction loss 를 단순화시켜 연산량을 줄인 simplified content loss 를 제안한다. StarGAN 의 reconstruction loss 는 generator 를 2 번 통과하지만 simplified content loss 는 1 번만 통과하기 때문에 연산량이 줄어든다. 또한 이미지 Framing 을 통해 배경의 왜곡을 방지하고, 양방향 성장을 통해 학습 속도를 향상시킨 아키텍쳐를 제안한다.

  • PDF

Content-based Retrieval System using Image Shape Features (영상 형태 특징을 이용한 내용 기반 검색 시스템)

  • 황병곤;정성호;이상열
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.6 no.1
    • /
    • pp.33-38
    • /
    • 2001
  • In this paper, we present an image retrieval system using shape features. The preprocessing to gain shape feature includes edge extraction using chain code. The shape features consist of center of mass, standard deviation, ratio of major axis and minor axis length. The similarity is estimated as comparing the features of query image with the features of images in database. Thus, the candidates of images are retrieved according to the order of similarity. The result of an experimentation is dullness for scale, rotation and translation. We evaluate the performance of shape features for image retrieval on a database with over 170 images. The Recall and the Precision is each 0.72 and 0.83 in the result of average experiment. So the proposed method is presented useful method.

  • PDF

A Cycle GAN-based Wallpaper Image Transformation Method for Interior Simulation (Cycle GAN 기반 벽지 인테리어 이미지 변환 기법)

  • Seong-Hoon Kim;Yo-Han Kim;Sun-Yong Kim
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.18 no.2
    • /
    • pp.349-354
    • /
    • 2023
  • As the population interested in interior design has been increasing, the global interior market has grown significantly. Global interior companies are developing and providing simulation services for various interior elements. Although wallpaper design is the most important interior element, existing wallpaper design simulation services are difficult to use due to drawbacks such as differences between expected and actual results, long simulation time, and the need for professional skills. We proposed a wallpaper image transformation method for interior design using cycle generative adversarial networks (GAN). The proposed method demonstrates that users can simulate wallpaper design within a short period of time based on interior image data using various types of wallpaper.

Development of QA Phantom Prototype for Imaged Based Radiation Treatment System (영상기반 방사선 치료기기를 위한 QA 팬텀 시작품 개발)

  • Chang, Jin-A;Oh, Seoung-Jong;Jung, Won-Kyun;Jang, Hong-Suk;Kim, Hoi-Nam;Kang, Dae-Gyu;Lee, Doo-Hyun;Suh, Tae-Suk
    • Progress in Medical Physics
    • /
    • v.19 no.2
    • /
    • pp.120-124
    • /
    • 2008
  • In this study, we developed the protopype of QA phantom for image QA including an additional component for image based radiation treatment system. The new phantom considered two main parts: Image quality and fusion accuracy. Image quality part included for daily CT number linearity and spatial resolution, and fusion accuracy part designed to simulate a simple translation-rotation setting. The CT scans of the phantom obtained from conventional CT, MVCT of Tomotherapy unit, and both image sets were satisfied the recommendation of spatial resolution. This phantom was simple and efficient for daily imaging QA, and it is important to provide a new concept of verification of image registration.

  • PDF

Object-based Image Retrieval Using Dominant Color Pair and Color Correlogram (Dominant 컬러쌍 정보와 Color Correlogram을 이용한 객체기반 영상검색)

  • 박기태;문영식
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.40 no.2
    • /
    • pp.1-8
    • /
    • 2003
  • This paper proposes an object-based image retrieval technique based on the dominant color pair information. Most of existing methods for content based retrieval extract the features from an image as a whole, instead of an object of interest. As a result, the retrieval performance tends to degrade due to the background colors. This paper proposes an object based retrieval scheme, in which an object of interest is used as a query and the similarity is measured on candidate regions of DB images where the object may exist. From the segmented image, the dominant color pair information between adjacent regions is used for selecting candidate regions. The similarity between the query image and DB image is measured by using the color correlogram technique. The dominant color pair information is robust against translation, rotation, and scaling. Experimental results show that the performance of the proposed method has been improved by reducing the errors caused by background colors.

Generating a Stereoscopic Image from a Monoscopic Camera (단안 카메라를 이용한 입체영상 생성)

  • Lee, Dong-Woo;Lee, Kwan-Wook;Kim, Man-Bae
    • Journal of Broadcast Engineering
    • /
    • v.17 no.1
    • /
    • pp.17-25
    • /
    • 2012
  • In this paper, we propose a method of producing a stereoscopic image from multiple images captured from a monoscopic camera. By translating a camera in the horizontal direction, left and right images are chosen among N captured images. For this, image edges are extracted and a rotational angle is estimated from edge orientation. Also, a translational vector is also estimated from the correlation of projected image data. Then, two optimal images are chosen and subsequently compensated using the rotational angle as well as the translational vector in order to make a satisfactory stereoscopic image. The proposed method was performed on thirty-two test image set. The subjective visual fatigue test was carried out to validate the 3D quality of stereoscopic images. In terms of visual fatigue, the 3D satisfaction ratio reached approximately 84%.

Automatic Registration Between KOMPSAT-2 and TerraSAR-X Images (KOMPSAT-2 영상과 TerraSAR-X 영상 간 자동기하보정)

  • Han, You-Kyung;Byun, Young-Gi;Chae, Tae-Byeong;Kim, Yong-Il
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.29 no.6
    • /
    • pp.667-675
    • /
    • 2011
  • In this paper, we propose an automatic image-to-image registration between high resolution multi-sensor images. To do this, TerraSAR-X image was shifted according to the initial translation differences of the x and y directions between images estimated using Mutual Information method. After that, the Canny edge operator was applied to both images to extract linear features. These features were used to design a cost function that finds matching points based on the similarities of their locations and gradient orientations. For extracting large number of evenly distributed matching points, only one point within each regular grid constructed throughout the image was extracted to the final matching point pair. The model, which combined the piecewise linear function with the global affine transformation, was applied to increase the accuracy of the geometric correction, and the proposed method showed RMSE lower than 5m in all study sites.

Sign2Gloss2Text-based Sign Language Translation with Enhanced Spatial-temporal Information Centered on Sign Language Movement Keypoints (수어 동작 키포인트 중심의 시공간적 정보를 강화한 Sign2Gloss2Text 기반의 수어 번역)

  • Kim, Minchae;Kim, Jungeun;Kim, Ha Young
    • Journal of Korea Multimedia Society
    • /
    • v.25 no.10
    • /
    • pp.1535-1545
    • /
    • 2022
  • Sign language has completely different meaning depending on the direction of the hand or the change of facial expression even with the same gesture. In this respect, it is crucial to capture the spatial-temporal structure information of each movement. However, sign language translation studies based on Sign2Gloss2Text only convey comprehensive spatial-temporal information about the entire sign language movement. Consequently, detailed information (facial expression, gestures, and etc.) of each movement that is important for sign language translation is not emphasized. Accordingly, in this paper, we propose Spatial-temporal Keypoints Centered Sign2Gloss2Text Translation, named STKC-Sign2 Gloss2Text, to supplement the sequential and semantic information of keypoints which are the core of recognizing and translating sign language. STKC-Sign2Gloss2Text consists of two steps, Spatial Keypoints Embedding, which extracts 121 major keypoints from each image, and Temporal Keypoints Embedding, which emphasizes sequential information using Bi-GRU for extracted keypoints of sign language. The proposed model outperformed all Bilingual Evaluation Understudy(BLEU) scores in Development(DEV) and Testing(TEST) than Sign2Gloss2Text as the baseline, and in particular, it proved the effectiveness of the proposed methodology by achieving 23.19, an improvement of 1.87 based on TEST BLEU-4.

Region-Based 3D Image Registration Technique for TKR (전슬관절치환술을 위한 3차원 영역기반 영상정합 기술)

  • Key, J.H.;Seo, D.C.;Park, H.S.;Youn, I.C.;Lee, M.K.;Yoo, S.K.;Choi, K.W.
    • Journal of Biomedical Engineering Research
    • /
    • v.27 no.6
    • /
    • pp.392-401
    • /
    • 2006
  • Image Guided Surgery (IGS) system which has variously tried in medical engineering fields is able to give a surgeon objective information of operation process like decision making and surgical planning. This information is displayed through 3D images which are acquired from image modalities like CT and MRI for pre-operation. The technique of image registration is necessary to construct IGS system. Image registration means that 3D model and the object operated by a surgeon are matched on the common frame. Major techniques of registration in IGS system have been used by recognizing fiducial markers placed on the object. However, this method has been criticized due to additional trauma, its invasive protocol inserting fiducial markers in patient's bone and generating noise data when 2D slice images are acquired by image modality because many markers are made of metal. Therefore, this paper developed shape-based registration technique to improve the limitation of fiducial marker based IGS system. Iterative Closest Points (ICP) algorithm was used to match corresponding points and quaternion based rotation and translation transformation using closed form solution applied to find the optimized cost function of transformation. we assumed that this algorithm were used in Total Knee replacement (TKR) operation. Accordingly, we have developed region-based 3D registration technique based on anatomical landmarks and this registration algorithm was evaluated in a femur model. It was found that region-based algorithm can improve the accuracy in 3D registration.