• Title/Summary/Keyword: pix2pix

Search Result 59, Processing Time 0.03 seconds

Normal map generation based on Pix2Pix for rendering fabric image (옷감 이미지 렌더링을 위한 Pix2Pix 기반의 Normal map 생성)

  • Nam, Hyeongil;Park, Jong-Il
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2020.07a
    • /
    • pp.257-260
    • /
    • 2020
  • 본 논문은 단일의 옷감 이미지로 가상의 그래픽 렌더링을 위해 Pix2Pix 방법을 이용하여 Normal map 을 생성하는 방법을 제시한다. 구체적으로 단일의 이미지를 이용해서 Normal map 를 생성하기 위해, Color image 와 Normal map 쌍의 training dataset 을 Pix2Pix 방법을 이용해서 학습시킨다 또한, test dataset 의 Color image 를 입력으로 넣어 생성된 Normal map 결과를 확인한다. 그리고 선행연구에서 사용되어오던 U-Net 방식의 방법과 본 논문에서 사용한 Pix2Pix 를 이용한 Normal map 생성 결과를 SSIM(Structural Similarity Index)으로 비교 평가한다. 또한, 생성된 Normal map 을 렌더링하고자 하는 가상 객체의 사이즈에 맞게 사이즈를 조정하여 OpenGL 로 렌더링한 결과를 확인한다. 본 논문을 통해서 단일의 패턴 이미지를 Pix2Pix 로 생성한 Normal map 으로 옷감의 디테일을 사실감 있게 표현할 수 있음을 확인할 수 있었다.

  • PDF

A Study on the Image Preprosessing model linkage method for usability of Pix2Pix (Pix2Pix의 활용성을 위한 학습이미지 전처리 모델연계방안 연구)

  • Kim, Hyo-Kwan;Hwang, Won-Yong
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.15 no.5
    • /
    • pp.380-386
    • /
    • 2022
  • This paper proposes a method for structuring the preprocessing process of a training image when color is applied using Pix2Pix, one of the adversarial generative neural network techniques. This paper concentrate on the prediction result can be damaged according to the degree of light reflection of the training image. Therefore, image preprocesisng and parameters for model optimization were configured before model application. In order to increase the image resolution of training and prediction results, it is necessary to modify the of the model so this part is designed to be tuned with parameters. In addition, in this paper, the logic that processes only the part where the prediction result is damaged by light reflection is configured together, and the pre-processing logic that does not distort the prediction result is also configured.Therefore, in order to improve the usability, the accuracy was improved through experiments on the part that applies the light reflection tuning filter to the training image of the Pix2Pix model and the parameter configuration.

Image-to-Image Translation Based on U-Net with R2 and Attention (R2와 어텐션을 적용한 유넷 기반의 영상 간 변환에 관한 연구)

  • Lim, So-hyun;Chun, Jun-chul
    • Journal of Internet Computing and Services
    • /
    • v.21 no.4
    • /
    • pp.9-16
    • /
    • 2020
  • In the Image processing and computer vision, the problem of reconstructing from one image to another or generating a new image has been steadily drawing attention as hardware advances. However, the problem of computer-generated images also continues to emerge when viewed with human eyes because it is not natural. Due to the recent active research in deep learning, image generating and improvement problem using it are also actively being studied, and among them, the network called Generative Adversarial Network(GAN) is doing well in the image generating. Various models of GAN have been presented since the proposed GAN, allowing for the generation of more natural images compared to the results of research in the image generating. Among them, pix2pix is a conditional GAN model, which is a general-purpose network that shows good performance in various datasets. pix2pix is based on U-Net, but there are many networks that show better performance among U-Net based networks. Therefore, in this study, images are generated by applying various networks to U-Net of pix2pix, and the results are compared and evaluated. The images generated through each network confirm that the pix2pix model with Attention, R2, and Attention-R2 networks shows better performance than the existing pix2pix model using U-Net, and check the limitations of the most powerful network. It is suggested as a future study.

Performance Improvement of Image-to-Image Translation with RAPGAN and RRDB (RAPGAN와 RRDB를 이용한 Image-to-Image Translation의 성능 개선)

  • Dongsik Yoon;Noyoon Kwak
    • Journal of Internet of Things and Convergence
    • /
    • v.9 no.1
    • /
    • pp.131-138
    • /
    • 2023
  • This paper is related to performance improvement of Image-to-Image translation using Relativistic Average Patch GAN and Residual in Residual Dense Block. The purpose of this paper is to improve performance through technical improvements in three aspects to compensate for the shortcomings of the previous pix2pix, a type of Image-to-Image translation. First, unlike the previous pix2pix constructor, it enables deeper learning by using Residual in Residual Block in the part of encoding the input image. Second, since we use a loss function based on Relativistic Average Patch GAN to predict how real the original image is compared to the generated image, both of these images affect adversarial generative learning. Finally, the generator is pre-trained to prevent the discriminator from being learned prematurely. According to the proposed method, it was possible to generate images superior to the previous pix2pix by more than 13% on average at the aspect of FID.

A Study on the Restoration of Korean Traditional Palace Image by Adjusting the Receptive Field of Pix2Pix (Pix2Pix의 수용 영역 조절을 통한 전통 고궁 이미지 복원 연구)

  • Hwang, Won-Yong;Kim, Hyo-Kwan
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.15 no.5
    • /
    • pp.360-366
    • /
    • 2022
  • This paper presents a AI model structure for restoring Korean traditional palace photographs, which remain only black-and-white photographs, to color photographs using Pix2Pix, one of the adversarial generative neural network techniques. Pix2Pix consists of a combination of a synthetic image generator model and a discriminator model that determines whether a synthetic image is real or fake. This paper deals with an artificial intelligence model by adjusting a receptive field of the discriminator, and analyzes the results by considering the characteristics of the ancient palace photograph. The receptive field of Pix2Pix, which is used to restore black-and-white photographs, was commonly used in a fixed size, but a fixed size of receptive field is not suitable for a photograph which consisting with various change in an image. This paper observed the result of changing the size of the existing fixed a receptive field to identify the proper size of the discriminator that could reflect the characteristics of ancient palaces. In this experiment, the receptive field of the discriminator was adjusted based on the prepared ancient palace photos. This paper measure a loss of the model according to the change in a receptive field of the discriminator and check the results of restored photos using a well trained AI model from experiments.

Loss of βPix Causes Defects in Early Embryonic Development, and Cell Spreading and Platelet-Derived Growth Factor-Induced Chemotaxis in Mouse Embryonic Fibroblasts

  • Kang, TaeIn;Lee, Seung Joon;Kwon, Younghee;Park, Dongeun
    • Molecules and Cells
    • /
    • v.42 no.8
    • /
    • pp.589-596
    • /
    • 2019
  • ${\beta}Pix$ is a guanine nucleotide exchange factor for the Rho family small GTPases, Rac1 and Cdc42. It is known to regulate focal adhesion dynamics and cell migration. However, the in vivo role of ${\beta}Pix$ is currently not well understood. Here, we report the production and characterization of ${\beta}Pix$-KO mice. Loss of ${\beta}Pix$ results in embryonic lethality accompanied by abnormal developmental features, such as incomplete neural tube closure, impaired axial rotation, and failure of allantois-chorion fusion. We also generated ${\beta}Pix$-KO mouse embryonic fibroblasts (MEFs) to examine ${\beta}Pix$ function in mouse fibroblasts. ${\beta}Pix$-KO MEFs exhibit decreased Rac1 activity, and defects in cell spreading and platelet-derived growth factor (PDGF)-induced ruffle formation and chemotaxis. The average size of focal adhesions is increased in ${\beta}Pix$-KO MEFs. Interestingly, ${\beta}Pix$-KO MEFs showed increased motility in random migration and rapid wound healing with elevated levels of MLC2 phosphorylation. Taken together, our data demonstrate that ${\beta}Pix$ plays essential roles in early embryonic development, cell spreading, and cell migration in fibroblasts.

Depth Map Extraction from the Single Image Using Pix2Pix Model (Pix2Pix 모델을 활용한 단일 영상의 깊이맵 추출)

  • Gang, Su Myung;Lee, Joon Jae
    • Journal of Korea Multimedia Society
    • /
    • v.22 no.5
    • /
    • pp.547-557
    • /
    • 2019
  • To extract the depth map from a single image, a number of CNN-based deep learning methods have been performed in recent research. In this study, the GAN structure of Pix2Pix is maintained. this model allows to converge well, because it has the structure of the generator and the discriminator. But the convolution in this model takes a long time to compute. So we change the convolution form in the generator to a depthwise convolution to improve the speed while preserving the result. Thus, the seven down-sizing convolutional hidden layers in the generator U-Net are changed to depthwise convolution. This type of convolution decreases the number of parameters, and also speeds up computation time. The proposed model shows similar depth map prediction results as in the case of the existing structure, and the computation time in case of a inference is decreased by 64%.

Evaluation of Dynamic X-ray Imaging Sensor and Detector Composing of Multiple In-Ga-Zn-O Thin Film Transistors in a Pixel (픽셀내 다수의 산화물 박막트랜지스터로 구성된 동영상 엑스레이 영상센서와 디텍터에 대한 평가)

  • Seung Ik Jun;Bong Goo Lee
    • Journal of the Korean Society of Radiology
    • /
    • v.17 no.3
    • /
    • pp.359-365
    • /
    • 2023
  • In order to satisfy the requirements of dynamic X-ray imaging with high frame rate and low image lag, minimizing parasitic capacitance in photodiode and overlapped electrodes in pixels is critically required. This study presents duoPIXTM dynamic X-ray imaging sensor composing of readout thin film transistor, reset thin film transistor and photodiode in a pixel. Furthermore, dynamic X-ray detector using duoPIXTM imaging sensor was manufactured and evaluated its X-ray imaging performances such as frame rate, sensitivity, noise, MTF and image lag. duoPIXTM dynamic X-ray detector has 150 × 150 mm2 imaging area, 73 um pixel pitch, 2048 × 2048 matrix resolution(4.2M pixels) and maximum 50 frames per second. By means of comparison with conventional dynamic X-ray detector, duoPIXTM dynamic X-ray detector showed overall better performances than conventional dynamic X-ray detector as shown in the previous study.

Accuracy Analysis of Satellite Imagery in Road Construction Site Using UAV (도로 토목 공사 현장에서 UAV를 활용한 위성 영상 지도의 정확도 분석)

  • Shin, Seung-Min;Ban, Chang-Woo
    • Journal of the Korean Society of Industry Convergence
    • /
    • v.24 no.6_2
    • /
    • pp.753-762
    • /
    • 2021
  • Google provides mapping services using satellite imagery, this is widely used for the study. Since about 20 years ago, research and business using drones have been expanding. Pix4D is widely used to create 3D information models using drones. This study compared the distance error by comparing the result of the road construction site with the DSM data of Google Earth and Pix4 D. Through this, we tried to understand the reliability of the result of distance measurement in Google Earth. A DTM result of 3.08 cm/pixel was obtained as a result of matching with 49666 key points for each image. The length and altitude of Pix4D and Google Earth were measured and compared using the obtained PCD. As a result, the average error of the distance based on the data of Pix4D was measured to be 0.68 m, confirming that the error was relatively small. As a result of measuring the altitude of Google Earth and Pix4D and comparing them, it was confirmed that the maximum error was 83.214m, which was measured using satellite images, but the error was quite large and there was inaccuracy. Through this, it was confirmed that there are difficulties in analyzing and acquiring data at road construction sites using Google Earth, and the result was obtained that point cloud data using drones is necessary.