• 제목/요약/키워드: Structural Similarity (SSIM)

검색결과 79건 처리시간 0.024초

초음파 영상에서의 초고분해능 합성곱 신경망 알고리즘의 시뮬레이션 및 실험 연구 (Simulation and Experimental Studies of Super Resolution Convolutional Neural Network Algorithm in Ultrasound Image)

  • 이영진
    • 한국방사선학회논문지
    • /
    • 제17권5호
    • /
    • pp.693-699
    • /
    • 2023
  • 초음파는 의료분야에서 비파괴적 및 비침습적인 질병 진단에 널리 활용되고 있다. 진단의료영상의 질병진단 정확도를 향상시키기 위하여 공간 분해능을 향상시키는 것은 매우 중요하다. 본 연구에서는 초음파 영상에서의 초고분해능 합성곱 신경망 알고리즘 (super resolution convolutional neural network, SRCNN)을 모델링하여 적용 가능성을 분석하고자 한다. 연구는 Field II 시뮬레이션과 open source로 제공되는 임상 간 혈관종 초음파 영상을 사용한 실험 연구로 수행되었다. 제안하는 SRCNN 알고리즘은 저분해능 (low resolution, LR)에서 고분해능 (high resolution)으로 end-to-end 방식의 학습이 적용될 수 있도록 모델링하였다. 시뮬레이션 결과 Field II 프로그램을 통한 팬텀 영상에서의 반치폭 값은 SRCNN을 사용하였을 때 LR에 비하여 41.01% 향상되는 것을 확인하였다. 또한, 최대신호대잡음비 (peak to signal to noise ratio, PSNR)와 구조적 유사도 지표 (structural similarity index, SSIM)) 평가 결과는 시뮬레이션과 실제 간 혈관종 영상에서 SRCNN이 가장 우수한 값으로 도출되었다. 결론적으로 SRCNN의 초음파 영상에서의 적용 가능성을 증명하였고, 나아가 다양한 진단의료분야에서의 사용이 가능할 것으로 기대한다.

An Efficient CT Image Denoising using WT-GAN Model

  • Hae Chan Jeong;Dong Hoon Lim
    • 한국컴퓨터정보학회논문지
    • /
    • 제29권5호
    • /
    • pp.21-29
    • /
    • 2024
  • CT 촬영 시 방사선량을 줄이면 피폭 위험성을 낮출 수 있으나, 영상 해상도가 크게 저하 될 뿐아니라 잡음(noise) 발생으로 인해 진단의 효용성이 떨어진다. 따라서, CT 영상에서의 잡음제거는 영상복원 분야에 있어 매우 중요하고 필수적인 처리 과정이다. 영상 영역에서 잡음과 원래 신호를 분리하여 잡음만을 제거하는 것은 한계가 있다. 본 논문에서는 웨이블릿 변환 기반 GAN 모델 즉, WT-GAN(wavelet transform-based GAN) 모델을 이용하여 CT 영상에서 효과적으로 잡음 제거하고자 한다. 여기서 사용된 GAN 모델은 U-Net 구조의 생성자와 PatchGAN 구조의 판별자를 통해 잡음제거 영상을 생성한다. 본 논문에서 제안된 WT-GAN 모델의 성능 평가를 위해 다양한 잡음, 즉, 가우시안 잡음(Gaussian noise), 포아송 잡음 (Poisson noise) 그리고 스펙클 잡음 (speckle noise)에 의해 훼손된 CT 영상을 대상으로 실험하였다. 성능 실험 결과, WT-GAN 모델은 전통적인 필터 즉, BM3D 필터뿐만 아니라 기존의 딥러닝 모델인 DnCNN, CDAE 모형 그리고 U-Net GAN 모형보다 정성적이고, 정량적인 척도 즉, PSNR (Peak Signal-to-Noise Ratio) 그리고 SSIM (Structural Similarity Index Measure) 면에서 우수한 결과를 보였다.

DETECTION AND RESTORATION OF NON-RADIAL VARIATION OVER FULL-DISK SOLAR IMAGES

  • Yang, Yunfei;Lin, Jiaben;Feng, Song;Deng, Hui;Wang, Feng;Ji, Kaifan
    • 천문학회지
    • /
    • 제46권5호
    • /
    • pp.191-200
    • /
    • 2013
  • Full-disk solar images are provided by many solar telescopes around the world. However, the observed images show Non-Radial Variation (NRV) over the disk. In this paper, we propose algorithms for detecting distortions and restoring these images. For detecting NRV, the cross-correlation coefficients matrix of radial profiles is calculated and the minimum value in the matrix is defined as the Index of Non-radial Variation (INV). This index has been utilized to evaluate the H images of GONG, and systemic variations of different instruments are obtained. For obtaining the NRV's image, a Multi-level Morphological Filter (MMF) is designed to eliminate structures produced by solar activities over the solar surface. Comparing with the median filter, the proposed filter is a better choice. The experimental results show that the effect of our automatic detection and restoration methods is significant for getting a flat and high contrast full-disk image. For investigating the effect of our method on solar features, structural similarity (SSIM) index is utilized. The high SSIM indices (close to 1) of solar features show that the details of the structures remain after NRV restoring.

Human Visual System-aware Dimming Method Combining Pixel Compensation and Histogram Specification for TFT-LCDs

  • Jin, Jeong-Chan;Kim, Young-Jin
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제11권12호
    • /
    • pp.5998-6016
    • /
    • 2017
  • In thin-film transistor liquid-crystal displays (TFT-LCDs), which are most commonly used in mobile devices, the backlight accounts for about 70% of the power consumption. Therefore, most low-power-related studies focus on realizing power savings through backlight dimming. Image compensation is performed to mitigate the visual distortion caused by the backlight dimming. Therefore, popular techniques include pixel compensation for brightness recovery and contrast enhancement, such as histogram equalization. However, existing pixel compensation techniques often have limitations with respect to blur owing to the pixel saturation phenomenon, or because contrast enhancement cannot adequately satisfy the human visual system (HVS). To overcome these, in this study, we propose a novel dimming technique to achieve both power saving and HVS-awareness by combining the pixel compensation and histogram specifications, which convert the original cumulative density function (CDF) by designing and using the desired CDF of an image. Because the process of obtaining the desired CDF is customized to consider image characteristics, histogram specification is found to achieve better HVS-awareness than histogram equalization. For the experiments, we employ the LIVE image database, and we use the structural similarity (SSIM) index to measure the degree of visual satisfaction. The experimental results show that the proposed technique achieves up to 15.9% increase in the SSIM index compared with existing dimming techniques that use pixel compensation and histogram equalization in the case of the same low-power ratio. Further, the results indicate that it achieves improved HVS-awareness and increased power saving concurrently compared with previous techniques.

시차의 신뢰도를 이용한 플렌옵틱 영상의 초고해상도 복원 방법 (Super-resolution Reconstruction Method for Plenoptic Images based on Reliability of Disparity)

  • 정민창;김송란;강현수
    • 한국정보통신학회논문지
    • /
    • 제22권3호
    • /
    • pp.425-433
    • /
    • 2018
  • 본 논문에서는 시차의 신뢰도를 기반으로 플렌옵틱 영상의 초고해상도 복원 알고리즘을 제안한다. 그리고 플렌옵틱 카메라 영상으로부터 생성한 서브어퍼처(sub-aperture) 이미지는 TV_L1알고리즘에 기반한 시차 추정과 초고해상도 영상 복원에 활용된다. 특히 제안된 알고리즘은 시차가 부정확하게 나타날 수 있는 경계 역역에서 향상된 성능을 보인다. 시차 벡터의 신뢰도는 서브어퍼처 이미지의 상하좌우 각 위치별 영역에 따른 분산을 고려하여 판단한다. 신뢰도가 낮은 시차벡터는 초고해상도 영상 복원시 제외된다. 제안된 방법은 바이큐빅 보간 방법과 기존의 시차기반방법 그리고 사전기반 방법과 비교하여 평가되었다. 성능 평가에서 초고해상도 영상복원의 결과는 PSNR, SSIM 관점에서 성능을 비교하여 최상의 성능을 보여준다.

자기 지도 학습훈련 기반의 Noise2Void 네트워크를 이용한 PET 영상의 잡음 제거 평가: 팬텀 실험 (The Evaluation of Denoising PET Image Using Self Supervised Noise2Void Learning Training: A Phantom Study)

  • 윤석환;박찬록
    • 대한방사선기술학회지:방사선기술과학
    • /
    • 제44권6호
    • /
    • pp.655-661
    • /
    • 2021
  • Positron emission tomography (PET) images is affected by acquisition time, short acquisition times results in low gamma counts leading to degradation of image quality by statistical noise. Noise2Void(N2V) is self supervised denoising model that is convolutional neural network (CNN) based deep learning. The purpose of this study is to evaluate denoising performance of N2V for PET image with a short acquisition time. The phantom was scanned as a list mode for 10 min using Biograph mCT40 of PET/CT (Siemens Healthcare, Erlangen, Germany). We compared PET images using NEMA image-quality phantom for standard acquisition time (10 min), short acquisition time (2min) and simulated PET image (S2 min). To evaluate performance of N2V, the peak signal to noise ratio (PSNR), normalized root mean square error (NRMSE), structural similarity index (SSIM) and radio-activity recovery coefficient (RC) were used. The PSNR, NRMSE and SSIM for 2 min and S2 min PET images compared to 10min PET image were 30.983, 33.936, 9.954, 7.609 and 0.916, 0.934 respectively. The RC for spheres with S2 min PET image also met European Association of Nuclear Medicine Research Ltd. (EARL) FDG PET accreditation program. We confirmed generated S2 min PET image from N2V deep learning showed improvement results compared to 2 min PET image and The PET images on visual analysis were also comparable between 10 min and S2 min PET images. In conclusion, noisy PET image by means of short acquisition time using N2V denoising network model can be improved image quality without underestimation of radioactivity.

딥러닝을 이용한 나노소재 투과전자 현미경의 초해상 이미지 획득 (Super-Resolution Transmission Electron Microscope Image of Nanomaterials Using Deep Learning)

  • 남충희
    • 한국재료학회지
    • /
    • 제32권8호
    • /
    • pp.345-353
    • /
    • 2022
  • In this study, using deep learning, super-resolution images of transmission electron microscope (TEM) images were generated for nanomaterial analysis. 1169 paired images with 256 × 256 pixels (high resolution: HR) from TEM measurements and 32 × 32 pixels (low resolution: LR) produced using the python module openCV were trained with deep learning models. The TEM images were related to DyVO4 nanomaterials synthesized by hydrothermal methods. Mean-absolute-error (MAE), peak-signal-to-noise-ratio (PSNR), and structural similarity (SSIM) were used as metrics to evaluate the performance of the models. First, a super-resolution image (SR) was obtained using the traditional interpolation method used in computer vision. In the SR image at low magnification, the shape of the nanomaterial improved. However, the SR images at medium and high magnification failed to show the characteristics of the lattice of the nanomaterials. Second, to obtain a SR image, the deep learning model includes a residual network which reduces the loss of spatial information in the convolutional process of obtaining a feature map. In the process of optimizing the deep learning model, it was confirmed that the performance of the model improved as the number of data increased. In addition, by optimizing the deep learning model using the loss function, including MAE and SSIM at the same time, improved results of the nanomaterial lattice in SR images were achieved at medium and high magnifications. The final proposed deep learning model used four residual blocks to obtain the characteristic map of the low-resolution image, and the super-resolution image was completed using Upsampling2D and the residual block three times.

Synthesis of T2-weighted images from proton density images using a generative adversarial network in a temporomandibular joint magnetic resonance imaging protocol

  • Chena, Lee;Eun-Gyu, Ha;Yoon Joo, Choi;Kug Jin, Jeon;Sang-Sun, Han
    • Imaging Science in Dentistry
    • /
    • 제52권4호
    • /
    • pp.393-398
    • /
    • 2022
  • Purpose: This study proposed a generative adversarial network (GAN) model for T2-weighted image (WI) synthesis from proton density (PD)-WI in a temporomandibular joint(TMJ) magnetic resonance imaging (MRI) protocol. Materials and Methods: From January to November 2019, MRI scans for TMJ were reviewed and 308 imaging sets were collected. For training, 277 pairs of PD- and T2-WI sagittal TMJ images were used. Transfer learning of the pix2pix GAN model was utilized to generate T2-WI from PD-WI. Model performance was evaluated with the structural similarity index map (SSIM) and peak signal-to-noise ratio (PSNR) indices for 31 predicted T2-WI (pT2). The disc position was clinically diagnosed as anterior disc displacement with or without reduction, and joint effusion as present or absent. The true T2-WI-based diagnosis was regarded as the gold standard, to which pT2-based diagnoses were compared using Cohen's ĸ coefficient. Results: The mean SSIM and PSNR values were 0.4781(±0.0522) and 21.30(±1.51) dB, respectively. The pT2 protocol showed almost perfect agreement(ĸ=0.81) with the gold standard for disc position. The number of discordant cases was higher for normal disc position (17%) than for anterior displacement with reduction (2%) or without reduction (10%). The effusion diagnosis also showed almost perfect agreement(ĸ=0.88), with higher concordance for the presence (85%) than for the absence (77%) of effusion. Conclusion: The application of pT2 images for a TMJ MRI protocol useful for diagnosis, although the image quality of pT2 was not fully satisfactory. Further research is expected to enhance pT2 quality.

관자뼈 HRCT 스캔 시 선량감소 방법에 관한 연구 (A Study on the Dose Reduction Method for Temporal Bone HRCT Scan)

  • 윤준;김현주
    • 한국방사선학회논문지
    • /
    • 제17권7호
    • /
    • pp.1041-1047
    • /
    • 2023
  • 고 해상력 CT에 해당하는 관자뼈 CT는 높은 관전압과 얇은 단면두께 등의 적용으로 스캔 선량이 인접 부위 검사보다 높다. 이에 검사조건 중 재구성 알고리즘을 변화 적용하여 검사 선량을 줄이면서 병변에 대한 민감도가 우수한 알고리즘을 찾아 유의성과 임상 기초자료 제공 가능성을 알아보았다. 그 결과 100 kVp로 관전압을 낮추어 적용 시 선량이 약 35.6% 감소하였고, 100 kVp로 획득한 Raw data에 Definition 알고리즘 적용 시 SNR, CNR이 우수하였으며 다른 알고리즘과 비교 시 통계적으로 유의한 차이를 보였다(p<0.05). 그리고 구조적 유사도를 비교한 결과 SSIM index가 ROI 별 0.776, 0.813, 0.741로 분석되었다. 따라서 관자뼈 CT 스캔에서 알고리즘 변경적용은 CT 검사로 발생하는 선량을 일부 감소시킬 수 있고 임상 기초자료 측면에서 매우 의미가 있다고 생각한다.

ISFRNet: A Deep Three-stage Identity and Structure Feature Refinement Network for Facial Image Inpainting

  • Yan Wang;Jitae Shin
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제17권3호
    • /
    • pp.881-895
    • /
    • 2023
  • Modern image inpainting techniques based on deep learning have achieved remarkable performance, and more and more people are working on repairing more complex and larger missing areas, although this is still challenging, especially for facial image inpainting. For a face image with a huge missing area, there are very few valid pixels available; however, people have an ability to imagine the complete picture in their mind according to their subjective will. It is important to simulate this capability while maintaining the identity features of the face as much as possible. To achieve this goal, we propose a three-stage network model, which we refer to as the identity and structure feature refinement network (ISFRNet). ISFRNet is based on 1) a pre-trained pSp-styleGAN model that generates an extremely realistic face image with rich structural features; 2) a shallow structured network with a small receptive field; and 3) a modified U-net with two encoders and a decoder, which has a large receptive field. We choose structural similarity index (SSIM), peak signal-to-noise ratio (PSNR), L1 Loss and learned perceptual image patch similarity (LPIPS) to evaluate our model. When the missing region is 20%-40%, the above four metric scores of our model are 28.12, 0.942, 0.015 and 0.090, respectively. When the lost area is between 40% and 60%, the metric scores are 23.31, 0.840, 0.053 and 0.177, respectively. Our inpainting network not only guarantees excellent face identity feature recovery but also exhibits state-of-the-art performance compared to other multi-stage refinement models.