• 제목/요약/키워드: structural similarity (SSIM) index

검색결과 52건 처리시간 0.023초

사후전산화단층촬영의 법의병리학 분야 활용을 위한 조건부 적대적 생성 신경망을 이용한 CT 영상의 해상도 개선: 팬텀 연구 (Enhancing CT Image Quality Using Conditional Generative Adversarial Networks for Applying Post-mortem Computed Tomography in Forensic Pathology: A Phantom Study)

  • 윤예빈;허진행;김예지;조혜진;윤용수
    • 대한방사선기술학회지:방사선기술과학
    • /
    • 제46권4호
    • /
    • pp.315-323
    • /
    • 2023
  • Post-mortem computed tomography (PMCT) is commonly employed in the field of forensic pathology. PMCT was mainly performed using a whole-body scan with a wide field of view (FOV), which lead to a decrease in spatial resolution due to the increased pixel size. This study aims to evaluate the potential for developing a super-resolution model based on conditional generative adversarial networks (CGAN) to enhance the image quality of CT. 1761 low-resolution images were obtained using a whole-body scan with a wide FOV of the head phantom, and 341 high-resolution images were obtained using the appropriate FOV for the head phantom. Of the 150 paired images in the total dataset, which were divided into training set (96 paired images) and validation set (54 paired images). Data augmentation was perform to improve the effectiveness of training by implementing rotations and flips. To evaluate the performance of the proposed model, we used the Peak Signal-to-Noise Ratio (PSNR), Structural Similarity Index Measure (SSIM) and Deep Image Structure and Texture Similarity (DISTS). Obtained the PSNR, SSIM, and DISTS values of the entire image and the Medial orbital wall, the zygomatic arch, and the temporal bone, where fractures often occur during head trauma. The proposed method demonstrated improvements in values of PSNR by 13.14%, SSIM by 13.10% and DISTS by 45.45% when compared to low-resolution images. The image quality of the three areas where fractures commonly occur during head trauma has also improved compared to low-resolution images.

자기 지도 학습훈련 기반의 Noise2Void 네트워크를 이용한 PET 영상의 잡음 제거 평가: 팬텀 실험 (The Evaluation of Denoising PET Image Using Self Supervised Noise2Void Learning Training: A Phantom Study)

  • 윤석환;박찬록
    • 대한방사선기술학회지:방사선기술과학
    • /
    • 제44권6호
    • /
    • pp.655-661
    • /
    • 2021
  • Positron emission tomography (PET) images is affected by acquisition time, short acquisition times results in low gamma counts leading to degradation of image quality by statistical noise. Noise2Void(N2V) is self supervised denoising model that is convolutional neural network (CNN) based deep learning. The purpose of this study is to evaluate denoising performance of N2V for PET image with a short acquisition time. The phantom was scanned as a list mode for 10 min using Biograph mCT40 of PET/CT (Siemens Healthcare, Erlangen, Germany). We compared PET images using NEMA image-quality phantom for standard acquisition time (10 min), short acquisition time (2min) and simulated PET image (S2 min). To evaluate performance of N2V, the peak signal to noise ratio (PSNR), normalized root mean square error (NRMSE), structural similarity index (SSIM) and radio-activity recovery coefficient (RC) were used. The PSNR, NRMSE and SSIM for 2 min and S2 min PET images compared to 10min PET image were 30.983, 33.936, 9.954, 7.609 and 0.916, 0.934 respectively. The RC for spheres with S2 min PET image also met European Association of Nuclear Medicine Research Ltd. (EARL) FDG PET accreditation program. We confirmed generated S2 min PET image from N2V deep learning showed improvement results compared to 2 min PET image and The PET images on visual analysis were also comparable between 10 min and S2 min PET images. In conclusion, noisy PET image by means of short acquisition time using N2V denoising network model can be improved image quality without underestimation of radioactivity.

Synthesis of T2-weighted images from proton density images using a generative adversarial network in a temporomandibular joint magnetic resonance imaging protocol

  • Chena, Lee;Eun-Gyu, Ha;Yoon Joo, Choi;Kug Jin, Jeon;Sang-Sun, Han
    • Imaging Science in Dentistry
    • /
    • 제52권4호
    • /
    • pp.393-398
    • /
    • 2022
  • Purpose: This study proposed a generative adversarial network (GAN) model for T2-weighted image (WI) synthesis from proton density (PD)-WI in a temporomandibular joint(TMJ) magnetic resonance imaging (MRI) protocol. Materials and Methods: From January to November 2019, MRI scans for TMJ were reviewed and 308 imaging sets were collected. For training, 277 pairs of PD- and T2-WI sagittal TMJ images were used. Transfer learning of the pix2pix GAN model was utilized to generate T2-WI from PD-WI. Model performance was evaluated with the structural similarity index map (SSIM) and peak signal-to-noise ratio (PSNR) indices for 31 predicted T2-WI (pT2). The disc position was clinically diagnosed as anterior disc displacement with or without reduction, and joint effusion as present or absent. The true T2-WI-based diagnosis was regarded as the gold standard, to which pT2-based diagnoses were compared using Cohen's ĸ coefficient. Results: The mean SSIM and PSNR values were 0.4781(±0.0522) and 21.30(±1.51) dB, respectively. The pT2 protocol showed almost perfect agreement(ĸ=0.81) with the gold standard for disc position. The number of discordant cases was higher for normal disc position (17%) than for anterior displacement with reduction (2%) or without reduction (10%). The effusion diagnosis also showed almost perfect agreement(ĸ=0.88), with higher concordance for the presence (85%) than for the absence (77%) of effusion. Conclusion: The application of pT2 images for a TMJ MRI protocol useful for diagnosis, although the image quality of pT2 was not fully satisfactory. Further research is expected to enhance pT2 quality.

ISFRNet: A Deep Three-stage Identity and Structure Feature Refinement Network for Facial Image Inpainting

  • Yan Wang;Jitae Shin
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제17권3호
    • /
    • pp.881-895
    • /
    • 2023
  • Modern image inpainting techniques based on deep learning have achieved remarkable performance, and more and more people are working on repairing more complex and larger missing areas, although this is still challenging, especially for facial image inpainting. For a face image with a huge missing area, there are very few valid pixels available; however, people have an ability to imagine the complete picture in their mind according to their subjective will. It is important to simulate this capability while maintaining the identity features of the face as much as possible. To achieve this goal, we propose a three-stage network model, which we refer to as the identity and structure feature refinement network (ISFRNet). ISFRNet is based on 1) a pre-trained pSp-styleGAN model that generates an extremely realistic face image with rich structural features; 2) a shallow structured network with a small receptive field; and 3) a modified U-net with two encoders and a decoder, which has a large receptive field. We choose structural similarity index (SSIM), peak signal-to-noise ratio (PSNR), L1 Loss and learned perceptual image patch similarity (LPIPS) to evaluate our model. When the missing region is 20%-40%, the above four metric scores of our model are 28.12, 0.942, 0.015 and 0.090, respectively. When the lost area is between 40% and 60%, the metric scores are 23.31, 0.840, 0.053 and 0.177, respectively. Our inpainting network not only guarantees excellent face identity feature recovery but also exhibits state-of-the-art performance compared to other multi-stage refinement models.

Image Denoising Based on Adaptive Fractional Order Anisotropic Diffusion

  • Yu, Jimin;Tan, Lijian;Zhou, Shangbo;Wang, Liping;Wang, Chaomei
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제11권1호
    • /
    • pp.436-450
    • /
    • 2017
  • Recently, the method based on fractional order partial differential equation has been used in image processing. Usually, the optional order of fractional differentiation is determined by a lot of experiments. In this paper, a denoising model is proposed based on adaptive fractional order anisotropic diffusion. In the proposed model, the complexity of the local image texture is reflected by the local variance, and the order of the fractional differentiation is determined adaptively. In the process of the adaptive fractional order model, the discrete Fourier transform is applied to compute the fractional order difference as well as the dynamic evolution process. Experimental results show that the peak signal-to-noise ratio (PSNR) and structural similarity index measurement (SSIM) of the proposed image denoising algorithm is better than that of other some algorithms. The proposed algorithm not only can keep the detailed image information and edge information, but also obtain a good visual effect.

An Efficient Frame-Level Rate Control Algorithm for High Efficiency Video Coding

  • Lin, Yubei;Zhang, Xingming;Xiao, Jianen;Su, Shengkai
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제10권4호
    • /
    • pp.1877-1891
    • /
    • 2016
  • In video coding, the goal of rate control (RC) is not only to avoid the undesirable fluctuation in bit allocation, but also to provide a good visual perception. In this paper, a novel frame-level rate control algorithm for High Efficiency Video Coding (HEVC) is proposed. Firstly a model that reveals the relationship between bit per pixel (bpp), the bitrate of the intra frame and the bitrate of the subsequent inter frames in a group of pictures (GOP) is established, based on which the target bitrate of the first intra frame is well estimated. Then a novel frame-level bit allocation algorithm is developed, which provides a robust bit balancing scheme between the intra frame and the inter frames in a GOP to achieve the visual quality smoothness throughout the whole sequence. Our experimental results show that when compared to the RC scheme for HEVC encoder HM-16.0, the proposed algorithm can produce reconstructed frames with more consistent objective video quality. In addition, the objective visual quality of the reconstructed frames can be improved with less bitrate.

Photorealistic Ray-traced Visualization Approach for the Interactive Biomimetic Design of Insect Compound Eyes

  • Nguyen, Tung Lam;Trung, Hieu Tran Doan;Lee, Wooseok;Lee, Hocheol
    • Current Optics and Photonics
    • /
    • 제5권6호
    • /
    • pp.699-710
    • /
    • 2021
  • In this study, we propose a biomimetic optical structure design methodology for investigating micro-optical mechanisms associated with the compound eyes of insects. With these compound eyes, insects can respond fast while maintaining a wide field of view. Also, considerable research attention has been focused on the insect compound eyes to utilize these benefits. However, their nano micro-structures are complex and challenging to demonstrate in real applications. An effectively integrated design methodology is required considering the manufacturing difficulty. We show that photorealistic ray-traced visualization is an effective method for designing the biomimetic of a micro-compound eye of an insect. We analyze the image formation mechanism and create a three-dimensional computer-aided design model. Then, a ray-trace visualization is applied to observe the optical image formation. Finally, the segmented images are stitched together to generate an image with a wide-angle; the image is assessed for quality. The high structural similarity index (SSIM) value (approximately 0.84 to 0.89) of the stitched image proves that the proposed MATLAB-based image stitching algorithm performs effectively and comparably to the commercial software. The results may be employed for the understanding, researching, and design of advanced optical systems based on biological eyes and for other industrial applications.

Spatial Frequency Coverage and Image Reconstruction for Photonic Integrated Interferometric Imaging System

  • Zhang, Wang;Ma, Hongliu;Huang, Kang
    • Current Optics and Photonics
    • /
    • 제5권6호
    • /
    • pp.606-616
    • /
    • 2021
  • A photonic integrated interferometric imaging system possesses the characteristics of small-scale, low weight, low power consumption, and better image quality. It has potential application for replacing conventional large space telescopes. In this paper, the principle of photonic integrated interferometric imaging is investigated. A novel lenslet array arrangement and lenslet pairing approach are proposed, which are helpful in improving spatial frequency coverage. For the novel lenslet array arrangement, two short interference arms were evenly distributed between two adjacent long interference arms. Each lenslet in the array would be paired twice through the novel lenslet pairing approach. Moreover, the image reconstruction model for optical interferometric imaging based on compressed sensing was established. Image simulation results show that the peak signal to noise ratio (PSNR) of the reconstructed image based on compressive sensing is about 10 dB higher than that of the direct restored image. Meanwhile, the normalized mean square error (NMSE) of the direct restored image is approximately 0.38 higher than that of the reconstructed image. Structural similarity index measure (SSIM) of the reconstructed image based on compressed sensing is about 0.33 higher than that of the direct restored image. The increased spatial frequency coverage and image reconstruction approach jointly contribute to better image quality of the photonic integrated interferometric imaging system.

근육 활성화 모델 기반의 데이터 증강을 활용한 동시 동작 인식 프레임워크 (Simultaneous Motion Recognition Framework using Data Augmentation based on Muscle Activation Model)

  • 김세진;정완균
    • 로봇학회논문지
    • /
    • 제19권2호
    • /
    • pp.203-212
    • /
    • 2024
  • Simultaneous motion is essential in the activities of daily living (ADL). For motion intention recognition, surface electromyogram (sEMG) and corresponding motion label is necessary. However, this process is time-consuming and it may increase the burden of the user. Therefore, we propose a simultaneous motion recognition framework using data augmentation based on muscle activation model. The model consists of multiple point sources to be optimized while the number of point sources and their initial parameters are automatically determined. From the experimental results, it is shown that the framework has generated the data which are similar to the real one. This aspect is quantified with the following two metrics: structural similarity index measure (SSIM) and mean squared error (MSE). Furthermore, with k-nearest neighbor (k-NN) or support vector machine (SVM), the classification accuracy is also enhanced with the proposed framework. From these results, it can be concluded that the generalization property of the training data is enhanced and the classification accuracy is increased accordingly. We expect that this framework reduces the burden of the user from the excessive and time-consuming data acquisition.

방향 회전에 불변한 얼굴 영역 분할과 LBP를 이용한 얼굴 검출 (Face Detection using Orientation(In-Plane Rotation) Invariant Facial Region Segmentation and Local Binary Patterns(LBP))

  • 이희재;김하영;이다빛;이상국
    • 정보과학회 논문지
    • /
    • 제44권7호
    • /
    • pp.692-702
    • /
    • 2017
  • LBP기반 특징점 기술자를 이용한 얼굴검출은 얼굴의 형태정보 및 눈, 코, 입과 같은 얼굴 요소들 간 공간정보를 표현할 수 없는 문제가 있다. 이러한 문제를 해결하기 위해 선행 연구들은 얼굴 영상을 다수개의 사각형 부분영역들로 분할하였다. 하지만, 연구마다 서로 다른 개수와 크기로 부분 영역을 분할하였기 때문에 실험에 사용하는 데이터베이스에 적합한 부분 영역의 분할 기준이 모호하며, 부분 영역의 수에 비례하여 LBP 히스토그램 차원이 증가되고, 부분 영역의 개수가 증가함에 따라 얼굴의 방향 회전에 대한 민감도가 크게 증가한다. 본 논문은 LBP기반 특징점 기술자의 방향 회전 문제와 특징점 차원의 수 문제를 해결할 수 있는 새로운 부분 영역 분할 방법을 제안한다. 실험 결과, 제안하는 방법은 방향 회전된 단일 얼굴 영상에서 99.0278%의 검출 정확도를 보였다.