• Title/Summary/Keyword: AI-based image generation

Search Result 40, Processing Time 0.028 seconds

A Study on AI Softwear [Stable Diffusion] ControlNet plug-in Usabilities

  • Chenghao Wang;Jeanhun Chung
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.15 no.4
    • /
    • pp.166-171
    • /
    • 2023
  • With significant advancements in the field of artificial intelligence, many novel algorithms and technologies have emerged. Currently, AI painting can generate high-quality images based on textual descriptions. However, it is often challenging to control details when generating images, even with complex textual inputs. Therefore, there is a need to implement additional control mechanisms beyond textual descriptions. Based on ControlNet, this passage describes a combined utilization of various local controls (such as edge maps and depth maps) and global control within a single model. It provides a comprehensive exposition of the fundamental concepts of ControlNet, elucidating its theoretical foundation and relevant technological features. Furthermore, combining methods and applications, understanding the technical characteristics involves analyzing distinct advantages and image differences. This further explores insights into the development of image generation patterns.

A Study on the Development Direction of Medical Image Information System Using Big Data and AI (빅데이터와 AI를 활용한 의료영상 정보 시스템 발전 방향에 대한 연구)

  • Yoo, Se Jong;Han, Seong Soo;Jeon, Mi-Hyang;Han, Man Seok
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.11 no.9
    • /
    • pp.317-322
    • /
    • 2022
  • The rapid development of information technology is also bringing about many changes in the medical environment. In particular, it is leading the rapid change of medical image information systems using big data and artificial intelligence (AI). The prescription delivery system (OCS), which consists of an electronic medical record (EMR) and a medical image storage and transmission system (PACS), has rapidly changed the medical environment from analog to digital. When combined with multiple solutions, PACS represents a new direction for advancement in security, interoperability, efficiency and automation. Among them, the combination with artificial intelligence (AI) using big data that can improve the quality of images is actively progressing. In particular, AI PACS, a system that can assist in reading medical images using deep learning technology, was developed in cooperation with universities and industries and is being used in hospitals. As such, in line with the rapid changes in the medical image information system in the medical environment, structural changes in the medical market and changes in medical policies to cope with them are also necessary. On the other hand, medical image information is based on a digital medical image transmission device (DICOM) format method, and is divided into a tomographic volume image, a volume image, and a cross-sectional image, a two-dimensional image, according to a generation method. In addition, recently, many medical institutions are rushing to introduce the next-generation integrated medical information system by promoting smart hospital services. The next-generation integrated medical information system is built as a solution that integrates EMR, electronic consent, big data, AI, precision medicine, and interworking with external institutions. It aims to realize research. Korea's medical image information system is at a world-class level thanks to advanced IT technology and government policies. In particular, the PACS solution is the only field exporting medical information technology to the world. In this study, along with the analysis of the medical image information system using big data, the current trend was grasped based on the historical background of the introduction of the medical image information system in Korea, and the future development direction was predicted. In the future, based on DICOM big data accumulated over 20 years, we plan to conduct research that can increase the image read rate by using AI and deep learning algorithms.

A Study on the Work Process of Creating AI SORA Videos (AI SORA 동영상 생성 제작의 작업 과정에 관한 고찰)

  • Cho, Hyun Kyung
    • The Journal of the Convergence on Culture Technology
    • /
    • v.10 no.5
    • /
    • pp.827-832
    • /
    • 2024
  • The AI program Sora is a video production model that can be used innovatively and is the starting point of a major paradigm shift in video planning and production in the future. In this paper, through consideration of the characteristics, application, and process of the AI video production program, the characteristics of the AI design video production method were understood, and the production algorithm was considered. The detailed consideration and characteristics of the work creation process for the video graphic AI video generation program that will be intensified every year were examined. Next, the method of generating a customized video with a text prompt and the process of innovative production results different from the previous production method were considered. In addition, the design direction through the generation of AI images was studied through the review of the strengths and weaknesses of the image details of the recently announced AI music video results. By considering the security of the AI generation video Sora and looking at the internal process of the actual AI process, it will be possible to present indicators for the future direction of AI video model production and education along with the direction of the design designer and education system. In the text and conclusion, we analyzed the strengths and weaknesses and future status of OpenAI Sora image, concluded how to apply the Sora model's capabilities, limitations, quality, and human creativity, and presented problems and alternatives through examples of the Sora model's capabilities and limitations to increase human creativity.

Development of 3D Printed Fashion Jewelry Design Using Generative AI (생성형 AI를 활용한 3D 프린팅 패션 주얼리 디자인 개발)

  • Bo Ae Hwang;Jung Soo Lee
    • Journal of Fashion Business
    • /
    • v.28 no.4
    • /
    • pp.129-148
    • /
    • 2024
  • With the advent of the 4th industrial era and the development of digital technologies such as artificial intelligence (AI), metaverse, 3D printing, and 3D virtual wearing systems, the fashion industry continues to attempt to use digital technology and introduce it into various areas. The purpose of this study was to determine whether fashion and digital technology could be combined to create works and to suggest ways to apply digital technology in the fashion industry. As a research method, image generative AI, Midjourney was applied to the initial design ideation stage to derive inspiration images. 3D printing technique was then introduced as a production method to print fashion jewelry. As a result of the research, a total of six jewelry designs printed with a 3D printer were developed. One necklace, one bracelet, three earrings, and one ring were developed. This study identified the possibility of applying digital technology to real fashion jewelry design products by designing jewelry based on inspirational images derived from image generation AI and producing pieces of fashion jewelry with 3D modeling tasks and 3D printing outputs. This study is significant in that it expands the expression area of fashion jewelry design that combines digital technology.

Synthetic Infra-Red Image Dataset Generation by CycleGAN based on SSIM Loss Function (SSIM 목적 함수와 CycleGAN을 이용한 적외선 이미지 데이터셋 생성 기법 연구)

  • Lee, Sky;Leeghim, Henzeh
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.25 no.5
    • /
    • pp.476-486
    • /
    • 2022
  • Synthetic dynamic infrared image generation from the given virtual environment is being the primary goal to simulate the output of the infra-red(IR) camera installed on a vehicle to evaluate the control algorithm for various search & reconnaissance missions. Due to the difficulty to obtain actual IR data in complex environments, Artificial intelligence(AI) has been used recently in the field of image data generation. In this paper, CycleGAN technique is applied to obtain a more realistic synthetic IR image. We added the Structural Similarity Index Measure(SSIM) loss function to the L1 loss function to generate a more realistic synthetic IR image when the CycleGAN image is generated. From the simulation, it is applicable to the guided-missile flight simulation tests by using the synthetic infrared image generated by the proposed technique.

GAN-based research for high-resolution medical image generation (GAN 기반 고해상도 의료 영상 생성을 위한 연구)

  • Ko, Jae-Yeong;Cho, Baek-Hwan;Chung, Myung-Jin
    • Annual Conference of KIPS
    • /
    • 2020.05a
    • /
    • pp.544-546
    • /
    • 2020
  • 의료 데이터를 이용하여 인공지능 기계학습 연구를 수행할 때 자주 마주하는 문제는 데이터 불균형, 데이터 부족 등이며 특히 정제된 충분한 데이터를 구하기 힘들다는 것이 큰 문제이다. 본 연구에서는 이를 해결하기 위해 GAN(Generative Adversarial Network) 기반 고해상도 의료 영상을 생성하는 프레임워크를 개발하고자 한다. 각 해상도 마다 Scale 의 Gradient 를 동시에 학습하여 빠르게 고해상도 이미지를 생성해낼 수 있도록 했다. 고해상도 이미지를 생성하는 Neural Network 를 고안하였으며, PGGAN, Style-GAN 과의 성능 비교를 통해 제안된 모델이 양질의 고해상도 의료영상 이미지를 더 빠르게 생성할 수 있음을 확인하였다. 이를 통해 인공지능 기계학습 연구에 있어서 의료 영상의 데이터 부족, 데이터 불균형 문제를 해결할 수 있는 Data augmentation 이나, Anomaly detection 등의 연구에 적용할 수 있다.

Application of Deep Learning to Solar Data: 3. Generation of Solar images from Galileo sunspot drawings

  • Lee, Harim;Moon, Yong-Jae;Park, Eunsu;Jeong, Hyunjin;Kim, Taeyoung;Shin, Gyungin
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.44 no.1
    • /
    • pp.81.2-81.2
    • /
    • 2019
  • We develop an image-to-image translation model, which is a popular deep learning method based on conditional Generative Adversarial Networks (cGANs), to generate solar magnetograms and EUV images from sunspot drawings. For this, we train the model using pairs of sunspot drawings from Mount Wilson Observatory (MWO) and their corresponding SDO/HMI magnetograms and SDO/AIA EUV images (512 by 512) from January 2012 to September 2014. We test the model by comparing pairs of actual SDO images (magnetogram and EUV images) and the corresponding AI-generated ones from October to December in 2014. Our results show that bipolar structures and coronal loop structures of AI-generated images are consistent with those of the original ones. We find that their unsigned magnetic fluxes well correlate with those of the original ones with a good correlation coefficient of 0.86. We also obtain pixel-to-pixel correlations EUV images and AI-generated ones. The average correlations of 92 test samples for several SDO lines are very good: 0.88 for AIA 211, 0.87 for AIA 1600 and 0.93 for AIA 1700. These facts imply that AI-generated EUV images quite similar to AIA ones. Applying this model to the Galileo sunspot drawings in 1612, we generate HMI-like magnetograms and AIA-like EUV images of the sunspots. This application will be used to generate solar images using historical sunspot drawings.

  • PDF

Med-StyleGAN2: A GAN-Based Synthetic Data Generation for Medical Image Generation (Med-StyleGAN2: 의료 영상 생성을 위한 GAN 기반의 합성 데이터 생성)

  • Jae-Ha Choi;Sung-Yeon Kim;Hae-Rin Byeon;Se-Yeon Lee;Jung-Soo Lee
    • Annual Conference of KIPS
    • /
    • 2023.11a
    • /
    • pp.904-905
    • /
    • 2023
  • 본 논문에서는 의료 영상 생성을 위한 Med-StyleGAN2를 제안한다. 생성적 적대 신경망은 이미지 생성에는 효과적이지만, 의료 영상 생성에는 한계점을 가지고 있다. 따라서 본 연구에서는 의료 영상 생성에 특화된 StyleGAN 기반 학습 모델을 제안한다. 이는 다양한 의료 영상 어플리케이션에 활용할 수 있으며, 생성된 의료 영상에 대한 정량적, 정성적 평가를 수행함으로써 의료 영상 생성 분야의 발전 가능성에 대해 연구한다.

Research on Core patent mining methods based on key components of Generative AI (생성형 인공지능 기술의 핵심 구성 요소 기반 주요 특허 발굴 방법에 관한 연구)

  • Gayun Kim;Beom-Seok Kim;Jinhong Yang
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.16 no.5
    • /
    • pp.292-300
    • /
    • 2023
  • This paper proposes a patent discovery method and strategy for Generative AI-related patents by utilizing qualitative evaluation indicators established based on the core components of the technology. Currently, the evaluation of patent quality relies on quantitative indicators, but existing quantitative indicators cannot represent the characteristics of Generative AI technology, making it difficult to accurately evaluate. Therefore, there is a need for additional qualitative indicators that consider technical characteristics based on patent claims, which can reveal the actual strength of the patent. In this paper, we propose a new evaluation index considering the technical characteristics of Generative AI. Core patents were selected using the proposed evaluation index, and the appropriateness of the proposed index was verified through the existing quantitative evaluation method for the selected core patents.

Current Status of Development and Practice of Artificial Intelligence Solutions for Digital Transformation of Fashion Manufacturers (패션 제조 기업의 디지털 트랜스포메이션을 위한 인공지능 솔루션 개발 및 활용 현황)

  • Kim, Ha Youn;Choi, Woojin;Lee, Yuri;Jang, Seyoon
    • Journal of Fashion Business
    • /
    • v.26 no.2
    • /
    • pp.28-47
    • /
    • 2022
  • Rapid development of information and communication technology is leading the digital transformation (hereinafter, DT) of various industries. At this point in rapid online transition, fashion manufacturers operating offline-oriented businesses have become highly interested in DT and artificial intelligence (hereinafter AI), which leads DT. The purpose of this study is to examine the development status and application case of AI-based digital technology developed for the fashion industry, and to examine the DT stage and AI application status of domestic fashion manufacturers. Hence, in-depth interviews were conducted with five domestic IT companies developing AI technology for the fashion industry and six domestic fashion manufacturers applying AI technology. After analyzing interviews, study results were as follows: The seven major AI technologies leading the DT of the fashion industry were fashion image recognition, trend analysis, prediction & visualization, automated fashion design generation, demand forecast & optimizing inventory, optimizing logistics, curation, and ad-tech. It was found that domestic fashion manufacturers were striving for innovative changes through DT although the DT stage varied from company to company. This study is of academic significance as it organized technologies specialized in fashion business by analyzing AI-based digitization element technologies that lead DT in the fashion industry. It is also expected to serve as basic study when DT and AI technology development are applied to the fashion field so that traditional domestic fashion manufacturers showing low growth can rise again.