• Title/Summary/Keyword: Generative Model

Search Result 328, Processing Time 0.026 seconds

An Development of Image Retrieval Model based on Image2Vec using GAN (Generative Adversarial Network를 활용한 Image2Vec기반 이미지 검색 모델 개발)

  • Jo, Jaechoon;Lee, Chanhee;Lee, Dongyub;Lim, Heuiseok
    • Journal of Digital Convergence
    • /
    • v.16 no.12
    • /
    • pp.301-307
    • /
    • 2018
  • The most of the IR focus on the method for searching the document, so the keyword-based IR system is not able to reflect the feature information of the image. In order to overcome these limitations, we have developed a system that can search similar images based on the vector information of images, and it can search for similar images based on sketches. The proposed system uses the GAN to up sample the sketch to the image level, convert the image to the vector through the CNN, and then retrieve the similar image using the vector space model. The model was learned using fashion image and the image retrieval system was developed. As a result, the result is showed meaningful performance.

Improved Method of License Plate Detection and Recognition Facilitated by Fast Super-Resolution GAN (Fast Super-Resolution GAN 기반 자동차 번호판 검출 및 인식 성능 고도화 기법)

  • Min, Dongwook;Lim, Hyunseok;Gwak, Jeonghwan
    • Smart Media Journal
    • /
    • v.9 no.4
    • /
    • pp.134-143
    • /
    • 2020
  • Vehicle License Plate Recognition is one of the approaches for transportation and traffic safety networks, such as traffic control, speed limit enforcement and runaway vehicle tracking. Although it has been studied for decades, it is attracting more and more attention due to the recent development of deep learning and improved performance. Also, it is largely divided into license plate detection and recognition. In this study, experiments were conducted to improve license plate detection performance by utilizing various object detection methods and WPOD-Net(Warped Planar Object Detection Network) model. The accuracy was improved by selecting the method of detecting the vehicle(s) and then detecting the license plate(s) instead of the conventional method of detecting the license plate using the object detection model. In particular, the final performance was improved through the process of removing noise existing in the image by using the Fast-SRGAN model, one of the Super-Resolution methods. As a result, this experiment showed the performance has improved an average of 4.34% from 92.38% to 96.72% compared to previous studies.

A Study on Image Generation from Sentence Embedding Applying Self-Attention (Self-Attention을 적용한 문장 임베딩으로부터 이미지 생성 연구)

  • Yu, Kyungho;No, Juhyeon;Hong, Taekeun;Kim, Hyeong-Ju;Kim, Pankoo
    • Smart Media Journal
    • /
    • v.10 no.1
    • /
    • pp.63-69
    • /
    • 2021
  • When a person sees a sentence and understands the sentence, the person understands the sentence by reminiscent of the main word in the sentence as an image. Text-to-image is what allows computers to do this associative process. The previous deep learning-based text-to-image model extracts text features using Convolutional Neural Network (CNN)-Long Short Term Memory (LSTM) and bi-directional LSTM, and generates an image by inputting it to the GAN. The previous text-to-image model uses basic embedding in text feature extraction, and it takes a long time to train because images are generated using several modules. Therefore, in this research, we propose a method of extracting features by using the attention mechanism, which has improved performance in the natural language processing field, for sentence embedding, and generating an image by inputting the extracted features into the GAN. As a result of the experiment, the inception score was higher than that of the model used in the previous study, and when judged with the naked eye, an image that expresses the features well in the input sentence was created. In addition, even when a long sentence is input, an image that expresses the sentence well was created.

A GAN-based face rotation technique using 3D face model for game characters (3D 얼굴 모델 기반의 GAN을 이용한 게임 캐릭터 회전 기법)

  • Kim, Handong;Han, Jongdae;Yang, Heekyung;Min, Kyungha
    • Journal of Korea Game Society
    • /
    • v.21 no.3
    • /
    • pp.13-24
    • /
    • 2021
  • This paper shows the face rotation applicable to game character facial illustration. Existing studies limited data to human face data, required a large amount of data, and the synthesized results were not good. In this paper, the following method was introduced to solve the existing problems of existing studies. First, a 3D model with features of the input image was rotated and then rendered as a 2D image to construct a data set. Second, by designing GAN that can learn features of various poses from the data built through the 3D model, the input image can be synthesized at a desired pose. This paper presents the results of synthesizing the game character face illustration. From the synthesized result, it can be confirmed that the proposed method works well.

Dog-Species Classification through CycleGAN and Standard Data Augmentation

  • Chan, Park;Nammee, Moon
    • Journal of Information Processing Systems
    • /
    • v.19 no.1
    • /
    • pp.67-79
    • /
    • 2023
  • In the image field, data augmentation refers to increasing the amount of data through an editing method such as rotating or cropping a photo. In this study, a generative adversarial network (GAN) image was created using CycleGAN, and various colors of dogs were reflected through data augmentation. In particular, dog data from the Stanford Dogs Dataset and Oxford-IIIT Pet Dataset were used, and 10 breeds of dog, corresponding to 300 images each, were selected. Subsequently, a GAN image was generated using CycleGAN, and four learning groups were established: 2,000 original photos (group I); 2,000 original photos + 1,000 GAN images (group II); 3,000 original photos (group III); and 3,000 original photos + 1,000 GAN images (group IV). The amount of data in each learning group was augmented using existing data augmentation methods such as rotating, cropping, erasing, and distorting. The augmented photo data were used to train the MobileNet_v3_Large, ResNet-152, InceptionResNet_v2, and NASNet_Large frameworks to evaluate the classification accuracy and loss. The top-3 accuracy for each deep neural network model was as follows: MobileNet_v3_Large of 86.4% (group I), 85.4% (group II), 90.4% (group III), and 89.2% (group IV); ResNet-152 of 82.4% (group I), 83.7% (group II), 84.7% (group III), and 84.9% (group IV); InceptionResNet_v2 of 90.7% (group I), 88.4% (group II), 93.3% (group III), and 93.1% (group IV); and NASNet_Large of 85% (group I), 88.1% (group II), 91.8% (group III), and 92% (group IV). The InceptionResNet_v2 model exhibited the highest image classification accuracy, and the NASNet_Large model exhibited the highest increase in the accuracy owing to data augmentation.

Regeneration of a defective Railroad Surface for defect detection with Deep Convolution Neural Networks (Deep Convolution Neural Networks 이용하여 결함 검출을 위한 결함이 있는 철도선로표면 디지털영상 재 생성)

  • Kim, Hyeonho;Han, Seokmin
    • Journal of Internet Computing and Services
    • /
    • v.21 no.6
    • /
    • pp.23-31
    • /
    • 2020
  • This study was carried out to generate various images of railroad surfaces with random defects as training data to be better at the detection of defects. Defects on the surface of railroads are caused by various factors such as friction between track binding devices and adjacent tracks and can cause accidents such as broken rails, so railroad maintenance for defects is necessary. Therefore, various researches on defect detection and inspection using image processing or machine learning on railway surface images have been conducted to automate railroad inspection and to reduce railroad maintenance costs. In general, the performance of the image processing analysis method and machine learning technology is affected by the quantity and quality of data. For this reason, some researches require specific devices or vehicles to acquire images of the track surface at regular intervals to obtain a database of various railway surface images. On the contrary, in this study, in order to reduce and improve the operating cost of image acquisition, we constructed the 'Defective Railroad Surface Regeneration Model' by applying the methods presented in the related studies of the Generative Adversarial Network (GAN). Thus, we aimed to detect defects on railroad surface even without a dedicated database. This constructed model is designed to learn to generate the railroad surface combining the different railroad surface textures and the original surface, considering the ground truth of the railroad defects. The generated images of the railroad surface were used as training data in defect detection network, which is based on Fully Convolutional Network (FCN). To validate its performance, we clustered and divided the railroad data into three subsets, one subset as original railroad texture images and the remaining two subsets as another railroad surface texture images. In the first experiment, we used only original texture images for training sets in the defect detection model. And in the second experiment, we trained the generated images that were generated by combining the original images with a few railroad textures of the other images. Each defect detection model was evaluated in terms of 'intersection of union(IoU)' and F1-score measures with ground truths. As a result, the scores increased by about 10~15% when the generated images were used, compared to the case that only the original images were used. This proves that it is possible to detect defects by using the existing data and a few different texture images, even for the railroad surface images in which dedicated training database is not constructed.

A Study on ER Suspension System with Energy Generation (재생 에너지를 특징으로하는 ER현가장치 연구)

  • 김기선;김승환
    • Journal of the Korean Institute of Telematics and Electronics T
    • /
    • v.36T no.1
    • /
    • pp.71-78
    • /
    • 1999
  • This paper presents a new type of energy generative ER suspension system which does not require external power sources. This is accomplished by converting vibration energy(kinetic energy) into electrical energy. In order to undertake this, an appropriate size of the ER damper is manufactured by incorporation a mechanism which changes the linear motion of the ER damper to the rotary motion. This rotary motion is amplified by gears and activates a generator to produce the electrical energy. The efficiency of energy generation is evaluated and the level of damping force with generated power is also investigated. Then, the ER suspension system is applied to the quarter car model, and its vibration isolation is experimentally evaluated with respect to the piston speed.

  • PDF

The Generative Mechanism of Cloud Streets

  • Kang Sung-Dae;Kimura Fujio
    • Environmental Sciences Bulletin of The Korean Environmental Sciences Society
    • /
    • v.1 no.2
    • /
    • pp.119-124
    • /
    • 1997
  • Cloud streets were successfully simulated by numerical model (RAMS) including an isolated mountain near the coast, large sensible heat flux from the sea surface, uniform stratification and wind velocity with low Froude number (0.25) in the inflow boundary. The well developed cloud streets between a pair of convective rolls are simulated at a level of 1 km over the sea. The following five results were obtained: 1) For the formation of the pair of convective rolls, both strong static instability and a topographically induced mechanical disturbance are strongly required at the same time. 2) Strong sensible heat flux from the sea surface is the main energy source of the pair of convective rolls, and the buoyancy caused by condensation in the cloud is negligibly small. 3) The pair of convective rolls is a complex of two sub-rolls. One is the outer roll, which has a large radius, but weak circulation, and the other is the inner roll, which has a small radius, but strong circulation. The outer roll gathers a large amount of moisture by convergence in the lower marine boundary, and the inner roll transfers the convergent moisture to the upper boundary layer by strong upward motion between them. 4) The pair of inner rolls form the line-shaped cloud streets, and keep them narrow along the center-line of the domain. 5) Both by non-hydrostatic and by hydrostatic assumptions, cloud streets can be simulated. In our case, non-hydrostatic processes enhanced somewhat the formation of cloud streets. The horizontal size of the topography does not seem to be restricted to within the small scale where non-hydrostatic effects are important.

  • PDF

A Study on Problem Based Storytelling of Digital Game Modification (디지털 게임 모드의 문제 기반 스토리텔링 연구)

  • Yun, Hye-Young
    • Journal of Korea Game Society
    • /
    • v.16 no.3
    • /
    • pp.65-76
    • /
    • 2016
  • The paradigm of the human experience through computer as a medium is moving from cultural interface to cultural software. Mod represents this paradigm shift in digital game culture. Until now, Digital game play focused on the immersive agency in the process of achieving the goals of the game. On the other hand, In Mod, play focus on generative agency in the process of creating and solving problems in game world. Through Mod, original game perceived as transformable data and creative material. Transformation and generation of such a game content through the user's Mod presents a sustainable model of the digital game.

A Study on methodology of physical Fabrication & reorganization of Epidermis in Space Design - Focused on reorganization of Epidermis, Fabrication - (공간디자인에서 디지털 표피 재 조직화, 물리적 구현 방법 연구 - 표피 재 조직화, 가공 중심으로 -)

  • Park, Jeong-Joo
    • Korean Institute of Interior Design Journal
    • /
    • v.17 no.2
    • /
    • pp.150-161
    • /
    • 2008
  • It requires more close cooperation process and mediator for smooth communication in this industry structure where design and engineers are separated. The database of design integrates separate system and helps connection between organizations. The application category is utilized variously from formation to operation. Architectures addressed in this content as Frank Gehry and Nox are making differentiated design on the base of 3d digital methodology and using it widely from generation to fabrication. Especially they got to be free from the generative limit as it became available to analyse, digital surface organization, and realize the complex system form. Now more integrated and delicate works got to be affordable owing to various kinds of improved CNC, RP(rapid-prototype) machines, and architecture hardwares. With a linkage of software now at their disposal, architects can create a digital model of a building and all of its design elements, and in turn use this 3d information to construct actual building components using machines driven by CNC and other advanced manufacturing techniques. Digital technologies are enabling a direct correlation between what design and construction, thus bringing to the forefront the issue of the significance of information, the production, communication, application, and control of digital information in the industrial system. The central requirement is the clear, reliable, and consistent exchange of information among all parties involved in creating a given project.