• Title/Summary/Keyword: Train Generation

Search Result 327, Processing Time 0.031 seconds

Application of Deep Learning to Solar Data: 3. Generation of Solar images from Galileo sunspot drawings

  • Lee, Harim;Moon, Yong-Jae;Park, Eunsu;Jeong, Hyunjin;Kim, Taeyoung;Shin, Gyungin
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.44 no.1
    • /
    • pp.81.2-81.2
    • /
    • 2019
  • We develop an image-to-image translation model, which is a popular deep learning method based on conditional Generative Adversarial Networks (cGANs), to generate solar magnetograms and EUV images from sunspot drawings. For this, we train the model using pairs of sunspot drawings from Mount Wilson Observatory (MWO) and their corresponding SDO/HMI magnetograms and SDO/AIA EUV images (512 by 512) from January 2012 to September 2014. We test the model by comparing pairs of actual SDO images (magnetogram and EUV images) and the corresponding AI-generated ones from October to December in 2014. Our results show that bipolar structures and coronal loop structures of AI-generated images are consistent with those of the original ones. We find that their unsigned magnetic fluxes well correlate with those of the original ones with a good correlation coefficient of 0.86. We also obtain pixel-to-pixel correlations EUV images and AI-generated ones. The average correlations of 92 test samples for several SDO lines are very good: 0.88 for AIA 211, 0.87 for AIA 1600 and 0.93 for AIA 1700. These facts imply that AI-generated EUV images quite similar to AIA ones. Applying this model to the Galileo sunspot drawings in 1612, we generate HMI-like magnetograms and AIA-like EUV images of the sunspots. This application will be used to generate solar images using historical sunspot drawings.

  • PDF

Application of Deep Learning to Solar Data: 2. Generation of Solar UV & EUV images from magnetograms

  • Park, Eunsu;Moon, Yong-Jae;Lee, Harim;Lim, Daye
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.44 no.1
    • /
    • pp.81.3-81.3
    • /
    • 2019
  • In this study, we apply conditional Generative Adversarial Network, which is one of the deep learning method, to the image-to-image translation from solar magentograms to solar UV and EUV images. For this, we train a model using pairs of SDO/AIA 9 wavelength UV and EUV images and their corresponding SDO/HMI line-of-sight magnetograms from 2011 to 2017 except August and September each year. We evaluate the model by comparing pairs of SDO/AIA images and corresponding generated ones in August and September. Our results from this study are as follows. First, we successfully generate SDO/AIA like solar UV and EUV images from SDO/HMI magnetograms. Second, our model has pixel-to-pixel correlation coefficients (CC) higher than 0.8 except 171. Third, our model slightly underestimates the pixel values in the view of Relative Error (RE), but the values are quite small. Fourth, considering CC and RE together, 1600 and 1700 photospheric UV line images, which have quite similar structures to the corresponding magnetogram, have the best results compared to other lines. This methodology can be applicable to many scientific fields that use several different filter images.

  • PDF

A Comparison of Deep Neural Network Structures for Learning Various Motions (다양한 동작 학습을 위한 깊은신경망 구조 비교)

  • Park, Soohwan;Lee, Jehee
    • Journal of the Korea Computer Graphics Society
    • /
    • v.27 no.5
    • /
    • pp.73-79
    • /
    • 2021
  • Recently, in the field of computer animation, a method for generating motion using deep learning has been studied away from conventional finite-state machines or graph-based methods. The expressiveness of the network required for learning motions is more influenced by the diversity of motion contained in it than by the simple length of motion to be learned. This study aims to find an efficient network structure when the types of motions to be learned are diverse. In this paper, we train and compare three types of networks: basic fully-connected structure, mixture of experts structure that uses multiple fully-connected layers in parallel, recurrent neural network which is widely used to deal with seq2seq, and transformer structure used for sequence-type data processing in the natural language processing field.

PathGAN: Local path planning with attentive generative adversarial networks

  • Dooseop Choi;Seung-Jun Han;Kyoung-Wook Min;Jeongdan Choi
    • ETRI Journal
    • /
    • v.44 no.6
    • /
    • pp.1004-1019
    • /
    • 2022
  • For autonomous driving without high-definition maps, we present a model capable of generating multiple plausible paths from egocentric images for autonomous vehicles. Our generative model comprises two neural networks: feature extraction network (FEN) and path generation network (PGN). The FEN extracts meaningful features from an egocentric image, whereas the PGN generates multiple paths from the features, given a driving intention and speed. To ensure that the paths generated are plausible and consistent with the intention, we introduce an attentive discriminator and train it with the PGN under a generative adversarial network framework. Furthermore, we devise an interaction model between the positions in the paths and the intentions hidden in the positions and design a novel PGN architecture that reflects the interaction model for improving the accuracy and diversity of the generated paths. Finally, we introduce ETRIDriving, a dataset for autonomous driving, in which the recorded sensor data are labeled with discrete high-level driving actions, and demonstrate the state-of-the-art performance of the proposed model on ETRIDriving in terms of accuracy and diversity.

Generating Synthetic Raman Spectra of DMMP and 2-CEES by Mathematical Transforms and Deep Generative Models (수학적 변환과 심층 생성 모델을 활용한 DMMP와 2-CEES의 모의 라만 분광 생성)

  • Sungwon Park;Boseong Jeong;Hongjoong Kim
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.26 no.5
    • /
    • pp.422-430
    • /
    • 2023
  • To build an automated system detecting toxic chemicals from Raman spectra, we have to obtain sufficient data of toxic chemicals. However, it usually costs high to gather Raman spectra of toxic chemicals in diverse situations. Tackling this problem, we develop methods to generate synthetic Raman spectra of DMMP and 2-CEES without actual experiments. First, we propose certain mathematical transforms to augment few original Raman spectra. Then, we train deep generative models to generate more realistic and diverse data. Analyzing synthetic Raman spectra of toxic chemicals generated by our methods through visualization, we qualitatively verify that the data are sufficiently similar to original data and diverse. For conclusion, we obtain a synthetic dataset of DMMP and 2-CEES with the proposed algorithm.

A Study on Korean Speech Animation Generation Employing Deep Learning (딥러닝을 활용한 한국어 스피치 애니메이션 생성에 관한 고찰)

  • Suk Chan Kang;Dong Ju Kim
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.10
    • /
    • pp.461-470
    • /
    • 2023
  • While speech animation generation employing deep learning has been actively researched for English, there has been no prior work for Korean. Given the fact, this paper for the very first time employs supervised deep learning to generate Korean speech animation. By doing so, we find out the significant effect of deep learning being able to make speech animation research come down to speech recognition research which is the predominating technique. Also, we study the way to make best use of the effect for Korean speech animation generation. The effect can contribute to efficiently and efficaciously revitalizing the recently inactive Korean speech animation research, by clarifying the top priority research target. This paper performs this process: (i) it chooses blendshape animation technique, (ii) implements the deep-learning model in the master-servant pipeline of the automatic speech recognition (ASR) module and the facial action coding (FAC) module, (iii) makes Korean speech facial motion capture dataset, (iv) prepares two comparison deep learning models (one model adopts the English ASR module, the other model adopts the Korean ASR module, however both models adopt the same basic structure for their FAC modules), and (v) train the FAC modules of both models dependently on their ASR modules. The user study demonstrates that the model which adopts the Korean ASR module and dependently trains its FAC module (getting 4.2/5.0 points) generates decisively much more natural Korean speech animations than the model which adopts the English ASR module and dependently trains its FAC module (getting 2.7/5.0 points). The result confirms the aforementioned effect showing that the quality of the Korean speech animation comes down to the accuracy of Korean ASR.

The way to make training data for deep learning model to recognize keywords in product catalog image at E-commerce (온라인 쇼핑몰에서 상품 설명 이미지 내의 키워드 인식을 위한 딥러닝 훈련 데이터 자동 생성 방안)

  • Kim, Kitae;Oh, Wonseok;Lim, Geunwon;Cha, Eunwoo;Shin, Minyoung;Kim, Jongwoo
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.1
    • /
    • pp.1-23
    • /
    • 2018
  • From the 21st century, various high-quality services have come up with the growth of the internet or 'Information and Communication Technologies'. Especially, the scale of E-commerce industry in which Amazon and E-bay are standing out is exploding in a large way. As E-commerce grows, Customers could get what they want to buy easily while comparing various products because more products have been registered at online shopping malls. However, a problem has arisen with the growth of E-commerce. As too many products have been registered, it has become difficult for customers to search what they really need in the flood of products. When customers search for desired products with a generalized keyword, too many products have come out as a result. On the contrary, few products have been searched if customers type in details of products because concrete product-attributes have been registered rarely. In this situation, recognizing texts in images automatically with a machine can be a solution. Because bulk of product details are written in catalogs as image format, most of product information are not searched with text inputs in the current text-based searching system. It means if information in images can be converted to text format, customers can search products with product-details, which make them shop more conveniently. There are various existing OCR(Optical Character Recognition) programs which can recognize texts in images. But existing OCR programs are hard to be applied to catalog because they have problems in recognizing texts in certain circumstances, like texts are not big enough or fonts are not consistent. Therefore, this research suggests the way to recognize keywords in catalog with the Deep Learning algorithm which is state of the art in image-recognition area from 2010s. Single Shot Multibox Detector(SSD), which is a credited model for object-detection performance, can be used with structures re-designed to take into account the difference of text from object. But there is an issue that SSD model needs a lot of labeled-train data to be trained, because of the characteristic of deep learning algorithms, that it should be trained by supervised-learning. To collect data, we can try labelling location and classification information to texts in catalog manually. But if data are collected manually, many problems would come up. Some keywords would be missed because human can make mistakes while labelling train data. And it becomes too time-consuming to collect train data considering the scale of data needed or costly if a lot of workers are hired to shorten the time. Furthermore, if some specific keywords are needed to be trained, searching images that have the words would be difficult, as well. To solve the data issue, this research developed a program which create train data automatically. This program can make images which have various keywords and pictures like catalog and save location-information of keywords at the same time. With this program, not only data can be collected efficiently, but also the performance of SSD model becomes better. The SSD model recorded 81.99% of recognition rate with 20,000 data created by the program. Moreover, this research had an efficiency test of SSD model according to data differences to analyze what feature of data exert influence upon the performance of recognizing texts in images. As a result, it is figured out that the number of labeled keywords, the addition of overlapped keyword label, the existence of keywords that is not labeled, the spaces among keywords and the differences of background images are related to the performance of SSD model. This test can lead performance improvement of SSD model or other text-recognizing machine based on deep learning algorithm with high-quality data. SSD model which is re-designed to recognize texts in images and the program developed for creating train data are expected to contribute to improvement of searching system in E-commerce. Suppliers can put less time to register keywords for products and customers can search products with product-details which is written on the catalog.

Structural Analysis of Power Transmission Mechanism of Electro-Mechanical Brake Device for High Speed Train (고속열차용 전기기계식 제동장치의 동력전달 기구물에 대한 구조해석)

  • Oh, Hyuck Keun;Beak, Seung-Koo;Jeon, Chang-Sung
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.20 no.12
    • /
    • pp.237-246
    • /
    • 2019
  • The Electro-Mechanical Brake (EMB) is the next generation braking system for automobiles and railway vehicles. Current brake systems for high-speed trains generate a braking force using a pneumatic cylinder, but EMB systems produce that force through a combination of an electric motor and a gear. In this study, an EMB operation mechanism capable of generating a high braking force was proposed, and structural and vibration analyses of the gears and shafts, which are the core parts of the mechanisms, were performed. Dynamic structural analysis confirmed that the maximum stress in the analysis model was within the yield strength of the material. In addition, the design that maximizes the diameter of the motor shaft was found to be advantageous in strength, and large shear stress could be generated in the bolt fixing the gear and eccentric shaft. In addition, a test apparatus that can reproduce the mechanism of the analytical model was fabricated to measure the strain of the fixed bolt part, which is the most vulnerable part. The strain measurement results showed that the error between the analysis and measurement was within 10%, which could verify the accuracy of the analytical model.

A Study on the Contribution of Exterior Devices to Running Resistance in High-Speed Trains (고속열차 외부장치에 의한 주행저항 기여도 연구)

  • Oh, Hyuck Keun;Kwak, Minho;Kwon, Hyeok-bin;Kim, Sang-soo;Kim, Seogwon
    • Journal of the Korean Society for Railway
    • /
    • v.18 no.4
    • /
    • pp.309-316
    • /
    • 2015
  • The contribution of exterior devices such as bogie fairings and pantographs to running resistance was estimated on the basis of coasting tests at up to 350 km/h with the help of the Korean Next Generation High speed train (HEMU-430X). In order to assess the reduction of air resistance by nose car's bogie fairing, coasting tests were conducted with a removable bogie fairing at various speed ranges. And, the contribution of the pantograph to air resistance was also estimated with coasting tests that include the pantograph's rising and descent modes. The linear regression method was used to examine decelerations from time-velocity data and the equation of resistance to motion is proposed from the deceleration data. From the aerodynamic term of the equation of resistance to motion, the contribution to air resistance by nose car's bogie fairing and pantograph was estimated. The results show that the air resistance was reduced by about 3.8% by the nose car's bogie fairing. And, the 3.9% increase of air resistance by the pantograph (open knee mode) has been found.

디지털미디어 시대의 시각디자인 교육시스템 연구

  • 정봉금
    • Archives of design research
    • /
    • v.16 no.3
    • /
    • pp.341-350
    • /
    • 2003
  • The topic of 21st century's culture is the appearance of digital media. It made changes as big as the industrial revolution, and our society is now ruled by the digital media. The main objective of this study is to forecast the direction of current visual design education by researching and analyzing how the introduction of digital media is influencing the evolution of visual design's identity, which is an ever changing and developing science. Also, since the rain target of digital media is the young generation, the change in the method of expressing visual language is inevitable In fact, there have been a lot of changes in the methods of creating and distributing visual communication due to the introduction of digital media. In the past, most educational institutions of design had similar objectives, curriculums and teaching methods to provide education that prepares students for practical business. However, in this digital media era, the application and utilization of visual design are uncomparably diversified, and it is generally classified as interaction. The purpose of this study is to find a wat to train visual design professionals in this digital era. For this purpose, this study will identify a new educational system that fulfills the demands of this society by fusing the traditional education and the new digital education, and will suggest what an design education institute that is ahead of the demands of society should be like.

  • PDF