• Title/Summary/Keyword: Variational Autoencoder

Search Result 35, Processing Time 0.025 seconds

The Applicability of Conditional Generative Model Generating Groundwater Level Fluctuation Corresponding to Precipitation Pattern (조건부 생성모델을 이용한 강수 패턴에 따른 지하수위 생성 및 이의 활용에 관한 연구)

  • Jeong, Jiho;Jeong, Jina;Lee, Byung Sun;Song, Sung-Ho
    • Economic and Environmental Geology
    • /
    • v.54 no.1
    • /
    • pp.77-89
    • /
    • 2021
  • In this study, a method has been proposed to improve the performance of hydraulic property estimation model developed by Jeong et al. (2020). In their study, low-dimensional features of the annual groundwater level (GWL) fluctuation patterns extracted based on a Denoising autoencoder (DAE) was used to develop a regression model for predicting hydraulic properties of an aquifer. However, low-dimensional features of the DAE are highly dependent on the precipitation pattern even if the GWL is monitored at the same location, causing uncertainty in hydraulic property estimation of the regression model. To solve the above problem, a process for generating the GWL fluctuation pattern for conditioning the precipitation is proposed based on a conditional variational autoencoder (CVAE). The CVAE trains a statistical relationship between GWL fluctuation and precipitation pattern. The actual GWL and precipitation data monitored on a total of 71 monitoring stations over 10 years in South Korea was applied to validate the effect of using CVAE. As a result, the trained CVAE model reasonably generated GWL fluctuation pattern with the conditioning of various precipitation patterns for all the monitoring locations. Based on the trained CVAE model, the low-dimensional features of the GWL fluctuation pattern without interference of different precipitation patterns were extracted for all monitoring stations, and they were compared to the features extracted based on the DAE. Consequently, it can be confirmed that the statistical consistency of the features extracted using CVAE is improved compared to DAE. Thus, we conclude that the proposed method may be useful in extracting a more accurate feature of GWL fluctuation pattern affected solely by hydraulic characteristics of the aquifer, which would be followed by the improved performance of the previously developed regression model.

Reinforcement of user authentication system of shared kick scooter using autoencoder and variational autoencoder (오토인코더와 변이형 오토인코더를 활용한 공유 킥보드 사용자 인증 시스템 강화)

  • Kang, Yea-Jun;Kim, Hyun-Ji;Lim, Se-Jin;Kim, Won-Woong;Seo, Hwa-Jeong
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2021.11a
    • /
    • pp.643-646
    • /
    • 2021
  • 경찰청에 따르면 도로교통법이 개정된 이후 3개월단 개인형 이동장치(PM)를 단속한 결과 무면허 운전이 3199건에 달하는 것으로 나타났다. 공유 킥보드 서비스의 경우 회원가입을 할 때 운전면허증 취득 여부를 확인하긴 하지만 서비스를 이용할 때는 별도의 확인 절차 없이 대여할 수 있기 때문에 운전면허증을 취득하지 않았어도 대여하는 경우가 발생한다. 본 논문에서는 공유 킥보드 서비스의 보안 취약점을 보완하기 위해 오토인코더와 변이형 오토인코더를 사용한 딥러닝 기반의 공유 킥보드 대리 대여 방지 시스템을 제안한다. 오토인코더는 지문 데이터로부터 특징만을 추출할 수 있어, 사용자의 지문 원본을 서버에게 노출시키지 않을 수 있다. 변이형 오토인코더는 생성형 모델로써, 사용자의 지문 데이터를 증폭 시켜 합성곱 신경망의 성능을 높이는데 도움을 준다. 이러한 오토인코더와 변이형 오토인코더의 특징을 이용해 사용자의 지문을 서버에 노출시키지 않으면서 적은 데이터로 신뢰성 높은 사용자 인증이 가능한 전동 킥보드 대여 시스템을 제안한다.

Semi-supervised learning of speech recognizers based on variational autoencoder and unsupervised data augmentation (변분 오토인코더와 비교사 데이터 증강을 이용한 음성인식기 준지도 학습)

  • Jo, Hyeon Ho;Kang, Byung Ok;Kwon, Oh-Wook
    • The Journal of the Acoustical Society of Korea
    • /
    • v.40 no.6
    • /
    • pp.578-586
    • /
    • 2021
  • We propose a semi-supervised learning method based on Variational AutoEncoder (VAE) and Unsupervised Data Augmentation (UDA) to improve the performance of an end-to-end speech recognizer. In the proposed method, first, the VAE-based augmentation model and the baseline end-to-end speech recognizer are trained using the original speech data. Then, the baseline end-to-end speech recognizer is trained again using data augmented from the learned augmentation model. Finally, the learned augmentation model and end-to-end speech recognizer are re-learned using the UDA-based semi-supervised learning method. As a result of the computer simulation, the augmentation model is shown to improve the Word Error Rate (WER) of the baseline end-to-end speech recognizer, and further improve its performance by combining it with the UDA-based learning method.

Visual Object Manipulation Based on Exploration Guided by Demonstration (시연에 의해 유도된 탐험을 통한 시각 기반의 물체 조작)

  • Kim, Doo-Jun;Jo, HyunJun;Song, Jae-Bok
    • The Journal of Korea Robotics Society
    • /
    • v.17 no.1
    • /
    • pp.40-47
    • /
    • 2022
  • A reward function suitable for a task is required to manipulate objects through reinforcement learning. However, it is difficult to design the reward function if the ample information of the objects cannot be obtained. In this study, a demonstration-based object manipulation algorithm called stochastic exploration guided by demonstration (SEGD) is proposed to solve the design problem of the reward function. SEGD is a reinforcement learning algorithm in which a sparse reward explorer (SRE) and an interpolated policy using demonstration (IPD) are added to soft actor-critic (SAC). SRE ensures the training of the critic of SAC by collecting prior data and IPD limits the exploration space by making SEGD's action similar to the expert's action. Through these two algorithms, the SEGD can learn only with the sparse reward of the task without designing the reward function. In order to verify the SEGD, experiments were conducted for three tasks. SEGD showed its effectiveness by showing success rates of more than 96.5% in these experiments.

Variational Autoencoder-based Assembly Feature Extraction Network for Rapid Learning of Reinforcement Learning (강화학습의 신속한 학습을 위한 변이형 오토인코더 기반의 조립 특징 추출 네트워크)

  • Jun-Wan Yun;Minwoo Na;Jae-Bok Song
    • The Journal of Korea Robotics Society
    • /
    • v.18 no.3
    • /
    • pp.352-357
    • /
    • 2023
  • Since robotic assembly in an unstructured environment is very difficult with existing control methods, studies using artificial intelligence such as reinforcement learning have been conducted. However, since long-time operation of a robot for learning in the real environment adversely affects the robot, so a method to shorten the learning time is needed. To this end, a method based on a pre-trained neural network was proposed in this study. This method showed a learning speed about 3 times than the existing methods, and the stability of reward during learning was also increased. Furthermore, it can generate a more optimal policy than not using a pre-trained neural network. Using the proposed reinforcement learning-based assembly trajectory generator, 100 attempts were made to assemble the power connector within a random error of 4.53 mm in width and 3.13 mm in length, resulting in 100 successes.

Self-Attention-based SMILES Generationfor De Novo Drug Design (신약 디자인을 위한 Self-Attention 기반의 SMILES 생성자)

  • PIAO, SHENGMIN;Choi, Jonghwan;Kim, Kyeonghun;Park, Sanghyun
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2021.05a
    • /
    • pp.343-346
    • /
    • 2021
  • 약물 디자인이란 단백질과 같은 생물학적 표적에 작용할 수 있는 새로운 약물을 개발하는 과정이다. 전통적인 방법은 탐색과 개발 단계로 구성되어 있으나, 하나의 신약 개발을 위해서는 10 년 이상의 장시간이 요구되기 때문에, 이러한 기간을 단축하기 위한 인공지능 기반의 약물 디자인 방법들이 개발되고 있다. 하지만 많은 심층학습 기반의 약물 디자인 모델들은 RNN 기법을 활용하고 있고, RNN 은 훈련속도가 느리다는 단점이 있기 때문에 개선의 여지가 남아있다. 이런 단점을 극복하기 위해 본 연구는 self-attention 과 variational autoencoder 를 활용한 SMILES 생성 모델을 제안한다. 제안된 모델은 최신 약물 디자인 모델 대비 훈련 시간을 1/36 단축하고, 뿐만 아니라 유효한 SMILES 를 더 많이 생성하는 것을 확인하였다.

Visualization of Korean Speech Based on the Distance of Acoustic Features (음성특징의 거리에 기반한 한국어 발음의 시각화)

  • Pok, Gou-Chol
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.13 no.3
    • /
    • pp.197-205
    • /
    • 2020
  • Korean language has the characteristics that the pronunciation of phoneme units such as vowels and consonants are fixed and the pronunciation associated with a notation does not change, so that foreign learners can approach rather easily Korean language. However, when one pronounces words, phrases, or sentences, the pronunciation changes in a manner of a wide variation and complexity at the boundaries of syllables, and the association of notation and pronunciation does not hold any more. Consequently, it is very difficult for foreign learners to study Korean standard pronunciations. Despite these difficulties, it is believed that systematic analysis of pronunciation errors for Korean words is possible according to the advantageous observations that the relationship between Korean notations and pronunciations can be described as a set of firm rules without exceptions unlike other languages including English. In this paper, we propose a visualization framework which shows the differences between standard pronunciations and erratic ones as quantitative measures on the computer screen. Previous researches only show color representation and 3D graphics of speech properties, or an animated view of changing shapes of lips and mouth cavity. Moreover, the features used in the analysis are only point data such as the average of a speech range. In this study, we propose a method which can directly use the time-series data instead of using summary or distorted data. This was realized by using the deep learning-based technique which combines Self-organizing map, variational autoencoder model, and Markov model, and we achieved a superior performance enhancement compared to the method using the point-based data.

A study on the application of residual vector quantization for vector quantized-variational autoencoder-based foley sound generation model (벡터 양자화 변분 오토인코더 기반의 폴리 음향 생성 모델을 위한 잔여 벡터 양자화 적용 연구)

  • Seokjin Lee
    • The Journal of the Acoustical Society of Korea
    • /
    • v.43 no.2
    • /
    • pp.243-252
    • /
    • 2024
  • Among the Foley sound generation models that have recently begun to be studied, a sound generation technique using the Vector Quantized-Variational AutoEncoder (VQ-VAE) structure and generation model such as Pixelsnail are one of the important research subjects. On the other hand, in the field of deep learning-based acoustic signal compression, residual vector quantization technology is reported to be more suitable than the conventional VQ-VAE structure. Therefore, in this paper, we aim to study whether residual vector quantization technology can be effectively applied to the Foley sound generation. In order to tackle the problem, this paper applies the residual vector quantization technique to the conventional VQ-VAE-based Foley sound generation model, and in particular, derives a model that is compatible with the existing models such as Pixelsnail and does not increase computational resource consumption. In order to evaluate the model, an experiment was conducted using DCASE2023 Task7 data. The results show that the proposed model enhances about 0.3 of the Fréchet audio distance. Unfortunately, the performance enhancement was limited, which is believed to be due to the decrease in the resolution of time-frequency domains in order to do not increase consumption of the computational resources.

Chart-based Stock Price Prediction by Combing Variation Autoencoder and Attention Mechanisms (변이형 오토인코더와 어텐션 메커니즘을 결합한 차트기반 주가 예측)

  • Sanghyun Bae;Byounggu Choi
    • Information Systems Review
    • /
    • v.23 no.1
    • /
    • pp.23-43
    • /
    • 2021
  • Recently, many studies have been conducted to increase the accuracy of stock price prediction by analyzing candlestick charts using artificial intelligence techniques. However, these studies failed to consider the time-series characteristics of candlestick charts and to take into account the emotional state of market participants in data learning for stock price prediction. In order to overcome these limitations, this study produced input data by combining volatility index and candlestick charts to consider the emotional state of market participants, and used the data as input for a new method proposed on the basis of combining variantion autoencoder (VAE) and attention mechanisms for considering the time-series characteristics of candlestick chart. Fifty firms were randomly selected from the S&P 500 index and their stock prices were predicted to evaluate the performance of the method compared with existing ones such as convolutional neural network (CNN) or long-short term memory (LSTM). The results indicated the method proposed in this study showed superior performance compared to the existing ones. This study implied that the accuracy of stock price prediction could be improved by considering the emotional state of market participants and the time-series characteristics of the candlestick chart.

Deep Learning-Based Personalized Recommendation Using Customer Behavior and Purchase History in E-Commerce (전자상거래에서 고객 행동 정보와 구매 기록을 활용한 딥러닝 기반 개인화 추천 시스템)

  • Hong, Da Young;Kim, Ga Yeong;Kim, Hyon Hee
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.11 no.6
    • /
    • pp.237-244
    • /
    • 2022
  • In this paper, we present VAE-based recommendation using online behavior log and purchase history to overcome data sparsity and cold start. To generate a variable for customers' purchase history, embedding and dimensionality reduction are applied to the customers' purchase history. Also, Variational Autoencoders are applied to online behavior and purchase history. A total number of 12 variables are used, and nDCG is chosen for performance evaluation. Our experimental results showed that the proposed VAE-based recommendation outperforms SVD-based recommendation. Also, the generated purchase history variable improves the recommendation performance.