• Title/Summary/Keyword: 딥러닝 융합연구

Search Result 439, Processing Time 0.028 seconds

A Study on Verification of Back TranScription(BTS)-based Data Construction (Back TranScription(BTS)기반 데이터 구축 검증 연구)

  • Park, Chanjun;Seo, Jaehyung;Lee, Seolhwa;Moon, Hyeonseok;Eo, Sugyeong;Lim, Heuiseok
    • Journal of the Korea Convergence Society
    • /
    • v.12 no.11
    • /
    • pp.109-117
    • /
    • 2021
  • Recently, the use of speech-based interfaces is increasing as a means for human-computer interaction (HCI). Accordingly, interest in post-processors for correcting errors in speech recognition results is also increasing. However, a lot of human-labor is required for data construction. in order to manufacture a sequence to sequence (S2S) based speech recognition post-processor. To this end, to alleviate the limitations of the existing construction methodology, a new data construction method called Back TranScription (BTS) was proposed. BTS refers to a technology that combines TTS and STT technology to create a pseudo parallel corpus. This methodology eliminates the role of a phonetic transcriptor and can automatically generate vast amounts of training data, saving the cost. This paper verified through experiments that data should be constructed in consideration of text style and domain rather than constructing data without any criteria by extending the existing BTS research.

LSTM Prediction of Streamflow during Peak Rainfall of Piney River (LSTM을 이용한 Piney River유역의 최대강우시 유량예측)

  • Kareem, Kola Yusuff;Seong, Yeonjeong;Jung, Younghun
    • Journal of Korean Society of Disaster and Security
    • /
    • v.14 no.4
    • /
    • pp.17-27
    • /
    • 2021
  • Streamflow prediction is a very vital disaster mitigation approach for effective flood management and water resources planning. Lately, torrential rainfall caused by climate change has been reported to have increased globally, thereby causing enormous infrastructural loss, properties and lives. This study evaluates the contribution of rainfall to streamflow prediction in normal and peak rainfall scenarios, typical of the recent flood at Piney Resort in Vernon, Hickman County, Tennessee, United States. Daily streamflow, water level, and rainfall data for 20 years (2000-2019) from two USGS gage stations (03602500 upstream and 03599500 downstream) of the Piney River watershed were obtained, preprocesssed and fitted with Long short term memory (LSTM) model. Tensorflow and Keras machine learning frameworks were used with Python to predict streamflow values with a sequence size of 14 days, to determine whether the model could have predicted the flooding event in August 21, 2021. Model skill analysis showed that LSTM model with full data (water level, streamflow and rainfall) performed better than the Naive Model except some rainfall models, indicating that only rainfall is insufficient for streamflow prediction. The final LSTM model recorded optimal NSE and RMSE values of 0.68 and 13.84 m3/s and predicted peak flow with the lowest prediction error of 11.6%, indicating that the final model could have predicted the flood on August 24, 2021 given a peak rainfall scenario. Adequate knowledge of rainfall patterns will guide hydrologists and disaster prevention managers in designing efficient early warning systems and policies aimed at mitigating flood risks.

A Methodology for Realty Time-series Generation Using Generative Adversarial Network (적대적 생성망을 이용한 부동산 시계열 데이터 생성 방안)

  • Ryu, Jae-Pil;Hahn, Chang-Hoon;Shin, Hyun-Joon
    • Journal of the Korea Convergence Society
    • /
    • v.12 no.10
    • /
    • pp.9-17
    • /
    • 2021
  • With the advancement of big data analysis, artificial intelligence, machine learning, etc., data analytics technology has developed to help with optimal decision-making. However, in certain areas, the lack of data restricts the use of these techniques. For example, real estate related data often have a long release cycle because of its recent release or being a non-liquid asset. In order to overcome these limitations, we studied the scalability of the existing time series through the TimeGAN model. A total of 45 time series related to weekly real estate data were collected within the period of 2012 to 2021, and a total of 15 final time series were selected by considering the correlation between the time series. As a result of data expansion through the TimeGAN model for the 15 time series, it was found that the statistical distribution between the real data and the extended data was similar through the PCA and t-SNE visualization algorithms.

Summarization of Korean Dialogues through Dialogue Restructuring (대화문 재구조화를 통한 한국어 대화문 요약)

  • Eun Hee Kim;Myung Jin Lim;Ju Hyun Shin
    • Smart Media Journal
    • /
    • v.12 no.11
    • /
    • pp.77-85
    • /
    • 2023
  • After COVID-19, communication through online platforms has increased, leading to an accumulation of massive amounts of conversational text data. With the growing importance of summarizing this text data to extract meaningful information, there has been active research on deep learning-based abstractive summarization. However, conversational data, compared to structured texts like news articles, often contains missing or transformed information, necessitating consideration from multiple perspectives due to its unique characteristics. In particular, vocabulary omissions and unrelated expressions in the conversation can hinder effective summarization. Therefore, in this study, we restructured by considering the characteristics of Korean conversational data, fine-tuning a pre-trained text summarization model based on KoBART, and improved conversation data summary perfomance through a refining operation to remove redundant elements from the summary. By restructuring the sentences based on the order of utterances and extracting a central speaker, we combined methods to restructure the conversation around them. As a result, there was about a 4 point improvement in the Rouge-1 score. This study has demonstrated the significance of our conversation restructuring approach, which considers the characteristics of dialogue, in enhancing Korean conversation summarization performance.

Battery charge prediction of sailing yacht regeneration system using neural networks (신경망을 이용한 세일링 요트 리제너레이션 시스템의 배터리 충전 예측)

  • Lee, Tae-Hee;Hwang, Woo-Sung;Choi, Myung-Ryul
    • Journal of Digital Convergence
    • /
    • v.18 no.11
    • /
    • pp.241-246
    • /
    • 2020
  • In this paper, we propose a neural network model to converge the marine electric propulsion system and deep learning algorithm to predict the DC/DC converter output current in the electric propulsion regeneration system and to predict the battery charge during regeneration. In order to experiment with the proposed neural network, the input voltage and current of the PCM were measured and the data set was secured on the prototype PCM board. In addition, in order to improve the learning results in the insufficient data set, the scale of the data set was increased through data fitting and its learning was executed further. After learning, the difference between the data prediction result of the neural network model and the actual measurement data was compared. The proposed neural network model effectively showed the prediction of battery charge according to changes in input voltage and current. In addition, by predicting the characteristic change of the analog circuit constituting the DC/DC converter through a neural network, it is determined that the characteristics of the analog circuit should be considered when designing the regeneration system.

Prediction for Energy Demand Using 1D-CNN and Bidirectional LSTM in Internet of Energy (에너지인터넷에서 1D-CNN과 양방향 LSTM을 이용한 에너지 수요예측)

  • Jung, Ho Cheul;Sun, Young Ghyu;Lee, Donggu;Kim, Soo Hyun;Hwang, Yu Min;Sim, Issac;Oh, Sang Keun;Song, Seung-Ho;Kim, Jin Young
    • Journal of IKEEE
    • /
    • v.23 no.1
    • /
    • pp.134-142
    • /
    • 2019
  • As the development of internet of energy (IoE) technologies and spread of various electronic devices have diversified patterns of energy consumption, the reliability of demand prediction has decreased, causing problems in optimization of power generation and stabilization of power supply. In this study, we propose a deep learning method, 1-Dimention-Convolution and Bidirectional Long Short-Term Memory (1D-ConvBLSTM), that combines a convolution neural network (CNN) and a Bidirectional Long Short-Term Memory(BLSTM) for highly reliable demand forecasting by effectively extracting the energy consumption pattern. In experimental results, the demand is predicted with the proposed deep learning method for various number of learning iterations and feature maps, and it is verified that the test data is predicted with a small number of iterations.

Method of Extracting the Topic Sentence Considering Sentence Importance based on ELMo Embedding (ELMo 임베딩 기반 문장 중요도를 고려한 중심 문장 추출 방법)

  • Kim, Eun Hee;Lim, Myung Jin;Shin, Ju Hyun
    • Smart Media Journal
    • /
    • v.10 no.1
    • /
    • pp.39-46
    • /
    • 2021
  • This study is about a method of extracting a summary from a news article in consideration of the importance of each sentence constituting the article. We propose a method of calculating sentence importance by extracting the probabilities of topic sentence, similarity with article title and other sentences, and sentence position as characteristics that affect sentence importance. At this time, a hypothesis is established that the Topic Sentence will have a characteristic distinct from the general sentence, and a deep learning-based classification model is trained to obtain a topic sentence probability value for the input sentence. Also, using the pre-learned ELMo language model, the similarity between sentences is calculated based on the sentence vector value reflecting the context information and extracted as sentence characteristics. The topic sentence classification performance of the LSTM and BERT models was 93% accurate, 96.22% recall, and 89.5% precision, resulting in high analysis results. As a result of calculating the importance of each sentence by combining the extracted sentence characteristics, it was confirmed that the performance of extracting the topic sentence was improved by about 10% compared to the existing TextRank algorithm.

Automatic Augmentation Technique of an Autoencoder-based Numerical Training Data (오토인코더 기반 수치형 학습데이터의 자동 증강 기법)

  • Jeong, Ju-Eun;Kim, Han-Joon;Chun, Jong-Hoon
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.22 no.5
    • /
    • pp.75-86
    • /
    • 2022
  • This study aims to solve the problem of class imbalance in numerical data by using a deep learning-based Variational AutoEncoder and to improve the performance of the learning model by augmenting the learning data. We propose 'D-VAE' to artificially increase the number of records for a given table data. The main features of the proposed technique go through discretization and feature selection in the preprocessing process to optimize the data. In the discretization process, K-means are applied and grouped, and then converted into one-hot vectors by one-hot encoding technique. Subsequently, for memory efficiency, sample data are generated with Variational AutoEncoder using only features that help predict with RFECV among feature selection techniques. To verify the performance of the proposed model, we demonstrate its validity by conducting experiments by data augmentation ratio.

Pose Creation of Character in Two-Dimensional Cartoon through Human Pose Estimation (인간자세 추정방법에 의한 2차원 웹툰 캐릭터 포즈 생성)

  • Jeong, Hieyong;Shin, Choonsung
    • Journal of Broadcast Engineering
    • /
    • v.27 no.5
    • /
    • pp.718-727
    • /
    • 2022
  • The Korean domestic cartoon industry has grown explosively by 65% compared to the previous year. Then the market size is expected to exceed KRW 1 trillion. However, excessive work results in health deterioration. Moreover, this working environment makes the production of human resources insufficient, repeating a vicious cycle. Although some tasks require creation activity during cartoon production, there are still a lot of simple repetitive tasks. Therefore, this study aimed to develop a method for creating a character pose through human pose estimation (HPE). The HPE is to detect key points for each joint of a user. The primary role of the proposed method was to make each joint of the character match that of the human. The proposed method enabled us to create the pose of the two-dimensional cartoon character through the results. Furthermore, it was possible to save the static image for one character pose and the video for continuous character pose.

Classification Method based on Graph Neural Network Model for Diagnosing IoT Device Fault (사물인터넷 기기 고장 진단을 위한 그래프 신경망 모델 기반 분류 방법)

  • Kim, Jin-Young;Seon, Joonho;Yoon, Sung-Hun
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.22 no.3
    • /
    • pp.9-14
    • /
    • 2022
  • In the IoT(internet of things) where various devices can be connected, failure of essential devices may lead to a lot of economic and life losses. For reducing the losses, fault diagnosis techniques have been considered an essential part of IoT. In this paper, the method based on a graph neural network is proposed for determining fault and classifying types by extracting features from vibration data of systems. For training of the deep learning model, fault dataset are used as input data obtained from the CWRU(case western reserve university). To validate the classification performance of the proposed model, a conventional CNN(convolutional neural networks)-based fault classification model is compared with the proposed model. From the simulation results, it was confirmed that the classification performance of the proposed model outweighed the conventional model by up to 5% in the unevenly distributed data. The classification runtime can be improved by lightweight the proposed model in future works.