• Title/Summary/Keyword: Deep Learning based System

Search Result 1,194, Processing Time 0.032 seconds

Image retrieval based on a combination of deep learning and behavior ontology for reducing semantic gap (시맨틱 갭을 줄이기 위한 딥러닝과 행위 온톨로지의 결합 기반 이미지 검색)

  • Lee, Seung;Jung, Hye-Wuk
    • Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology
    • /
    • v.9 no.11
    • /
    • pp.1133-1144
    • /
    • 2019
  • Recently, the amount of image on the Internet has rapidly increased, due to the advancement of smart devices and various approaches to effective image retrieval have been researched under these situation. Existing image retrieval methods simply detect the objects in a image and carry out image retrieval based on the label of each object. Therefore, the semantic gap occurs between the image desired by a user and the image obtained from the retrieval result. To reduce the semantic gap in image retrievals, we connect the module for multiple objects classification based on deep learning with the module for human behavior classification. And we combine the connected modules with a behavior ontology. That is to say, we propose an image retrieval system considering the relationship between objects by using the combination of deep learning and behavior ontology. We analyzed the experiment results using walking and running data to take into account dynamic behaviors in images. The proposed method can be extended to the study of automatic annotation generation of images that can improve the accuracy of image retrieval results.

Performance Analysis of Wireless Communication Systems Using Deep Learning Based Transmit Power Control in Nakagami Fading Channels (나카가미 페이딩 채널에서 딥러닝 기반 송신 전력 제어 기법을 이용하는 무선통신 시스템에 대한 성능 분석)

  • Kim, Donghyeon;Kim, Dongyon;Lee, In-Ho
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.24 no.6
    • /
    • pp.744-750
    • /
    • 2020
  • In this paper, we propose a deep learning based transmit power control (TPC) scheme to improve the spectral and energy efficiency of wireless communication systems. In the wireless communication system, the positions of multiple transceivers follow a uniform distribution, and the performances of spectral and energy efficiency for the proposed TPC scheme are analyzed assuming the Nakagami fading channels. The proposed TPC scheme uses batch normalization to improve spectral and energy efficiency in deep learning based training. Through simulation, we compare the results of the spectral and energy efficiency of the proposed TPC scheme and the conventional one for various area sizes that limit the position range of the transceivers and Nakagami fading factors. Comparing the performance results, we verify that the proposed scheme provides better performance than the conventional one.

Evaluation of Criteria for Mapping Characters Using an Automated Hangul Font Generation System based on Deep Learning (딥러닝 학습을 이용한 한글 글꼴 자동 제작 시스템에서 글자 쌍의 매핑 기준 평가)

  • Jeon, Ja-Yeon;Ji, Young-Seo;Park, Dong-Yeon;Lim, Soon-Bum
    • Journal of Korea Multimedia Society
    • /
    • v.23 no.7
    • /
    • pp.850-861
    • /
    • 2020
  • Hangul is a language that is composed of initial, medial, and final syllables. It has 11,172 characters. For this reason, the current method of designing all the characters by hand is very expensive and time-consuming. In order to solve the problem, this paper proposes an automatic Hangul font generation system and evaluates the standards for mapping Hangul characters to produce an effective automated Hangul font generation system. The system was implemented using character generation engine based on deep learning CycleGAN. In order to evaluate the criteria when mapping characters in pairs, each criterion was designed based on Hangul structure and character shape, and the quality of the generated characters was evaluated. As a result of the evaluation, the standards designed based on the Hangul structure did not affect the quality of the automated Hangul font generation system. On the other hand, when tried with similar characters, the standards made based on the shape of Hangul characters produced better quality characters than when tried with less similar characters. As a result, it is better to generate automated Hangul font by designing a learning method based on mapping characters in pairs that have similar character shapes.

Realization of home appliance classification system using deep learning (딥러닝을 이용한 가전제품 분류 시스템 구현)

  • Son, Chang-Woo;Lee, Sang-Bae
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.21 no.9
    • /
    • pp.1718-1724
    • /
    • 2017
  • Recently, Smart plugs for real time monitoring of household appliances based on IoT(Internet of Things) have been activated. Through this, consumers are able to save energy by monitoring real-time energy consumption at all times, and reduce power consumption through alarm function based on consumer setting. In this paper, we measure the alternating current from a wall power outlet for real-time monitoring. At this time, the current pattern for each household appliance was classified and it was experimented with deep learning to determine which product works. As a result, we used a cross validation method and a bootstrap verification method in order to the classification performance according to the type of appliances. Also, it is confirmed that the cost function and the learning success rate are the same as the train data and test data.

Automatic Categorization of Islamic Jurisprudential Legal Questions using Hierarchical Deep Learning Text Classifier

  • AlSabban, Wesam H.;Alotaibi, Saud S.;Farag, Abdullah Tarek;Rakha, Omar Essam;Al Sallab, Ahmad A.;Alotaibi, Majid
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.9
    • /
    • pp.281-291
    • /
    • 2021
  • The Islamic jurisprudential legal system represents an essential component of the Islamic religion, that governs many aspects of Muslims' daily lives. This creates many questions that require interpretations by qualified specialists, or Muftis according to the main sources of legislation in Islam. The Islamic jurisprudence is usually classified into branches, according to which the questions can be categorized and classified. Such categorization has many applications in automated question-answering systems, and in manual systems in routing the questions to a specialized Mufti to answer specific topics. In this work we tackle the problem of automatic categorisation of Islamic jurisprudential legal questions using deep learning techniques. In this paper, we build a hierarchical deep learning model that first extracts the question text features at two levels: word and sentence representation, followed by a text classifier that acts upon the question representation. To evaluate our model, we build and release the largest publicly available dataset of Islamic questions and answers, along with their topics, for 52 topic categories. We evaluate different state-of-the art deep learning models, both for word and sentence embeddings, comparing recurrent and transformer-based techniques, and performing extensive ablation studies to show the effect of each model choice. Our hierarchical model is based on pre-trained models, taking advantage of the recent advancement of transfer learning techniques, focused on Arabic language.

Deep Learning Frameworks for Cervical Mobilization Based on Website Images

  • Choi, Wansuk;Heo, Seoyoon
    • Journal of International Academy of Physical Therapy Research
    • /
    • v.12 no.1
    • /
    • pp.2261-2266
    • /
    • 2021
  • Background: Deep learning related research works on website medical images have been actively conducted in the field of health care, however, articles related to the musculoskeletal system have been introduced insufficiently, deep learning-based studies on classifying orthopedic manual therapy images would also just be entered. Objectives: To create a deep learning model that categorizes cervical mobilization images and establish a web application to find out its clinical utility. Design: Research and development. Methods: Three types of cervical mobilization images (central posteroanterior (CPA) mobilization, unilateral posteroanterior (UPA) mobilization, and anteroposterior (AP) mobilization) were obtained using functions of 'Download All Images' and a web crawler. Unnecessary images were filtered from 'Auslogics Duplicate File Finder' to obtain the final 144 data (CPA=62, UPA=46, AP=36). Training classified into 3 classes was conducted in Teachable Machine. The next procedures, the trained model source was uploaded to the web application cloud integrated development environment (https://ide.goorm.io/) and the frame was built. The trained model was tested in three environments: Teachable Machine File Upload (TMFU), Teachable Machine Webcam (TMW), and Web Service webcam (WSW). Results: In three environments (TMFU, TMW, WSW), the accuracy of CPA mobilization images was 81-96%. The accuracy of the UPA mobilization image was 43~94%, and the accuracy deviation was greater than that of CPA. The accuracy of the AP mobilization image was 65-75%, and the deviation was not large compared to the other groups. In the three environments, the average accuracy of CPA was 92%, and the accuracy of UPA and AP was similar up to 70%. Conclusion: This study suggests that training of images of orthopedic manual therapy using machine learning open software is possible, and that web applications made using this training model can be used clinically.

Real-Time Joint Animation Production and Expression System using Deep Learning Model and Kinect Camera (딥러닝 모델과 Kinect 카메라를 이용한 실시간 관절 애니메이션 제작 및 표출 시스템 구축에 관한 연구)

  • Kim, Sang-Joon;Lee, Yu-Jin;Park, Goo-man
    • Journal of Broadcast Engineering
    • /
    • v.26 no.3
    • /
    • pp.269-282
    • /
    • 2021
  • As the distribution of 3D content such as augmented reality and virtual reality increases, the importance of real-time computer animation technology is increasing. However, the computer animation process consists mostly of manual or marker-attaching motion capture, which requires a very long time for experienced professionals to obtain realistic images. To solve these problems, animation production systems and algorithms based on deep learning model and sensors have recently emerged. Thus, in this paper, we study four methods of implementing natural human movement in deep learning model and kinect camera-based animation production systems. Each method is chosen considering its environmental characteristics and accuracy. The first method uses a Kinect camera. The second method uses a Kinect camera and a calibration algorithm. The third method uses deep learning model. The fourth method uses deep learning model and kinect. Experiments with the proposed method showed that the fourth method of deep learning model and using the Kinect simultaneously showed the best results compared to other methods.

Two-phase flow pattern online monitoring system based on convolutional neural network and transfer learning

  • Hong Xu;Tao Tang
    • Nuclear Engineering and Technology
    • /
    • v.54 no.12
    • /
    • pp.4751-4758
    • /
    • 2022
  • Two-phase flow may almost exist in every branch of the energy industry. For the corresponding engineering design, it is very essential and crucial to monitor flow patterns and their transitions accurately. With the high-speed development and success of deep learning based on convolutional neural network (CNN), the study of flow pattern identification recently almost focused on this methodology. Additionally, the photographing technique has attractive implementation features as well, since it is normally considerably less expensive than other techniques. The development of such a two-phase flow pattern online monitoring system is the objective of this work, which seldom studied before. The ongoing preliminary engineering design (including hardware and software) of the system are introduced. The flow pattern identification method based on CNNs and transfer learning was discussed in detail. Several potential CNN candidates such as ALexNet, VggNet16 and ResNets were introduced and compared with each other based on a flow pattern dataset. According to the results, ResNet50 is the most promising CNN network for the system owing to its high precision, fast classification and strong robustness. This work can be a reference for the online monitoring system design in the energy system.

Damage Localization of Bridges with Variational Autoencoder (Variational Autoencoder를 이용한 교량 손상 위치 추정방법)

  • Lee, Kanghyeok;Chung, Minwoong;Jeon, Chanwoong;Shin, Do Hyoung
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.40 no.2
    • /
    • pp.233-238
    • /
    • 2020
  • Most deep learning (DL) approaches for bridge damage localization based on a structural health monitoring system commonly use supervised learning-based DL models. The supervised learning-based DL model requires the response data obtained from sensors on the bridge and also the label which indicates the damaged state of the bridge. However, it is impractical to accurately obtain the label data in fields, thus, the supervised learning-based DL model has a limitation in that it is not easily applicable in practice. On the other hand, an unsupervised learning-based DL model has the merit of being able to train without label data. Considering this advantage, this study aims to propose and theoretically validate a damage localization approach for bridges using a variational autoencoder, a representative unsupervised learning-based DL network: as a result, this study indicated the feasibility of VAE for damage localization.

Object Edge-based Image Generation Technique for Constructing Large-scale Image Datasets (대형 이미지 데이터셋 구축을 위한 객체 엣지 기반 이미지 생성 기법)

  • Ju-Hyeok Lee;Mi-Hui Kim
    • Journal of IKEEE
    • /
    • v.27 no.3
    • /
    • pp.280-287
    • /
    • 2023
  • Deep learning advancements can solve computer vision problems, but large-scale datasets are necessary for high accuracy. In this paper, we propose an image generation technique using object bounding boxes and image edge components. The object bounding boxes are extracted from the images through object detection, and image edge components are used as input values for the image generation model to create new image data. As results of experiments, the images generated by the proposed method demonstrated similar image quality to the source images in the image quality assessment, and also exhibited good performance during the deep learning training process.