• Title/Summary/Keyword: Learning media

Search Result 1,581, Processing Time 0.028 seconds

Comparison Analysis of Four Face Swapping Models for Interactive Media Platform COX (인터랙티브 미디어 플랫폼 콕스에 제공될 4가지 얼굴 변형 기술의 비교분석)

  • Jeon, Ho-Beom;Ko, Hyun-kwan;Lee, Seon-Gyeong;Song, Bok-Deuk;Kim, Chae-Kyu;Kwon, Ki-Ryong
    • Journal of Korea Multimedia Society
    • /
    • v.22 no.5
    • /
    • pp.535-546
    • /
    • 2019
  • Recently, there have been a lot of researches on the whole face replacement system, but it is not easy to obtain stable results due to various attitudes, angles and facial diversity. To produce a natural synthesis result when replacing the face shown in the video image, technologies such as face area detection, feature extraction, face alignment, face area segmentation, 3D attitude adjustment and facial transposition should all operate at a precise level. And each technology must be able to be interdependently combined. The results of our analysis show that the difficulty of implementing the technology and contribution to the system in facial replacement technology has increased in facial feature point extraction and facial alignment technology. On the other hand, the difficulty of the facial transposition technique and the three-dimensional posture adjustment technique were low, but showed the need for development. In this paper, we propose four facial replacement models such as 2-D Faceswap, OpenPose, Deekfake, and Cycle GAN, which are suitable for the Cox platform. These models have the following features; i.e. these models include a suitable model for front face pose image conversion, face pose image with active body movement, and face movement with right and left side by 15 degrees, Generative Adversarial Network.

Efficient Inference of Image Objects using Semantic Segmentation (시멘틱 세그멘테이션을 활용한 이미지 오브젝트의 효율적인 영역 추론)

  • Lim, Heonyeong;Lee, Yurim;Jee, Minkyu;Go, Myunghyun;Kim, Hakdong;Kim, Wonil
    • Journal of Broadcast Engineering
    • /
    • v.24 no.1
    • /
    • pp.67-76
    • /
    • 2019
  • In this paper, we propose an efficient object classification method based on semantic segmentation for multi-labeled image data. In addition to various pixel unit information and processing techniques such as color information, contour, contrast, and saturation included in image data, a detailed region in which each object is located is extracted as a meaningful unit and the experiment is conducted to reflect the result in the inference. We use a neural network that has been proven to perform well in image classification to understand which object is located where image data containing various class objects are located. Based on these researches, we aim to provide artificial intelligence services that can classify real-time detailed areas of complex images containing various objects in the future.

A Study on Self-medication for Health Promotion of the Silver Generation

  • Oh, Soonhwan;Ryu, Gihwan
    • International Journal of Advanced Culture Technology
    • /
    • v.8 no.4
    • /
    • pp.82-88
    • /
    • 2020
  • With the development of medical care in the 21st century and the rapid development of the 4th industry, electronic devices and household goods taking into account the physical and mental aging of the silver generation have been developed, and apps related to health and health are generally developed and operated. The apps currently used by the silver generation are a form that provides information on diseases by focusing on prevention rather than treatment, such as safety management apps for the elderly living alone and methods for preventing diseases. There are not many apps that provide information on foods that have a direct effect and nutrients in that food, and research on apps that can obtain information about individual foods is insufficient. In this paper, we propose an app that analyzes food factors and provides self-medication for health promotion of the silver generation. This app allows the silver generation to conveniently and easily obtain information such as nutrients, calories, and efficacy of food they need. In addition, this app collects/categorizes healthy food information through a textom solution-based crawling agent, and stores highly relevant words in a data resource. In addition, wide deep learning was applied to enable self-medication recommendations for food. When this technique is applied, the most appropriate healthy food is suggested to people with similar eating patterns and tastes in the same age group, and users can receive recommendations on customized healthy foods that they need before eating. This made it possible to obtain convenient healthy food information through a customized interface for the elderly through a smartphone.

Environmental Sound Classification for Selective Noise Cancellation in Industrial Sites (산업현장에서의 선택적 소음 제거를 위한 환경 사운드 분류 기술)

  • Choi, Hyunkook;Kim, Sangmin;Park, Hochong
    • Journal of Broadcast Engineering
    • /
    • v.25 no.6
    • /
    • pp.845-853
    • /
    • 2020
  • In this paper, we propose a method for classifying environmental sound for selective noise cancellation in industrial sites. Noise in industrial sites causes hearing loss in workers, and researches on noise cancellation have been widely conducted. However, the conventional methods have a problem of blocking all sounds and cannot provide the optimal operation per noise type because of common cancellation method for all types of noise. In order to perform selective noise cancellation, therefore, we propose a method for environmental sound classification based on deep learning. The proposed method uses new sets of acoustic features consisting of temporal and statistical properties of Mel-spectrogram, which can overcome the limitation of Mel-spectrogram features, and uses convolutional neural network as a classifier. We apply the proposed method to five-class sound classification with three noise classes and two non-noise classes. We confirm that the proposed method provides improved classification accuracy by 6.6% point, compared with that using conventional Mel-spectrogram features.

Proposal of mobile application for rounded shoulder improvement in connection with EMG sensor (근전도 센서를 연동한 둥근 어깨 개선 모바일 어플리케이션 제안)

  • Park, So-Mi;Kay, Yoonshin;Im, Hee-Su;Park, Su-E
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.5
    • /
    • pp.667-676
    • /
    • 2021
  • Recently, adolescents in Korea are exposed to the risk of postural imbalance due to overuse of smartphones and lack of physical activity due to the amount of learning. In addition, the need for effective non-face-to-face exercise services is increasing due to Corona 19. With this in mind, this study proposes an exercise service using an EMG sensor to overcome the limitations of non-face-to-face services while providing the effect of improving round shoulders for adolescents. An exercise program that can improve round shoulders was constructed, and an application in conjunction with an EMG sensor was implemented to exercise effectively. The exercise program was configured to alternately exercise the target muscle area for 4 weeks, and the function to provide feedback was added by measuring the EMG values that change accordingly. Through this study, we intend to provide the basis for exercise-based posture correction digital service, and improve the unbalanced body through this, thereby promoting the possibility of health promotion.

Design and Implementation of Observation Manipulation Model for Creating Kids Contents Based on Augmented Reality (증강현실 기반의 키즈 콘텐츠 제작을 위한 관찰 조작형 모델의 설계 및 구현)

  • Oh, Am-Suk
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.3
    • /
    • pp.339-345
    • /
    • 2021
  • With the development of online education due to COVID-19, the EduTech market, which combines new technologies such as AI and AR/VR in education is rapidly growing. In addition, the children's industry is steadily growing despite the decreasing birth rate every year as more and more families with one child per household are investing in their children. However, supply of contents to EduTech market is slow compared to demands that are increasing. Therefore, the purpose of this paper is to help solve these problems by developing and supporting AR kids contents with convenience, practicality, and efficiency using AR technology. AR content for supporting vocabulary learning for infants is not just an end to watching and listening, but an observation-driven model that can manipulate content directly, which attracts children's interest and helps children learn words. This paper is intended for infants from 15 months to 36 months old when full-fledged language development occurs.

Youtube Influencer's Startup Strategy Using Lean Startup Technique (린스타트업 기법을 활용한 유튜브 인플루언서의 창업전략)

  • Park, Jeong Sun;Park, Sang Hyeok;Kim, Young Lag
    • The Journal of Information Systems
    • /
    • v.31 no.1
    • /
    • pp.147-173
    • /
    • 2022
  • Purpose As the use of social network services has become common, it has become possible to freely communicate and establish relationships with other people anytime, anywhere for communication and information sharing. Influencers who have a strong influence on consumers' perceptions and attitudes through their own opinions and stories have appeared on various social media channels such as YouTube. Recently, companies utilize influencers with a large number of followers to check interactions with customers to understand customer attitudes and opinions about products in real time. Start-ups with insufficient resources need to quickly examine customer responses to reduce the probability of failure after product planning. The Lean process of creating an MVP and quickly confirming and learning the market response should be repeated over and over again. Findings In this paper, we try to suggest that the YouTube platform can play a sufficient role as a customer experiment space through examples. The case company is a company that has successfully commercialized products by continuously interacting with customers through the YouTube platform for the first four months of its founding. This paper is expected to be helpful in the experimental process for prospective founders and early founders to examine customer responses to reduce the probability of market failure before commercialization. Design/methodology/approach This paper analyzed the YouTube channel data of case companies based on the netnography methodology and presented the contents of the lean process management carried out in the experimental stage and the post-production stage through interview research.

Facial Image Synthesis by Controlling Skin Microelements (피부 미세요소 조절을 통한 얼굴 영상 합성)

  • Kim, Yujin;Park, In Kyu
    • Journal of Broadcast Engineering
    • /
    • v.27 no.3
    • /
    • pp.369-377
    • /
    • 2022
  • Recent deep learning-based face synthesis research shows the result of generating a realistic face including overall style or elements such as hair, glasses, and makeup. However, previous methods cannot create a face at a very detailed level, such as the microstructure of the skin. In this paper, to overcome this limitation, we propose a technique for synthesizing a more realistic facial image from a single face label image by controlling the types and intensity of skin microelements. The proposed technique uses Pix2PixHD, an Image-to-Image Translation method, to convert a label image showing the facial region and skin elements such as wrinkles, pores, and redness to create a facial image with added microelements. Experimental results show that it is possible to create various realistic face images reflecting fine skin elements corresponding to this by generating various label images with adjusted skin element regions.

Compression of DNN Integer Weight using Video Encoder (비디오 인코더를 통한 딥러닝 모델의 정수 가중치 압축)

  • Kim, Seunghwan;Ryu, Eun-Seok
    • Journal of Broadcast Engineering
    • /
    • v.26 no.6
    • /
    • pp.778-789
    • /
    • 2021
  • Recently, various lightweight methods for using Convolutional Neural Network(CNN) models in mobile devices have emerged. Weight quantization, which lowers bit precision of weights, is a lightweight method that enables a model to be used through integer calculation in a mobile environment where GPU acceleration is unable. Weight quantization has already been used in various models as a lightweight method to reduce computational complexity and model size with a small loss of accuracy. Considering the size of memory and computing speed as well as the storage size of the device and the limited network environment, this paper proposes a method of compressing integer weights after quantization using a video codec as a method. To verify the performance of the proposed method, experiments were conducted on VGG16, Resnet50, and Resnet18 models trained with ImageNet and Places365 datasets. As a result, loss of accuracy less than 2% and high compression efficiency were achieved in various models. In addition, as a result of comparison with similar compression methods, it was verified that the compression efficiency was more than doubled.

Image Enhancement based on Piece-wise Linear Enhancement Curves for Improved Visibility under Sunlight (햇빛 아래에서 향상된 시인성을 위한 Piece-wise Linear Enhancement Curves 기반 영상 개선)

  • Lee, Junmin;Song, Byung Cheol
    • Journal of Broadcast Engineering
    • /
    • v.27 no.5
    • /
    • pp.812-815
    • /
    • 2022
  • Images displayed on a digital devices under the sunlight are generally perceived to be darker than the original images, which leads to a decrease in visibility. For better visibility, global luminance compensation or tone mapping adaptive to ambient lighting is required. However, the existing methods have limitations in chrominance compensation and are difficult to use in real world due to their heavy computational cost. To solve these problems, this paper propose a piece-wise linear curves (PLECs)-based image enhancement method to improve both luminance and chrominance. At this time, PLECs are regressed through deep learning and implemented in the form of a lookup table to real-time operation. Experimental results show that the proposed method has better visibility compared to the original image with low computational cost.