• Title/Summary/Keyword: 이미지 예측 모델

Search Result 212, Processing Time 0.027 seconds

Image Mood Classification Using Deep CNN and Its Application to Automatic Video Generation (심층 CNN을 활용한 영상 분위기 분류 및 이를 활용한 동영상 자동 생성)

  • Cho, Dong-Hee;Nam, Yong-Wook;Lee, Hyun-Chang;Kim, Yong-Hyuk
    • Journal of the Korea Convergence Society
    • /
    • v.10 no.9
    • /
    • pp.23-29
    • /
    • 2019
  • In this paper, the mood of images was classified into eight categories through a deep convolutional neural network and video was automatically generated using proper background music. Based on the collected image data, the classification model is learned using a multilayer perceptron (MLP). Using the MLP, a video is generated by using multi-class classification to predict image mood to be used for video generation, and by matching pre-classified music. As a result of 10-fold cross-validation and result of experiments on actual images, each 72.4% of accuracy and 64% of confusion matrix accuracy was achieved. In the case of misclassification, by classifying video into a similar mood, it was confirmed that the music from the video had no great mismatch with images.

Development of RVE Reconstruction Algorithm for SMC Multiscale Modeling (SMC 복합재료 멀티스케일 모델링을 위한 RVE 재구성 알고리즘 개발)

  • Lim, Hyoung Jun;Choi, Ho-Il;Yoon, Sang Jae;Lim, Sang Won;Choi, Chi Hoon;Yun, Gun Jin
    • Composites Research
    • /
    • v.34 no.1
    • /
    • pp.70-75
    • /
    • 2021
  • This paper presents a novel algorithm to reconstruct meso-scale representative volume elements (RVE), referring to experimentally observed features of Sheet Molding Compound (SMC) composites. Predicting anisotropic mechanical properties of SMC composites is challenging in the multiscale virtual test using finite element (FE) models. To this end, an SMC RVE modeler consisting of a series of image processing techniques, the novel reconstruction algorithm, and a FE mesh generator for the SMC composites are developed. First, micro-CT image processing is conducted to estimate probabilistic distributions of two critical features, such as fiber chip orientation and distribution that are highly related to mechanical performance. Second, a reconstruction algorithm for 3D fiber chip packing is developed in consideration of the overlapping effect between fiber chips. Third, the macro-scale behavior of the SMC is predicted by the multiscale analysis.

Forest Change Detection Service Based on Artificial Intelligence Learning Data (인공지능 학습용 데이터 기반의 산림변화탐지 서비스)

  • Chung, Hankun;Kim, Jong-in;Ko, Sun Young;Chai, Seunggi;Shin, Youngtae
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.11 no.8
    • /
    • pp.347-354
    • /
    • 2022
  • Since the era of the 4th industrial revolution has been ripe, the use of artificial intelligence(AI) based on massive data is beginning to be actively applied in various fields. However, as the process of analyzing forest species is carried out manually, many errors are occurring. Therefore, in this paper, about 60,000 pieces of AI learning data were automatically analyzed for pine, larch, conifer, and broadleaf trees of aerial photographs and pseudo images in the metropolitan area, and an AI model was developed to distinguish tree species. Through this, it is expected to increase in work efficiency by using the tree species division image as basic data when producing forest change detection and forest field topics.

Development of Virtual Makeup Tool based on Mobile Augmented Reality

  • Song, Mi-Young;Kim, Young-Sun
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.1
    • /
    • pp.127-133
    • /
    • 2021
  • In this study, an augmented reality-based make-up tool was built to analyze the user's face shape based on face-type reference model data and to provide virtual makeup by providing face-type makeup. To analyze the face shape, first recognize the face from the image captured by the camera, then extract the features of the face contour area and use them as analysis properties. Next, the feature points of the extracted face contour area are normalized to compare with the contour area characteristics of each face reference model data. Face shape is predicted and analyzed using the distance difference between the feature points of the normalized contour area and the feature points of the each face-type reference model data. In augmented reality-based virtual makeup, in the image input from the camera, the face is recognized in real time to extract the features of each area of the face. Through the face-type analysis process, you can check the results of virtual makeup by providing makeup that matches the analyzed face shape. Through the proposed system, We expect cosmetics consumers to check the makeup design that suits them and have a convenient and impact on their decision to purchase cosmetics. It will also help you create an attractive self-image by applying facial makeup to your virtual self.

Automatic Video Editing Technology based on Matching System using Genre Characteristic Patterns (장르 특성 패턴을 활용한 매칭시스템 기반의 자동영상편집 기술)

  • Mun, Hyejun;Lim, Yangmi
    • Journal of Broadcast Engineering
    • /
    • v.25 no.6
    • /
    • pp.861-869
    • /
    • 2020
  • We introduce the application that automatically makes several images stored in user's device into one video by using the different climax patterns appearing for each film genre. For the classification of the genre characteristics of movies, a climax pattern model style was created by analyzing the genre of domestic movie drama, action, horror and foreign movie drama, action, and horror. The climax pattern was characterized by the change in shot size, the length of the shot, and the frequency of insert use in a specific scene part of the movie, and the result was visualized. The model visualized by genre developed as a template using Firebase DB. Images stored in the user's device were selected and matched with the climax pattern model developed as a template for each genre. Although it is a short video, it is a feature of the proposed application that it can create an emotional story video that reflects the characteristics of the genre. Recently, platform operators such as YouTube and Naver are upgrading applications that automatically generate video using a picture or video taken by the user directly with a smartphone. However, applications that have genre characteristics like movies or include video-generation technology to show stories are still insufficient. It is predicted that the proposed automatic video editing has the potential to develop into a video editing application capable of transmitting emotions.

Business Application of Convolutional Neural Networks for Apparel Classification Using Runway Image (합성곱 신경망의 비지니스 응용: 런웨이 이미지를 사용한 의류 분류를 중심으로)

  • Seo, Yian;Shin, Kyung-shik
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.3
    • /
    • pp.1-19
    • /
    • 2018
  • Large amount of data is now available for research and business sectors to extract knowledge from it. This data can be in the form of unstructured data such as audio, text, and image data and can be analyzed by deep learning methodology. Deep learning is now widely used for various estimation, classification, and prediction problems. Especially, fashion business adopts deep learning techniques for apparel recognition, apparel search and retrieval engine, and automatic product recommendation. The core model of these applications is the image classification using Convolutional Neural Networks (CNN). CNN is made up of neurons which learn parameters such as weights while inputs come through and reach outputs. CNN has layer structure which is best suited for image classification as it is comprised of convolutional layer for generating feature maps, pooling layer for reducing the dimensionality of feature maps, and fully-connected layer for classifying the extracted features. However, most of the classification models have been trained using online product image, which is taken under controlled situation such as apparel image itself or professional model wearing apparel. This image may not be an effective way to train the classification model considering the situation when one might want to classify street fashion image or walking image, which is taken in uncontrolled situation and involves people's movement and unexpected pose. Therefore, we propose to train the model with runway apparel image dataset which captures mobility. This will allow the classification model to be trained with far more variable data and enhance the adaptation with diverse query image. To achieve both convergence and generalization of the model, we apply Transfer Learning on our training network. As Transfer Learning in CNN is composed of pre-training and fine-tuning stages, we divide the training step into two. First, we pre-train our architecture with large-scale dataset, ImageNet dataset, which consists of 1.2 million images with 1000 categories including animals, plants, activities, materials, instrumentations, scenes, and foods. We use GoogLeNet for our main architecture as it has achieved great accuracy with efficiency in ImageNet Large Scale Visual Recognition Challenge (ILSVRC). Second, we fine-tune the network with our own runway image dataset. For the runway image dataset, we could not find any previously and publicly made dataset, so we collect the dataset from Google Image Search attaining 2426 images of 32 major fashion brands including Anna Molinari, Balenciaga, Balmain, Brioni, Burberry, Celine, Chanel, Chloe, Christian Dior, Cividini, Dolce and Gabbana, Emilio Pucci, Ermenegildo, Fendi, Giuliana Teso, Gucci, Issey Miyake, Kenzo, Leonard, Louis Vuitton, Marc Jacobs, Marni, Max Mara, Missoni, Moschino, Ralph Lauren, Roberto Cavalli, Sonia Rykiel, Stella McCartney, Valentino, Versace, and Yve Saint Laurent. We perform 10-folded experiments to consider the random generation of training data, and our proposed model has achieved accuracy of 67.2% on final test. Our research suggests several advantages over previous related studies as to our best knowledge, there haven't been any previous studies which trained the network for apparel image classification based on runway image dataset. We suggest the idea of training model with image capturing all the possible postures, which is denoted as mobility, by using our own runway apparel image dataset. Moreover, by applying Transfer Learning and using checkpoint and parameters provided by Tensorflow Slim, we could save time spent on training the classification model as taking 6 minutes per experiment to train the classifier. This model can be used in many business applications where the query image can be runway image, product image, or street fashion image. To be specific, runway query image can be used for mobile application service during fashion week to facilitate brand search, street style query image can be classified during fashion editorial task to classify and label the brand or style, and website query image can be processed by e-commerce multi-complex service providing item information or recommending similar item.

The Development of the Prototype for Electronic Journal (전자저널 개발모형에 관한 연구)

  • 정준민
    • Journal of the Korean Society for information Management
    • /
    • v.18 no.3
    • /
    • pp.203-218
    • /
    • 2001
  • The prototype of electronic journal is proposed on the notion of functional properties of internet or web. Additionally, the methodology for the friendly relationship between print oriented journal and electronic one is also suggested. The electronic journal is consisted of 6 menus, administration area, community service, what\`s up & current papers. category service, searching and extend retrieval service. And the electronic journal is designed not only on the succession of the attributes of printed journal, but also on tole emphasis of its electrical properties. But at the end of document, it is mentioned that in the near future all the information service related media or mechanisms will be synthesized and uniquely evolved on the succession of those properties.

  • PDF

Estimating the Direction and Distance of an Unknown Radiation Source Using RMC (RMC를 이용한 미지 선원의 방향, 거리 예측)

  • Shin, Youngjun;Kim, Geehyun;Lee, Gyemin
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.53 no.9
    • /
    • pp.118-125
    • /
    • 2016
  • Rotating modulation collimator(RMC) is a remote sensing technique for a radiation source. This paper introduces an RMC system model and its image reconstruction algorithm based on Kowash's research. The reconstructed image can show the direction of a source. However, the distance to the source cannot be recovered. Moreover, the RMC image suffers from $180^{\circ}$ ambiguity. In this paper, we propose a distance estimation method using two RMCs together with a solution to the ambiguity. We also demonstrate its performance using simulated RMC data.

A Pilot Study on Automatic Diagnosis of Cancer Cells Metastasis in Frozen Section Using Convolutional Neural Network (합성곱 신경망을 이용한 동결절편의 암세포 전이 여부 자동진단에 관한 예비연구)

  • Jung, Dae-Il;Kang, Jae-Ku;Jeon, Hye-Lynn;Oh, Se-Jong;Kim, Sungchul;Kim, Young-Gon;Gong, Gyungyub;Song, In Hye;Park, So Yeon;Ahn, Soomin;Lee, Hyunna;Yang, Dong Hyun;You, Wonsang
    • Annual Conference of KIPS
    • /
    • 2020.05a
    • /
    • pp.480-482
    • /
    • 2020
  • 동결절편검사는 수술과 연계하여 암 전이 여부를 판단하기 위한 응급한 병리검사가 필요할 때 이용된다. 합성곱 신경망은 이미지 분류에 뛰어난 성능을 보이는 딥러닝 기법으로 본 논문에서는 이를 이용하여 유방암 전이 여부를 자동적으로 진단하는 방법을 제안한다. 실험과정은 전처리, 학습, 후처리의 과정으로 구성되어 있으며, 합성곱 신경망으로는 Resnet-18 모델을 사용하였다. 실험결과 예측 정확도 및 종양의 최대 길이 정합 여부를 점수로 환산하여 약 0.514 의 결과를 보였다.

Hyperparameter Search for Facies Classification with Bayesian Optimization (베이지안 최적화를 이용한 암상 분류 모델의 하이퍼 파라미터 탐색)

  • Choi, Yonguk;Yoon, Daeung;Choi, Junhwan;Byun, Joongmoo
    • Geophysics and Geophysical Exploration
    • /
    • v.23 no.3
    • /
    • pp.157-167
    • /
    • 2020
  • With the recent advancement of computer hardware and the contribution of open source libraries to facilitate access to artificial intelligence technology, the use of machine learning (ML) and deep learning (DL) technologies in various fields of exploration geophysics has increased. In addition, ML researchers have developed complex algorithms to improve the inference accuracy of various tasks such as image, video, voice, and natural language processing, and now they are expanding their interests into the field of automatic machine learning (AutoML). AutoML can be divided into three areas: feature engineering, architecture search, and hyperparameter search. Among them, this paper focuses on hyperparamter search with Bayesian optimization, and applies it to the problem of facies classification using seismic data and well logs. The effectiveness of the Bayesian optimization technique has been demonstrated using Vincent field data by comparing with the results of the random search technique.