• Title/Summary/Keyword: 이미지 기반 모델링

Search Result 137, Processing Time 0.032 seconds

Color-based Emotion Analysis Using Fuzzy Logic (퍼지 논리를 이용한 색채 기반 감성 분석)

  • Woo, Young-Woon;Kim, Chang-Kyu;Kim, Chee-Yong
    • Journal of Digital Contents Society
    • /
    • v.9 no.2
    • /
    • pp.245-250
    • /
    • 2008
  • Psychology of color is a research field of psychology for studying human's behavior connected with color. Color carries symbolism and image while sharing psychological consensus with human. Each color has a respective image such as hope, passion, love, life, death, and so on. Peculiar stimuli by colors on these images have great influence on human's emotion and psychology. We therefore proposed a method for understanding human's state of emotion based on colors in this paper. In order to understand human's state of emotion, we analyzed color information used to model a room by a user and then described frequencies of each color as percent using fuzzy inference rules by membership values of fuzzy membership functions for colors used for modeling the room. When we applied the proposed color-based emotion analysis method to emotional state based on colors of Alschuler and Hattwick, we could see the proposed method is efficient.

  • PDF

A Study on the Production of 3D Datasets for Stone Pagodas by Period in Korea

  • Byong-Kwon Lee;Eun-Ji Kim
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.9
    • /
    • pp.105-111
    • /
    • 2023
  • Currently, most of content restoration using artificial intelligence learning is 2D learning. However, 3D form of artificial intelligence learning is in an incomplete state due to the disadvantage of requiring a lot of computation and learning speed from the existing 2 axes (X, Y) to 3 axes (X, Y, Z). The purpose of this paper is to secure a data-set for artificial intelligence learning by analyzing and 3D modeling the stone pagodas of ourinari by era based on the two-dimensional information (image) of cultural assets. In addition, we analyzed the differences and characteristics of towers in each era in Korea, and proposed a feature modeling method suitable for artificial intelligence learning. Restoration of cultural properties relies on a variety of materials, expert techniques and historical archives. By recording and managing the information necessary for the restoration of cultural properties through this study, it is expected that it will be used as an important documentary heritage for restoring and maintaining Korean traditional pagodas in the future.

An Efficient Window Sliding Method for On-road Vehicle License Plate Detection (도로 상 차량 번호판 검출을 위한 효율적인 윈도우 슬라이딩 기법)

  • Mo, Hong-Chul;Nang, Jong-Ho
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2011.06a
    • /
    • pp.450-453
    • /
    • 2011
  • 고화질의 디지털 카메라 및 스마트폰, 감시용 카메라의 보급 등으로 인해 최근 패턴 인식 및 이미지 프로세싱 분야에서 고화질의 이미지 및 비디오를 처리해야 하는 경우가 많아지고 있다. 특히 차량 번호판 감지 등과 같은 객체 인식 분야의 경우, 고화질의 이미지로 인해 그만큼 인식에 필요한 계산 비용이 증가하게 되었는데 따라서 이러한 계산 비용을 효율적으로 줄이기 위한 기법이 요구되고 있다. 또한 기존의 차량 번호판 감지의 도메인과는 다르게 도로 상에서의 실시간 차량 번호판 감지의 필요성이 대두되고 있기에 본 논문에서는 도로 상에서의 실시간 번호판 감지 시스템을 위한 차량 번호판 주변정보 기반의 효율적인 윈도우 슬라이딩(window sliding) 방법을 제안한다. 본 논문의 시스템은 총 3단계로, (1) SVM(Supported Vector Machine) 을 통한 차량 번호판 주위 정보에 대한 학습, (2) 도로 상의 번호판 위치 확률 모델링을 통한 탐색 공간의 감소, (3) $context_{plate}$분류기를 통한 OCS(operator context scanning)의 수행이다. 이와 같은 $context_{plate}$분류기와 OCS를 통해 번호판 검출을 위한 윈도우 슬라이딩의 수가 크게 줄었음을 알 수 있었으며, 또한 번호판의 정보를 건너뛰지 않고, 신뢰성 있게 접근함을 알 수 있었다.

Web-based 3D Face Modeling System for Hairline Modification Surgery (헤어라인 교정 시술을 위한 웹기반 얼굴 3D 모델링)

  • Lee, Sang-Wook;Jang, Yoon-Hee;Jeong, Eun-Young
    • The Journal of the Korea Contents Association
    • /
    • v.11 no.11
    • /
    • pp.91-101
    • /
    • 2011
  • This research aims to suggest web-based 3D face modeling system for hairline modification surgery. As public interests in beauty regarding face escalate with era of wide persoanl mobile smart iCT devices, need for medical information system is urgent and increasing demand. This research attempted to build 3D facing modeling library deploying conventional technology and proprietary software available. Implications from the our experiment found that problems and requirement for developing new web based standard. We suggest new system from our experiment and literature review regarding relevant technologies. Main features of our suggested systems is based on studies regarding hair loss treatment such as medical science, beauty studies and information technology. This system processes input images of 2D frontal and profile pictures of face into 3D face modeling with mesh-data. The mesh data is compatible with web standard technology including SVG and Canvas Tag supported natively by HTML5.

Deep Learning based Singing Voice Synthesis Modeling (딥러닝 기반 가창 음성합성(Singing Voice Synthesis) 모델링)

  • Kim, Minae;Kim, Somin;Park, Jihyun;Heo, Gabin;Choi, Yunjeong
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.10a
    • /
    • pp.127-130
    • /
    • 2022
  • This paper is a study on singing voice synthesis modeling using a generator loss function, which analyzes various factors that may occur when applying BEGAN among deep learning algorithms optimized for image generation to Audio domain. and we conduct experiments to derive optimal quality. In this paper, we focused the problem that the L1 loss proposed in the BEGAN-based models degrades the meaning of hyperparameter the gamma(𝛾) which was defined to control the diversity and quality of generated audio samples. In experiments we show that our proposed method and finding the optimal values through tuning, it can contribute to the improvement of the quality of the singing synthesis product.

  • PDF

Character-based Subtitle Generation by Learning of Multimodal Concept Hierarchy from Cartoon Videos (멀티모달 개념계층모델을 이용한 만화비디오 컨텐츠 학습을 통한 등장인물 기반 비디오 자막 생성)

  • Kim, Kyung-Min;Ha, Jung-Woo;Lee, Beom-Jin;Zhang, Byoung-Tak
    • Journal of KIISE
    • /
    • v.42 no.4
    • /
    • pp.451-458
    • /
    • 2015
  • Previous multimodal learning methods focus on problem-solving aspects, such as image and video search and tagging, rather than on knowledge acquisition via content modeling. In this paper, we propose the Multimodal Concept Hierarchy (MuCH), which is a content modeling method that uses a cartoon video dataset and a character-based subtitle generation method from the learned model. The MuCH model has a multimodal hypernetwork layer, in which the patterns of the words and image patches are represented, and a concept layer, in which each concept variable is represented by a probability distribution of the words and the image patches. The model can learn the characteristics of the characters as concepts from the video subtitles and scene images by using a Bayesian learning method and can also generate character-based subtitles from the learned model if text queries are provided. As an experiment, the MuCH model learned concepts from 'Pororo' cartoon videos with a total of 268 minutes in length and generated character-based subtitles. Finally, we compare the results with those of other multimodal learning models. The Experimental results indicate that given the same text query, our model generates more accurate and more character-specific subtitles than other models.

Performance Criterion-based Polynomial Calibration Model for Laser Scan Camera (레이저 스캔 카메라 보정을 위한 성능지수기반 다항식 모델)

  • Baek, Gyeong-Dong;Cheon, Seong-Pyo;Kim, Su-Dae;Kim, Sung-Shin
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.21 no.5
    • /
    • pp.555-563
    • /
    • 2011
  • The goal of image calibration is to find a relation between image and world coordinates. Conventional image calibration uses physical camera model that is able to reflect camera's optical properties between image and world coordinates. In this paper, we try to calibrate images distortion using performance criterion-based polynomial model which assumes that the relation between image and world coordinates can be identified by polynomial equation and its order and parameters are able to be estimated with image and object coordinate values and performance criterion. In order to overcome existing limitations of the conventional image calibration model, namely, over-fitting feature, the performance criterion-based polynomial model is proposed. The efficiency of proposed method can be verified with 2D images that were taken by laser scan camera.

Realistic 3D Scene Reconstruction from an Image Sequence (연속적인 이미지를 이용한 3차원 장면의 사실적인 복원)

  • Jun, Hee-Sung
    • The KIPS Transactions:PartB
    • /
    • v.17B no.3
    • /
    • pp.183-188
    • /
    • 2010
  • A factorization-based 3D reconstruction system is realized to recover 3D scene from an image sequence. The image sequence is captured from uncalibrated perspective camera from several views. Many matched feature points over all images are obtained by feature tracking method. Then, these data are supplied to the 3D reconstruction module to obtain the projective reconstruction. Projective reconstruction is converted to Euclidean reconstruction by enforcing several metric constraints. After many triangular meshes are obtained, realistic reconstruction of 3D models are finished by texture mapping. The developed system is implemented in C++, and Qt library is used to implement the system user interface. OpenGL graphics library is used to realize the texture mapping routine and the model visualization program. Experimental results using synthetic and real image data are included to demonstrate the effectiveness of the developed system.

Design and Implementation of a WEB Based Courseware for Geometric Solids Using VRML (VRML을 이용한 웹 기반 입체도형학습 코스웨어의 설계 및 구현)

  • Kim, Joung-Hwa;Woo, Jong-Jung
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2003.11a
    • /
    • pp.219-222
    • /
    • 2003
  • 웹 코스웨어의 대부분은 2 차원적인 텍스트와 이미지를 이용한 것으로 설계되어 있으나 3 차원의 입체개념 형성이 필요한 입체도형 학습에서는 효과적인 학습이 되기 어렵다. 본 논문은 WWW에서 3차원 가상현실을 적용하여 구현한 웹 코스웨어로 중학생을 위한 입체도형 학습을 주제로 하였다. 2 차원 평면공간에서는 설명하기 어려운 입체도형의 성질을 3 차원의 가상현실의 공간에서 학습자 스스로 다양한 경험을 통해 이를 이해하고 학습의 개별화 요구를 충족시키는데 그 목적이 있다. 이를 위해 학습자가 주도적으로 학습을 조작, 진행해 나갈 수 있는 구성주의 학습이론을 기반으로 웹에서 3 차원 가상공간을 제공하는 스크립트 언어인 VRML2.0 을 이용하여 모델링하여 동적인 학습과 상호작용성을 높일 수 있도록 구현하였다.

  • PDF

Study for Injurious Multimedia Contents Analysis Mechanism in Smart Devices (스마트 기기에서 유해 멀티미디어 콘텐츠 판별 메커니즘 및 성능 분석)

  • Min, Sun-Ho;Kim, Seok-Woo;Ha, Kyeoung-Ju;Seo, Chang-Ho
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.23 no.6
    • /
    • pp.1001-1006
    • /
    • 2013
  • In this paper, Recently, we describe the distinction mechanism analysis and injurious distinction mechanism performance analysis in order to determine harmfulness of the injurious multimedia which is being rapidly spread in the smart phone and Intelligent Robots. Based on the injurious mechanism distinction technologies, We defined individual injurious characteristics elements of multimedia(images and videos). Also, We analyze harmfulness of the injurious multimedia content by the visual characteristics modeling.