• Title/Summary/Keyword: 영상 특징추출

Search Result 2,333, Processing Time 0.031 seconds

Design of Narrative Text Visualization Through Character-net (캐릭터 넷을 통한 내러티브 텍스트 시각화 디자인 연구)

  • Jeon, Hea-Jeong;Park, Seung-Bo;Lee, O-Joun;You, Eun-Soon
    • The Journal of the Korea Contents Association
    • /
    • v.15 no.2
    • /
    • pp.86-100
    • /
    • 2015
  • Through advances driven by the Internet and the Smart Revolution, the amount and types of data generated by users have increased and diversified respectively. There is now a new concept at the center of attention, which is Big Data for assessing enormous amount of data and enjoying new values therefrom. In particular, efforts are required to analyze narratives within video clips and to study how to visualize such narratives in order to search contents stored in the Big Data. As part of the research efforts, this paper analyzes dialogues exchanged among characters and offers an interface named "Character-net" developed for modelling narratives. The interface Character-net can extract characters by analyzing narrative videos and also model the relationships between characters, both in the automatic manner. This signifies a possibility of a tool that can visualize a narrative based on an approach different from those used in existing studies. However, its drawbacks have been observed in terms of limited applications and difficulty in grasping a narrative's features at a glace. It was assumed that Character-net could be improved with the introduction of information design. Against the backdrop, the paper first provides a brief explanation of visualization design found in the data information design area and investigates research cases focused on the visualization of narratives present in videos. Next, key ideas of Character-net and its technical differences from existing studies have been introduced, followed by methods suggested for its potential improvements with the help of design-side solutions.

A Study on Lip-reading Enhancement Using Time-domain Filter (시간영역 필터를 이용한 립리딩 성능향상에 관한 연구)

  • 신도성;김진영;최승호
    • The Journal of the Acoustical Society of Korea
    • /
    • v.22 no.5
    • /
    • pp.375-382
    • /
    • 2003
  • Lip-reading technique based on bimodal is to enhance speech recognition rate in noisy environment. It is most important to detect the correct lip-image. But it is hard to estimate stable performance in dynamic environment, because of many factors to deteriorate Lip-reading's performance. There are illumination change, speaker's pronunciation habit, versatility of lips shape and rotation or size change of lips etc. In this paper, we propose the IIR filtering in time-domain for the stable performance. It is very proper to remove the noise of speech, to enhance performance of recognition by digital filtering in time domain. While the lip-reading technique in whole lip image makes data massive, the Principal Component Analysis of pre-process allows to reduce the data quantify by detection of feature without loss of image information. For the observation performance of speech recognition using only image information, we made an experiment on recognition after choosing 22 words in available car service. We used Hidden Markov Model by speech recognition algorithm to compare this words' recognition performance. As a result, while the recognition rate of lip-reading using PCA is 64%, Time-domain filter applied to lip-reading enhances recognition rate of 72.4%.

Application of Eco-friendly Planning of Sinseo Innovation City in Daegu using the Analysis of Satellite Image and Field Survey (위성영상 분석과 현장조사를 통한 대구 신서혁신도시의 친환경적 도시계획의 적용 검토)

  • Kim, Jiyeong;Kim, Eun Jung
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.37 no.3
    • /
    • pp.143-156
    • /
    • 2019
  • The purpose of this study is to examine whether the Sinseo Innovation City of Daegu has been eco-friendly developed by analyzing changes in NDVI (Normalized Difference Vegetation Index) and LST (Land Surface Temperature) and conducting field surveys. Using Landsat satellite images, it compares NDVI and LST changes between the years of 2008 and 2018. The results of the study are as follows. First, the NDVI has decreased by 0.07 and the zLST has increased by $0.85^{\circ}C$ over the past 10 years. Second, districts with lower NDVI and higher zLST were concentrated with infrastructure with impermeable materials. Districts with higher NDVI and lower zLST were utilized urban design techniques such as permeable parking lot, green roof, and permeable pavement. Third, districts with higher NDVI and lower zLST were applied eco-friendly planning items properly by district unit plan guideline. It is meaningful to suggest planing directions and urban planning elements considering the environmental friendly development.

A preliminary study for development of an automatic incident detection system on CCTV in tunnels based on a machine learning algorithm (기계학습(machine learning) 기반 터널 영상유고 자동 감지 시스템 개발을 위한 사전검토 연구)

  • Shin, Hyu-Soung;Kim, Dong-Gyou;Yim, Min-Jin;Lee, Kyu-Beom;Oh, Young-Sup
    • Journal of Korean Tunnelling and Underground Space Association
    • /
    • v.19 no.1
    • /
    • pp.95-107
    • /
    • 2017
  • In this study, a preliminary study was undertaken for development of a tunnel incident automatic detection system based on a machine learning algorithm which is to detect a number of incidents taking place in tunnel in real time and also to be able to identify the type of incident. Two road sites where CCTVs are operating have been selected and a part of CCTV images are treated to produce sets of training data. The data sets are composed of position and time information of moving objects on CCTV screen which are extracted by initially detecting and tracking of incoming objects into CCTV screen by using a conventional image processing technique available in this study. And the data sets are matched with 6 categories of events such as lane change, stoping, etc which are also involved in the training data sets. The training data are learnt by a resilience neural network where two hidden layers are applied and 9 architectural models are set up for parametric studies, from which the architectural model, 300(first hidden layer)-150(second hidden layer) is found to be optimum in highest accuracy with respect to training data as well as testing data not used for training. From this study, it was shown that the highly variable and complex traffic and incident features could be well identified without any definition of feature regulation by using a concept of machine learning. In addition, detection capability and accuracy of the machine learning based system will be automatically enhanced as much as big data of CCTV images in tunnel becomes rich.

Selective Incremental Learning for Face Tracking Using Staggered Multi-Scale LBP (얼굴 추적에서의 Staggered Multi-Scale LBP를 사용한 선택적인 점진 학습)

  • Lee, Yonggeol;Choi, Sang-Il
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.52 no.5
    • /
    • pp.115-123
    • /
    • 2015
  • The incremental learning method performs well in face face tracking. However, it has a drawback in that it is sensitive to the tracking error in the previous frame due to the environmental changes. In this paper, we propose a selective incremental learning method to track a face more reliably under various conditions. The proposed method is robust to illumination variation by using the LBP(Local Binary Pattern) features for each individual frame. We select patches to be used in incremental learning by using Staggered Multi-Scale LBP, which prevents the propagation of tracking errors occurred in the previous frame. The experimental results show that the proposed method improves the face tracking performance on the videos with environmental changes such as illumination variation.

Neural-network based Computerized Emotion Analysis using Multiple Biological Signals (다중 생체신호를 이용한 신경망 기반 전산화 감정해석)

  • Lee, Jee-Eun;Kim, Byeong-Nam;Yoo, Sun-Kook
    • Science of Emotion and Sensibility
    • /
    • v.20 no.2
    • /
    • pp.161-170
    • /
    • 2017
  • Emotion affects many parts of human life such as learning ability, behavior and judgment. It is important to understand human nature. Emotion can only be inferred from facial expressions or gestures, what it actually is. In particular, emotion is difficult to classify not only because individuals feel differently about emotion but also because visually induced emotion does not sustain during whole testing period. To solve the problem, we acquired bio-signals and extracted features from those signals, which offer objective information about emotion stimulus. The emotion pattern classifier was composed of unsupervised learning algorithm with hidden nodes and feature vectors. Restricted Boltzmann machine (RBM) based on probability estimation was used in the unsupervised learning and maps emotion features to transformed dimensions. The emotion was characterized by non-linear classifiers with hidden nodes of a multi layer neural network, named deep belief network (DBN). The accuracy of DBN (about 94 %) was better than that of back-propagation neural network (about 40 %). The DBN showed good performance as the emotion pattern classifier.

An Improved RSR Method to Obtain the Sparse Projection Matrix (희소 투영행렬 획득을 위한 RSR 개선 방법론)

  • Ahn, Jung-Ho
    • Journal of Digital Contents Society
    • /
    • v.16 no.4
    • /
    • pp.605-613
    • /
    • 2015
  • This paper addresses the problem to make sparse the projection matrix in pattern recognition method. Recently, the size of computer program is often restricted in embedded systems. It is very often that developed programs include some constant data. For example, many pattern recognition programs use the projection matrix for dimension reduction. To improve the recognition performance, very high dimensional feature vectors are often extracted. In this case, the projection matrix can be very big. Recently, RSR(roated sparse regression) method[1] was proposed. This method has been proved one of the best algorithm that obtains the sparse matrix. We propose three methods to improve the RSR; outlier removal, sampling and elastic net RSR(E-RSR) in which the penalty term in RSR optimization function is replaced by that of the elastic net regression. The experimental results show that the proposed methods are very effective and improve the sparsity rate dramatically without sacrificing the recognition rate compared to the original RSR method.

Generalized Steganalysis using Deep Learning (딥러닝을 이용한 범용적 스테그아날리시스)

  • Kim, Hyunjae;Lee, Jaekoo;Kim, Gyuwan;Yoon, Sungroh
    • KIISE Transactions on Computing Practices
    • /
    • v.23 no.4
    • /
    • pp.244-249
    • /
    • 2017
  • Steganalysis is to detect information hidden by steganography inside general data such as images. There are stegoanalysis techniques that use machine learning (ML). Existing ML approaches to steganalysis are based on extracting features from stego images and modeling them. Recently deep learning-based methodologies have shown significant improvements in detection accuracy. However, all the existing methods, including deep learning-based ones, have a critical limitation in that they can only detect stego images that are created by a specific steganography method. In this paper, we propose a generalized steganalysis method that can model multiple types of stego images using deep learning. Through various experiments, we confirm the effectiveness of our approach and envision directions for future research. In particular, we show that our method can detect each type of steganography with the same level of accuracy as that of a steganalysis method dedicated to that type of steganography, thereby demonstrating the general applicability of our approach to multiple types of stego images.

A Study on Cross-Association between UCI Identification System and Content-based Identifier for Copyright Identification and Management of Broadcasting Content (방송콘텐츠 저작권 식별관리를 위한 UCI 표준식별체계와 내용기반 식별정보의 상호연계 연구)

  • Kim, Joo-Sub;Nam, Je-Ho
    • Journal of Broadcast Engineering
    • /
    • v.14 no.3
    • /
    • pp.288-298
    • /
    • 2009
  • In this paper, we propose a scheme to associate content-based video signature with Universal Content Identifier (UCI) system of broadcast content for copyright identification and management. Note that content-based video signature can identify a previously distributed content since it is directly extracted from content itself without allocation process of identifier such as UCI. Thus, we design the schema of UCI application metadata, which provides a video signature in order to consistently maintain a systemic link between UCI and the video signature. Also, we present the scenarios of copyright identification, management and additional service, which are based on transmission and management mechanism of video signature with UCI identification system.

Cyberspace Coordinate Create for Augmented Reality (증강현실을 위한 가상 공간좌표 생성)

  • Ban, KyeongJin;Ryu, NamHoon;Kim, KyeongOk;Han, JeaJung;Kim, EungKon
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2009.05a
    • /
    • pp.765-769
    • /
    • 2009
  • The Augmented Reality of existing for the interaction which the object and background is smooth used the data glove or marker. It is inconvenient to a use and it occurs the result of immersion feeling decrease. Immersion it will wind from Augmented Reality and the hazard which it strengthens the removal of the additional entry device which stands is necessary. It recognizes the space coordinates which is accurate even from the condition where the hazard marker which will reach does not attach in necessity. Immersion feeling improvement from Augmented Reality wearing the hazard additional entry device it proposes the space coordinate creation technique of the virtuality description below for a interaction without from the present paper. The method which is proposed the image which it acquires the object of virtuality reflected at 2D space and the characteristic line about under extracting the space coordinate which reflects about under calculating it reflected. The application is possible in markerless Augmented Reality and the mobile Augmented Reality.

  • PDF