• Title/Summary/Keyword: 동영상 정보

Search Result 2,241, Processing Time 0.025 seconds

A Quantification Method of Human Body Motion Similarity using Dynamic Time Warping for Keypoints Extracted from Video Streams (동영상에서 추출한 키포인트 정보의 동적 시간워핑(DTW)을 이용한 인체 동작 유사도의 정량화 기법)

  • Im, June-Seok;Kim, Jin-Heon
    • Journal of IKEEE
    • /
    • v.24 no.4
    • /
    • pp.1109-1116
    • /
    • 2020
  • The matching score evaluating human copying ability can be a good measure to check children's developmental stages, or sports movements like golf swing and dance, etc. It also can be used as HCI for AR, VR applications. This paper presents a method to evaluate the motion similarity between demonstrator who initiates movement and participant who follows the demonstrator action. We present a quantification method of the similarity which utilizes Euclidean L2 distance of Openpose keypoins vector similarity. The proposed method adapts DTW, thus can flexibly cope with the time delayed motions.

A Design and Implementation of Multimedia Retrieval System based on MAF(Multimedia Application File Format) (MAF(Multimedia Application File Format) 기반 멀티미디어 검색 시스템의 설계 및 구현)

  • Gang Young-Mo;Park Joo-Hyoun;Bang Hyung-Gin;Nang Jong-Ho;Kim Hyung-Chul
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.33 no.9
    • /
    • pp.574-584
    • /
    • 2006
  • Recently, ISO/IEC 23000 (also known as 'MPEG-A') has proposed a new file format called 'MAF(Multimedia Application File Format)[1]' which provides a capability of integrating/storing the widely-used compression standards for audio and video and the metadata in MPEG-7 form into a single file format. However, it is still very hard to verify the usefulness of MPEG-A in the real applications because there is still no real system that fully implements this standard. In this thesis, a design and implementation of a multimedia retrieval system based on MPEG-A standard on PC and mobile device is presented. Furthermore, an extension of MPEG-A for describing the metadata for video is also proposed. It is selected and defined as a subset of MPEG-7 MDS[4] and TV-anytime[5] for video that is useful and manageable in the mobile environments. In order to design the multimedia retrieval system based on MPEG-A, we define the system requirements in terms of portability, extensibility, compatibility, adaptability, efficiency. Based on these requirements, we design the system which composed of 3 layers: Application Layer, Middleware Layer, Platform Layer. The proposed system consists of two sub-parts, client-part and server-part. The client-part consists of MAF authoring tool, MAP player tool and MAF searching tool which allow users to create, play and search the MAF files, respectively. The server-part is composed of modules to store and manage the MAF files and metadata extracted from MAF files. We show the usefulness of the proposed system by implementing the client system both on MS-Windows platform on desk-top computer and WIPI platform on mobile phone, and validate whether it to satisfy all the system requirements. The proposed system can be used to verify the specification in the MPEG-A, and to proves the usefulness of MPEG-A in the real application.

A Study on Integrated Visualization and Mapping Techniques using the Geophysical Results of the Coastal Area of the Dokdo in the East Sea (독도 연안 해저 지구물리 자료의 통합 중첩 주제도 작성 연구)

  • Lee, Myoung Hoon;Kim, Chang Hwan;Park, Chan Hong;Rho, Hyun Soo;Kim, Dae Choul
    • Economic and Environmental Geology
    • /
    • v.49 no.5
    • /
    • pp.381-388
    • /
    • 2016
  • The purpose of this study is to integrate and visualize using mapping techniques based on precise seabed geomorphology, seafloor backscattering images and high-resolution underwater images of the nearshore area around the Dokdo, in the East Sea. We have been obtained the precise topography map using multibeam echosounder system around the nearshore area(~50 m) of the southern part of the Seodo. Side scan sonar survey for analysis seafloor backscattering images was carried out in the same area of topography data. High-resolution underwater images(zone(a), zone(b), zone(c)) were taken in significant habitat scope of the nearshore area of the southern part of the Seodo. Using the results of bathymetry, seafloor backscattering images, high-resolution underwater images, we performed an integrated visualization about the nearshore area of the Dokdo. The integrated visualizing techniques are possible to make the seabed characteristic mapping results of the nearshore area of the Dokdo. The integrated visualization results present more complex and reliable information than separate geological products for seabed environmental mapping study and it is useful to understand the relation between seafloor characteristics and topographic environments of the study area. The integrated visualizing techniques and mapping analysis need to study sustainably and periodically, for effective monitoring of the nearshore ecosystem of the Dokdo.

Multimedia Information Retrieval Using Semantic Relevancy (의미적 연관성을 이용한 멀티미디어 정보 검색)

  • Park, Chang-Sup
    • Journal of Internet Computing and Services
    • /
    • v.8 no.5
    • /
    • pp.67-79
    • /
    • 2007
  • As the Web technologies and wired/wireless network are improved and various new multimedia services are introduced recently, need for searching multimedia including video data has been much increasing, The previous approaches for multimedia retrieval, however, do not make use of the relationships among semantic concepts contained in multimedia contents in an efficient way and provide only restricted search results, This paper proposes a multimedia retrieval system exploiting semantic relevancy of multimedia contents based on a domain ontology, We show the effectiveness of the proposed system by experiments on a prototype system we have developed. The proposed multimedia retrieval system can extend a given search keyword based on the relationships among the semantic concepts in the ontology and can find a wide range of multimedia contents having semantic relevancy to the input keyword. It also presents the results categorized by the semantic meaning and relevancy to the keyword derived from the ontology. Independency of domain ontology with respect to metadata on the multimedia contents is preserved in the proposed system architecture.

  • PDF

A Study on the Improvement of Military Information Communication Network Efficiency Using CCN (CCN을 활용한 군 정보통신망 효율성 향상 방안)

  • Kim, Hui-Jung;Kwon, Tae-Wook
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.15 no.5
    • /
    • pp.799-806
    • /
    • 2020
  • The rapid growth of smartphone-to-Internet of Things (IoT) connections and the explosive demand for data usage centered on mobile video are increasing day by day, and this increase in data usage creates many problems in the IP system. In a full-based environment, in which information requesters focus on information providers to receive information from specific servers, problems arise with bottlenecks and large data processing. To address this problem, CCN networking technology, a future network technology, has emerged as an alternative to CCN networking technology, which reduces bottlenecks that occur when requesting popular content through caching of intermediate nodes and increases network efficiency, and can be applied to military information and communication networks to address the problem of traffic concentration and the use of various surveillance equipment in full-based networks, such as scientific monitoring systems, and to provide more efficient content.

Audio Segmentation and Classification Using Support Vector Machine and Fuzzy C-Means Clustering Techniques (서포트 벡터 머신과 퍼지 클러스터링 기법을 이용한 오디오 분할 및 분류)

  • Nguyen, Ngoc;Kang, Myeong-Su;Kim, Cheol-Hong;Kim, Jong-Myon
    • The KIPS Transactions:PartB
    • /
    • v.19B no.1
    • /
    • pp.19-26
    • /
    • 2012
  • The rapid increase of information imposes new demands of content management. The purpose of automatic audio segmentation and classification is to meet the rising need for efficient content management. With this reason, this paper proposes a high-accuracy algorithm that segments audio signals and classifies them into different classes such as speech, music, silence, and environment sounds. The proposed algorithm utilizes support vector machine (SVM) to detect audio-cuts, which are boundaries between different kinds of sounds using the parameter sequence. We then extract feature vectors that are composed of statistical data and they are used as an input of fuzzy c-means (FCM) classifier to partition audio-segments into different classes. To evaluate segmentation and classification performance of the proposed SVM-FCM based algorithm, we consider precision and recall rates for segmentation and classification accuracy for classification. Furthermore, we compare the proposed algorithm with other methods including binary and FCM classifiers in terms of segmentation performance. Experimental results show that the proposed algorithm outperforms other methods in both precision and recall rates.

Scene Change Detection with Sequential Access Method in Compressed MPEG Videos (순차접근법을 이용한 MPEG 압축영역에서의 장면전환점 검출)

  • Ahn, Eui-Sub;Song, Hyun-Soo;Lee, Jae-Dong;Kim, Sung-Un
    • The KIPS Transactions:PartB
    • /
    • v.11B no.3
    • /
    • pp.353-360
    • /
    • 2004
  • The study on scene change detection in the compressed MPEG videos has been done by various approaches. However, most of these approacher accomplished scene change detection by carrying out decoding processes and then by comparing pixels with pixels. This approach it not suitable for real time applications owing to much computing time of decoding processes. Recently, the study on scene change detection algorithms using only information of compressed domain is becoming Increasingly important. In this paper, we propose a sequential access method as an efficient scene change detection algorithm in the compressed domain. According to the type of pictures in the compressed MPEG video streams (divided in I-blocks and each I-block into P-blocks), the proposed algorithm provides effective scene change detection by applying sequential access and block by block mechanism. The proposed sequential access method provides fast and accurate detection operation by reducing checking procedures of unnecessary pictures due to molt of operations in compressed domain and checking by block units. Also, this approach uses optimal algorithm to provide fast and accurate detection operation.

An Analysis of the Awareness of Undergraduate Students Toward Educational Methods Utilizing SNS (SNS의 교육적 활용 방법에 대한 대학생들의 인식 분석)

  • Chung, Mihyun;Gim, Yeongrok;Kim, Jaehyoun
    • Journal of Internet Computing and Services
    • /
    • v.15 no.5
    • /
    • pp.33-41
    • /
    • 2014
  • The purpose of this study is to provide statistical analysis based suggestions on effective methods to take advantage of social network service (SNS) for educational purposes based on investigations of undergraduate students with high utilization of SNS. To achieve this, we conducted an open questionnaire survey and qualitatively analyzed the data using NVivo 10 program. Additionally, to visualize the analysis by education subject of the SNS educational methods, we have divided the data into 3 categories of teacher, student and common area. As a result, our analysis showed that the exchange opinions and discussions, information sharing and learning of contents of questions and answers, surveys, intimacy formation, counseling etc. were of the common area. Furthermore, teacher area included the notice announcement, teaching materials sharing, real-time assessment (quiz), attendance check, assignment grading and feedback, etc. Finally the student area was of project activities (group/individual), prepare and review the shared data, video self learning, peer review and feedback of projects, lecture evaluation, etc.

Recognition of Hmm Facial Expressions using Optical Flow of Feature Regions (얼굴 특징영역상의 광류를 이용한 표정 인식)

  • Lee Mi-Ae;Park Ki-Soo
    • Journal of KIISE:Software and Applications
    • /
    • v.32 no.6
    • /
    • pp.570-579
    • /
    • 2005
  • Facial expression recognition technology that has potentialities for applying various fields is appling on the man-machine interface development, human identification test, and restoration of facial expression by virtual model etc. Using sequential facial images, this study proposes a simpler method for detecting human facial expressions such as happiness, anger, surprise, and sadness. Moreover the proposed method can detect the facial expressions in the conditions of the sequential facial images which is not rigid motion. We identify the determinant face and elements of facial expressions and then estimates the feature regions of the elements by using information about color, size, and position. In the next step, the direction patterns of feature regions of each element are determined by using optical flows estimated gradient methods. Using the direction model proposed by this study, we match each direction patterns. The method identifies a facial expression based on the least minimum score of combination values between direction model and pattern matching for presenting each facial expression. In the experiments, this study verifies the validity of the Proposed methods.

Dynamic Hand Gesture Recognition Using CNN Model and FMM Neural Networks (CNN 모델과 FMM 신경망을 이용한 동적 수신호 인식 기법)

  • Kim, Ho-Joon
    • Journal of Intelligence and Information Systems
    • /
    • v.16 no.2
    • /
    • pp.95-108
    • /
    • 2010
  • In this paper, we present a hybrid neural network model for dynamic hand gesture recognition. The model consists of two modules, feature extraction module and pattern classification module. We first propose a modified CNN(convolutional Neural Network) a pattern recognition model for the feature extraction module. Then we introduce a weighted fuzzy min-max(WFMM) neural network for the pattern classification module. The data representation proposed in this research is a spatiotemporal template which is based on the motion information of the target object. To minimize the influence caused by the spatial and temporal variation of the feature points, we extend the receptive field of the CNN model to a three-dimensional structure. We discuss the learning capability of the WFMM neural networks in which the weight concept is added to represent the frequency factor in training pattern set. The model can overcome the performance degradation which may be caused by the hyperbox contraction process of conventional FMM neural networks. From the experimental results of human action recognition and dynamic hand gesture recognition for remote-control electric home appliances, the validity of the proposed models is discussed.