• 제목/요약/키워드: Video Data

검색결과 3,487건 처리시간 0.047초

요약 비디오 영상과 PCA를 이용한 유사비디오 검출 기법 (Similar Video Detection Method with Summarized Video Image and PCA)

  • 유재만;김우생
    • 한국멀티미디어학회논문지
    • /
    • 제8권8호
    • /
    • pp.1134-1141
    • /
    • 2005
  • 웹 상의 출판이 보편화 될수록 많은 데이터의 내용물들이 압축, 포맷, 편집 등 변형된 상태로 중복해서 존재하게 된다. 이러한 유사한 데이터들은 검색 시 속도나 검색률 등에 문제를 야기 시킬 수도 있으며, 반면에 특정 사이트에 문제가 발생할 경우 다른 사이트의 중복된 데이터를 제공해 줄 수도 있게 된다. 따라서 본 논문에서는 대규모 데이터베이스 상에 존재하는 비디오들 중에서 유사한 데이터들에 대한 정보를 사전에 감지할 수 있는 효율적인 방법을 제안한다. 본 연구에서는 비디오들을 직접 비교하는 대신 비디오를 대표하는 요약 비디오 영상을 만들고, 주성분 분석(PCA-principle component analysis) 기법을 적용하여 저차원 특징벡터 상에 군집화를 통해 유사 비디오들을 검출하였다. 실험을 통하여 제안하는 방법의 효율성과 정확성이 우수함을 보였다.

  • PDF

Automatic Poster Generation System Using Protagonist Face Analysis

  • Yeonhwi You;Sungjung Yong;Hyogyeong Park;Seoyoung Lee;Il-Young Moon
    • Journal of information and communication convergence engineering
    • /
    • 제21권4호
    • /
    • pp.287-293
    • /
    • 2023
  • With the rapid development of domestic and international over-the-top markets, a large amount of video content is being created. As the volume of video content increases, consumers tend to increasingly check data concerning the videos before watching them. To address this demand, video summaries in the form of plot descriptions, thumbnails, posters, and other formats are provided to consumers. This study proposes an approach that automatically generates posters to effectively convey video content while reducing the cost of video summarization. In the automatic generation of posters, face recognition and clustering are used to gather and classify character data, and keyframes from the video are extracted to learn the overall atmosphere of the video. This study used the facial data of the characters and keyframes as training data and employed technologies such as DreamBooth, a text-to-image generation model, to automatically generate video posters. This process significantly reduces the time and cost of video-poster production.

Study on 3 DoF Image and Video Stitching Using Sensed Data

  • Kim, Minwoo;Chun, Jonghoon;Kim, Sang-Kyun
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제11권9호
    • /
    • pp.4527-4548
    • /
    • 2017
  • This paper proposes a method to generate panoramic images by combining conventional feature extraction algorithms (e.g., SIFT, SURF, MPEG-7 CDVS) with sensed data from inertia sensors to enhance the stitching results. The challenge of image stitching increases when the images are taken from two different mobile phones with no posture calibration. Using inertia sensor data obtained by the mobile phone, images with different yaw, pitch, and roll angles are preprocessed and adjusted before performing stitching process. Performance of stitching (e.g., feature extraction time, inlier point numbers, stitching accuracy) between conventional feature extraction algorithms is reported along with the stitching performance with/without using the inertia sensor data. In addition, the stitching accuracy of video data was improved using the same sensed data, with discrete calculation of homograph matrix. The experimental results for stitching accuracies and speed using sensed data are presented in this paper.

시험자료 획득을 위한 영상 송수신 시스템 구현 (Implementation of Video Transmitting and Receiving System for Acquisition of Test Data)

  • 류상규
    • 한국군사과학기술학회지
    • /
    • 제20권5호
    • /
    • pp.681-687
    • /
    • 2017
  • This paper presents about an implementation of Video Transmitting and Receiving System(VTRS) for acquiring test data. The VTRS consists of two parts. The first is Transmitter Unit(TU) that is installed on a missile to acquire various kinds of data and transmit the data to the ground through RF signals. The second is Receiver Unit(RU) that receives the transmitted RF signals and reconstruct those to the original data. To gather a high speed data reliably and securely on the ground, the TU is designed by considering data transfer scheme, data compression, modulation method, encryption technic, link budget, and antenna radiation pattern. Further, a placement method of multiple receiving stations is suggested. The VTRS has been tested on a field to check the link margins and maximum receiving distance in a real environment. Finally, the VTRS is applied to a missile flight test and gathered high speed data reliably.

공유메모리를 이용한 효율적인 감시 영상 표출 방안 (A Plan of Efficient Images Display Using Shared Memory)

  • 이원재;안태기;신정렬
    • 한국철도학회:학술대회논문집
    • /
    • 한국철도학회 2011년도 정기총회 및 추계학술대회 논문집
    • /
    • pp.3306-3311
    • /
    • 2011
  • Last Subway video surveillance system consists of a network device that is used. Through the network to transmit video data to digital conversion of analog video via a process server or a PC video to a split-screen in various forms is expressed. In recent years, multi-monitor video cameras from the same pop-up or more, such as history, structure expressed on a variety of video is required by express. The problem with these systems, video compression and transmission of many cameras, and this image data received from the server or PC to take out all the images you want to watch to occur when in order to express all of the images because of the need to decode most of the program per limit of number of channels is positioned. This limited number of channels to have a video that nothing forced, but it is likely to do so in the future performance of the hardware evolves gradually channeled images available number of channels will increase proportionately. However, as the development of hardware required for a single screen video channel will be more gradual capital. The hardware rather than relying solely on the performance of the decoded video data on the screen in order to express a more efficient utilization of shared memory for video surveillance software will provide the operating plan.

  • PDF

Visual-Attention-Aware Progressive RoI Trick Mode Streaming in Interactive Panoramic Video Service

  • Seok, Joo Myoung;Lee, Yonghun
    • ETRI Journal
    • /
    • 제36권2호
    • /
    • pp.253-263
    • /
    • 2014
  • In the near future, traditional narrow and fixed viewpoint video services will be replaced by high-quality panorama video services. This paper proposes a visual-attention-aware progressive region of interest (RoI) trick mode streaming service (VA-PRTS) that prioritizes video data to transmit according to the visual attention and transmits prioritized video data progressively. VA-PRTS enables the receiver to speed up the time to display without degrading the perceptual quality. For the proposed VA-PRTS, this paper defines a cutoff visual attention metric algorithm to determine the quality of the encoded video slice based on the capability of visual attention and the progressive streaming method based on the priority of RoI video data. Compared to conventional methods, VA-PRTS increases the bitrate saving by over 57% and decreases the interactive delay by over 66%, while maintaining a level of perceptual video quality. The experiment results show that the proposed VA-PRTS improves the quality of the viewer experience for interactive panoramic video streaming services. The development results show that the VA-PRTS has highly practical real-field feasibility.

Low-Complexity MPEG-4 Shape Encoding towards Realtime Object-Based Applications

  • Jang, Euee-Seon
    • ETRI Journal
    • /
    • 제26권2호
    • /
    • pp.122-135
    • /
    • 2004
  • Although frame-based MPEG-4 video services have been successfully deployed since 2000, MPEG-4 video coding is now facing great competition in becoming a dominant player in the market. Object-based coding is one of the key functionalities of MPEG-4 video coding. Real-time object-based video encoding is also important for multimedia broadcasting for the near future. Object-based video services using MPEG-4 have not yet made a successful debut due to several reasons. One of the critical problems is the coding complexity of object-based video coding over frame-based video coding. Since a video object is described with an arbitrary shape, the bitstream contains not only motion and texture data but also shape data. This has introduced additional complexity to the decoder side as well as to the encoder side. In this paper, we have analyzed the current MPEG-4 video encoding tools and proposed efficient coding technologies that reduce the complexity of the encoder. Using the proposed coding schemes, we have obtained a 56 percent reduction in shape-coding complexity over the MPEG-4 video reference software (Microsoft version, 2000 edition).

  • PDF

Method of extracting context from media data by using video sharing site

  • Kondoh, Satoshi;Ogawa, Takeshi
    • 한국방송∙미디어공학회:학술대회논문집
    • /
    • 한국방송공학회 2009년도 IWAIT
    • /
    • pp.709-713
    • /
    • 2009
  • Recently, a lot of research that applies data acquired from devices such as cameras and RFIDs to context aware services is being performed in the field on Life-Log and the sensor network. A variety of analytical techniques has been proposed to recognize various information from the raw data because video and audio data include a larger volume of information than other sensor data. However, manually watching a huge amount of media data again has been necessary to create supervised data for the update of a class or the addition of a new class because these techniques generally use supervised learning. Therefore, the problem was that applications were able to use only recognition function based on fixed supervised data in most cases. Then, we proposed a method of acquiring supervised data from a video sharing site where users give comments on any video scene because those sites are remarkably popular and, therefore, many comments are generated. In the first step of this method, words with a high utility value are extracted by filtering the comment about the video. Second, the set of feature data in the time series is calculated by applying functions, which extract various feature data, to media data. Finally, our learning system calculates the correlation coefficient by using the above-mentioned two kinds of data, and the correlation coefficient is stored in the DB of the system. Various other applications contain a recognition function that is used to generate collective intelligence based on Web comments, by applying this correlation coefficient to new media data. In addition, flexible recognition that adjusts to a new object becomes possible by regularly acquiring and learning both media data and comments from a video sharing site while reducing work by manual operation. As a result, recognition of not only the name of the seen object but also indirect information, e.g. the impression or the action toward the object, was enabled.

  • PDF

Energy-Aware Video Coding Selection for Solar-Powered Wireless Video Sensor Networks

  • Yi, Jun Min;Noh, Dong Kun;Yoon, Ikjune
    • 한국컴퓨터정보학회논문지
    • /
    • 제22권7호
    • /
    • pp.101-108
    • /
    • 2017
  • A wireless image sensor node collecting image data for environmental monitoring or surveillance requires a large amount of energy to transmit the huge amount of video data. Even though solar energy can be used to overcome the energy constraint, since the collected energy is also limited, an efficient energy management scheme for transmitting a large amount of video data is needed. In this paper, we propose a method to reduce the number of blackout nodes and increase the amount of gathered data by selecting an appropriate video coding method according to the energy condition of the node in a solar-powered wireless video sensor network. This scheme allocates the amount of energy that can be used over time in order to seamlessly collect data regardless of night or day, and selects a high compression coding method when the allocated energy is large and a low compression coding when the quota is low. Thereby, it reduces the blackout of the relay node and increases the amount of data obtained at the sink node by allowing the data to be transmitted continuously. Also, if the energy is lower than operating normaly, the frame rate is adjusted to prevent the energy exhaustion of nodes. Simulation results show that the proposed scheme suppresses the energy exhaustion of the relay node and collects more data than other schemes.

비디오 프록시 서버에서의 시간 제약 다중 요청 기법 기반 동영상 데이터 관리 (Video Data Management based on Time Constraint Multiple Access Technique in Video Proxy Server)

  • 이준표;조철영;권철희;이종순;김태영
    • 한국컴퓨터정보학회논문지
    • /
    • 제15권10호
    • /
    • pp.113-120
    • /
    • 2010
  • 본 논문에서는 비디오 프록시 서버의 제한된 저장 공간을 효율적으로 활용하기 위한 시간 제약 다중 요청 기법을 제안한다. 제안하는 기법은 요청된 동영상 데이터를 전송받아 사용자에게 전송하고 비디오 프록시 서버에 일시적으로 저장한다. 이때 일시적으로 저장된 동영상 데이터는 설정된 시간 내에서 발생되는 사용자의 요청의 상태에 따라 저장장치에서 삭제되거나 저장된다. 또한 새롭게 요청된 동영상의 저장 공간을 확보하기 위해서 저장장치에 저장되어 있는 동영상 세그먼트 중 요청 가능성이 가장 낮은 세그먼트를 선정하고 제거한다. 이를 위해 사용자에 의해 주로 요청되는 동영상 세그먼트 부분인 전방 클래스와 요청되지 않았거나 요청될 가능성이 적은 세그먼트 부분인 후방 클래스로 분리한다. 분리된 클래스 중 후방 클래스에서 가장 오래전에 요청된 세그먼트를 선정하여 삭제함으로써 제한된 공간을 효율적으로 활용한다. 실험을 통해 제안하는 방법이 기존의 방법들 보다 높은 적중률을 보이는 동시에 보다 적은 삭제 횟수를 보인다는 것을 확인한다.