• Title/Summary/Keyword: Video Identification

Search Result 177, Processing Time 0.028 seconds

Object Tracking Algorithm using Feature Map based on Siamese Network (Siamese Network의 특징맵을 이용한 객체 추적 알고리즘)

  • Lim, Su-Chang;Park, Sung-Wook;Kim, Jong-Chan;Ryu, Chang-Su
    • Journal of Korea Multimedia Society
    • /
    • v.24 no.6
    • /
    • pp.796-804
    • /
    • 2021
  • In computer vision, visual tracking method addresses the problem of localizing an specific object in video sequence according to the bounding box. In this paper, we propose a tracking method by introducing the feature correlation comparison into the siamese network to increase its matching identification. We propose a way to compute location of object to improve matching performance by a correlation operation, which locates parts for solving the searching problem. The higher layer in the network can extract a lot of object information. The lower layer has many location information. To reduce error rate of the object center point, we built a siamese network that extracts the distribution and location information of target objects. As a result of the experiment, the average center error rate was less than 25%.

YOLO-based Video Non-identification Tool Development (YOLO기반 영상 비식별화 도구 개발)

  • Shin, Hyeong-Hwan;Park, Sung-Wan;Park, Sang-Hyun;Oh, Chi-Min;Kim, Seungwon
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2021.11a
    • /
    • pp.875-877
    • /
    • 2021
  • 영상 매체의 발달과 영상 미디어의 쉬운 공유는 많은 이점을 가지고 왔다. 하지만 영상이 인터넷 상에서 쉽게 공유되면서 개인이 원치 않는 모습 및 정보가 자신도 모르게 공개되는 초상권 문제나 사생활 침해 문제가 발생하고 있다. 이를 막기 위해 영상의 인물을 비식별화 하고 있지만 수작업으로 진행되는 영상의 비식별화는 많은 시간과 비용이 들어간다. 이에 본 논문에서는 자동으로 영상의 인물을 탐지, 추적하여 비식별화 영상처리를 진행할 수 있는 YOLO 기반 비식별화 시스템을 제안한다.

Scene extraction technology on deep learning for media production (미디어 제작을 위한 씬 검출 기법)

  • Song, Hyok;Ko, Min-Soo;Yoo, Jisang
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2022.06a
    • /
    • pp.184-185
    • /
    • 2022
  • 인터넷 환경의 변화에 따라 텍스트 기반의 정보 전달에서 멀티미디어 기반의 스트리밍 방식으로 바뀌어가고 있다. 또한 대용량의 동영상 데이터뿐 아니라 Shorts, Clip Reels 또는 등 다양한 방식의 동영상 형태로 배포되고 있으며 서비스 플랫폼에서는 손쉽게 편집할 수 있도록 기능을 제공하고 있다. 대용량 콘텐츠, TV, Youtue 콘텐츠를 포함하여 소용량 동영상 편집에 필요한 영상 제작 기술에서 가장 인력과 시간이 많이 소요되는 부분은 편집 단계로 딥러닝 기반 인공지능 기술을 활용하여 자동화하고 있으며 영상편집에서 가장 기본이 되는 단위인 씬검출 기법을 개발하였다. 키프레임 검출 기법과 유사도 기법을 이용하여 씬을 추출하였으며 블록 Cost Function을 이용하여 최적화하여 0.5214의 정확도를 도출하였다.

  • PDF

The depth quality enhancement algorithm for Autostereoscopic 3D Monitor (무안경 3D 모니터를 위한 Depth 화질 향상 Algorithm)

  • Song, Sung-Ho;Lee, Kyoung-Il;Lee, Dong-Ha;Park, Jong-Cheol;Lee, Jea-Jun;Kim, Young-Kil
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2012.05a
    • /
    • pp.133-136
    • /
    • 2012
  • In this paper, we found the many effective ways and apply for improve the 3D quality of Autostereoscopic 3D display products. Autostereoscopic products compared to traditional 3D glasses, the disadvantage is the poor depth of 3D picture quality and it only can see the fixed distance and position. So, for the compensate this disadvantage, we use the Head tracking technology and video placement algorithms and several techniques. In this paper, the will report on how to improve the Parallax Barrier Autostereoscopic 3D quality through the Head tracking of the user identification, video replacement algorithms and crosstalk improving method.

  • PDF

Three Dimensional Tracking of Road Signs based on Stereo Vision Technique (스테레오 비전 기술을 이용한 도로 표지판의 3차원 추적)

  • Choi, Chang-Won;Choi, Sung-In;Park, Soon-Yong
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.20 no.12
    • /
    • pp.1259-1266
    • /
    • 2014
  • Road signs provide important safety information about road and traffic conditions to drivers. Road signs include not only common traffic signs but also warning information regarding unexpected obstacles and road constructions. Therefore, accurate detection and identification of road signs is one of the most important research topics related to safe driving. In this paper, we propose a 3-D vision technique to automatically detect and track road signs in a video sequence which is acquired from a stereo vision camera mounted on a vehicle. First, color information is used to initially detect the sign candidates. Second, the SVM (Support Vector Machine) is employed to determine true signs from the candidates. Once a road sign is detected in a video frame, it is continuously tracked from the next frame until it is disappeared. The 2-D position of a detected sign in the next frame is predicted by the 3-D motion of the vehicle. Here, the 3-D vehicle motion is acquired by using the 3-D pose information of the detected sign. Finally, the predicted 2-D position is corrected by template-matching of the scaled template of the detected sign within a window area around the predicted position. Experimental results show that the proposed method can detect and track many types of road signs successfully. Tracking comparisons with two different methods are shown.

An Efficient Car Management System based on an Object-Oriented Modeling using Car Number Recognition and Smart Phone (자동차 번호판 인식 및 스마트폰을 활용한 객체지향 설계 기반의 효율적인 차량 관리 시스템)

  • Jung, Se-Hoon;Kwon, Young-Wook;Sim, Chun-Bo
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.7 no.5
    • /
    • pp.1153-1164
    • /
    • 2012
  • In this paper, we propose an efficient car management system based on object-oriented modeling using car number recognition and smart phone. The proposed system perceives car number of repair vehicle after recognizing the licence plate using an IP camera in real time. And then, existing repair history information of the recognized car is be displayed in DID. In addition, maintenance process is shooting video while auto maintenance mechanic repairs car through IP-camera. That will be provide customer car identification and repairs history management function by sending key frames extracted from recorded video automatically. We provide user graphic interface based on web and mobile for your convenience. The module design of the proposed system apply software design modeling based on granular object-oriented considering reuse and extensibility after implementation. Car repairs center and maintenance companies can improve business efficiency, as well as the requested vehicle repair can increase customer confidence.

Copyright Protection for the Video image with Coded Watermarking (암호화 워터마킹을 사용한 비디오 영상의 저작권 보호)

  • Park, Young;Kim, Hang-Rae;Rhu, Ho-Joon;Kim, Jae-Won
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2003.11a
    • /
    • pp.120-123
    • /
    • 2003
  • In this paper, a digital watermarking scheme whichis effective in protecting a copyright of video image under an image transformation and impulse noise is proposed. The proposing scheme is to use a coded watermark that insert the personal ID of copyrighter. The recovery ability is improved by the coded watermark. Also the coded watermark is abel to trace the illegal distributors. Binary image is used as watermark image, the value of PSNR and recovered rates of watermark are obtained in order to confirm the required invisibility and robustness in watermark system. The experimental results show that image quality is less degraded as the PSNR of 98.21 ㏈. It is also observed that excellent watermark recovery is achieved under the image transformation and impulse noise.

  • PDF

A Study on ERP and Behavior Responses in Emotion Regulation (정서조절에 관한 Event related potentials 및 행동학적 반응 연구)

  • Seo, Ssang-Hee
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.14 no.10
    • /
    • pp.5003-5011
    • /
    • 2013
  • This paper measured whether neural and behavior responses to attention-emotion task were reflected to emotion regulation capacities. For this purpose, Nineteen healthy right-handed graduates participated in the emotion-attention task three times for three days. Before and after the negative and positive video clips were shown, the participants performed emotion-attention task. EEG and response time were recorded during emotion-attention task. There was positive correlation between ERP P100 and P300 component. The larger the P100 amplitudes at the specific positions, the longer the P300 latencies at these same positions during attention-emotion task. The longer the P300 latencies at the specific positions, the longer the delay in response time. Also, there is and individual differences in ERP components and response time during attention-emotion integration task. Individuals who had lower amplitude and shorter latency of ERP showed faster response time during attention-emotion task, regardless of the type of video clips. This characteristic was interpreted to the lower emotional controls due to premature response for target identification.

Identification of Design Attributes of the Affective Expressions for Movie Making (영화의 감성만족도 측정을 위한 시.청각적 영향 요인의 체계적 도출)

  • Kim, In-Ki;Kim, Ji-Ho;Chang, Woo-Jin;Lee, Cheol;Yun, Myung-Hwan
    • 한국HCI학회:학술대회논문집
    • /
    • 2007.02a
    • /
    • pp.143-149
    • /
    • 2007
  • 영상은 동적인 시각 이미지와 청각의 결합에 의해 감성적인 반응을 유도한다. 다양한 영상 기법을 통하여 감성적 반응의 극대화를 추구하는 영화는 영상의 시청각적 요소들을 감성의 관점에서 효과적으로 설계하는데 본보기가 된다. 그러나, 제품의 설계속성들에 대한 감성적 평가결과를 모형화하는 감성공학적 관점에서 볼 때 영화는 시청각적 자극의 수준이 극히 다양하고 동적인 경험재로 모형화의 어려움이 있다. 본 연구에서는 영화의 감성 모형을 구축하기 위한 사전연구의 단계로 영화에서의 시청각적 요인들을 문헌조사를 통해 수집, 정리, 선별하고 이러한 시청각적 요인들 중에 영화를 관람하는 관객의 감성적, 인지적 반응에 영향을 주는 유효한 요인들을 객관적이고 체계적으로 탐색하고자 하였다. 이를 위해, 감성 및 인지적 반응의 변화를 생체신호를 통해 측정하는 한편, 생체신호의 측정 시 사용된 영화의 시청각적 자극요인을 Video/Audio Processing방법에 의해 연속적인 수치로 정량화하였다. 생체신호와 정량화된 시청각적 자극요인을 동기화하고 통계적으로 분석함으로써, 생체신호의 반응과 시청각적 자극요인과의 인과관계를 통계적으로 신뢰성있는 수준에서 검증하고자 하였다. 생체신호를 종속변수로, 시청각적 자극요인을 독립변수로 하는 896개의 부분선형회귀모형(Partial Linear Regression Model)들 중 통계적으로 유의한 선형관계에 있는 경우의 빈도분석에 의하면, 시각적 요인들 중에는 밝기(Brightness), 대비(Contrast), 색상(Color), 움직임(Motion), 장면전환속도(Shot change Rate), 주요대상의 상대적 크기가, 청각적 요인들 중에는 Peak주파수, Peak주파수의 음량, 평균음량, 소음비(Sound-to-Noise Ratio)가 생체신호의 변화에 통계적으로 유의한 영향을 주는 것으로 나타났다. 이는, 위의 시청각적 자극 요인들은 특히 관객의 감성 및 인지적인 반응에 유의한 영향을 주는 요소로 작용할 수 있음을 시사하고 있다. 이를 토대로, 위의 시청각적 자극 요인들이 가지는 다양한 조합들을 설명변수로 하는 통계적인 영화의 감성 모형을 구축할 수 있을 것으로 기대한다.

  • PDF

Efficient Frame Synchronization Detector and Low Complexity Automatic Gain Controller for DVB-S2 (효율적인 디지털 위성 방송 프레임 동기 검출 회로 및 낮은 복잡도의 자동 이득 제어 회로)

  • Choi, Jin-Kyu;Sunwoo, Myung-Hoon;Kim, Pan-Soo;Chang, Dae-Ig
    • Journal of the Institute of Electronics Engineers of Korea SD
    • /
    • v.46 no.2
    • /
    • pp.31-37
    • /
    • 2009
  • This paper presents an efficient frame synchronization strategy with the identification of modulation type for Digital Video Broadcasting-Satellite second generation (DVB-S2). To detect the Start Of Frame (SOF) and identify a modulation mode at low SNR, we propose a new correlator structure and a low complexity Automatic Gain Controller (AGC). The proposed frame synchronization architecture can reduce about 93% multipliers and 89% adders compared with the direct implementation of the Differential - Generalized Post Detection Integration (D-GPDI) algorithm which is very complex and the proposed a low complexity AGC consists of only 5 multipliers and 3 adders. The proposed architecture has been thoroughly verified on the Xilinx Virtex II FPGA board.