• Title/Summary/Keyword: AI Video

Search Result 157, Processing Time 0.028 seconds

Method of Automatically Generating Metadata through Audio Analysis of Video Content (영상 콘텐츠의 오디오 분석을 통한 메타데이터 자동 생성 방법)

  • Sung-Jung Young;Hyo-Gyeong Park;Yeon-Hwi You;Il-Young Moon
    • Journal of Advanced Navigation Technology
    • /
    • v.25 no.6
    • /
    • pp.557-561
    • /
    • 2021
  • A meatadata has become an essential element in order to recommend video content to users. However, it is passively generated by video content providers. In the paper, a method for automatically generating metadata was studied in the existing manual metadata input method. In addition to the method of extracting emotion tags in the previous study, a study was conducted on a method for automatically generating metadata for genre and country of production through movie audio. The genre was extracted from the audio spectrogram using the ResNet34 artificial neural network model, a transfer learning model, and the language of the speaker in the movie was detected through speech recognition. Through this, it was possible to confirm the possibility of automatically generating metadata through artificial intelligence.

Early Termination of Block Vector Search for Fast Encoding of HEVC Screen Content Coding

  • Ma, Jonghyun;Sim, Donggyu
    • IEIE Transactions on Smart Processing and Computing
    • /
    • v.3 no.6
    • /
    • pp.388-392
    • /
    • 2014
  • This paper proposes an early termination method of a block vector search for fast encoding of high efficiency video coding (HEVC) screen content coding (SCC). In the proposed algorithm, two blocks indicated by two block vector predictors (BVPs) were first employed as an intra block copy (IBC) search. If the sum of absolute difference (SAD) value of the block is less than a threshold defined empirically, an IBC BV search is terminated early. The initial threshold for early termination is derived by statistical analysis and it can be modified adaptively based on a quantization parameter (QP). The proposed algorithm is evaluated on SCM-2.0 under all intra (AI) coding configurations. Experimental results show that the proposed algorithm reduces IBC BV search time by 29.23% on average while the average BD-rate loss is 0.41% under the HEVC SCC common test conditions (CTC).

Best Practices on Improving the Virtual Reality (VR) Content Development Process with EPIC's Unreal Engine

  • Kong, Ji Hoon;Kim, Ki Du;Kim, R. Young Chul
    • International Journal of Advanced Culture Technology
    • /
    • v.9 no.4
    • /
    • pp.417-423
    • /
    • 2021
  • Recently, in the Game industries, they are increasing to use of game engines to reduce the development cost of 3D content and software. In particular, Unreal Engine provides a blueprint visual scripting function that enables software production without programming (coding). Although High-end video content can be produced, the problem is that content development is complicated and requires advanced manpower. To solve this problem, we propose an optimized VR game context process. This is because 1) a Blueprint visual script is used, 2) VR games with various interactions can be produced, 3) Non-majors in the software field (or groups) can develop advanced content. In various related industries such as defense, medical care, manufacturing, and construction, we may easily develop any game content without programming with our refined VR rhythm action game development process. We expect to reduce the development cost with the process advantages in the game industries.

Compression method of feature based on CNN image classification network using Autoencoder (오토인코더를 이용한 CNN 이미지 분류 네트워크의 feature 압축 방안)

  • Go, Sungyoung;Kwon, Seunguk;Kim, Kyuheon
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2020.11a
    • /
    • pp.280-282
    • /
    • 2020
  • 최근 사물인터넷(IoT), 자율주행과 같이 기계 간의 통신이 요구되는 서비스가 늘어감에 따라, 기계 임무 수행에 최적화된 데이터의 생성 및 압축에 대한 필요성이 증가하고 있다. 또한, 사물인터넷과 인공지능(AI)이 접목된 기술이 주목을 받으면서 딥러닝 모델에서 추출되는 특징(feature)을 디바이스에서 클라우드로 전송하는 방안에 관한 연구가 진행되고 있으며, 국제 표준화 기구인 MPEG에서는 '기계를 위한 부호화(Video Coding for Machine: VCM)'에 대한 표준 기술 개발을 진행 중이다. 딥러닝으로 특징을 추출하는 가장 대표적인 방법으로는 합성곱 신경망(Convolutional Neural Network: CNN)이 있으며, 오토인코더는 입력층과 출력층의 구조를 동일하게 하여 출력을 가능한 한 입력에 근사시키고 은닉층을 입력층보다 작게 구성하여 차원을 축소함으로써 데이터를 압축하는 딥러닝 기반 이미지 압축 방식이다. 이에 본 논문에서는 이러한 오토인코더의 성질을 이용하여 CNN 기반의 이미지 분류 네트워크의 합성곱 신경망으로부터 추출된 feature에 오토인코더를 적용하여 압축하는 방안을 제안한다.

  • PDF

Automatic Video Editing Application based on Climax Pattern Classified by Genre (장르별 클라이맥스 패턴 적용 자동 영상편집 어플리케이션)

  • Im, Hyejeong;Mun, Hyejun;Park, Gaeun;Lim, Yangmi
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2020.07a
    • /
    • pp.611-612
    • /
    • 2020
  • 최근 유튜브, 네이버와 같은 플랫폼 사업자들은 다양하고 많은 동영상확보를 위해 최대한 시간을 적게 들이고 좋은 퀄리티의 영상을 자동으로 생성해주는 어플리케이션을 개발하는데 AI 기술을 적극적으로 사용하고 있다. 가장 주도적으로 진행하는 곳은 IBM 의 왓슨의 인지하이라이트 기술이다. 관중의 함성소리와 스포츠특성 데이터들을 활용하여 하이라이트 부분의 영상만 자동 생성하고 있다. 하지만 현재까지의 기술은 인간의 감성을 자극하는 스토리 전개방식의 자동영상 생성에 있어서는 부족한 부분이 많이 존재한다.이 에 본 논문은 영화의 클라이맥스 부분의 영상편집방식을 분석하여 이에 대한 장르별 샷 사이즈 변화패턴을 시각화한 후, 장르간 편집 차이점을 패턴화한 템플릿을 구축하여 사용자의 이미지 데이터들을 장르별 클라이맥스 패턴의 특성에 맞게 추천하여 짧은 영상을 자동 생성하는 어플리케이션을 개발하였다. 향후 본 연구는 1 인 미디어 산업 및 사이버교육 분야에서 가장 많이 소요되는 영상편집 시간을 단축하는데 큰 효율이 있을 것이라 기대한다.

  • PDF

Using Ensemble Learning Algorithm and AI Facial Expression Recognition, Healing Service Tailored to User's Emotion (앙상블 학습 알고리즘과 인공지능 표정 인식 기술을 활용한 사용자 감정 맞춤 힐링 서비스)

  • Yang, seong-yeon;Hong, Dahye;Moon, Jaehyun
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2022.11a
    • /
    • pp.818-820
    • /
    • 2022
  • The keyword 'healing' is essential to the competitive society and culture of Koreans. In addition, as the time at home increases due to COVID-19, the demand for indoor healing services has increased. Therefore, this thesis analyzes the user's facial expression so that people can receive various 'customized' healing services indoors, and based on this, provides lighting, ASMR, video recommendation service, and facial expression recording service.The user's expression was analyzed by applying the ensemble algorithm to the expression prediction results of various CNN models after extracting only the face through object detection from the image taken by the user.

Block Position Adaptive Intra Mode Coding (블록 위치에 따른 적응적 화면 내 예측 모드 부호화)

  • Cheon, Muho;Kim, Bumyoon;Jeon, Byeungwoo
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2022.06a
    • /
    • pp.201-202
    • /
    • 2022
  • 본 논문에서는 VVC(Versatile Video Coding)의 화면 내 예측 수행 시 픽처의 좌측 상단 블록에서 고정적으로 Planar 를 사용하도록 하여 부호화 성능을 향상시킬 수 있는 방법을 제안한다. VVC 의 화면 내 예측 기술은 픽처의 좌측 상단 블록의 참조 화소가 모두 패딩되어 동일한 값을 가짐에도 불구하고 다른 블록들과 동일하게 화면 내 예측모드를 탐색 및 신호하는 비효율성을 갖는다. 본 논문에서는 이 경우 화면 내 예측 모드에 관한 탐색과 신호를 생략하고 고정적으로 Planar 모드를 사용하도록 하고, 실험을 통하여 VTM-16.0 대비 BDBR(Bjøntegaard Delta Bit Rate) 측면에서 AI(All Intra) 구성하에 Y(-0.004%), Cb(-0.010%), Cr(0.023%)의 결과를 얻을 수 있음을 보인다.

  • PDF

Trends and Development Prospects in Broadcasting Technology (방송 기술 동향 및 발전 전망)

  • J.S. Um;B.M. Lim;H.Y. Jung;S.K. Ahn;H.J. Yim;J.H. Seo
    • Electronics and Telecommunications Trends
    • /
    • v.39 no.2
    • /
    • pp.43-53
    • /
    • 2024
  • The media environment is rapidly evolving to be tailored to viewers using personal mobile devices in accordance with technological evolution and changes in social structures. Broadcast media technology is also advancing to enable new services, including data casting, in various reception environments beyond the existing fixed environment and one-way audio/video content services. In addition, technologies to increase the transmission capacity to accommodate next-generation large-capacity media content as well as communication network utilization and convergence technologies are being developed to facilitate interactive services and expand the broadcasting coverage. We discuss the current status and future prospects in broadcasting technology for terrestrial and mobile communication systems and analyze broadcasting technology elements for upcoming media environments relying on generative artificial intelligence.

A Methodology for Making Military Surveillance System to be Intelligent Applied by AI Model (AI모델을 적용한 군 경계체계 지능화 방안)

  • Changhee Han;Halim Ku;Pokki Park
    • Journal of Internet Computing and Services
    • /
    • v.24 no.4
    • /
    • pp.57-64
    • /
    • 2023
  • The ROK military faces a significant challenge in its vigilance mission due to demographic problems, particularly the current aging population and population cliff. This study demonstrates the crucial role of the 4th industrial revolution and its core artificial intelligence algorithm in maximizing work efficiency within the Command&Control room by mechanizing simple tasks. To achieve a fully developed military surveillance system, we have chosen multi-object tracking (MOT) technology as an essential artificial intelligence component, aligning with our goal of an intelligent and automated surveillance system. Additionally, we have prioritized data visualization and user interface to ensure system accessibility and efficiency. These complementary elements come together to form a cohesive software application. The CCTV video data for this study was collected from the CCTV cameras installed at the 1st and 2nd main gates of the 00 unit, with the cooperation by Command&Control room. Experimental results indicate that an intelligent and automated surveillance system enables the delivery of more information to the operators in the room. However, it is important to acknowledge the limitations of the developed software system in this study. By highlighting these limitations, we can present the future direction for the development of military surveillance systems.

A neck healthy warning algorithm for identifying text neck posture prevention (거북목 자세를 예방하기 위한 목 건강 경고 알고리즘)

  • Jae-Eun Lee;Jong-Nam Kim;Hong-Seok Choi;Young-Bong Kim
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.23 no.3
    • /
    • pp.115-122
    • /
    • 2022
  • With the outbreak of COVID-19 a few years ago, video conferencing and electronic document work have increased, and for this reason, the proportion of computer work among modern people's daily routines is increasing. However, as more and more people work on computers in the wrong posture for a long time, the number of patients with poor eyesight and text neck is increasing. Until recently, many studies have been published to correct posture, but most of them have limitations that users may experience discomfort because they have to correct posture by wearing equipment. A posture correction sensor algorithm is proposed to prevent access to the minimum distance between a computer monitor and a person using an ultrasonic sensor device. At this time, an algorithm for minimizing false alarms among warning alarms that sound at the minimum distance is also proposed. Because the ultrasonic sensor device is used, posture correction can be performed without attaching a device to the body, and the user can relieve discomfort. In addition, experimental results showed that accuracy can be improved by reducing false alarms by removing more than half of the noise generated during distance measurement.