• Title/Summary/Keyword: Video Software Method

Search Result 309, Processing Time 0.029 seconds

Acquiring 3-dimensional data of a human face using a laser slit-ray projection method

  • Ishimatsu, T.;Taguchi, N.;Kawasue, K.;Kumon, K.
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1988.10b
    • /
    • pp.816-821
    • /
    • 1988
  • This paper describes a system which enables a fast 3-dimensional measurement of a human face using a sli-ray projection method. One distinctive feature of our system is that a real-time video signal processor is employed in order to reduce the amount of image data to be processed and enable a fast measurement. Another feature of our system is that a skillful calibration software is developed. Due to this calibration software, opetators can be free from cumbersome settings of the measuring system.

  • PDF

A Study on Implementation for Real-time Lane Departure Warning System & Smart Night Vision Based on HDR Camera Platform (실시간 차선 이탈 경고 및 Smart Night Vision을 위한 HDR Camera Platform 구현에 관한 연구)

  • Park, Hwa-Beom;Park, Ge-O;Kim, Young-kil
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2017.05a
    • /
    • pp.123-126
    • /
    • 2017
  • The information and communication technology that is being developed recently has been greatly influencing the automobile market. In recent years, devices equipped with IT technology have been installed for the safety and convenience of the driver. However, it has the advantage of increased convenience as well as the disadvantage of increasing traffic accidents due to driver 's distraction. In order to prevent such accidents, it is necessary to develop safety systems of various types and ways. In this paper, we propose a method to implement a multi-function camera driving safety system that notifies a pedestrian and lane departure warning without using a radar sensor or a stereo video image, and a study on the analysis of a lane departure alarm software result.

  • PDF

Audio and Video Bimodal Emotion Recognition in Social Networks Based on Improved AlexNet Network and Attention Mechanism

  • Liu, Min;Tang, Jun
    • Journal of Information Processing Systems
    • /
    • v.17 no.4
    • /
    • pp.754-771
    • /
    • 2021
  • In the task of continuous dimension emotion recognition, the parts that highlight the emotional expression are not the same in each mode, and the influences of different modes on the emotional state is also different. Therefore, this paper studies the fusion of the two most important modes in emotional recognition (voice and visual expression), and proposes a two-mode dual-modal emotion recognition method combined with the attention mechanism of the improved AlexNet network. After a simple preprocessing of the audio signal and the video signal, respectively, the first step is to use the prior knowledge to realize the extraction of audio characteristics. Then, facial expression features are extracted by the improved AlexNet network. Finally, the multimodal attention mechanism is used to fuse facial expression features and audio features, and the improved loss function is used to optimize the modal missing problem, so as to improve the robustness of the model and the performance of emotion recognition. The experimental results show that the concordance coefficient of the proposed model in the two dimensions of arousal and valence (concordance correlation coefficient) were 0.729 and 0.718, respectively, which are superior to several comparative algorithms.

Non-fixed Quantization Considering Entropy Encoding in HEVC (HEVC 엔트로피 부호화를 고려한 비균등 양자화 방법)

  • Gweon, Ryeong-Hee;Han, Woo-Jin;Lee, Yung-Lyul
    • Journal of Broadcast Engineering
    • /
    • v.16 no.6
    • /
    • pp.1036-1046
    • /
    • 2011
  • MPEG and VCEG have constituted a collaboration team called JCT-VC(Joint Collaborative Team on Video Coding) and have been developing HEVC(High Efficiency Video Coding) standard. All transform coefficients in a TU(Transform Unit) have been equally quantized according to the quantization and inverse quantization method which is used in HEVC standard. Such an equal quantization is not efficient because the transformed coefficients in the TU are not eqully distributed. Furthermore, the quantized coefficients which is positioned in later scanning order cannot be efficient due to the entropy scanning method. We suggest an algorithm that transform coefficients are quantized at different values according to the position in TU considering a scanning order of entropy encoding to improve the coding efficiency. The principle of this algorithm is that quantization and inverse quantization are carried out according to the scanning order which is in accordance with the statistical characteristic of distribution of quantized transform coefficients. The proposed algorithm shows on the average of 0.34% Y BD-rate compression rate improvement.

Generating Augmented Lifting Player using Pose Tracking

  • Choi, Jong-In;Kim, Jong-Hyun
    • Journal of the Korea Society of Computer and Information
    • /
    • v.25 no.5
    • /
    • pp.19-26
    • /
    • 2020
  • This paper proposes a framework for creating acrobatic scenes such as soccer ball lifting using various users' videos. The proposed method can generate a desired result within a few seconds using a general video of user recorded with a mobile phone. The framework of this paper is largely divided into three parts. The first is to analyze the posture by receiving the user's video. To do this, the user can calculate the pose of the user by analyzing the video using a deep learning technique, and track the movement of a selected body part. The second is to analyze the movement trajectory of the selected body part and calculate the location and time of hitting the object. Finally, the trajectory of the object is generated using the analyzed hitting information. Then, a natural object lifting scenes synchronized with the input user's video can be generated. Physical-based optimization was used to generate a realistic moving object. Using the method of this paper, we can produce various augmented reality applications.

Motion Estimation Method by Using Depth Camera (깊이 카메라를 이용한 움직임 추정 방법)

  • Kwon, Soon-Kak;Kim, Seong-Woo
    • Journal of Broadcast Engineering
    • /
    • v.17 no.4
    • /
    • pp.676-683
    • /
    • 2012
  • Motion estimation in video coding greatly affects implementation complexity. In this paper, a reducing method of the complexity in motion estimation is proposed by using both the depth and color cameras. We obtain object information with video sequence from distance information calculated by depth camera, then perform labeling for grouping pixels within similar distances as the same object. Three search regions (background, inside-object, boundary) are determined adaptively for each of motion estimation blocks within current and reference pictures. If a current block is the inside-object region, then motion is searched within the inside-object region of reference picture. Also if a current block is the background region, then motion is searched within the background region of reference picture. From simulation results, we can see that the proposed method compared to the full search method remains the almost same as the motion estimated difference signal and significantly reduces the searching complexity.

Spatiotemporal Saliency-Based Video Summarization on a Smartphone (스마트폰에서의 시공간적 중요도 기반의 비디오 요약)

  • Lee, Won Beom;Williem, Williem;Park, In Kyu
    • Journal of Broadcast Engineering
    • /
    • v.18 no.2
    • /
    • pp.185-195
    • /
    • 2013
  • In this paper, we propose a video summarization technique on a smartphone, based on spatiotemporal saliency. The proposed technique detects scene changes by computing the difference of the color histogram, which is robust to camera and object motion. Then the similarity between adjacent frames, face region, and frame saliency are computed to analyze the spatiotemporal saliency in a video clip. Over-segmented hierarchical tree is created using scene changes and is updated iteratively using mergence and maintenance energies computed during the analysis procedure. In the updated hierarchical tree, segmented frames are extracted by applying a greedy algorithm on the node with high saliency when it satisfies the reduction ratio and the minimum interval requested by the user. Experimental result shows that the proposed method summaries a 2 minute-length video in about 10 seconds on a commercial smartphone. The summarization quality is superior to the commercial video editing software, Muvee.

Fast Distributed Network File System using State Transition Model in the Media Streaming System (미디어 스트리밍 시스템에서의 상태 천이 모델을 활용한 고속 분산 네트워크 파일 시스템)

  • Woo, Soon;Lee, Jun-Pyo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.17 no.6
    • /
    • pp.145-152
    • /
    • 2012
  • Due to the large sizes of streaming media, previous delivery techniques are not providing optimal performance. For this purpose, video proxy server is employed for reducing the bandwidth consumption, network congestion, and network traffic. This paper proposes a fast distributed network file system using state transition model in the media streaming system for efficient utilization of video proxy server. The proposed method is composed of three steps: step 1. Training process using state transition model, step 2. base and decision probability generation, and step 3. storing and deletion based on probability. In addition, storage space of video proxy server is divided into each segment area in order to store the segments efficiently and to avoid the fragmentation. The simulation results show that the proposed method performs better than other methods in terms of hit rate and number of deletion. Therefore, the proposed method provides the lowest user start-up latency and the highest bandwidth saving significantly.

Half-Pixel Correction for MPEG-2/H.264 Transcoding (DCT 기반 MPEG-2/H.264 변환을 위한 1/2 화소 보정)

  • Kwon Soon-young;Lee Joo-kyong;Chung Ki-dong
    • Journal of KIISE:Software and Applications
    • /
    • v.32 no.10
    • /
    • pp.956-962
    • /
    • 2005
  • To improve video quality and coding efficiency, H.264/AVC adopts different half pixel calculating method compared with the previous standards. So, the transcoder requires additional works to transcode the pre-coded video contents with the previous standards to H.264/AVC in DCT domain. In this paper, we propose the first half-pixel correction method for MPEG-2 to H.264 transcoding in DCT domain. In the proposed method, MPEG-2 block is added to the correction block obtained by difference calculation of half-pixel values between two standards using DCT reference frame. Experimental results show that the proposed achieves better quality than pixel based cascaded transcoding method.

The Examination of Reliability of Lower Limb Joint Angles with Free Software ImageJ

  • Kim, Heung Youl
    • Journal of the Ergonomics Society of Korea
    • /
    • v.34 no.6
    • /
    • pp.583-595
    • /
    • 2015
  • Objective: The purpose of this study was to determine the reliability of lower limb joint angles computed with the software ImageJ during jumping movements. Background: Kinematics is the study of bodies in motion without regard to the forces or torques that may produce the motion. The most common method for collecting motion data uses an imaging and motion-caption system to record the 2D or 3D coordinates of markers attached to a moving object, followed by manual or automatic digitizing software. Above all, passive optical motion capture systems (e.g. Vicon system) have been regarded as the gold standards for collecting motion data. On the other hand, ImageJ is used widely for an image analysis as free software, and can collect the 2D coordinates of markers. Although much research has been carried out into the utilizations of the ImageJ software, little is known about their reliability. Method: Seven healthy female students participated as the subject in this study. Seventeen reflective markers were attached on the right and left lower limbs to measure two and three-dimensional joint angular motions. Jump performance was recorded by ten-vicon camera systems (250Hz) and one digital video camera (240Hz). The joint angles of the ankle and knee joints were calculated using 2D (ImageJ) and 3D (Vicon-MX) motion data, respectively. Results: Pearson's correlation coefficients between the two methods were calculated, and significance tests were conducted (${\alpha}=1%$). Correlation coefficients between the two were over 0.98. In Vicon-MX and ImageJ, there is no systematic error by examination of the validity using the Bland-Altman method, and all data are in the 95% limits of agreement. Conclusion: In this study, correlation coefficients are generally high, and the regression line is near the identical line. Therefore, it is considered that motion analysis using ImageJ is a useful tool for evaluation of human movements in various research areas. Application: This result can be utilized as a practical tool to analyze human performance in various fields.