• Title/Summary/Keyword: pose regression

Search Result 19, Processing Time 0.013 seconds

Head Pose Estimation Based on Perspective Projection Using PTZ Camera (원근투영법 기반의 PTZ 카메라를 이용한 머리자세 추정)

  • Kim, Jin Suh;Lee, Gyung Ju;Kim, Gye Young
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.7 no.7
    • /
    • pp.267-274
    • /
    • 2018
  • This paper describes a head pose estimation method using PTZ(Pan-Tilt-Zoom) camera. When the external parameters of a camera is changed by rotation and translation, the estimated face pose for the same head also varies. In this paper, we propose a new method to estimate the head pose independently on varying the parameters of PTZ camera. The proposed method consists of 3 steps: face detection, feature extraction, and pose estimation. For each step, we respectively use MCT(Modified Census Transform) feature, the facial regression tree method, and the POSIT(Pose from Orthography and Scaling with ITeration) algorithm. The existing POSIT algorithm does not consider the rotation of a camera, but this paper improves the POSIT based on perspective projection in order to estimate the head pose robustly even when the external parameters of a camera are changed. Through experiments, we confirmed that RMSE(Root Mean Square Error) of the proposed method improve $0.6^{\circ}$ less then the conventional method.

Object Tracking with the Multi-Templates Regression Model Based MS Algorithm

  • Zhang, Hua;Wang, Lijia
    • Journal of Information Processing Systems
    • /
    • v.14 no.6
    • /
    • pp.1307-1317
    • /
    • 2018
  • To deal with the problems of occlusion, pose variations and illumination changes in the object tracking system, a regression model weighted multi-templates mean-shift (MS) algorithm is proposed in this paper. Target templates and occlusion templates are extracted to compose a multi-templates set. Then, the MS algorithm is applied to the multi-templates set for obtaining the candidate areas. Moreover, a regression model is trained to estimate the Bhattacharyya coefficients between the templates and candidate areas. Finally, the geometric center of the tracked areas is considered as the object's position. The proposed algorithm is evaluated on several classical videos. The experimental results show that the regression model weighted multi-templates MS algorithm can track an object accurately in terms of occlusion, illumination changes and pose variations.

Study of the Gaussian Mixture Joint-Adaptive Heatmap Regression for Top-Down Human Pose Estimation (관절 적응형 Gaussian Mixture 히트맵 회귀법을 이용한 하향식 사람 자세 추정에 관한 연구)

  • Ong, Zhun-Gee;Cho, Jungchan;Choi, Sang-il
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2022.07a
    • /
    • pp.35-36
    • /
    • 2022
  • 본 논문은 딥러닝 사람 자세 추정 모델이 사람의 관절 키포인트를 예측하는데 관절의 2차원 면적에 의해 키포인트별 𝜎, 즉, 표준 편차를 가지는 가우시안 커널(Gaussian Kernel)을 예측하는 방법을 제안한다. 각 관절 키포인트에 대해 다른 𝜎를 가지는 정답 히트맵(Ground Truth Heatmap)과 제안한 Gaussian Mixture Block를 모델에 추가해서 관절의 크기를 맞는 히트맵을 예측한다.

  • PDF

A Study on Recognition of the Eroticism in Fashion Advertisement

  • Lim, Mi-Ae;Choi, In-Ryu
    • The International Journal of Costume Culture
    • /
    • v.12 no.1
    • /
    • pp.13-25
    • /
    • 2009
  • This research is progressed to look out for efficient expression-elements of eroticism used in advertisements. Since these expressions of eroticism appealing to sex which is one of the primitive instincts of mankind are increasing in advertisements of cosmetic products which are used more often by recent high-rate-growth and the elevation of living conditions. The most usual expression-elements of eroticism in advertisement are exposure, pose, fashion style, make up, hair style and color. To analyze those expression-elements we made four pieces of fashion advertisement photos with four different types and surveyed both fashion majored students and non-fashion majored students. We applied regression analysis, ANOVA, and frequency analysis to verify the hypothesis. We found that in eroticism, the pose was the most important cognitive feature among the expression-elements and degree of cognition are varied according to major field and sexual interest. As a result, degree of cognition which effected by expression-elements will be varied even in same advertisement. In particular, convincing that the pose was the significant factor of eroticism cognition, expression of eroticism in advertisement would be more diverse and daring.

  • PDF

A Deep Convolutional Neural Network Based 6-DOF Relocalization with Sensor Fusion System (센서 융합 시스템을 이용한 심층 컨벌루션 신경망 기반 6자유도 위치 재인식)

  • Jo, HyungGi;Cho, Hae Min;Lee, Seongwon;Kim, Euntai
    • The Journal of Korea Robotics Society
    • /
    • v.14 no.2
    • /
    • pp.87-93
    • /
    • 2019
  • This paper presents a 6-DOF relocalization using a 3D laser scanner and a monocular camera. A relocalization problem in robotics is to estimate pose of sensor when a robot revisits the area. A deep convolutional neural network (CNN) is designed to regress 6-DOF sensor pose and trained using both RGB image and 3D point cloud information in end-to-end manner. We generate the new input that consists of RGB and range information. After training step, the relocalization system results in the pose of the sensor corresponding to each input when a new input is received. However, most of cases, mobile robot navigation system has successive sensor measurements. In order to improve the localization performance, the output of CNN is used for measurements of the particle filter that smooth the trajectory. We evaluate our relocalization method on real world datasets using a mobile robot platform.

Noisy label based discriminative least squares regression and its kernel extension for object identification

  • Liu, Zhonghua;Liu, Gang;Pu, Jiexin;Liu, Shigang
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.11 no.5
    • /
    • pp.2523-2538
    • /
    • 2017
  • In most of the existing literature, the definition of the class label has the following characteristics. First, the class label of the samples from the same object has an absolutely fixed value. Second, the difference between class labels of the samples from different objects should be maximized. However, the appearance of a face varies greatly due to the variations of the illumination, pose, and expression. Therefore, the previous definition of class label is not quite reasonable. Inspired by discriminative least squares regression algorithm (DLSR), a noisy label based discriminative least squares regression algorithm (NLDLSR) is presented in this paper. In our algorithm, the maximization difference between the class labels of the samples from different objects should be satisfied. Meanwhile, the class label of the different samples from the same object is allowed to have small difference, which is consistent with the fact that the different samples from the same object have some differences. In addition, the proposed NLDLSR is expanded to the kernel space, and we further propose a novel kernel noisy label based discriminative least squares regression algorithm (KNLDLSR). A large number of experiments show that our proposed algorithms can achieve very good performance.

A Study of Weighing System to Apply into Hydraulic Excavator with CNN (CNN기반 굴삭기용 부하 측정 시스템 구현을 위한 연구)

  • Hwang Hun Jeong;Young Il Shin;Jin Ho Lee;Ki Yong Cho
    • Journal of Drive and Control
    • /
    • v.20 no.4
    • /
    • pp.133-139
    • /
    • 2023
  • A weighing system calculates the bucket's excavation amount of an excavator. Usually, the excavation amount is computed by the excavator's motion equations with sensing data. But these motion equations have computing errors that are induced by assumptions to the linear systems and identification of the equation's parameters. To reduce computing errors, some commercial weighing system incorporates particular motion into the excavation process. This study introduces a linear regression model on an artificial neural network that has fewer predicted errors and doesn't need a particular pose during an excavation. Time serial data were gathered from a 30tons excavator's loading test. Then these data were preprocessed to be adjusted by MPL (Multi Layer Perceptron) or CNN (Convolutional Neural Network) based linear regression models. Each model was trained by changing hyperparameter such as layer or node numbers, drop-out rate, and kernel size. Finally ID-CNN-based linear regression model was selected.

3D Human Reconstruction from Video using Quantile Regression (분위 회귀 분석을 이용한 비디오로부터의 3차원 인체 복원)

  • Han, Jisoo;Park, In Kyu
    • Journal of Broadcast Engineering
    • /
    • v.24 no.2
    • /
    • pp.264-272
    • /
    • 2019
  • In this paper, we propose a 3D human body reconstruction and refinement method from the frames extracted from a video to obtain natural and smooth motion in temporal domain. Individual frames extracted from the video are fed into convolutional neural network to estimate the location of the joint and the silhouette of the human body. This is done by projecting the parameter-based 3D deformable model to 2D image and by estimating the value of the optimal parameters. If the reconstruction process for each frame is performed independently, temporal consistency of human pose and shape cannot be guaranteed, yielding an inaccurate result. To alleviate this problem, the proposed method analyzes and interpolates the principal component parameters of the 3D morphable model reconstructed from each individual frame. Experimental result shows that the erroneous frames are corrected and refined by utilizing the relation between the previous and the next frames to obtain the improved 3D human reconstruction result.

Facial Feature Extraction with Its Applications

  • Lee, Minkyu;Lee, Sangyoun
    • Journal of International Society for Simulation Surgery
    • /
    • v.2 no.1
    • /
    • pp.7-9
    • /
    • 2015
  • Purpose In the many face-related application such as head pose estimation, 3D face modeling, facial appearance manipulation, the robust and fast facial feature extraction is necessary. We present the facial feature extraction method based on shape regression and feature selection for real-time facial feature extraction. Materials and Methods The facial features are initialized by statistical shape model and then the shape of facial features are deformed iteratively according to the texture pattern which is selected on the feature pool. Results We obtain fast and robust facial feature extraction result with error less than 4% and processing time less than 12 ms. The alignment error is measured by average of ratio of pixel difference to inter-ocular distance. Conclusion The accuracy and processing time of the method is enough to apply facial feature based application and can be used on the face beautification or 3D face modeling.

Panel Study on the Environmental Kuznets Hypothesis in the Case of OECD 17 Countries (비정태적 패널자료를 이용한 환경 쿠즈네츠가설에 대한 실증분석 - OECD 17 개국 사례분석 -)

  • Cho, Sang-Sup;Kang, Shin-Won;Kim, Dong-Yeub
    • Environmental and Resource Economics Review
    • /
    • v.10 no.4
    • /
    • pp.619-632
    • /
    • 2001
  • The purpose of this study is to test the Kuznets Hypothesis on the relationship between environmental pollution and economic growth by using the panel data. The major results of the study can be summarized threefold as follows. First, previous studies can pose the risk of spurious regression because of the nature of non-stationery of the data used. Second, the result of the co-integration test indicates that the emission of $CO_2$ and per capita income are co-integrated. Finally, according to the results of OLS and DOLS estimation, the turning point in this study is set in far higher level of per capita income compared with those in previous studies.

  • PDF