• Title/Summary/Keyword: keypoints

Search Result 71, Processing Time 0.021 seconds

Improved Triangle Keypoints matching system for efficient generation (효율적인 계산을 위한 개선된 삼각형 닮음 조건 기반 영상 간 유사 공간 계산 알고리즘)

  • Lee, Inhong;Kang, Jeonho;Nam, Kwijung;Kim, KyuHeon
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2020.07a
    • /
    • pp.236-238
    • /
    • 2020
  • 기존에 개발한 삼각형 닮음 조건 기반 영상 간 유사 공간 계산 알고리즘은 근접 거리에 과도하게 많은 특징점이 추출되면 정확도가 낮아지는 점, 계산 과정에서의 Threshold를 주관적으로 설정해 주어야 해 정확한 Threshold를 찾기 위하여 전체 알고리즘을 여러번 반복하여 실행시켜야 하는 점에서 비효율적인 측면이 있다. 이를 해결하기 위하여 본 논문에서는 기존의 삼각형 닮음 조건 기반 영상 간 유사 공간 계산 알고리즘에 근접 거리 내의 특징점을 제거하는 알고리즘과 서로 다른 Threshold를 가진 유사 공간 계산 알고리즘들을 병렬적으로 계산해 한 번의 알고리즘 실행만으로 자동적으로 적절한 Threshold를 찾을 수 있도록 하는 모듈을 추가하여 기존의 알고리즘과 비교하여 더 효율적으로 영상 간 유사 공간을 계산해낼 수 있도록 개선된 삼각형 닮음 조건 기반 영상 간 유사 공간 계산 알고리즘을 제안한다.

  • PDF

Comparison of Fall Detection Systems Based on YOLOPose and Long Short-Term Memory

  • Seung Su Jeong;Nam Ho Kim;Yun Seop Yu
    • Journal of information and communication convergence engineering
    • /
    • v.22 no.2
    • /
    • pp.139-144
    • /
    • 2024
  • In this study, four types of fall detection systems - designed with YOLOPose, principal component analysis (PCA), convolutional neural network (CNN), and long short-term memory (LSTM) architectures - were developed and compared in the detection of everyday falls. The experimental dataset encompassed seven types of activities: walking, lying, jumping, jumping in activities of daily living, falling backward, falling forward, and falling sideways. Keypoints extracted from YOLOPose were entered into the following architectures: RAW-LSTM, PCA-LSTM, RAW-PCA-LSTM, and PCA-CNN-LSTM. For the PCA architectures, the reduced input size stemming from a dimensionality reduction enhanced the operational efficiency in terms of computational time and memory at the cost of decreased accuracy. In contrast, the addition of a CNN resulted in higher complexity and lower accuracy. The RAW-LSTM architecture, which did not include either PCA or CNN, had the least number of parameters, which resulted in the best computational time and memory while also achieving the highest accuracy.

Improving Detection Range for Short Baseline Stereo Cameras Using Convolutional Neural Networks and Keypoint Matching (컨볼루션 뉴럴 네트워크와 키포인트 매칭을 이용한 짧은 베이스라인 스테레오 카메라의 거리 센싱 능력 향상)

  • Byungjae Park
    • Journal of Sensor Science and Technology
    • /
    • v.33 no.2
    • /
    • pp.98-104
    • /
    • 2024
  • This study proposes a method to overcome the limited detection range of short-baseline stereo cameras (SBSCs). The proposed method includes two steps: (1) predicting an unscaled initial depth using monocular depth estimation (MDE) and (2) adjusting the unscaled initial depth by a scale factor. The scale factor is computed by triangulating the sparse visual keypoints extracted from the left and right images of the SBSC. The proposed method allows the use of any pre-trained MDE model without the need for additional training or data collection, making it efficient even when considering the computational constraints of small platforms. Using an open dataset, the performance of the proposed method was demonstrated by comparing it with other conventional stereo-based depth estimation methods.

A comparative study on keypoint detection for developmental dysplasia of hip diagnosis using deep learning models in X-ray and ultrasound images (X-ray 및 초음파 영상을 활용한 고관절 이형성증 진단을 위한 특징점 검출 딥러닝 모델 비교 연구)

  • Sung-Hyun Kim;Kyungsu Lee;Si-Wook Lee;Jin Ho Chang;Jae Youn Hwang;Jihun Kim
    • The Journal of the Acoustical Society of Korea
    • /
    • v.42 no.5
    • /
    • pp.460-468
    • /
    • 2023
  • Developmental Dysplasia of the Hip (DDH) is a pathological condition commonly occurring during the growth phase of infants. It acts as one of the factors that can disrupt an infant's growth and trigger potential complications. Therefore, it is critically important to detect and treat this condition early. The traditional diagnostic methods for DDH involve palpation techniques and diagnosis methods based on the detection of keypoints in the hip joint using X-ray or ultrasound imaging. However, there exist limitations in objectivity and productivity during keypoint detection in the hip joint. This study proposes a deep learning model-based keypoint detection method using X-ray and ultrasound imaging and analyzes the performance of keypoint detection using various deep learning models. Additionally, the study introduces and evaluates various data augmentation techniques to compensate the lack of medical data. This research demonstrated the highest keypoint detection performance when applying the residual network 152 (ResNet152) model with simple & complex augmentation techniques, with average Object Keypoint Similarity (OKS) of approximately 95.33 % and 81.21 % in X-ray and ultrasound images, respectively. These results demonstrate that the application of deep learning models to ultrasound and X-ray images to detect the keypoints in the hip joint could enhance the objectivity and productivity in DDH diagnosis.

Comparison of Feature Point Extraction Algorithms Using Unmanned Aerial Vehicle RGB Reference Orthophoto (무인항공기 RGB 기준 정사영상을 이용한 특징점 추출 알고리즘 비교)

  • Lee, Kirim;Seong, Jihoon;Jung, Sejung;Shin, Hyeongil;Kim, Dohoon;Lee, Wonhee
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.44 no.2
    • /
    • pp.263-270
    • /
    • 2024
  • As unmanned aerial vehicles(UAVs) and sensors have been developed in a variety of ways, it has become possible to update information on the ground faster than existing aerial photography or remote sensing. However, acquisition and input of ground control points(GCPs) UAV photogrammetry takes a lot of time, and geometric distortion occurs if measurement and input of GCPs are incorrect. In this study, RGB-based orthophotos were generated to reduce GCPs measurment and input time, and comparison and evaluation were performed by applying feature point algorithms to target orthophotos from various sensors. Four feature point extraction algorithms were applied to the two study sites, and as a result, speeded up robust features(SURF) was the best in terms of the ratio of matching pairs to feature points. When compared overall, the accelerated-KAZE(AKAZE) method extracted the most feature points and matching pairs, and the binary robust invariant scalable keypoints(BRISK) method extracted the fewest feature points and matching pairs. Through these results, it was confirmed that the AKAZE method is superior when performing geometric correction of the objective orthophoto for each sensor.

Fall Detection Based on Human Skeleton Keypoints Using GRU

  • Kang, Yoon-Kyu;Kang, Hee-Yong;Weon, Dal-Soo
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.12 no.4
    • /
    • pp.83-92
    • /
    • 2020
  • A recent study to determine the fall is focused on analyzing fall motions using a recurrent neural network (RNN), and uses a deep learning approach to get good results for detecting human poses in 2D from a mono color image. In this paper, we investigated the improved detection method to estimate the position of the head and shoulder key points and the acceleration of position change using the skeletal key points information extracted using PoseNet from the image obtained from the 2D RGB low-cost camera, and to increase the accuracy of the fall judgment. In particular, we propose a fall detection method based on the characteristics of post-fall posture in the fall motion analysis method and on the velocity of human body skeleton key points change as well as the ratio change of body bounding box's width and height. The public data set was used to extract human skeletal features and to train deep learning, GRU, and as a result of an experiment to find a feature extraction method that can achieve high classification accuracy, the proposed method showed a 99.8% success rate in detecting falls more effectively than the conventional primitive skeletal data use method.

Antiblurry Dejitter Image Stabilization Method of Fuzzy Video for Driving Recorders

  • Xiong, Jing-Ying;Dai, Ming;Zhao, Chun-Lei;Wang, Ruo-Qiu
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.11 no.6
    • /
    • pp.3086-3103
    • /
    • 2017
  • Video images captured by vehicle cameras often contain blurry or dithering frames due to inadvertent motion from bumps in the road or by insufficient illumination during the morning or evening, which greatly reduces the perception of objects expression and recognition from the records. Therefore, a real-time electronic stabilization method to correct fuzzy video from driving recorders has been proposed. In the first stage of feature detection, a coarse-to-fine inspection policy and a scale nonlinear diffusion filter are proposed to provide more accurate keypoints. Second, a new antiblurry binary descriptor and a feature point selection strategy for unintentional estimation are proposed, which brought more discriminative power. In addition, a new evaluation criterion for affine region detectors is presented based on the percentage interval of repeatability. The experiments show that the proposed method exhibits improvement in detecting blurry corner points. Moreover, it improves the performance of the algorithm and guarantees high processing speed at the same time.

A Method of Constructing Robust Descriptors Using Scale Space Derivatives (스케일 공간 도함수를 이용한 강인한 기술자 생성 기법)

  • Park, Jongseung;Park, Unsang
    • Journal of KIISE
    • /
    • v.42 no.6
    • /
    • pp.764-768
    • /
    • 2015
  • Requirement of effective image handling methods such as image retrieval has been increasing with the rising production and consumption of multimedia data. In this paper, a method of constructing more effective descriptor is proposed for robust keypoint based image retrieval. The proposed method uses information embedded in the first order and second order derivative images, in addition to the scale space image, for the descriptor construction. The performance of multi-image descriptor is evaluated in terms of the similarities in keypoints with a public domain image database that contains various image transformations. The proposed descriptor shows significant improvement in keypoint matching with minor increase of the length.

Remote Sensing of Nearshore Currents using Coastal Optical Imagery (해안 광학영상 자료를 이용한 쇄파지역 연안류 측정기술)

  • Yoo, Jeseon;Kim, Sun-Sin
    • Ocean and Polar Research
    • /
    • v.37 no.1
    • /
    • pp.11-22
    • /
    • 2015
  • In-situ measurements are labor-intensive, time-consuming, and limited in their ability to observe currents with spatial variations in the surf zone. This paper proposes an optical image-based method of measurement of currents in the surf zone. This method measures nearshore currents by tracking in time wave breaking-induced foam patches from sequential images. Foam patches in images tend to be arrayed with irregular pixel intensity values, which are likely to remain consistent for a short period of time. This irregular intensity feature of a foam patch is characterized and represented as a keypoint using an image-based object recognition method, i.e., Scale Invariant Feature Transform (SIFT). The keypoints identified by the SIFT method are traced from time sequential images to produce instantaneous velocity fields. In order to remove erroneous velocities, the instantaneous velocity fields are filtered by binding them within upper and lower limits, and averaging the velocity data in time and space with a certain interval. The measurements that are obtained by this method are comparable to the results estimated by an existing image-based method of observing currents, named the Optical Current Meter (OCM).

An Approach for Localization Around Indoor Corridors Based on Visual Attention Model (시각주의 모델을 적용한 실내 복도에서의 위치인식 기법)

  • Yoon, Kook-Yeol;Choi, Sun-Wook;Lee, Chong-Ho
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.17 no.2
    • /
    • pp.93-101
    • /
    • 2011
  • For mobile robot, recognizing its current location is very important to navigate autonomously. Especially, loop closing detection that robot recognize location where it has visited before is a kernel problem to solve localization. A considerable amount of research has been conducted on loop closing detection and localization based on appearance because vision sensor has an advantage in terms of costs and various approaching methods to solve this problem. In case of scenes that consist of repeated structures like in corridors, perceptual aliasing in which, the two different locations are recognized as the same, occurs frequently. In this paper, we propose an improved method to recognize location in the scenes which have similar structures. We extracted salient regions from images using visual attention model and calculated weights using distinctive features in the salient region. It makes possible to emphasize unique features in the scene to classify similar-looking locations. In the results of corridor recognition experiments, proposed method showed improved recognition performance. It shows 78.2% in the accuracy of single floor corridor recognition and 71.5% for multi floor corridors recognition.