• 제목/요약/키워드: 카메라 기반 인식

Search Result 700, Processing Time 0.032 seconds

Staff-line Detection and Removal Algorithm for Mobile Phone-based Recognition of Musical Images (카메라 기반 악보 영상 인식을 위한 오선 검출 및 삭제 알고리즘)

  • Son, Hwa-Jeong;Kim, Soo-Hyung;Oh, Sung-Ryul
    • The Journal of the Korea Contents Association
    • /
    • v.7 no.11
    • /
    • pp.34-42
    • /
    • 2007
  • In this paper, we propose a staff-line detection and removal algorithm from a music score image obtained by a mobile phone camera. As a preprocessing technique to recognize a music score image, staff-line detection and removal should be efficiently applied to the skewed or curved images. The proposed method detects a staff-line by dividing a staff according to the degree of distortion. The number of division is calculated by dividing a staff repletely until an average of differences of y coordinates in every divided position is smaller than a threshold. Therefore, the number of division can be adaptively estimated according to the degree of the distortion. For an experiment, we make various kinds of images by rotating one from $1^{\circ}\;to\;3^{\circ}$ or curving slightly upward. The results show that the proposed method performed well on the experiment images.

Development of a Blocks Recognition Application for Children's Education using a Smartphone Camera (스마트폰 카메라 기반 아동 교육용 산수 블록 인식 애플리케이션 개발)

  • Park, Sang-A;Oh, Ji-Won;Hong, In-Sik;Nam, Yunyoung
    • Journal of Internet Computing and Services
    • /
    • v.20 no.4
    • /
    • pp.29-38
    • /
    • 2019
  • Currently, information society is rapidly changing and demands innovation and creativity in various fields. Therefore, the importance of mathematics, which can be the basis of creativity and logic, is emphasized. The purpose of this paper is to develop a math education application that can further expand the logical thinking of mathematics and allow voluntary learning to occur through the use of readily available teaching aid for children to form motivation and interest in learning. This paper provides math education applications using a smartphone and blocks for children. The main function of the application is to shoot with the camera and show the calculated values. When a child uses a block to make a formula and shoots a block using a camera, you can directly see the calculated value of your formula. The preprocessing process, text extraction, and character recognition of the photographed images have been implemented using OpenCV libraries and Tesseract-OCR libraries.

Manipulator with Camera for Mobile Robots (모바일 로봇을 위한 카메라 탑재 매니퓰레이터)

  • Lee Jun-Woo;Choe, Kyoung-Geun;Cho, Hun-Hee;Jeong, Seong-Kyun;Bong, Jae-Hwan
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.17 no.3
    • /
    • pp.507-514
    • /
    • 2022
  • Mobile manipulators are getting lime light in the field of home automation due to their mobility and manipulation capabilities. In this paper, we developed a small size manipulator system that can be mounted on a mobile robot as a preliminary study to develop a mobile manipulator. The developed manipulator has four degree-of-freedom. At the end-effector of manipulator, there are a camera and a gripper to recognize and manipulate the object. One of four degree-of-freedom is linear motion in vertical direction for better interaction with human hands which are located higher than the mobile manipulator. The developed manipulator was designed to dispose the four actuators close to the base of the manipulator to reduce rotational inertia of the manipulator, which improves stability of manipulation and reduces the risk of rollover. The developed manipulator repeatedly performed a pick and place task and successfully manipulate the object within the workspace of manipulator.

3D Fingertip Estimation based on the TOF Camera for Virtual Touch Screen System (가상 터치스크린 시스템을 위한 TOF 카메라 기반 3차원 손 끝 추정)

  • Kim, Min-Wook;Ahn, Yang-Keun;Jung, Kwang-Mo;Lee, Chil-Woo
    • The KIPS Transactions:PartB
    • /
    • v.17B no.4
    • /
    • pp.287-294
    • /
    • 2010
  • TOF technique is one of the skills that can obtain the object's 3D depth information. But depth image has low resolution and fingertip occupy very small region, so, it is difficult to find the precise fingertip's 3D information by only using depth image from TOF camera. In this paper, we estimate fingertip's 3D location using Arm Model and reliable hand's 3D location information that is modified by hexahedron as hand model. Using proposed method we can obtain more precise fingertip's 3D information than using only depth image.

Design of IR Camera based Wearable Keyboard for Smart UX (스마트 UX용 IR 카메라 기반의 웨어러블 키보드 설계)

  • Park, H.Y.;Lee, E.S.;Nam, G.S.;Kang, M.G.;Jung, S.M.;Yeo, H.G.
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2015.05a
    • /
    • pp.501-502
    • /
    • 2015
  • In this paper, a infra red-camera based wearable keyboard for variable UX(User eXprience) in smart platform was proposed with finger-tracking for anywhere portable devices. This virtual wearable keyboard was designed with OpenCV and MFC for the applying of 'fingertip tracking' and 'k-cuvature' algorithm with 2 IR-cameras.

  • PDF

Design of Real-time Face Recognition Systems Based on Data-Preprocessing and Neuro-Fuzzy Networks for the Improvement of Recognition Rate (인식률 향상을 위한 데이터 전처리와 Neuro-Fuzzy 네트워크 기반의 실시간 얼굴 인식 시스템 설계)

  • Yoo, Sung-Hoon;Oh, Sung-Kwun;Kim, Hyun-Ki
    • Proceedings of the KIEE Conference
    • /
    • 2011.07a
    • /
    • pp.1952-1953
    • /
    • 2011
  • 본 논문에서는 다항식 기반 Radial Basis Function(RBF)신경회로망(Polynomial based Radial Basis function Neural Network)을 설계하고 이를 n-클래스 패턴 분류 문제에 적용한다. 제안된 다항식기반 RBF 신경회로망은 입력층, 은닉층, 출력층으로 이루어진다. 입력층은 입력 벡터의 값들을 은닉층으로 전달하는 기능을 수행하고 은닉층과 출력층사이의 연결가중치는 상수, 선형식 또는 이차식으로 이루어지며 경사 하강법에 의해 학습된다. Networks의 최종 출력은 연결가중치와 은닉층 출력의 곱에 의해 퍼지추론의 결과로서 얻어진다. 패턴분류기의 최적화는 PSO(Particle Swarm Optimization)알고리즘을 통해 이루어진다. 그리고 제안된 패턴분류기는 실제 얼굴인식 시스템으로 응용하여 직접 CCD 카메라로부터 입력받은 데이터를 영상 보정, 얼굴 검출, 특징 추출 등과 같은 처리 과정을 포함하여 서로 다른 등록인물의 n-클래스 분류 문제에 적용 및 평가되어 분류기로써의 성능을 분석해본다.

  • PDF

A method of improving the quality of 3D images acquired from RGB-depth camera (깊이 영상 카메라로부터 획득된 3D 영상의 품질 향상 방법)

  • Park, Byung-Seo;Kim, Dong-Wook;Seo, Young-Ho
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.5
    • /
    • pp.637-644
    • /
    • 2021
  • In general, in the fields of computer vision, robotics, and augmented reality, the importance of 3D space and 3D object detection and recognition technology has emerged. In particular, since it is possible to acquire RGB images and depth images in real time through an image sensor using Microsoft Kinect method, many changes have been made to object detection, tracking and recognition studies. In this paper, we propose a method to improve the quality of 3D reconstructed images by processing images acquired through a depth-based (RGB-Depth) camera on a multi-view camera system. In this paper, a method of removing noise outside an object by applying a mask acquired from a color image and a method of applying a combined filtering operation to obtain the difference in depth information between pixels inside the object is proposed. Through each experiment result, it was confirmed that the proposed method can effectively remove noise and improve the quality of 3D reconstructed image.

Gesture Spotting by Web-Camera in Arbitrary Two Positions and Fuzzy Garbage Model (임의 두 지점의 웹 카메라와 퍼지 가비지 모델을 이용한 사용자의 의미 있는 동작 검출)

  • Yang, Seung-Eun
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.1 no.2
    • /
    • pp.127-136
    • /
    • 2012
  • Many research of hand gesture recognition based on vision system have been conducted which enable user operate various electronic devices more easily. 3D position calculation and meaningful gesture classification from similar gestures should be executed to recognize hand gesture accurately. A simple and cost effective method of 3D position calculation and gesture spotting (a task to recognize meaningful gesture from other similar meaningless gestures) is described in this paper. 3D position is achieved by calculation of two cameras relative position through pan/tilt module and a marker regardless with the placed position. Fuzzy garbage model is proposed to provide a variable reference value to decide whether the user gesture is the command gesture or not. The reference is achieved from fuzzy command gesture model and fuzzy garbage model which returns the score that shows the degree of belonging to command gesture and garbage gesture respectively. Two-stage user adaptation is proposed that off-line (batch) adaptation for inter-personal difference and on-line (incremental) adaptation for intra-difference to enhance the performance. Experiment is conducted for 5 different users. The recognition rate of command (discriminate command gesture) is more than 95% when only one command like meaningless gesture exists and more than 85% when the command is mixed with many other similar gestures.

Reconstruction of Partially Occluded Facial Image Utilizing KPCA-based Denoising Method (KPCA 기반 노이즈 제거 기법을 이용한 부분 손상된 얼굴 영상의 복원)

  • Kang Daesung;Kim Jongho;Park Jooyoung
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2005.04a
    • /
    • pp.247-250
    • /
    • 2005
  • In numerous occasions, there is need to reconstruct partially occluded facial image. Typical examples include the recognition of criminals whose facial images are captured by surveillance cameras- ln such cases a significant part of the face is occluded making the process of identification extremely difficult, both for automatic face recognition systems and human observers. To overcome these difficulties, we consider the application of Kernel PCA-based denoising method to partially occluded facial image in this paper.

  • PDF

상황인지형 인터랙티브 텔레스크린 기술

  • Lee, Hyeon-Jin;Eom, Tae-Won;Jo, Gi-Seong;Lee, Hyeon-U;Ryu, Won
    • Information and Communications Magazine
    • /
    • v.30 no.8
    • /
    • pp.69-75
    • /
    • 2013
  • 공공 장소에서 주변 상황 정보와 연계하여 각종 정보 및 광고를 인터랙티브하게 제공하는 차세대 디지털 사이니지인 텔레스크린은 구석기 시대의 벽화에서 시작한 상호 의사 소통 방법의 하나이다. 텔레스크린은 초기에 단방향 광고 또는 단순 정보 전달형 서비스를 제공하여 왔으나, 최근에는 카메라, 센서, NFC(Near Field Communication), 스마트 폰 등을 활용하거나, 양방향 UI/UX(User Interface/User eXperience), 얼굴 인식 기술과 연동하여 소비자의 참여를 유도하고 있다. 또한, 사용자의 주변 상황과 사용자의 상태 정보 등을 수집, 분석함으로써 상황인지 기반의 양방향 커뮤니케이션이 가능한 인터랙티브 텔레스크린으로 진화하고 있다. 최근에는 3D 기술과 사람의 반응을 감지하는 인지 기술들이 지속적으로 발전하고 있기에, 멀지 않은 미래에는 사용자의 감성에 반응하는 텔레스크린 서비스가 제공될 것으로 기대된다[1]. 그리하여, 사용자가 인식하지 못하는 사이에 주변 상황과 더불어 사용자의 감성에 기반하여 가장 효과가 높을 것으로 기대되는 맞춤형 정보에 자연스럽게 노출되게 될 것이다.