• 제목/요약/키워드: Computer vision technology

검색결과 674건 처리시간 0.029초

소형 휴머노이드 로봇 시스템 개발 (The Development of a Miniature Humanoid Robot System)

  • 성영휘;이수영
    • 제어로봇시스템학회논문지
    • /
    • 제7권5호
    • /
    • pp.420-426
    • /
    • 2001
  • In this paper, we introduce a case study of developing a miniature humanoid robot that has 16 degrees of freedom and is able to perform statically stable walking. The developed humanoid robot is 37cm tall and weighs 1,200g. RC servo motors are used as actuators. The robot can walk forward and turn to any direction on an even surface. It equipped with a small digital camera, so it can transmit vision data to a remote host computer via wireless modem. The robot can be operated in two modes: One is a remote-controlled mode, in which the robot behaves according to the command given by a human operator through the user-interface program running on a remote host computer, the other is a stand-alone mode, in which the robot behaves autonomously according the pre-programmed strategy. The user-interface program also contains a robot graphic simulator that is used to produce and verify the robot\`s gait motion. In our walking algorithm, the ankle joint is mainly used for balancing the robot. The experimental results shows that the developed robot can perform statically stable walking on an even surface.

  • PDF

위성 영상의 효과적인 분석을 위한 밝기와 크로스 엔트로피 기반의 그림자 검출 (Shadow Detection Based Intensity and Cross Entropy for Effective Analysis of Satellite Image)

  • 박기홍
    • 한국항행학회논문지
    • /
    • 제20권4호
    • /
    • pp.380-385
    • /
    • 2016
  • 그림자는 자연 영상에서 관찰되는 물리적인 현상이지만 위성 영상 분석에 부정적인 영향을 미치는 요소로 컴퓨터 비전의 전처리 과정에서 그림자 검출 과정은 매우 중요하다. 본 논문에서는 싱글 영상 기반의 위성 영상에서 효과적인 영상 분석을 위해 그림자를 검출하는 방법으로 크로스 엔트로피와 밝기 영상을 이용해 그림자를 검출하는 방법을 제안하였다. 칼라 영상을 그레이 레벨 영상으로 변환한 후 크로스 엔트로피를 기반으로 최적의 임계값을 추정하여 첫 번째 그림자 후보 영역으로 판별하였고, 칼라 영상의 밝기 영상을 이용해 최종 그림자 영역을 검출하였다. 제안하는 방법의 타당성을 위해 위성 영상들을 대상으로 실험하였고, 실험 결과 제안하는 그림자를 검출 방법이 효과적으로 수행됨을 확인하였다.

Interactive Typography System using Combined Corner and Contour Detection

  • Lim, Sooyeon;Kim, Sangwook
    • International Journal of Contents
    • /
    • 제13권1호
    • /
    • pp.68-75
    • /
    • 2017
  • Interactive Typography is a process where a user communicates by interacting with text and a moving factor. This research covers interactive typography using real-time response to a user's gesture. In order to form a language-independent system, preprocessing of entered text data presents image data. This preprocessing is followed by recognizing the image data and the setting interaction points. This is done using computer vision technology such as the Harris corner detector and contour detection. User interaction is achieved using skeleton information tracked by a depth camera. By synchronizing the user's skeleton information acquired by Kinect (a depth camera,) and the typography components (interaction points), all user gestures are linked with the typography in real time. An experiment was conducted, in both English and Korean, where users showed an 81% satisfaction level using an interactive typography system where text components showed discrete movements in accordance with the users' gestures. Through this experiment, it was possible to ascertain that sensibility varied depending on the size and the speed of the text and interactive alteration. The results show that interactive typography can potentially be an accurate communication tool, and not merely a uniform text transmission system.

인공지능 프로세서 기술 동향 (AI Processor Technology Trends)

  • 권영수
    • 전자통신동향분석
    • /
    • 제33권5호
    • /
    • pp.121-134
    • /
    • 2018
  • The Von Neumann based architecture of the modern computer has dominated the computing industry for the past 50 years, sparking the digital revolution and propelling us into today's information age. Recent research focus and market trends have shown significant effort toward the advancement and application of artificial intelligence technologies. Although artificial intelligence has been studied for decades since the Turing machine was first introduced, the field has recently emerged into the spotlight thanks to remarkable milestones such as AlexNet-CNN and Alpha-Go, whose neural-network based deep learning methods have achieved a ground-breaking performance superior to existing recognition, classification, and decision algorithms. Unprecedented results in a wide variety of applications (drones, autonomous driving, robots, stock markets, computer vision, voice, and so on) have signaled the beginning of a golden age for artificial intelligence after 40 years of relative dormancy. Algorithmic research continues to progress at a breath-taking pace as evidenced by the rate of new neural networks being announced. However, traditional Von Neumann based architectures have proven to be inadequate in terms of computation power, and inherently inefficient in their processing of vastly parallel computations, which is a characteristic of deep neural networks. Consequently, global conglomerates such as Intel, Huawei, and Google, as well as large domestic corporations and fabless companies are developing dedicated semiconductor chips customized for artificial intelligence computations. The AI Processor Research Laboratory at ETRI is focusing on the research and development of super low-power AI processor chips. In this article, we present the current trends in computation platform, parallel processing, AI processor, and super-threaded AI processor research being conducted at ETRI.

향상된 트래킹 시스템과 실시간 수성 사인펜을 위한 사실적 드로잉 (Improved Tracking System and Realistic Drawing for Real-Time Water-Based Sign Pen)

  • 허혜정;이주영
    • 한국컴퓨터정보학회논문지
    • /
    • 제19권2호
    • /
    • pp.125-132
    • /
    • 2014
  • 본 논문에서는 저가의 웹 카메라를 사용하여 마커 없이 손끝과 붓을 트래킹 하는 시스템을 제시한다. 트래킹 시스템은 CUDA를 사용하여 병렬처리를 적용했다. 이 트래킹 시스템은 노트북이나 데스크탑과 같은 환경에서 수행이 가능하고, 실시간 애플리케이션에 사용 가능한 성능을 가진다. 또한 본 논문에서는 사적인 수성 사인펜 드로잉 모델을 제시하고 구현된 결과를 보여준다. 제안하는 시스템은 손끝과 붓을 트래킹 하는 저가의 실시간 트래킹 시스템으로 사실적 드로잉 애플리케이션과 연동하여 미래 최첨단 교육 환경 구축의 테스트베드로의 활용을 기대한다.

신경망과 비젼 시스템을 이용한 영상의 이진화에서 동적 임계값 설정 (Dynamic Threshold Value Decision in Image Binarization using Neural Network and Vi sion System)

  • 김영탁;문희근;김수정;김관형;탁한호;이상배
    • 한국지능시스템학회:학술대회논문집
    • /
    • 한국퍼지및지능시스템학회 2002년도 추계학술대회 및 정기총회
    • /
    • pp.313-316
    • /
    • 2002
  • 이동 물체의 이동 거리 추적이나 대상 물체의 인식과 판별 물체의 특징 추출과 같은 응용분야에서 컴퓨터(Computer)와 비젼시스템(vision system)을 이용한 영상 데이터 처리 분야에 대한 이용률이 증가하면서, 그에 따른 연구가 활발히 진행되고 있다. 따라서 CCD 카메라(Charge-Couple Device Camera)로부터 입력된 그레이 레벨(Gray Level)의 영상을 입력받아 처리과정을 거쳐 위치정보를 전송하는 과정에서 정확한 정보를 얻기 위한 전처리 과정 방법을 제안하고, 실제 시스템에 적용한 결과를 제시한다. 여기서 영상의 전처리 과정 중 입력 영상에서 불필요한 부분을 제거하거나, 배경과 대상물의 분리, 내포된 잡음을 없애기 위하여 흔히 이진화 방법을 많이 사용한다 특히 이진화 과정에서 그레이 레벨의 입력영상에서 히스토그램(histogram) 정보를 이용하여 영상의 이진화시의 임계값을 찾는 것은 아주 중요한 요인이다 따라서 본 논문에서는 신경회로망을 이용하여 실시간으로 CCD 카메라를 통하여 입력되는 그레이 레벨의 입력 영상에 대하여 동적으로 적당한 임계값을 .찾는 방법을 제안하고자한다. 또한 제안한 신경회로망을 이용한 임계값 추출 알고리즘(algorithms)을 구현한 시스템(system)에 적용하여 일반적인 방법과 비교 검토하고 응용 가능성을 확인한다.

차선 추적을 이용한 환경변화에 강인한 차선 검출 방법 (A Method of Lane Marker Detection Robust to Environmental Variation Using Lane Tracking)

  • 이지혜;이강
    • 한국멀티미디어학회논문지
    • /
    • 제21권12호
    • /
    • pp.1396-1406
    • /
    • 2018
  • Lane detection is a key function in developing autonomous vehicle technology. In this paper, we propose a lane marker detection algorithm robust to environmental variation targeting low cost embedded computing devices. The proposed algorithm consists of two phases: initialization phase which is slow but has relatively higher accuracy; and the tracking phase which is fast and has the reliable performance in a limited condition. The initialization phase detects lane markers using a set of filters utilizing the various features of lane markers. The tracking phase uses Kalman filter to accelerate the lane marker detection processing. In a tracking phase, we measure the reliability of the detection results and switch it to initialization phase if the confidence level becomes below a threshold. By combining the initialization and tracking phases we achieved high accuracy and acceptable computing speed even under a low cost computing resources in which we cannot use the computing intensive algorithm such as deep learning approach. Experimental results show that the detection accuracy is about 95% on average and the processing speed is about 20 frames per second with Raspberry Pi 3 which is low cost device.

후보 단어 리스트와 확률 점수에 기반한 한국어 문자 인식 모델 (Candidate Word List and Probability Score Guided for Korean Scene Text Recognition)

  • 이윤지;이종민
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국정보통신학회 2022년도 춘계학술대회
    • /
    • pp.73-75
    • /
    • 2022
  • 글자 인식 시스템은 무인 로봇, 자율 주행 자동차 등 자동화를 필요로 하는 인공지능 분야에서 사용되는 기술로, 주변 환경에 여러 장애물이 있음에도 글자를 정확하게 인식하는 것을 말한다. 영어만 인식했던 기존의 연구와 달리, 본 논문은 영어, 한국어, 특수문자와 숫자를 포함한 다양한 문자가 혼재되어 있는 경우에도 강한 인식률을 보여준다. 가장 높은 확률 값을 갖는 클래스 하나 만을 선택하는 것이 아닌 차 순위의 확률도 함께 고려하여 후보 단어 리스트를 생성하고, 이로 인해 기존에 오인식되는 단어를 교정할 수 있는 방법을 제안한다.

  • PDF

치아 보철물 디자인을 위한 이미지 대 이미지 변환 GAN 모델 (An Image-to-Image Translation GAN Model for Dental Prothesis Design)

  • 김태민;김재곤
    • 한국IT서비스학회지
    • /
    • 제22권5호
    • /
    • pp.87-98
    • /
    • 2023
  • Traditionally, tooth restoration has been carried out by replicating teeth using plaster-based materials. However, recent technological advances have simplified the production process through the introduction of computer-aided design(CAD) systems. Nevertheless, dental restoration varies among individuals, and the skill level of dental technicians significantly influences the accuracy of the manufacturing process. To address this challenge, this paper proposes an approach to designing personalized tooth restorations using Generative Adversarial Network(GAN), a widely adopted technique in computer vision. The primary objective of this model is to create customized dental prosthesis for each patient by utilizing 3D data of the specific teeth to be treated and their corresponding opposite tooth. To achieve this, the 3D dental data is converted into a depth map format and used as input data for the GAN model. The proposed model leverages the network architecture of Pixel2Style2Pixel, which has demonstrated superior performance compared to existing models for image conversion and dental prosthesis generation. Furthermore, this approach holds promising potential for future advancements in dental and implant production.

Color Image Query Using Hierachical Search by Region of Interest with Color Indexing

  • Sombutkaew, Rattikorn;Chitsobhuk, Orachat
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 2004년도 ICCAS
    • /
    • pp.810-813
    • /
    • 2004
  • Indexing and Retrieving images from large and varied collections using image content as a key is a challenging and important problem in computer vision application. In this paper, a color Content-based Image Retrieval (CBIR) system using hierarchical Region of Interest (ROI) query and indexing is presented. During indexing process, First, The ROIs on every image in the image database are extracted using a region-based image segmentation technique, The JSEG approach is selected to handle this problem in order to create color-texture regions. Then, Color features in form of histogram and correlogram are then extracted from each segmented regions. Finally, The features are stored in the database as the key to retrieve the relevant images. As in the retrieval system, users are allowed to select ROI directly over the sample or user's submission image and the query process then focuses on the content of the selected ROI in order to find those images containing similar regions from the database. The hierarchical region-of-interest query is performed to retrieve the similar images. Two-level search is exploited in this paper. In the first level, the most important regions, usually the large regions at the center of user's query, are used to retrieve images having similar regions using static search. This ensures that we can retrieve all the images having the most important regions. In the second level, all the remaining regions in user's query are used to search from all the retrieved images obtained from the first level. The experimental results using the indexing technique show good retrieval performance over a variety of image collections, also great reduction in the amount of searching time.

  • PDF