• Title/Summary/Keyword: Object Manipulation

Search Result 173, Processing Time 0.022 seconds

Domain Adaptive Fruit Detection Method based on a Vision-Language Model for Harvest Automation (작물 수확 자동화를 위한 시각 언어 모델 기반의 환경적응형 과수 검출 기술)

  • Changwoo Nam;Jimin Song;Yongsik Jin;Sang Jun Lee
    • IEMEK Journal of Embedded Systems and Applications
    • /
    • v.19 no.2
    • /
    • pp.73-81
    • /
    • 2024
  • Recently, mobile manipulators have been utilized in agriculture industry for weed removal and harvest automation. This paper proposes a domain adaptive fruit detection method for harvest automation, by utilizing OWL-ViT model which is an open-vocabulary object detection model. The vision-language model can detect objects based on text prompt, and therefore, it can be extended to detect objects of undefined categories. In the development of deep learning models for real-world problems, constructing a large-scale labeled dataset is a time-consuming task and heavily relies on human effort. To reduce the labor-intensive workload, we utilized a large-scale public dataset as a source domain data and employed a domain adaptation method. Adversarial learning was conducted between a domain discriminator and feature extractor to reduce the gap between the distribution of feature vectors from the source domain and our target domain data. We collected a target domain dataset in a real-like environment and conducted experiments to demonstrate the effectiveness of the proposed method. In experiments, the domain adaptation method improved the AP50 metric from 38.88% to 78.59% for detecting objects within the range of 2m, and we achieved 81.7% of manipulation success rate.

Edge based Interactive Segmentation (경계선 기반의 대화형 영상분할 시스템)

  • Yun, Hyun Joo;Lee, Sang Wook
    • Journal of the Korea Computer Graphics Society
    • /
    • v.8 no.2
    • /
    • pp.15-22
    • /
    • 2002
  • Image segmentation methods partition an image into meaningful regions. For image composition and analysis, it is desirable for the partitioned regions to represent meaningful objects in terms of human perception and manipulation. Despite the recent progress in image understanding, however, most of the segmentation methods mainly employ low-level image features and it is still highly challenging to automatically segment an image based on high-level meaning suitable for human interpretation. The concept of HCI (Human Computer Interaction) can be applied to operator-assisted image segmentation in a manner that a human operator provides guidance to automatic image processing by interactively supplying critical information about object boundaries. Intelligent Scissors and Snakes have demonstrated the effectiveness of human-assisted segmentation [2] [1]. This paper presents a method for interactive image segmentation for more efficient and effective detection and tracking of object boundaries. The presented method is partly based on the concept of Intelligent Scissors, but employs the well-established Canny edge detector for stable edge detection. It also uses "sewing method" for including weak edges in object boundaries, and 5-direction search to promote more efficient and stable linking of neighboring edges than the previous methods.

  • PDF

Three-Dimensional Processing of Ultrasonic Pulse-Echo Signal (초음파 펄스에코 신호의 3차원 처리)

  • Song, Moon-Ho;Song, Sang-Rock;Cho, Jung-Ho;Sung, Je-Joong;Ahn, Hyung-Keun;Jang, Soon-Jae
    • Journal of the Korean Society for Nondestructive Testing
    • /
    • v.23 no.5
    • /
    • pp.464-474
    • /
    • 2003
  • Ultrasonic imaging of 3-D structures for nondestructive evaluation must provide readily recognizable images with enough details to clearly show various flaws that may or may not be present. Typical flaws that need to be detected are miniature cracks, for instance, in metal pipes having aged over years of operation in nuclear power plants; and these sub-millimeter cracks or flaws must be depicted in the final 3-D image for a meaningful evaluation. As a step towards improving conspicuity and thus detection of flaws, we propose a pulse-echo ultrasonic imaging technique to generate various 3-D views of the 3-D object under evaluation through strategic scanning and processing of the pulse-echo data. We employ a 2-D Wiener filter that filters the pulse-echo data along the plane orthogonal to the beam propagation so that ultrasonic beams can be sharpened. This three-dimensional processing and display coupled with 3-D manipulation capabilities by which users are able to pan and rotate the 3-D structure improve conspicuity of flaws. Providing such manipulation operations allow a clear depiction of the size and the location of various flaws in 3-D.

Design of Vision-based Interaction Tool for 3D Interaction in Desktop Environment (데스크탑 환경에서의 3차원 상호작용을 위한 비전기반 인터랙션 도구의 설계)

  • Choi, Yoo-Joo;Rhee, Seon-Min;You, Hyo-Sun;Roh, Young-Sub
    • The KIPS Transactions:PartB
    • /
    • v.15B no.5
    • /
    • pp.421-434
    • /
    • 2008
  • As computer graphics, virtual reality and augmented reality technologies have been developed, in many application areas based on those techniques, interaction for 3D space is required such as selection and manipulation of an 3D object. In this paper, we propose a framework for a vision-based 3D interaction which enables to simulate functions of an expensive 3D mouse for a desktop environment. The proposed framework includes a specially manufactured interaction device using three-color LEDs. By recognizing position and color of the LED from video sequences, various events of the mouse and 6 DOF interactions are supported. Since the proposed device is more intuitive and easier than an existing 3D mouse which is expensive and requires skilled manipulation, it can be used without additional learning or training. In this paper, we explain methods for making a pointing device using three-color LEDs which is one of the components of the proposed framework, calculating 3D position and orientation of the pointer and analyzing color of the LED from video sequences. We verify accuracy and usefulness of the proposed device by showing a measurement result of an error of the 3D position and orientation.

Hardware Architecture for PC-based MPEG-4 Video CODEC (PC 기반 MPEG-4 비디오 코덱 구현을 위한 하드웨어 아키텍쳐)

  • 곽진석;임영권;박상규;김진웅
    • Journal of Broadcast Engineering
    • /
    • v.2 no.2
    • /
    • pp.86-93
    • /
    • 1997
  • Fast growth of multimedia applications requires new functions for video data processing. such as obj;cted-based video representation and manipulation. which are not supported by 11PEG-l and 11PEG-2. To support these requirements. 11PEG-4 video coding allows users to manipulate every video object easily by decomposing a scene into several video objects and coding each of them independently. However. the large amount of computations and flexible structure of 11PEG-4 video CODEC make it difficult to be implemented by either the general purpose DSP or a dedicated VLSI. In this paper, we propose a hardware architecture using a hybrid of a high performance programmable DSP and an application specific IC to implement a flexible 11PEG-4 video codec requiring the large amount of computations. The application specific IC has the functions of motion estimation and compensation.

  • PDF

A Study on Infra-Technology of RCP Mobility System

  • Kim, Seung-Woo;Choe, Jae-Il;Im, Chan-Young
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2004.08a
    • /
    • pp.1435-1439
    • /
    • 2004
  • Most recently, CP(Cellular Phone) has been one of the most important technologies in the IT(Information Tech-nology) field, and it is situated in a position of great importance industrially and economically. To produce the best CP in the world, a new technological concept and its advanced implementation technique is required, due to the extreme level of competition in the world market. The RT(Robot Technology) has been developed as the next generation of a future technology. Current robots require advanced technology, such as soft computing, human-friendly interface, interaction technique, speech recognition, object recognition etc. unlike the industrial robots of the past. Therefore, this paper explains conceptual research for development of the RCP(Robotic Cellular Phone), a new technological concept, in which a synergy effect is generated by the merging of IT & RT. RCP infra consists of $RCP^{Mobility}$ $RCP^{Interaction}$, $RCP^{Integration}$ technologies. For $RCP^{Mobility}$, human-friendly motion automation and personal service with walking and arming ability are developed. $RCP^{Interaction}$ ability is achieved by modeling an emotion-generating engine and $RCP^{Integration}$ that recognizes environmental and self conditions is developed. By joining intelligent algorithms and CP communication network with the three base modules, a RCP system is constructed. Especially, the RCP mobility system is focused in this paper. $RCP^{Mobility}$ is to apply a mobility technology, which is popular robot technology, to CP and combine human-friendly motion and navigation function to CP. It develops a new technological application system of auto-charging and real-world entertainment function etc. This technology can make a CP companion pet robot. It is an automation of human-friendly motions such as opening and closing of CPs, rotation of antenna, manipulation and wheel-walking. It's target is the implementation of wheel and manipulator functions that can give service to humans with human-friendly motion. So, this paper presents the definition, the basic theory and experiment results of the RCP mobility system. We confirm a good performance of the RCP mobility system through the experiment results.

  • PDF

Change of Lumbar Lordotic angle by Taping Therapy on Low Back Pain Patient with Lumbar Hyperlordosis ; A Case Report (테이핑 요법으로 호전된 요통환자의 요추전만도 변화 1례)

  • Youn, Yu-Suck;Lee, Jong-Soo;Moon, Sang-Hyun
    • The Journal of Korea CHUNA Manual Medicine
    • /
    • v.4 no.1
    • /
    • pp.157-165
    • /
    • 2003
  • Low back pain (LBP) is a significant in today's society, with lifetime include factors associated with LBP ar reporter. Among the causes, aberration of posture may play a role in the development of LBP. Many investigators have assessed the curvature of spine in standing posture. But LBP is associated with Lumber Hyperlordosis of Hyperlordosis is controversial Subjects: In conservative treatment(acupuncture, herb med, manipulation & TENS. exercise, potural correction) for a 40 years old woman who had low back pain(V AS) be caused by decrease lumbar lordotic angie. Objectives: The object is change of lumbar lordotic angle of a 40 years old woman who had low back pain with Lumbar hyperlordosis, In conservative treatment. Method: In conservative treatment, We added taping therapy(mechanical correction taping of Kinesio Taping) about Lumbar Lordosis. Conclusion: We experienced a 40 years old woman who had love pack pain with Lumbar hyperlordosis. In conservative treatment, Her pain was Improved by additional taping therapy In company with decrease of Lumbar Lordosis. 1. abnormal spinal curvature, specially lumbar hyperlordosis act on induction & perpetuation agent for low back pain 2. In a patient had low back pain with lumbar hyperlordosis, change of lumbar lordotic angle is of utility value for the effect of treatment and assessment of prognosis. 3. pain control is more relative with change of lumbosacral angle than lumbar lordotic angle, in patient had low back pain with lumbar hyperlordosis. 4. mechanical taping therapy with elastic adhesive tape is effective for patient had low back pain with lumbar hyperlordosis

  • PDF

Direct Manipulation based Trajectory Inserting and Editing Methods for ARtalet Authoring Tool (디지로그 북 저작을 위한 감각형 조작 도구를 이용한 직조작 기반의 3D 객체의 이동궤적 삽입 및 편집 기술)

  • Ha, Tae-Jin;Lee, Young-Ho;Woo, Woon-Tack
    • 한국HCI학회:학술대회논문집
    • /
    • 2009.02a
    • /
    • pp.497-501
    • /
    • 2009
  • 'Digilog Book' integrates advantages of existing paper book and immersive digital contents in augmented reality environment, which enables users to feel physical touch and get additional multisensory feedback. As a high level authoring user interface, 'ARtalet' provides an intuitive way to make Digilog Book through 3D user interface in augmented reality environment. This paper mentions trajectory inserting and editing methods of 3D objects, then combining method of the trajectory. 3D object is selected by camera tracked prop, and then transformation matrix relative to book plane is stored in real time based on timeframe. The saved trajectory is managed as templates, and user can make various compositions of trajectories. We expect that suggested methods can enhance interest of readers.

  • PDF

Intelligent Character System using Emotion Metadata (감성 메타데이터를 활용한 지능형 캐릭터 시스템)

  • Han, Jong-Sung;Lee, Wan-Bok;Kyung, Byung-Pyo;Lee, Dong-Lyeor;Ryu, Seuc-Ho;Lee, Kyoung-Jae
    • The Journal of the Korea Contents Association
    • /
    • v.9 no.3
    • /
    • pp.99-107
    • /
    • 2009
  • As the information and the network technology are improved, the system which can express the interactions between the individuals becomes to play more important roles in these days. In fact, that tendency is especially well shown in the community area of P2P and social network service programs. This paper suggest an intelligent character manipulation system which can be effectively used to express emotional representation in an intelligent way in spite of many constraints. The system employs an emotion searching mechanism by attaching emotional information to each object in the database and defining a function of emotional similarity. It is expected that the system can be successfully used not only to find and represent the suitable emotional character representations but also to provide brand new services in the area of mobile platform based contents.

Accuracy of Mid Point Computation for Boundary Delimitation on Ellipsoid (타원체상에서 경계획선을 위한 중간점계산의 정확도)

  • 김병국;이종기;김정기
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.19 no.4
    • /
    • pp.365-372
    • /
    • 2001
  • The general rule of boundary delimitation is a the principle of equidistant. The principle of equidistant is a method that determine boundary delimitation from fixed distant of baseline or basepoint. In this paper, study Two-Point Algorithm and Three-Point Algorithm that are widely used. and developed the Boundary Delimitation Program to verify the result and error. This program is specially useful for maritime boundary delimitation problem because there is no artificial and natural object in sea to determine boundary. As a result The mid-points computed on Ellipsoid have small error rather than mid-points on plane or sphere without any distortion by map projection. Through developing boundary delimitation program, can eliminate the various manipulation error using paper map, and quickly cope with maritime boundary delimitation negotiation. Also, verify that the error of basepoint in baseline is propagate the mid-point in mid-line, and determine suitable reference plane.

  • PDF