• Title/Summary/Keyword: voice command

Search Result 97, Processing Time 0.031 seconds

Real time instruction classification system

  • Sang-Hoon Lee;Dong-Jin Kwon
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.16 no.3
    • /
    • pp.212-220
    • /
    • 2024
  • A recently the advancement of society, AI technology has made significant strides, especially in the fields of computer vision and voice recognition. This study introduces a system that leverages these technologies to recognize users through a camera and relay commands within a vehicle based on voice commands. The system uses the YOLO (You Only Look Once) machine learning algorithm, widely used for object and entity recognition, to identify specific users. For voice command recognition, a machine learning model based on spectrogram voice analysis is employed to identify specific commands. This design aims to enhance security and convenience by preventing unauthorized access to vehicles and IoT devices by anyone other than registered users. We converts camera input data into YOLO system inputs to determine if it is a person, Additionally, it collects voice data through a microphone embedded in the device or computer, converting it into time-domain spectrogram data to be used as input for the voice recognition machine learning system. The input camera image data and voice data undergo inference tasks through pre-trained models, enabling the recognition of simple commands within a limited space based on the inference results. This study demonstrates the feasibility of constructing a device management system within a confined space that enhances security and user convenience through a simple real-time system model. Finally our work aims to provide practical solutions in various application fields, such as smart homes and autonomous vehicles.

Two-way Interactive Algorithms Based on Speech and Motion Recognition with Generative AI Technology (생성형 AI 기술을 적용한 음성 및 모션 인식 기반 양방향 대화형 알고리즘)

  • Dae-Sung Jang;Jong-Chan Kim
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.19 no.2
    • /
    • pp.397-402
    • /
    • 2024
  • Speech recognition and motion recognition technologies are applied and used in various smart devices, but they are composed of simple command recognition forms and are used as simple functions. Apart from simple functions for recognition data, professional command execution capabilities are required based on data learned in various fields. Research is being conducted on a system platform that provides optimal data to users using Generative AI, which is currently competing around the world, and can interact through voice recognition and motion recognition. The main technical processes designed for this study were designed using technologies such as voice and motion recognition functions, application of AI technology, and two-way communication. In this paper, two-way communication between a device and a user can be achieved by various input methods through voice recognition and motion recognition technology applied with AI technology.

A Design and Implementation of the VoiceXML Multiple-View Editor Using MVC Framework (MVC 프레임 워크를 사용한 VoiceXML 다중 뷰 편집기의 설계 및 구현)

  • 유재우;염세훈
    • The Journal of the Acoustical Society of Korea
    • /
    • v.23 no.5
    • /
    • pp.390-399
    • /
    • 2004
  • In this paper, we design and implement a multiple-view VoiceXML editor to improve editing efficiency of the VoiceXML. The VoiceXML multiple-view Editor uses a MVC framework to support multiple views and paradigm. Our multiple-view editor consists of Model. View and Controller using MVC framework. A model, core data structure. is constructed of abstract syntax tree and abstract grammar. A view. user interface. is formalized in unparsing rules and unparser. A controller. to control model and view. is made of command interpreter and tree handler. The VoiceXML multiple-view editor overcomes a drawbacks of existing XML editors by showing document structure and context concurrently. as well as document flows. Our VoiceXML multiple-view editor. which MVC framework has been applied, provides various editing views concurrently to users. Thereby. it supports efficient and convenient editing environments for voice-web documents to users and it guarantees transparency of editors. as various views have a same consistent model.

A Study of Hybrid Automatic Interpret Support System (하이브리드 자동 통역지원 시스템에 관한 연구)

  • Lim, Chong-Gyu;Gang, Bong-Gyun;Park, Ju-Sik;Kang, Bong-Kyun
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.28 no.3
    • /
    • pp.133-141
    • /
    • 2005
  • The previous research has been mainly focused on individual technology of voice recognition, voice synthesis, translation, and bone transmission technical. Recently, commercial models have been produced using aforementioned technologies. In this research, a new automated translation support system concept has been proposed by combining established technology of bone transmission and wireless system. The proposed system has following three major components. First, the hybrid system consist of headset, bone transmission and other technologies will recognize user's voice. Second, computer recognized voice (using small server attached to the user) of the user will be converted into digital signal. Then it will be translated into other user's language by translation algorithm. Third, the translated language will be wirelessly transmitted to the other party. The transmitted signal will be converted into voice in the other party's computer using the hybrid system. This hybrid system will transmit the clear message regardless of the noise level in the environment or user's hearing ability. By using the network technology, communication between users can also be clearly transmitted despite the distance.

Design of Voice Control Solution for Industrial Articulated Robot (산업용 다관절로봇 음성제어솔루션 설계)

  • Kwak, Kwang-Jin;Kim, Dae-Yeon;Park, Jeongmin
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.21 no.2
    • /
    • pp.55-60
    • /
    • 2021
  • As the smart factory progresses, the use of automation facilities and robots is increasing. Also, with the development of IT technology, the utilization of the system using voice recognition is also increasing. Voice recognition technology is a technology that stands out in smart home and various IoT technologies, but it is difficult to apply to factories due to the specificity of factories. Therefore, in this study, a method to control an industrial articulated robot was designed using voice recognition technology that considers the situation at the manufacturing site. It was confirmed that the robot could be controlled through network protocol and command conversion after receiving voice commands for robot operation through mobile.

QPSK Modem Design of Satellite Air-defence Warning System (위성 전군방공경보체계 QPSK 모뎀 설계)

  • Kim, Younghun
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.18 no.6
    • /
    • pp.755-761
    • /
    • 2015
  • Satellite Air-defence Warning System receives the aircraft/ballistic track information and air defense control command obtained from Master Control & Reporting Center (MCRC) and Air Missile Defence Cell (AMD Cell) Systems. It consists of terminal and control system to propagate track information and air defense control command control via the military satellite communications. In this paper, there were described track information, air defense control command, the frame structure of modem to transmit a voice information and modulation/demodulator design, network synchronization methods via the satellite network.

Design of a Compact Laparoscopic Assistant Robot;KaLAR

  • Lee, Yun-Ju;Kim, Jona-Than;Ko, Seong-Young;Lee, Woo-Jung;Kwon, Dong-Soo
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2003.10a
    • /
    • pp.2648-2653
    • /
    • 2003
  • This paper describes the development of a 3-DOF laparoscopic assistant robot system with motor-controlled bending and zooming mechanisms using the voice command motion control and auto-tracking control. The system is designed with two major criteria: safety and adaptability. To satisfy the safety criteria we designed the robot with optimized range of motion. For adaptability, the robot is designed with compact size to minimize interference with the staffs in the operating room. The required external motions were replaced by the bending mechanism within the abdomen using flexible laparoscope. The zooming of the robot is achieved through in and out motion at the port where the laparoscope is inserted. The robot is attachable to the bedside using a conventional laparoscope holder with multiple DOF joints and is compact enough for hand-carry. The voice-controlled command input and auto-tracking control is expected to enhance the overall performance of the system while reducing the control load imposed on the surgeon during a laparoscopic surgery. The proposed system is expected to have sufficient safety features and an easy-to-use interface to enhance the overall performance of current laparoscopy.

  • PDF

A Study on Obstacle Avoidance and Autonomous Travelling of Mobile Robot in Manufacturing Precess for Smart Factory (스마트 팩토리를 위한 제조공정내에서 모바일 로봇의 장애물 회피 및 자율주행에 관한 연구)

  • Kim, D.B.;Kim, H.J.;Moon, J.C.;Bae, H.Y;Han, S.H.
    • Journal of the Korean Society of Industry Convergence
    • /
    • v.21 no.6
    • /
    • pp.379-388
    • /
    • 2018
  • In this study, we propose a new approach to impliment autonomous travelling of mobile robot based on obstacle avoidance and voice command. Obstacle Avoidance technology of mobile robpot. It has been used in wide range of different robotics areas to minimize the risk of collisions. Obstacle avoidance of mobile robots are mostly applied in transportation systems such as aircraft traffic control, autonomous cars etc. Collision avoidance is a important requirement in mobile robot systems where they all featured some kind of obstacle detection techniques in order to avoid colliding. In this paper it was illustrated the reliability of voice command and obstacle avoidance for autonomous travelling of mobile robot with two wheels as the purpose of application to the manufacturing process by simulation and experiments.

Study of Speech Recognition System Operation for Voice-driven UAV Control (음성 기반 무인 항공기 제어를 위한 음성인식 시스템 운용 체계 연구)

  • Park, Jeong-Sik
    • Journal of the Korean Society for Aeronautical & Space Sciences
    • /
    • v.47 no.3
    • /
    • pp.212-219
    • /
    • 2019
  • As unmanned aerial vehicle (UAV) has been utilized for military operation, efficient ways for controlling UAV has been necessary. In particular, instead of conventional approach using console control, speech recognition based UAV control is essential for military environments in which rapid command operation is required. But research on this novel approach is not actively studied yet. In this study, we introduce efficient ways of speech recognition system operation for voice-driven UAV control, focusing on mission command control from manned aircraft rather than ground control center. We propose an efficient way of system operation for UAV control in cooperation of aircraft and UAV, and verify its efficiency via speech recognition experiment.

Probabilistic Neural Network Based Learning from Fuzzy Voice Commands for Controlling a Robot

  • Jayawardena, Chandimal;Watanabe, Keigo;Izumi, Kiyotaka
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2004.08a
    • /
    • pp.2011-2016
    • /
    • 2004
  • Study of human-robot communication is one of the most important research areas. Among various communication media, any useful law we find in voice communication in human-human interactions, is significant in human-robot interactions too. Control strategy of most of such systems available at present is on/off control. These robots activate a function if particular word or phrase associated with that function can be recognized in the user utterance. Recently, there have been some researches on controlling robots using information rich fuzzy commands such as "go little slowly". However, in those works, although the voice command interpretation has been considered, learning from such commands has not been treated. In this paper, learning from such information rich voice commands for controlling a robot is studied. New concepts of the coach-player model and the sub-coach are proposed and such concepts are also demonstrated for a PA-10 redundant manipulator.

  • PDF