• Title/Summary/Keyword: voice image

Search Result 296, Processing Time 0.023 seconds

Implementation of a Refusable Human-Robot Interaction Task with Humanoid Robot by Connecting Soar and ROS (Soar (State Operator and Result)와 ROS 연계를 통해 거절가능 HRI 태스크의 휴머노이드로봇 구현)

  • Dang, Chien Van;Tran, Tin Trung;Pham, Trung Xuan;Gil, Ki-Jong;Shin, Yong-Bin;Kim, Jong-Wook
    • The Journal of Korea Robotics Society
    • /
    • v.12 no.1
    • /
    • pp.55-64
    • /
    • 2017
  • This paper proposes combination of a cognitive agent architecture named Soar (State, operator, and result) and ROS (Robot Operating System), which can be a basic framework for a robot agent to interact and cope with its environment more intelligently and appropriately. The proposed Soar-ROS human-robot interaction (HRI) agent understands a set of human's commands by voice recognition and chooses to properly react to the command according to the symbol detected by image recognition, implemented on a humanoid robot. The robotic agent is allowed to refuse to follow an inappropriate command like "go" after it has seen the symbol 'X' which represents that an abnormal or immoral situation has occurred. This simple but meaningful HRI task is successfully experimented on the proposed Soar-ROS platform with a small humanoid robot, which implies that extending the present hybrid platform to artificial moral agent is possible.

A Study on Traffic Light Detection (TLD) as an Advanced Driver Assistance System (ADAS) for Elderly Drivers

  • Roslan, Zhafri Hariz;Cho, Myeon-gyun
    • International Journal of Contents
    • /
    • v.14 no.2
    • /
    • pp.24-29
    • /
    • 2018
  • In this paper, we propose an efficient traffic light detection (TLD) method as an advanced driver assistance system (ADAS) for elderly drivers. Since an increase in traffic accidents is associated with the aging population and an increase in elderly drivers causes a serious social problem, the provision of ADAS for older drivers via TLD is becoming a necessary(Ed: verify word choice: necessary?) public service. Therefore, we propose an economical TLD method that can be implemented with a simple black box (built in camera) and a smartphone in the near future. The system utilizes a color pre-processing method to differentiate between the stop and go signals. A mathematical morphology algorithm is used to further enhance the traffic light detection and a circular Hough transform is utilized to detect the traffic light correctly. From the simulation results of the computer vision and image processing based on a proposed algorithm on Matlab, we found that the proposed TLD method can detect the stop and go signals from the traffic lights not only in daytime, but also at night. In the future, it will be possible to reduce the traffic accident rate by recognizing the traffic signal and informing the elderly of how to drive by voice.

Senior Emergency Management System Using Self-Learning Information Analysis (자가 학습 행동 분석 기반의 시니어 응급관리시스템)

  • Lee, Duk-Hee;Lee, Young-Sik;Kim, Chong-Kyen;Choi, Chul-Jae
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.16 no.5
    • /
    • pp.1011-1018
    • /
    • 2021
  • With the continuous increase of the senior population, it is necessary to introduce the 4th industrial revolution applied technology into the health and welfare field. In addition, a senior emergency management system centered on Nono Care is established due to the shortage of young people, which requires strategization of a welfare delivery system in which senior colleagues notify emergency relief facilities directly in case of an emergency. In this paper, senior emergency management system is designed to collect and analyze individual activities and inactivity information through senior self-learning through smartphone app and to predict emergency situations with voice and image registration information through smartphone app menu.

Recent Advances in Examination of Vocal Fold Vibration (성대진동검사의 최신 지견)

  • Lee, Jin-Choon;Bae, Inho
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.32 no.1
    • /
    • pp.1-8
    • /
    • 2021
  • Human vocal cords vibrate as quickly as 100-250 times per second, so it is impossible to observe them with normal endoscopic diagnostic equipment. High-speed videolaryngoscopy (HSV) allows the visualization of non-periodic vibratory motion of vocal fold beyond the limitation of videostroboscopy. New developed post-processing methods that converts HSV to two-dimensional videokymography (2D VKG) using U-medical image-processing software can provide quantitative information on vocal fold mucosa vibration. Multifunctional laryngeal examination system is composed of 3 kinds of examinations such as HSV, 2D scanning digital kymography (2D DKG) and line scanning digital kymography (DKG). Evaluation of entire vocal cord vibratory pattern in each cord is possible using 2D DKG and a faster and more reliable quantitative information can be obtained. As this system is used in clinical and research, it is expected to bring much advances to the diagnosis of voice disorders. In this review, I will introduce the principles and advantages on examination of the vocal fold vibration, which is in the spotlight recently, and proceed with the literature review.

Expression Analysis of Schizophrenia Symptoms : Focusing on the Movies and (조현병 증상의 표현 분석 : 영화 <더 보이스>,<블랙 스완>을 중심으로)

  • Choi, Ji-Won
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.9
    • /
    • pp.510-519
    • /
    • 2022
  • This study aims to examine how delusions and hallucinations are expressed as representative symptoms of schizophrenia in the two films and . We look at how the two films show the appearance and overall image of the schizophrenic patient, the emotional expression of the schizophrenic patient, and how family and colleagues express the perceptions of the schizophrenic patient's symptoms. Finally, we will examine the perception of treatment measures for schizophrenia. Comprehensively, through this process, the schizophrenia symptoms expressed in the two films will have some effect on social perception of schizophrenia and the social implications of therapeutic intervention for schizophrenia recovery.

Structural live load surveys by deep learning

  • Li, Yang;Chen, Jun
    • Smart Structures and Systems
    • /
    • v.30 no.2
    • /
    • pp.145-157
    • /
    • 2022
  • The design of safe and economical structures depends on the reliable live load from load survey. Live load surveys are traditionally conducted by randomly selecting rooms and weighing each item on-site, a method that has problems of low efficiency, high cost, and long cycle time. This paper proposes a deep learning-based method combined with Internet big data to perform live load surveys. The proposed survey method utilizes multi-source heterogeneous data, such as images, voice, and product identification, to obtain the live load without weighing each item through object detection, web crawler, and speech recognition. The indoor objects and face detection models are first developed based on fine-tuning the YOLOv3 algorithm to detect target objects and obtain the number of people in a room, respectively. Each detection model is evaluated using the independent testing set. Then web crawler frameworks with keyword and image retrieval are established to extract the weight information of detected objects from Internet big data. The live load in a room is derived by combining the weight and number of items and people. To verify the feasibility of the proposed survey method, a live load survey is carried out for a meeting room. The results show that, compared with the traditional method of sampling and weighing, the proposed method could perform efficient and convenient live load surveys and represents a new load research paradigm.

A Multi-Sensor Module of Snake Robot for Searching Survivors in Narrow Space (협소 공간 생존자 탐색을 위한 뱀형 로봇의 다중 센서 모듈)

  • Kim, Sungjae;Shin, Dong-Gwan;Pyo, Juhyun;Shin, Juseong;Jin, Maolin;Suh, Jinho
    • The Journal of Korea Robotics Society
    • /
    • v.16 no.4
    • /
    • pp.291-298
    • /
    • 2021
  • In this study, we present a multi-sensor module for snake robot searching survivors in a narrow space. To this end, we integrated five sensor systems by considering the opinions of the first responders: a gas sensor to detect CO2 gases from the exhalation of survivors, a CMOS camera to provide the image of survivors, an IR camera to see in the dark & smoky environment, two microphones to detect the voice of survivors, and an IMU to recognize the approximate location and direction of the robot and survivors. Furthermore, we integrated a speaker into the sensor module system to provide a communication channel between the first responders and survivors. To integrated all these mechatronics systems in a small, compact snake head, we optimized the positions of the sensors and designed a stacked structure for the whole system. We also developed a user-friendly GUI to show the information from the proposed sensor systems visually. Experimental results verified the searching function of the proposed sensor module system.

CutPaste-Based Anomaly Detection Model using Multi Scale Feature Extraction in Time Series Streaming Data

  • Jeon, Byeong-Uk;Chung, Kyungyong
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.8
    • /
    • pp.2787-2800
    • /
    • 2022
  • The aging society increases emergency situations of the elderly living alone and a variety of social crimes. In order to prevent them, techniques to detect emergency situations through voice are actively researched. This study proposes CutPaste-based anomaly detection model using multi-scale feature extraction in time series streaming data. In the proposed method, an audio file is converted into a spectrogram. In this way, it is possible to use an algorithm for image data, such as CNN. After that, mutli-scale feature extraction is applied. Three images drawn from Adaptive Pooling layer that has different-sized kernels are merged. In consideration of various types of anomaly, including point anomaly, contextual anomaly, and collective anomaly, the limitations of a conventional anomaly model are improved. Finally, CutPaste-based anomaly detection is conducted. Since the model is trained through self-supervised learning, it is possible to detect a diversity of emergency situations as anomaly without labeling. Therefore, the proposed model overcomes the limitations of a conventional model that classifies only labelled emergency situations. Also, the proposed model is evaluated to have better performance than a conventional anomaly detection model.

Analyzing performance of time series classification using STFT and time series imaging algorithms

  • Sung-Kyu Hong;Sang-Chul Kim
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.4
    • /
    • pp.1-11
    • /
    • 2023
  • In this paper, instead of using recurrent neural network, we compare a classification performance of time series imaging algorithms using convolution neural network. There are traditional algorithms that imaging time series data (e.g. GAF(Gramian Angular Field), MTF(Markov Transition Field), RP(Recurrence Plot)) in TSC(Time Series Classification) community. Furthermore, we compare STFT(Short Time Fourier Transform) algorithm that can acquire spectrogram that visualize feature of voice data. We experiment CNN's performance by adjusting hyper parameters of imaging algorithms. When evaluate with GunPoint dataset in UCR archive, STFT(Short-Time Fourier transform) has higher accuracy than other algorithms. GAF has 98~99% accuracy either, but there is a disadvantage that size of image is massive.

A Study on Color Image of TV News Anchor Woman's Jackets (TV 뉴스 여성앵커 재킷의 색상 이미지 연구)

  • Lee, Eun-Kyung
    • Korean Journal of Human Ecology
    • /
    • v.19 no.1
    • /
    • pp.149-156
    • /
    • 2010
  • TV news anchor woman's appearance, voice, expression, and clothing, etc., have an influence on the reliability of the article to be reported. Among these, clothing is the most crucial factor in forming an anchor woman's image, especially the clothing color factor. This study is aimed at providing the basic foundation for anchor woman when they select the clothing color by analyzing the clothing color image on the screen. For this purpose, the KBS and MBC 9 o'clock news desk and SBS 8 o'clock news of the local major news programs were selected. With the collection of 300 pieces of news clips related to anchor woman's clothing from January to December 2008, they were classified into F/W seasons and analyzed by the clothing color. The surveying method of clothing color was to capture the anchor woman's clothing among the news clips, then pick the representing color by applying Adobe Photoshop, and researching the formed $L^*a^*b^*$ value of color chips. The surveyed color was transformed into value of distant cell, H V/C, and the results were analyzed. As a result, it showed that the White system for anchor woman's clothing during the S/S seasons is most frequently picked, followed by the Red system. In F/W seasons, Gray system is the most favored, then White and Red, respectively. It was revealed that the most frequently selected colors for upper-wear by anchor women in the three broadcasting stations was an achromatic color, such as White or Gray, and then the chromatic color, Red. It shows that there is no big difference in season. The Inner-wear color matched the jackets which were also achromatic in color, white and black being the most favored in the S/S seasons, and in the case of chromatic colors, Red was the most favored. In addition to this, identical coloration with jacket, coloration with similar color, or single color as clothing color were no less frequently adopted. During the F/W seasons, identical coloration accounts for 26%, the most popular colored being White and Red. It was found that the coloration with achromatic colors are highly favored in the three major broadcasting stations alike.