• Title/Summary/Keyword: voice image

Search Result 293, Processing Time 0.027 seconds

Green Six Sigma for Green Growth Implementation (녹색성장 실행을 위한 그린 6시그마)

  • Kim, Dong-Chun;Hong, Sung-Hoon;Shin, Wan-Seon
    • Journal of Korean Society for Quality Management
    • /
    • v.38 no.4
    • /
    • pp.521-530
    • /
    • 2010
  • Global regulatory pressures relating climate change and environmental responsibility are asking companies to find out the best way for sustaining their continuous business growths. It could be known that inadequate management for environmental issues are bad for business, negatively affecting brand image, causing unnecessary losses and costs for environmental preservation. For this reason, environmentally conscious green business growth has been recognized as an essential requirement for a company to stay in business. Many companies are looking for green business opportunities of improving their environmental and financial results, and struggling with how green fits into their business. In this paper, the Green Six Sigma, an environmentally conscious Six Sigma methodology, is presented as a way to find solutions for green growths. The Six Sigma is known as a disciplined, data-driven approach and methodology for achieving world-class performance in any process from manufacturing to transactional. In chronological order, the Six Sigma has been evolved from Motorola's quality-oriented methodology to GE's cost-oriented lean approach, and is being evolved and developed as an environment-oriented green growth approach. There is no doubt that the Green Six Sigma, as an engine of green growth, is a power tool for achieving competitive business performance and reducing the impact on the environment.

Synthesis of Expressive Talking Heads from Speech with Recurrent Neural Network (RNN을 이용한 Expressive Talking Head from Speech의 합성)

  • Sakurai, Ryuhei;Shimba, Taiki;Yamazoe, Hirotake;Lee, Joo-Ho
    • The Journal of Korea Robotics Society
    • /
    • v.13 no.1
    • /
    • pp.16-25
    • /
    • 2018
  • The talking head (TH) indicates an utterance face animation generated based on text and voice input. In this paper, we propose the generation method of TH with facial expression and intonation by speech input only. The problem of generating TH from speech can be regarded as a regression problem from the acoustic feature sequence to the facial code sequence which is a low dimensional vector representation that can efficiently encode and decode a face image. This regression was modeled by bidirectional RNN and trained by using SAVEE database of the front utterance face animation database as training data. The proposed method is able to generate TH with facial expression and intonation TH by using acoustic features such as MFCC, dynamic elements of MFCC, energy, and F0. According to the experiments, the configuration of the BLSTM layer of the first and second layers of bidirectional RNN was able to predict the face code best. For the evaluation, a questionnaire survey was conducted for 62 persons who watched TH animations, generated by the proposed method and the previous method. As a result, 77% of the respondents answered that the proposed method generated TH, which matches well with the speech.

Smart Portable Navigation System Development and Implementation of 1:N service for Visually impaired person (Smart Portable Navigation System 개발 및 1:N 서비스 구현)

  • Kim, Jae-Kyung;Seo, Jae-Gil;Kim, Young-Kil
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.16 no.11
    • /
    • pp.2424-2430
    • /
    • 2012
  • The current Navigation System for the Visually Impaired Person has a short and limited communication distance and can't receive enough information from Visually Impaired Person to assist directly. In addition, because the path is dangerous and incomplete for the Visually Impaired Person, moving with White Stick is still inconvenient and dangerous. To solve this problem we implement communication that can send and receive video, voice, location information between the Visually Impaired Person's Smart Portable Navigation System Development and assistant's PC.

A Study on the Design of Cyber lecture Component (가상강의 Component 설계에 관한 연구)

  • 강정배;김선경
    • Proceedings of the Korea Society for Industrial Systems Conference
    • /
    • 2002.11a
    • /
    • pp.171-177
    • /
    • 2002
  • E-Learning is a modem main teaching method starting from the concept of remote education. This research is aimed for proposing cyber education library system, and designing a cyber education component that becomes a basis for e-Learning system. Cyber education library is a storage system of cyber lectures that can supply high quality data to the needed developers. Cyber education component consists of 5 categories and those are text, voice, image, animation, and flash. By using this system, the developers can save the necessary time and effort in education development. This system also helps students. The students can access various lecture data on a given subject and select the best fit for them.

  • PDF

2-Layer Fuzzy Controller for Behavior Control of Mobile Robot (이동로봇의 행동제어를 위한 2-Layer Fuzzy Controller)

  • Sim, Kwee-Bo;Byun, Kwang-Sub;Park, Chang-Hyun
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.13 no.3
    • /
    • pp.287-292
    • /
    • 2003
  • The ability of robot is being various and complex. The robot is utilizing distance, image data and voice data for sensing its circumstance. This paper suggests the 2-layer fuzzy control as the algorithm that control robot with various sensor information. In a obstacle avoidance, it utilizes many range finders and classifies them into 3parts(front, left, right). In 3 sub-controllers, the controller executes fuzzy conference. And then it executes combined control with a combination of outputs of 3 sub-controllers in the second step. The text compares the 2-layer fuzzy controller with the hierarchical fuzzy controller that has analogous structure. And the performance of the 2-layer fuzzy controller is confirmed by application this controller to robot following, simulation to each other and real experiment.

Monosyllable Speech Recognition through Facial Movement Analysis (안면 움직임 분석을 통한 단음절 음성인식)

  • Kang, Dong-Won;Seo, Jeong-Woo;Choi, Jin-Seung;Choi, Jae-Bong;Tack, Gye-Rae
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.63 no.6
    • /
    • pp.813-819
    • /
    • 2014
  • The purpose of this study was to extract accurate parameters of facial movement features using 3-D motion capture system in speech recognition technology through lip-reading. Instead of using the features obtained through traditional camera image, the 3-D motion system was used to obtain quantitative data for actual facial movements, and to analyze 11 variables that exhibit particular patterns such as nose, lip, jaw and cheek movements in monosyllable vocalizations. Fourteen subjects, all in 20s of age, were asked to vocalize 11 types of Korean vowel monosyllables for three times with 36 reflective markers on their faces. The obtained facial movement data were then calculated into 11 parameters and presented as patterns for each monosyllable vocalization. The parameter patterns were performed through learning and recognizing process for each monosyllable with speech recognition algorithms with Hidden Markov Model (HMM) and Viterbi algorithm. The accuracy rate of 11 monosyllables recognition was 97.2%, which suggests the possibility of voice recognition of Korean language through quantitative facial movement analysis.

Fast Link-Setup Protocol for Wireless Multimedia Sensor Networks (무선 멀티미디어 센서 네트워크를 위한 고속 링크 설정 프로토콜)

  • Pak, Wooguil
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.39C no.3
    • /
    • pp.247-254
    • /
    • 2014
  • For wireless multimedia sensor network (WMSN), it is crucial to select appropriate channels to deliver multimedia data stream generated by image and voice sensors periodically or continuously. Although most of previous researches assume that fixed channels are used for wireless sensor networks, they causes limit to support various application areas. In this paper, we apply link-setup algorithms developed for wireless cognitive radio networks for searching common channels between two nodes without common control channels. We also show the algorithm causes serious performance degradation as the total number of used channels increases, and then propose a new link-setup algorithm to resolve the problem. Therefore, the proposed algorithm shows 30 % higher performance compared to existing algorithms.

A Study on the Framework Construction of Disaster Monitoring and Transmitting System based on Smart-Phone (스마트 폰(Smart-Phone)기반의 재난 감시 및 상황전달시스템 프레임워크(Framework) 구축에 관한 연구)

  • Jeong, Duk-Hoon;Min, Geum-Young;An, Chang-Keun;Lee, Hoon-Seok
    • Journal of the Korea Safety Management & Science
    • /
    • v.13 no.2
    • /
    • pp.31-42
    • /
    • 2011
  • Smart-Phones are utilized in disaster management field because it can deliver disaster information to large population simultaneously and quickly, and provide accurate information through situation-based service using the LBS(Location Based Service). To study on the utilization of smart phone for disaster information collection and dissemination method, this study suggest a framework which connects smart phone by loading application for reporting disaster. The disaster monitoring and situation dissemination system framework using smart phone is composed of 4 parts. First, smart phone application enters image, video, voice and text information and location of the disaster. Second, the disaster report reception and situation dissemination server receives the information, save in the DB, and send through smart phone SMS. Third, store into disaster information database. Fourth, display the disaster report and management information on 2D GIS, support the decision making process in deciding whether to manage as disaster, and disaster management web service which disseminates situation.

Implementation of a Refusable Human-Robot Interaction Task with Humanoid Robot by Connecting Soar and ROS (Soar (State Operator and Result)와 ROS 연계를 통해 거절가능 HRI 태스크의 휴머노이드로봇 구현)

  • Dang, Chien Van;Tran, Tin Trung;Pham, Trung Xuan;Gil, Ki-Jong;Shin, Yong-Bin;Kim, Jong-Wook
    • The Journal of Korea Robotics Society
    • /
    • v.12 no.1
    • /
    • pp.55-64
    • /
    • 2017
  • This paper proposes combination of a cognitive agent architecture named Soar (State, operator, and result) and ROS (Robot Operating System), which can be a basic framework for a robot agent to interact and cope with its environment more intelligently and appropriately. The proposed Soar-ROS human-robot interaction (HRI) agent understands a set of human's commands by voice recognition and chooses to properly react to the command according to the symbol detected by image recognition, implemented on a humanoid robot. The robotic agent is allowed to refuse to follow an inappropriate command like "go" after it has seen the symbol 'X' which represents that an abnormal or immoral situation has occurred. This simple but meaningful HRI task is successfully experimented on the proposed Soar-ROS platform with a small humanoid robot, which implies that extending the present hybrid platform to artificial moral agent is possible.

A Study on Traffic Light Detection (TLD) as an Advanced Driver Assistance System (ADAS) for Elderly Drivers

  • Roslan, Zhafri Hariz;Cho, Myeon-gyun
    • International Journal of Contents
    • /
    • v.14 no.2
    • /
    • pp.24-29
    • /
    • 2018
  • In this paper, we propose an efficient traffic light detection (TLD) method as an advanced driver assistance system (ADAS) for elderly drivers. Since an increase in traffic accidents is associated with the aging population and an increase in elderly drivers causes a serious social problem, the provision of ADAS for older drivers via TLD is becoming a necessary(Ed: verify word choice: necessary?) public service. Therefore, we propose an economical TLD method that can be implemented with a simple black box (built in camera) and a smartphone in the near future. The system utilizes a color pre-processing method to differentiate between the stop and go signals. A mathematical morphology algorithm is used to further enhance the traffic light detection and a circular Hough transform is utilized to detect the traffic light correctly. From the simulation results of the computer vision and image processing based on a proposed algorithm on Matlab, we found that the proposed TLD method can detect the stop and go signals from the traffic lights not only in daytime, but also at night. In the future, it will be possible to reduce the traffic accident rate by recognizing the traffic signal and informing the elderly of how to drive by voice.