• 제목/요약/키워드: voice problem

Search Result 337, Processing Time 0.024 seconds

Customer Satisfaction Measurement Model Based on QFD

  • Liu, Yumin;Xu, Jichao
    • International Journal of Quality Innovation
    • /
    • v.4 no.2
    • /
    • pp.101-122
    • /
    • 2003
  • With the development of the American Customer satisfaction index (ACSI), research on customer satisfaction measurement or evaluation methods have become significant in the last decade. Most of international customer satisfaction barometers or indices are evolved based on the cause and effect relationship model of ACSI. Of critical importance to validity of customer satisfaction indices is how to construct a measurement attribute or indicator model and provide an effective implementation method effectively. Quality Function Deployment (QFD) is a very useful tool for translating the customer voice into product design through quality engineering. In fact, this is a methodology for measuring and analyzing evaluation indicators by their relationship matrix. In this paper, we will make an effort to integrate the framework of QFD into the measurement problem of customer satisfaction, and also develop a new multi-phase QFD model for evaluation of Customer Satisfaction Index (CSI). From the houses of quality in this model, the evaluation indicators impacting on customer's global satisfaction are identified by means of their relationship matrix. Then the evaluation indicator hierarchy and its measurement method for the customer satisfaction index are presented graphically. Furthermore, survey data from the Chinese automobile maintenance sector and a relevant case study are utilized to show the implementation method of the QFD model used to measure and analyze of customer satisfaction.

A Study on the Educational Uses of Smart Speaker (스마트 스피커의 교육적 활용에 관한 연구)

  • Chang, Jiyeun
    • Journal of the Korea Convergence Society
    • /
    • v.10 no.11
    • /
    • pp.33-39
    • /
    • 2019
  • Edutech, which combines education and information technology, is in the spotlight. Core technologies of the 4th Industrial Revolution have been actively used in education. Students use an AI-based learning platform to self-diagnose their needs. And get personalized training online with a cloud learning platform. Recently, a new educational medium called smart speaker that combines artificial intelligence technology and voice recognition technology has emerged and provides various educational services. The purpose of this study is to suggest a way to use smart speaker educationally to overcome the limitation of existing education. To this end, the concept and characteristics of smart speakers were analyzed, and the implications were derived by analyzing the contents provided by smart speakers. Also, the problem of using smart speaker was considered.

Audio and Video Bimodal Emotion Recognition in Social Networks Based on Improved AlexNet Network and Attention Mechanism

  • Liu, Min;Tang, Jun
    • Journal of Information Processing Systems
    • /
    • v.17 no.4
    • /
    • pp.754-771
    • /
    • 2021
  • In the task of continuous dimension emotion recognition, the parts that highlight the emotional expression are not the same in each mode, and the influences of different modes on the emotional state is also different. Therefore, this paper studies the fusion of the two most important modes in emotional recognition (voice and visual expression), and proposes a two-mode dual-modal emotion recognition method combined with the attention mechanism of the improved AlexNet network. After a simple preprocessing of the audio signal and the video signal, respectively, the first step is to use the prior knowledge to realize the extraction of audio characteristics. Then, facial expression features are extracted by the improved AlexNet network. Finally, the multimodal attention mechanism is used to fuse facial expression features and audio features, and the improved loss function is used to optimize the modal missing problem, so as to improve the robustness of the model and the performance of emotion recognition. The experimental results show that the concordance coefficient of the proposed model in the two dimensions of arousal and valence (concordance correlation coefficient) were 0.729 and 0.718, respectively, which are superior to several comparative algorithms.

Zigbee-based Local Army Strategy Network Configurations for Multimedia Military Service

  • Je, Seung-Mo
    • Journal of Multimedia Information System
    • /
    • v.6 no.3
    • /
    • pp.131-138
    • /
    • 2019
  • With the rapid evolution of communication technology, it became possible to overcome the spatial and temporal limitations faced by humans to some extent. Furthermore, the quality of personal life was revolutionized with the emergence of the personal communication device commonly known as the smart phone. In terms of defense networks, however, due to restrictions from the military and security perspectives, the use of smart phones has been prohibited and controlled in the army; thus, they are not being used for any defense strategy purposes as yet. Despite the current consideration of smart phones for military communication, due to the difficulties of network configuration and the high cost of the necessary communication devices, the main tools of communication between soldiers are limited to the use of flag, voice or hand signals, which are all very primitive. Although these primitive tools can be very effective in certain cases, they cannot overcome temporal and spatial limitations. Likewise, depending on the level of the communication skills of each individual, communication efficiency can vary significantly. As the term of military service continues to be shortened, however, types of communication of varying efficiency depending on the levels of skills of each individual newly added to the military is not desirable at all. To address this problem, it is essential to prepare an intuitive network configuration that facilitates use by soldiers in a short period of time by easily configuring the strategy network at a low cost while maintaining its security. Therefore, in this article, the author proposes a Zigbee-based local strategic network by using Opnet and performs a simulation accordingly.

Real-time Multi-device Control System Implementation for Natural User Interactive Platform

  • Kim, Myoung-Jin;Hwang, Tae-min;Chae, Sung-Hun;Kim, Min-Joon;Moon, Yeon-Kug;Kim, SeungJun
    • Journal of Internet Computing and Services
    • /
    • v.23 no.1
    • /
    • pp.19-29
    • /
    • 2022
  • Natural user interface (NUI) is used for the natural motion interface without using a specific device or tool like a mouse, keyboards, and pens. Recently, as non-contact sensor-based interaction technologies for recognizing human motion, gestures, voice, and gaze have been actively studied, an environment has been prepared that can provide more diverse contents based on various interaction methods compared to existing methods. However, as the number of sensors device is rapidly increasing, the system using a lot of sensors can suffer from a lack of computational resources. To address this problem, we proposed a real-time multi-device control system for natural interactive platform. In the proposed system, we classified two types of devices as the HC devices such as high-end commercial sensor and the LC devices such astraditional monitoring sensor with low-cost. we adopt each device manager to control efficiently. we demonstrate a proposed system works properly with user behavior such as gestures, motions, gazes, and voices.

Effect of Collaborative Problem-Solving for Competency Instruction Strategy Using Science Reading Text on Elementary Sch ool Students' Science Reading Ability (과학 읽기 자료를 이용한 협력적 문제해결 중심 과학 수업이 초등학교 학생들의 과학 읽기 능력에 미치는 영향)

  • Park, Jihun;Jun, Jaekyoung;Lee, Sujin;Nam, Jeonghee
    • Journal of Korean Elementary Science Education
    • /
    • v.41 no.4
    • /
    • pp.642-657
    • /
    • 2022
  • This study aimed to investigate how elementary school students' science reading ability is influenced by collaborative problem-solving for competency instruction strategy using science reading text. This study recruited two groups of elementary students in fifth grade. The experimental group underwent an instruction strategy using science reading text, while the comparative group experienced a science class using a textbook. Afterward, data from the science reading ability tests, voice recordings of the discussion process involving each group, and class videos were collected and analyzed. The results showed that science classes that used collaborative problem-solving for their competency instruction strategy via science reading text were effective in enhancing elementary school students' science reading ability. Meanwhile, the science reading ability test results indicated that the experimental group had statistically higher total scores than the comparative group in the three subelements, especially "introspection and evaluation" and "integration and interpretation" owing to their significant improvement in high-level cognitive processes. In these classes, the students read the materials that the teacher provided, participated in the discussion based on what they have read, and had the chance to reflect on their reading processes. Overall, students' science reading ability was enhanced through this process.

Effective Feature Vector for Isolated-Word Recognizer using Vocal Cord Signal (성대신호 기반의 명령어인식기를 위한 특징벡터 연구)

  • Jung, Young-Giu;Han, Mun-Sung;Lee, Sang-Jo
    • Journal of KIISE:Software and Applications
    • /
    • v.34 no.3
    • /
    • pp.226-234
    • /
    • 2007
  • In this paper, we develop a speech recognition system using a throat microphone. The use of this kind of microphone minimizes the impact of environmental noise. However, because of the absence of high frequencies and the partially loss of formant frequencies, previous systems developed with those devices have shown a lower recognition rate than systems which use standard microphone signals. This problem has led to researchers using throat microphone signals as supplementary data sources supporting standard microphone signals. In this paper, we present a high performance ASR system which we developed using only a throat microphone by taking advantage of Korean Phonological Feature Theory and a detailed throat signal analysis. Analyzing the spectrum and the result of FFT of the throat microphone signal, we find that the conventional MFCC feature vector that uses a critical pass filter does not characterize the throat microphone signals well. We also describe the conditions of the feature extraction algorithm which make it best suited for throat microphone signal analysis. The conditions involve (1) a sensitive band-pass filter and (2) use of feature vector which is suitable for voice/non-voice classification. We experimentally show that the ZCPA algorithm designed to meet these conditions improves the recognizer's performance by approximately 16%. And we find that an additional noise-canceling algorithm such as RAST A results in 2% more performance improvement.

Efficient Design of a Disaster Broadcasting System using LTE Modem (이동 LTE모뎀을 활용한 재난방송시스템 설계)

  • Moon, Chaeyoung;Kim, Semin;Ryoo, Kwangki
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2018.10a
    • /
    • pp.292-294
    • /
    • 2018
  • Recently, damage caused by natural disasters such as fire, earthquake, heavy rains and heavy snow is increasing. In addition, traffic accidents due to freezing, fog and fire in tunnels and bridges are frequently occurring. In such a disaster situation, it is very important to take prompt action by the person in charge of managing the facility and area.To this end, a disaster broadcasting system is used, but in the existing system, the broadcasting room and the speaker are connected by a wired connection. Also, the person in charge has to be in the broadcasting room to broadcast, which has a problem of delaying the time. In this paper, we design a disaster broadcasting system using LTE modem. The designed system enables a broadcasting person to make a call to a broadcasting system from anywhere using a cellular phone and a public telephone. Broadcasting via telephone is possible only with the telephone number pre-registered in the system and can be registered / deleted by the administrator. The registered telephone number, incoming voice file, and announcement voice for automatic broadcasting are stored in the system internal SD memory for convenient management. This disaster broadcasting system is expected to contribute to quick and convenient disaster broadcasting.

  • PDF

A Study on Development of Robot for Mutual Communication and Education of Students with Health Impairments (건강장애 학생의 상호소통 및 교육을 위한 로봇 개발에 대한 연구)

  • Ryu, Gun Jae;Kang, Jung Bae;Kim, Chang Geol;Kim, Kyung Sik;Song, Beong Seop
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.19 no.5
    • /
    • pp.15-24
    • /
    • 2014
  • In 2005, there was a partial revision of the Act on the Promotion of Education for the Handicapped people, so that students with health impairments would be able to receive special education support. Since the amendment of the bill, to support them classified into weak people in education, education support systems have been proposed and settled so that they may receive the support for free. According to the pre-study, after the amendment of this bill, there has been many studies on the form of educational service to support them, and recently, there have been a lot of researches to investigate their satisfaction with the current services and draw its problems. And yet these studies have been carried out by the preceeding researchers at the drawing of problems, but have a limitation to present fundamental countermeasures to the problems. Therefore, this study attempted to understand the meaning of health impairment through the pre-study and investigate the forms of the services currently supporting them and analyze the problem of each service. In addition, to solve the identified problems, a new support system was proposed. In order to confirm the performance of the system, we design the user satisfaction survey composed of a Likert 5-point scale per each question, and to make the task, comparing stories and clapping for increasing quality of their subjective evaluation about the image and voice transmission when the user uses it. As a result, in the overall evaluation of the robot system, the average score of each question was recorded to 4.31 points, and through the two tasks, it was found that there were effective data transmission of image and voice.

DNN based Robust Speech Feature Extraction and Signal Noise Removal Method Using Improved Average Prediction LMS Filter for Speech Recognition (음성 인식을 위한 개선된 평균 예측 LMS 필터를 이용한 DNN 기반의 강인한 음성 특징 추출 및 신호 잡음 제거 기법)

  • Oh, SangYeob
    • Journal of Convergence for Information Technology
    • /
    • v.11 no.6
    • /
    • pp.1-6
    • /
    • 2021
  • In the field of speech recognition, as the DNN is applied, the use of speech recognition is increasing, but the amount of calculation for parallel training needs to be larger than that of the conventional GMM, and if the amount of data is small, overfitting occurs. To solve this problem, we propose an efficient method for robust voice feature extraction and voice signal noise removal even when the amount of data is small. Speech feature extraction efficiently extracts speech energy by applying the difference in frame energy for speech and the zero-crossing ratio and level-crossing ratio that are affected by the speech signal. In addition, in order to remove noise, the noise of the speech signal is removed by removing the noise of the speech signal with an average predictive improved LMS filter with little loss of speech information while maintaining the intrinsic characteristics of speech in detection of the speech signal. The improved LMS filter uses a method of processing noise on the input speech signal by adjusting the active parameter threshold for the input signal. As a result of comparing the method proposed in this paper with the conventional frame energy method, it was confirmed that the error rate at the start point of speech is 7% and the error rate at the end point is improved by 11%.