• Title/Summary/Keyword: Human voice

Search Result 358, Processing Time 0.05 seconds

A Mobile Stress Management System utilizing Variable Voice Information According to the Wearing Area

  • Kang, Byeongsoo;Vannroath, Ky;Kang, Hyun-syug
    • Journal of the Korea Society of Computer and Information
    • /
    • v.22 no.6
    • /
    • pp.95-100
    • /
    • 2017
  • Recently, as stress has become a major threat to people's health, there is a growing interest in wearable stress management services for stress relief. In this paper, we developed a wearable device(Care-on) capable of extracting changeable human voice information at each site and a Healthcare App(S-Manager) that enables stress management in real time using the wearable device. It collects and analyzes variable real-time voice information for each part of the person's body. And It also provides the ability to monitor stress conditions in a mobile environment and provide feedback on the analysis results in step by step in the mobile environment. We tested the developed wearable devices and app in a mobile environment and analyzed the results to confirm their usefulness.

A Study on Audio/Voice Color Processing Technique (오디오/음성 컬러 처리 기술 연구)

  • Kim Kwangki;Kim Sang-Jin;Son BeakKwon;Hahn Minsoo
    • Proceedings of the KSPS conference
    • /
    • 2003.05a
    • /
    • pp.153-156
    • /
    • 2003
  • In this paper, we studied advanced audio/ voice information processing techniques, and trying to introduce more human friendly audio/voice. It is just in the beginning stage. Firstly, we approached in well-known time-domain methods such as moving average, differentiation, interpolation, and decimation. Moreover, some variation of them and envelope contour modification are utilized. We also suggested the MOS test to evaluate subjective listening factors. In the long term viewpoint, user's preference, mood, and environmental conditions will be considered and according to them, we hope our future technique can adapt speech and audio signals automatically.

  • PDF

Chest Girth Prediction Method Using Voice Signals Analysis Technology : Focusing on Men in the 20's (음성신호 분석 기술을 이용한 흉위 예측 기법 : 20대 남성을 대상으로)

  • Kim, Bong-Hyun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.16 no.9
    • /
    • pp.2031-2036
    • /
    • 2012
  • There is body type that physique classified by apparent characteristics as shape of human body. Chest girth circumference and body type statistically has been look into correlative disposition, character etc. In this paper, we carried out study about prediction of chest girth as voice that interrelationship drew to analyze voice of disposition, character etc. in personal character. With this in mind, we measured intensity, spectrum about laughter by chest girth to classify composition group of subjects and then we would like to extract experiment result to predict chest girth by reciprocal comparison.

Surgery of Benign Laryngeal Mucosal Lesions (후두 양성점막 병변의 수술적 치료)

  • Jin, Sung Min
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.24 no.2
    • /
    • pp.83-87
    • /
    • 2013
  • The term "phonosurgery," coined in the early 1960s, refers to surgical procedures that maintain, restore, or enhance the human voice. Phonosurgery includes phonomicrosurgery (endoscopic microsurgery of the vocal folds), laryngoplastic phonosurgery (open-neck surgery that restructures the cartilaginous framework of the larynx and the soft tissues), laryngeal injection (injection of medications as well as synthetic and organic biologic substances), and reinnervation of the larynx. Phonomicrosurgery is a means of maximally preserving the layered microstructure of the vocal fold, that is, the epithelium and lamina propria. The purpose of the surgery is usually to improve the vibratory characteristics of the layered microstructure of the vocal folds. Phonomicrosurgery has developed from convergence of microlaryngoscopic surgical technique theory and the mucosal wave theory of laryngeal sound production. Improvements in technology (i.e., laryngoscopes, handled instruments, and lasers), which in part arise from developments in more frequently performed minimally invasive surgical procedures, will probably facilitate the next generation of procedural innovations. The best methods of optimizing phonosurgical outcomes include making an accurate diagnosis, completing a comprehensive voice evaluation, providing sufficient preoperative therapy, carefully selecting patients to undergo phonomicrosurgical procedures, and requiring sufficient postoperative rest and therapy. Phonomicrosurgery will continue to evolve as a result of the interdependent collaboration of surgeons with voice scientists, speech pathologist, and other voice professionals.

  • PDF

Design and Implementation of Context-aware Application on Smartphone Using Speech Recognizer

  • Kim, Kyuseok
    • Journal of Advanced Information Technology and Convergence
    • /
    • v.10 no.2
    • /
    • pp.49-59
    • /
    • 2020
  • As technologies have been developing, our lives are getting easier. Today we are surrounded by the new technologies such as AI and IoT. Moreover, the word, "smart" is a very broad one because we are trying to change our daily environment into smart one by using those technologies. For example, the traditional workplaces have changed into smart offices. Since the 3rd industrial revolution, we have used the touch interface to operate the machines. In the 4th industrial revolution, however, we are trying adding the speech recognition module to the machines to operate them by giving voice commands. Today many of the things are communicated with human by voice commands. Many of them are called AI things and they do tasks which users request and do tasks more than what users request. In the 4th industrial revolution, we use smartphones all the time every day from the morning to the night. For this reason, the privacy using phone is not guaranteed sometimes. For example, the caller's voice can be heard through the phone speaker when accepting a call. So, it is needed to protect privacy on smartphone and it should work automatically according to the user context. In this aspect, this paper proposes a method to adjust the voice volume for call to protect privacy on smartphone according to the user context.

Development of An Ergonomic Product Development Process Reflecting Quantified Customer Preference (정량화된 고객 선호도를 체계적으로 반영하기 위한 인간공학적 제품 개발 프로세스)

  • Im, YoungJae;Jung, Eui S.;Park, SungJoon
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.34 no.1
    • /
    • pp.66-78
    • /
    • 2008
  • In the past, Manufacturers used to determine the quality of products, but the trend of today's market becomesmore into customer-driven. As a result, demands from customers are becoming more diverse and complicated,and most companies are obligated to meet their needs. As one of the effort to achieve their satisfaction,companies are now emphasizing activities to find out what customers specifically want and extract voice ofcustomer(VOC). This study attempts to develop an ergonomic product development process as a method tomaximally reflect the VOC. In order to meet this goal, ergonomic design guidelines, which are possible to beclassified according that user's human characteristics, will be recommended. Even now, there are numerousdesign guidelines already existing in the ergonomics literature. However, it is not realistically feasible to reviewall of those guidelines, and some of them are even conflicting with each other. Therefore, in this paper, theproduct development process, which prioritizes the human characteristics that reflect customer needs and appliesthe design guidelines that meet the most important ones, will be suggested. Finally, the research was described toshow the validity of the product development process through an example of a mobile phone development case.

Sound Source Localization and Separation for Emotional Robot (감성로봇을 위한 음원의 위치측정 및 분리)

  • 김경환;김연훈;곽윤근
    • Journal of the Korean Society for Precision Engineering
    • /
    • v.20 no.5
    • /
    • pp.116-123
    • /
    • 2003
  • These days, the researches related with the emotional robots are actively investigated and in progress. And human language, expression, action etc. are merged in the emotional robot to understand the human emotion. However, there are so many sound sources and background noise around the robot, that the robots should be able to separate the mixture of these sound sources into the original sound sources, moreover to understand the meaning of voice of a specific person. Also they should be able to turn or move to the direction of a specific person to observe his expression or action effectively. Until now, the researches on the localization and separation of sound sources have been so theoretical and computative that real-time processing is hardly possible. In this reason for the practical emotional robot, fast computation should be realized by using simple principle. In this paper the methods for detecting the direction of sound sources by using the phase difference between peaks on spectrums, and the separating the sound sources by using fundamental frequency and its overtones of human voice, are proposed. Also by using these methods, it is shown that the effective and real-time localization and separation of sound sources in living room are possible.

Virtual Human Authoring ToolKit for a Senior Citizen Living Alone (독거노인용 가상 휴먼 제작 툴킷)

  • Shin, Eunji;Jo, Dongsik
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.24 no.9
    • /
    • pp.1245-1248
    • /
    • 2020
  • Elderly people living alone need smart care for independent living. Recent advances in artificial intelligence have allowed for easier interaction by a computer-controlled virtual human. This technology can realize services such as medicine intake guide for the elderly living alone. In this paper, we suggest an intelligent virtual human and present our virtual human toolkit for controlling virtual humans for a senior citizen living alone. To make the virtual human motion, we suggest our authoring toolkit to map gestures, emotions, voices of virtual humans. The toolkit configured to create virtual human interactions allows the response of a suitable virtual human with facial expressions, gestures, and voice.

Data augmentation in voice spoofing problem (데이터 증강기법을 이용한 음성 위조 공격 탐지모형의 성능 향상에 대한 연구)

  • Choi, Hyo-Jung;Kwak, Il-Youp
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.3
    • /
    • pp.449-460
    • /
    • 2021
  • ASVspoof 2017 deals with detection of replay attacks and aims to classify real human voices and fake voices. The spoofed voice refers to the voice that reproduces the original voice by different types of microphones and speakers. data augmentation research on image data has been actively conducted, and several studies have been conducted to attempt data augmentation on voice. However, there are not many attempts to augment data for voice replay attacks, so this paper explores how audio modification through data augmentation techniques affects the detection of replay attacks. A total of 7 data augmentation techniques were applied, and among them, dynamic value change (DVC) and pitch techniques helped improve performance. DVC and pitch showed an improvement of about 8% of the base model EER, and DVC in particular showed noticeable improvement in accuracy in some environments among 57 replay configurations. The greatest increase was achieved in RC53, and DVC led to an approximately 45% improvement in base model accuracy. The high-end recording and playback devices that were previously difficult to detect were well identified. Based on this study, we found that the DVC and pitch data augmentation techniques are helpful in improving performance in the voice spoofing detection problem.

An Implementation of Lip Print Recognition system using VHDL (VHDL을 이용한 구순문 인식 시스템의 구현 연구)

  • Choi, Woo-Jin;Chung, Chin-Hyun
    • Proceedings of the KIEE Conference
    • /
    • 1999.07g
    • /
    • pp.2935-2937
    • /
    • 1999
  • The human has recognizable part of body such as a fingerprint, a crimson, a blood vessel. This part has been investigated constantly, its confidence for personal recognition is high. In spite of specialized part of human body, a lip print recognition is developed less than the other physical attribute that is a fingerprint. a voice pattern, a retinal blood-vessel pattern, or a facial recognition. This paper is to implement hardware for lip print recognition system using VHDL.

  • PDF