• Title/Summary/Keyword: Digital Voice

Search Result 386, Processing Time 0.022 seconds

Gender Analysis in Elderly Speech Signal Processing (노인음성신호처리에서의 젠더 분석)

  • Lee, JiYeoun
    • Journal of Digital Convergence
    • /
    • v.16 no.10
    • /
    • pp.351-356
    • /
    • 2018
  • Changes in vocal cords due to aging can change the frequency of speech, and the speech signals of the elderly can be automatically distinguished from normal speech signals through various analyzes. The purpose of this study is to provide a tool that can be easily accessed by the elderly and disabled people who can be excluded from the rapidly changing technological society and to improve the voice recognition performance. In the study, the gender of the subjects was reported as sex analysis, and the number of female and male voice samples was used equally. In addition, the gender analysis was applied to set the voices of the elderly without using voices of all ages. Finally, we applied a review methodology of standards and reference models to reduce gender difference. 10 Korean women and 10 men aged 70 to 80 years old are used in this study. Comparing the F0 value extracted directly with the waveform and the F0 extracted with TF32 and the Wavesufer speech analysis program, Wavesufer analyzed the F0 of the elderly voice better than TF32. However, there is a need for a voice analysis program for elderly people. In conclusions, analyzing the voice of the elderly will improve speech recognition and synthesis capabilities of existing smart medical systems.

The Design and Implement a Healthcare Alert App to Prevent Dementia (치매예방을 위한 헬스케어 알리미 앱 설계 및 구현)

  • Pi, SU-Young
    • Journal of Digital Convergence
    • /
    • v.16 no.10
    • /
    • pp.59-67
    • /
    • 2018
  • There are not that many m-health related services limited to the elderly. Many of the elderly who are at risk of dementia are unfamiliar to smart devices, so it is required to design an user-customized App. Therefore, I design and embody a mobile voice alert integrated app, which enables voice input to increase the accessibility of the elderly, so as to prevent diseases caused by declined cognitive function such as dementia. I conducted interviews and questionnaire after having the students use the app in Lifelong Education Center in H region of Gyeongbuk, and the analysis result has showed the high satisfaction. It is expected that it will be able to play a key role for M-Health service for the elderly since it is possible to prevent dementia through the voice health care alert app. I would like to learn deep learning in the future to predict the life patterns and the possibility of dementia of the elderly.

Design of FIR filter using direct memory access for voice signal processing module in implantable middle ear hearing devices (이식형 인공중이용 음성신호 처리 모듈을 위한 직접 메모리 억세스 기반의 FIR 필터 설계)

  • Kim, Jong-Min;Park, Il-Yong;Yoon, Young-Ho;Kim, Min-Kyu;Lim, Hyung-Gyu;Han, Ji-Hun;Kim, Myoung-Nam;Cho, Jin-Ho
    • Journal of Sensor Science and Technology
    • /
    • v.15 no.4
    • /
    • pp.223-230
    • /
    • 2006
  • An FIR filter for digital voice signal processing has been designed and implemented using a microcontroller in implantable middle ear hearing devices (IMEHDs). The designed digital voice signal processing filter which has fast and accurate filtering operation and controllable filter characteristics has been implemented using a hardware multiplier and a direct memory access (DMA) in the low power microcontroller, MSP430F169. It has been confirmed that each of the implemented 6-orders Remez FIR filters with 1 channel and 2 channels can be applied to the voice signal processing module of IMEHDs based on the evaluation results of the filtering performance experiment.

Study on User Experience design in Gesture Interaction as a Product Trigger - Focusing on Product Design - (제품 트리거로서 행동인식의 사용자 경험 디자인 연구 - 제품디자인을 중심으로 -)

  • Min, Sae-yan;Lee, Cathy Yeonchoo
    • Journal of Digital Convergence
    • /
    • v.17 no.5
    • /
    • pp.379-384
    • /
    • 2019
  • The purpose of this study is to investigate the problems of the rapidly increasing voice interface and to find out what results will be obtained when the new gesture interaction is applied to the product, and to suggest the improvement method for a better user experience. Through the literature review, I have conducted a theoretical review on the changes in the product interface used in the product and the difference between them, and then conducted in-depth interviews on the 20-30 users who used voice recognition as a product trigger. As a result, it was concluded that the decline in the reliability of accuracy leads to a decrease in the preference of voice recognition interactions and an needs of appropriate interface for the functional aspect of non-relavancy in physical distance as a product trigger. This study is meaningful in that it has found a problem with the study of the product trigger interface and suggested improvement measures, and hope to be helpful in follow-up study.

The Voice Template based User Authentication Scheme Suitable for Mobile Commerce Platform (모바일 상거래 플랫폼에 적합한 음성 템플릿 기반의 사용자 인증 기법)

  • Yun, Sung-Hyun;Koh, Hoon
    • Journal of Digital Convergence
    • /
    • v.10 no.5
    • /
    • pp.215-222
    • /
    • 2012
  • A smart phone has functions of both telephone and computer. The wide spread use of smart phones has sharply increased the demand for mobile commerce. The smart phone based mobile services are available anytime, anywhere. In commercial transactions, a digital signature scheme is used to make legally binding signature to prove both integrity of commercial document and verification of the signer. Smart phones are more risky compared with personal computers on the problems of how to protect privacy information. It's also easy to let proxy user to authenticate instead of the smart phone owner. In existing password or token based schemes, the ID is not physically bound to the owner. Thus, those schemes can not solve the problem of proxy authentication. To utilize the smart phone as the platform of mobile commerce, a study on the new type of authentication scheme is needed where the scheme should provide protocol to get legally binding signature and not to authenticate proxy user. In this paper, we create the mobile ID by using both the USIM and voice template of the smart phone owner. We also design and implement the user authentication scheme based on the mobile ID.

A Comparative Study on the Public Speech Spectrum between ROK and USA Politicians (한국과 미국 정치인 대중연설 음성의 스펙트럼 비교 연구)

  • Chung, Eun-Ee;Lee, Sang-Ho
    • Journal of Digital Contents Society
    • /
    • v.17 no.3
    • /
    • pp.143-155
    • /
    • 2016
  • In this study, we focused on the importance of politicians' voices in sending a message. Different factors for a voice may play different roles in sending a message and affect message recipients' responsiveness, understanding, and so on. For this reason, it can be said that an analytical study on voices in sending a diversity of messages is a meaningful attempt. We took interest in politicians' voices because we determined that a voice should be very important to politicians frequently sending a message through speech to the nation and others. This study aimed to investigate the voices of politicians, who represent their nation. We intended to select politicians representing ROK(Republic of Korea; South Korean) and USA(United States of America), choose representative speeches to the nation, make a comparative analysis of their voices in the speeches, and draw implications. We analyzed a total of eight voices - four ROK politicians and four USA ones, male and female - to characterize them and suggest guidelines for a voice with clearer message delivery. We analyzed the politicians' voices on the basis of such vocal properties as vocal pitch, accuracy of pronunciation, resonance, and intonation variation and found that the ROK politicians were somewhat poorer at utilizing their voice than the US ones. In particular, they were remarkably poorer at accurate pronunciation, which exerts a significant impact on message passing.

A Study on Forgery Techniques of Smartphone Voice Recording File Structure and Metadata (스마트폰 음성녹음 파일 구조 및 메타데이터의 위변조 기법에 관한 연구)

  • Park, Jae Wan;Kwak, Won Jun;Lee, John Sanghyun
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.6
    • /
    • pp.807-812
    • /
    • 2022
  • Recently, as the number of voice recording files submitted as court evidence increases, the number of cases claiming forgery is also increasing. If the audio recording file structure and metadata, which are objective grounds, are completely forged, it is actually impossible to detect forgery of the sophisticated audio recording file. It is extremely rare for the court to reject the file structure and metadata analysis performed with the forged audio recording file. The purpose of this study is to prove that forgery of voice recording file structure and metadata is easily possible. To this end, in this study, it was introduced that forgery detection is impossible when the 'mixed paste' function, which enables sophisticated editing based on the typification of the editing method of voice recording files, is applied. Moreover, it has been proven through experiments that forgery of file structure and metadata is possible. Therefore, a stricter standard for judging the admissibility of evidence is required when the audio recording file is adopted as digital evidence. This study will not only contribute to the standard of integrity in the adoption of digital evidence by judges, but will also contribute to the method of constructing a dataset for artificial intelligence in detecting forgery of recorded files that is expected to be developed in the future.

AN ANALYSIS OF MMPP/D1, D2/1/B QUEUE FOR TRAFFIC SHAPING OF VOICE IN ATM NETWORK

  • CHOI, DOO IL
    • Journal of the Korean Society for Industrial and Applied Mathematics
    • /
    • v.3 no.2
    • /
    • pp.69-80
    • /
    • 1999
  • Recently in telecommunication, BISDN ( Broadband Integrated Service Digital Network ) has received considerable attention for its capability of providing a common interface for future communication needs including voice, data and video. Since all information in BISDN are statistically multiplexed and are transported in high speed by means of discrete units of 53-octet ATM ( asynchronous Transfer Mode ) cells, appropriate traffic control needs. For traffic shaping of voice, the output cell discarding scheme has been proposed. We analyze the scheme with a MMPP/$D_1$, $D_2$/1/B queueing system to obtain performance measures such as loss probability and waiting time distribution.

  • PDF

A Correlation Study between Acoustic and Perceptual Parameters of the Singing Voice in Singing Students (성악 전공 학생의 가창 시 음성의 음향학적 매개 변수와 지각적 매개 변수사이의 상관 연구)

  • Jo, Sung-Mi;Lee, Sang-Ouk;Jeong, Ok-Ran
    • Proceedings of the KSPS conference
    • /
    • 2004.05a
    • /
    • pp.219-222
    • /
    • 2004
  • The purpose of this study was to determine a correlation between acoustic and perceptual parameters of the singing voice in singing students and compare them with the results with previous studies, and a more sensitive parameters in analyzing professional vocal usage. This study measured acoustic and perceptual parameters in 41 singing students. Digital audio recordings were made in sung vowels acoustic analysis. Each sample was judged by 1 experienced singing teacher and 1 voice pathologist on two semantic bipolar 7-point scales (ringing-dull, rich-thin). The results showed that SPP1 (p<0.01), SPP2 (p<0.01), and P1(p<0.01) had significant correlations with ringing and richness quality.

  • PDF