• Title/Summary/Keyword: e-Voice system

Search Result 118, Processing Time 0.021 seconds

A Threshold Adaptation based Voice Query Transcription Scheme for Music Retrieval (음악검색을 위한 가변임계치 기반의 음성 질의 변환 기법)

  • Han, Byeong-Jun;Rho, Seung-Min;Hwang, Een-Jun
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.59 no.2
    • /
    • pp.445-451
    • /
    • 2010
  • This paper presents a threshold adaptation based voice query transcription scheme for music information retrieval. The proposed scheme analyzes monophonic voice signal and generates its transcription for diverse music retrieval applications. For accurate transcription, we propose several advanced features including (i) Energetic Feature eXtractor (EFX) for onset, peak, and transient area detection; (ii) Modified Windowed Average Energy (MWAE) for defining multiple small but coherent windows with local threshold values as offset detector; and finally (iii) Circular Average Magnitude Difference Function (CAMDF) for accurate acquisition of fundamental frequency (F0) of each frame. In order to evaluate the performance of our proposed scheme, we implemented a prototype music transcription system called AMT2 (Automatic Music Transcriber version 2) and carried out various experiments. In the experiment, we used QBSH corpus [1], adapted in MIREX 2006 contest data set. Experimental result shows that our proposed scheme can improve the transcription performance.

The Study of Web-tool for Scholarly Discussion and Publishing : The Case of KIPS Cyber Forum (WWW에서의 학술토론과 출판에 관한 연구 - KIPS의 사례를 중심으로 -)

  • 김재관
    • Journal of Korea Technology Innovation Society
    • /
    • v.2 no.1
    • /
    • pp.44-57
    • /
    • 1999
  • KIPS is a net-world, cyberspace for scholars in Public Administration and Policy Sciences in WWW. All knowledge-intensive work has its core the publishing and debating of document. We have created a cyber forum for that work KIPS Cyber Forum has adapted ‘D3E’, the web-tool kit for non-technical users to easily debate and publish documents that exploit to the full networked interactive web media. And, for real-time communication, we added it the voice conferencing system. KIPS has opened Cyber Forum service in November 1998. The visitors on KWS Cyber Forum are increasingly growing, but the participants on the debate are a few. This means that the problems of Cyber Forum Service are not technical, but participation. The result imply that, at now, high participation of scholars on the debate is needed, at first, by the detailed guides for internet, www and relevant technical information. After that more expertly designed interface is to be important.

  • PDF

The Effects of Multi-Modality on the Use of Smart Phones

  • Lee, Gaeun;Kim, Seongmin;Choe, Jaeho;Jung, Eui Seung
    • Journal of the Ergonomics Society of Korea
    • /
    • v.33 no.3
    • /
    • pp.241-253
    • /
    • 2014
  • Objective: The objective of this study was to examine multi-modal interaction effects of input-mode switching on the use of smart phones. Background: Multi-modal is considered as an efficient alternative for input and output of information in mobile environments. However, there are various limitations in current mobile UI (User Interface) system that overlooks the transition between different modes or the usability of a combination of multi modal uses. Method: A pre-survey determined five representative tasks from smart phone tasks by their functions. The first experiment involved the use of a uni-mode for five single tasks; the second experiment involved the use of a multi-mode for three dual tasks. The dependent variables were user preference and task completion time. The independent variable in the first experiment was the type of modes (i.e., Touch, Pen, or Voice) while the variable in the second experiment was the type of tasks (i.e., internet searching, subway map, memo, gallery, and application store). Results: In the first experiment, there was no difference between the uses of pen and touch devices. However, a specific mode type was preferred depending on the functional characteristics of the tasks. In the second experiment, analysis of results showed that user preference depended on the order and combination of modes. Even with the transition of modes, users preferred the use of multi-modes including voice. Conclusion: The order of combination of modes may affect the usability of multi-modes. Therefore, when designing a multi-modal system, the fact that there are frequent transitions between various mobile contents in different modes should be properly considered. Application: It may be utilized as a user-centered design guideline for mobile multi modal UI system.

Performance Analysis of MC DS-CDMA System using Turbo Code in Multipath Rayleigh Fading Channel (다중경로 레일리 페이딩 채널에서 Turbo 부호를 적용한 MC DS-CDMA 시스템의 성능 분석)

  • 박기식
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.5 no.5
    • /
    • pp.902-907
    • /
    • 2001
  • In this paper, we analyzed BER performance of the MC DS-CDMA system and evaluated the degree of performance improvement of the system caused by adopting turbo code recently receiving much attention due to its powerful coding capability. As a result of analysis, it was found that the MC DS-CDMA system without any powerful coding scheme can not serve voice quality $(BER : 10^{-3}$ regardless of the number of users and the value of $(BER : 10^{-3)$ in Rayleigh fading channel. On the other hand, it was found that the MC DS-CDMA system adopting turbo code for performance improvement, shows improved BER performance and can serve voice quality without regard to the number of users and the value of $(BER : 10^{-3)$ in the same channel. For example, when $(BER : 10^{-3)$ is l0dB and the number of users was 10, the MC DS-CDMA system adopting turbo code showed improved BER performance about $5\times10^{-3}$.

  • PDF

A Study on the Correlation Between Sasang Constitution and Sound Characteristics Used Harmonics and Formant Bandwidth (Harmonics(배음)와 Formant Bandwidth(포먼트 폭)를 이용한 음성특성(音聲特性)과 사상체질간(四象體質間)의 상관성(相關性) 연구(硏究))

  • Park, Sung-Jin;Kim, Dal-Rae
    • Journal of Sasang Constitutional Medicine
    • /
    • v.16 no.1
    • /
    • pp.61-73
    • /
    • 2004
  • This study was prepared to investigate the correlation between Sasang constitutional groups and voice characteristics using voice analysis system(in this study, CSL). I focused on the voice characteristics in terms of harmonics, Formant frequency and Formant Bandwidth. The subjects were 71 males. I classified them into three groups, that is Soeumin group, Soyangin group and Taeumin group. The classification method of Constitution used two ways, QSCCII(Questionnarie for the Sasang Constitution Classification II) and Interview with a specialist in Sasang Constitution. So 71 people were categorized into 31 Soeumin(people), 18 Soyangin(people) and 22 Taeumin(people). Pitch is approximately similar to the fundamental frequency(F0) in voices. Shimmer in dB gives an evaluation of the period-to-period variability of the peak-to-peak amplitude within the analyzed voice sample. FFT(Fast Fourier Transform) method in CSL can display sampled voices into harmonics. H1 is the first peak and h2 is the second peak in the harmonics. The amplitude difference of h1 and h2(h1-h2) can be explained as the speaker's phonation type, And Formant frequency and bandwidth can be explained as the speaker's vocal tract. So I checked the harmonics and Formant frequency and Bandwidth as the voice parameters. First I have captured /e/ voices from all subjects using microphone. And then I analyzed /e/ voices with CSL. Power Spectrum and Formant History is the menu in the CSL which can display harmonics and Formant frequency and bandwidth. The results about the correlation between Sasang Constitutional Groups and voice parameters are as follows; 1. There is no significant amplitude difference of harmonics(h1-h2) among three groups. 2. There is the significant difference between Soeumin Group and Soyangin Group in Formant Frequency 1 and Formant Bandwidth 1(p<0.05). Any other parameters have no significance. I assume that Soyangin Group has clearer and brighter voice than Soeumin Group according to the Formant Bandwidth difference. And I think its result has coincidence with the context of "Dongyi Suse Bowon" and "Sasangimhejinam".

  • PDF

Design and Implementation of Voice Usenet Newsgroup Service System for Visual Disabilities (시각장애인을 위한 음성낭독 유즈넷 뉴스그룹 서비스 시스템의 설계 및 구현)

  • 조철환;장영건;박찬곤;홍승홍
    • Proceedings of the IEEK Conference
    • /
    • 2000.06c
    • /
    • pp.129-132
    • /
    • 2000
  • It is difficult to find web contents for visual disabilities in spite of rapid growing internet users, infrastructures and advance of computer technology. For the case of usenet newsgroup concerning disabilities, hit rate is even less than E-mail because of lack of accessibility, educational support and economical expense for internet. This paper is addressed to voice usenet newsgroup service by using telephone or web browser for visual disabilities without additional S/W support such as 775, usenet program and installation program, suggests a design method and an implementation example for it. Main features of it are easiness of man machine interface, popularity of access device such as telephone or web browser and independency of particular news server by using NNTP. This system supports general MIME format, is implemented for usenet server of Korean Social Worker's Community and will be implemented for Gomduri InfoNet BBS of Korean Society for Rehabilitation of Persons with Disabilities.

  • PDF

Performance analysis of cellular CDMA networks with power control error in nakagami fading channel (Nakagami 페이딩 채널에서 전력 제어 오차를 고려한 셀룰라 CDMA 네트워크의 성능 분석)

  • 이동도;김동희;박용서;황금찬
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.22 no.1
    • /
    • pp.1-11
    • /
    • 1997
  • We examine the DS/SSMA system which is employing coherent BPSK with RAKE receiver. We adop Nakagami m-distribution as a multipath fading model. First, we analyze the performances of the system in the single cell environment and obtain the other-cell interference according to power control error. And considering the other-cell interference into the analysis of single cell system, we examine the cellular CDMA network. The average BER and outage probability are the figures of merit that characterize the system performance. The required BER, 1E-3, and required outage probability are the figures of merit that characterize the system performance. The requeired BER, 1E-3, and required outage probability, 1% for the voice transmission is considered to acquire the capacity of system.

  • PDF

A Study on the Quality Improvement of Mechanical Drawing Notes Using Lean 6 Sigma Analysis (린 6시그마 분석을 통한 도면 주기 품질 향상 방안 연구)

  • Jeon, Yong Gu;Huh, Hyoung Jo;Lee, Seong Bae;Park, Hun Hyuk;An, Byung Guk
    • Journal of Korean Society for Quality Management
    • /
    • v.48 no.3
    • /
    • pp.381-393
    • /
    • 2020
  • Purpose: The purpose of this study was to find useful solutions by analyzing causes and results about defects on mechanical drawing notes and provide an automated tool with solutions to mechanical engineers. Methods: The collected data for defects on mechanical drawing notes were from ongoing development and mass production projects. Various measurement methods were used based on the Lean 6 Sigma analysis such as Process analysis, C&E diagram and some statistical analysis. Results: The results of this study are as follows; The results of the Lean 6 Sigma analysis, the validity of the selected indicators for improving drawing notes quality was verified through the verification of cause variables. The strategy established to improve the mechanical drawing notes was reflected as an automated program, and the defects were within a manageable range and achieved target Sigma level. Conclusion: Through the application of the "Mechanical drawing notes automation tool", it is expected to resolve the "Voice of Customer, VOC" and "Voice of Business, VOB".

Performance Analysis of Multimedia CDMA Mobile Communication System Considering Diverse Qos Requirements (멀티미디어 CDMA 이동통신 시스템에서의 다양한 QoS 요구조건을 고려한 성능 분석)

  • Kim, Baek-Hyun;Shin, Seung-Hoon;Kwak Kyung-Sup
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.27 no.1B
    • /
    • pp.1-12
    • /
    • 2002
  • In the multimedia CDMA mobile communication service, it is required to support various applications, such as voice, video, file transfer, e-mail, and Internet access, with guaranteed QoS. In the mixed traffic environment ,which consists of voice, stream data, and packet data, we analyze the network where preemptive priority is granted to delay-intolerant voice service and a buffer is offered to delay-tolerant stream data service. And, for best-effort packet data service, the access control by transmission permission probability is applied to obtain prominent throughput. To analyze the multimedia CDMA mobile communication system, we build a 2-dimensional markov chain model about prioritized-voice and stream data services and accomplish numerical analysis in combination with packet data traffic based on residual capacity equation.

Convergence Development of Video and E-learning System for Education Disabled Students (장애학생의 학습을 위한 화상과 이러닝 시스템의 융합 개발)

  • Son, Yeob-Myeong;Jung, Byeong-Soo
    • Journal of the Korea Convergence Society
    • /
    • v.6 no.4
    • /
    • pp.113-119
    • /
    • 2015
  • Currently, we are presenting an alternative educational environment for the normal student of education rules failure of the only that has been the school system student. The study for students with disabilities, it is designed especially to be able to use difficult disabilities the use of hand. Development objectives of the learning video e-learning system of persons with disabilities, is that to be able to capable of self-directed learning of disabled students. Configuration of e-running system, Web-based multimedia system, utilizing the system that will change the video conferencing system and voice to a character hearing impaired students through the chat system is 1:1 by communication, and teachers it is possible to perform two-way communication. A learning disability e-learning system developed in this paper between teachers and students with disabilities 1:1 training is conducted using a two-way communication algorithms.