• Title/Summary/Keyword: Speech sound

Search Result 628, Processing Time 0.038 seconds

A Study on the Acoustic Characteristics of the Pansori by Voice Signals Analysis (음성신호 분석에 의한 판소리의 음성학적 특징 연구)

  • Kim, HyunSook
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.14 no.7
    • /
    • pp.3218-3222
    • /
    • 2013
  • Pansori is our traditional vocal sound, originality and excellence in the art of conversation, gesture general became a globally recognized world intangible heritage. Especially, Pansori as shrews and humorous representation of audience participation with a high degree of artistic value and enjoy the arts throughout all layers to be responsible for the social integration of functions is evaluated. Therefore, in this paper, Pansori five yard target speech signal analysis techniques applied to analyze the Pansori acoustic features of a representation of a society and era correlation extraction studies were performed. Pansori on the five yard spectrogram, pitch, stability and strength analysis for this experiment. Pansori through experimental results Comical story while keeping the audience focused and interested to better reflect the characteristics of energy for the wave of voice and vocal cord tremor change the width of a large, stable and voice with a loud voice, that expresses were analyzed.

Sound Enhancement of low Sample rate Audio Using LMS in DWT Domain (DWT영역에서 LMS를 이용한 저 샘플링 비율 오디오 신호의 음질 향상)

  • 백수진;윤원중;박규식
    • The Journal of the Acoustical Society of Korea
    • /
    • v.23 no.1
    • /
    • pp.54-60
    • /
    • 2004
  • In order to mitigate the problems in storage space and network bandwidth for the full CD quality audio, current digital audio is always restricted by sampling rate and bandwidth. This restriction normally results in low sample rate audio or calls for the data compression scheme such as MP3. However, they can only reproduce a lower frequency range than a regular CD quality because of the Nyquist sampling theory. Consequently they lose rich spatial information embedded in high frequency. The propose of this paper is to propose efficient high frequency enhancement of low sample rate audio using n adaptive filtering and DWT analysis and synthesis. The proposed algorithm uses the LMS adaptive algorithm to estimate the missing high frequency contents in DWT domain and it then reconstructs the spectrally enhanced audio by using the DWT synthesis procedure. Several experiments with real speech and audio are performed and compared with other algorithm. From the experimental results of spectrogram and sonic test, we confirm that the proposed algorithm outperforms the other algorithm and reasonably works well for the most of audio cases.

A Comparing and Analysis of Bel canto and Seth Riggs vocalization methods (세스릭스 발성법과 벨칸토의 비교분석)

  • Seo, Jeong Hwan
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.17 no.6
    • /
    • pp.262-268
    • /
    • 2016
  • The in-country interest in vocalists is worthy of notice. However, the public, including experts, don't have sufficient information for their interest. In this situation, the popularity of Seth Riggs' books is surprising, because his major is vocal music, not popular music, although he is known as 'a great vocal trainer'. Therefore, we should analyze his vocalization methods and compare them with Bel Canto. His skills are (focused on) SLS (Speech-level singing). The unique styles he used are related to Bel Canto in vocal music, but with some differences. Nevertheless, both styles have something in common. For example, they are similar in that they continue to keep up with the times, so as to improve and evolve. Therefore, it is important for the selected passages to be investigated and presented, based on the individual body conditions and properties. Seth Riggs' vocalization method should be systematically examined too. This is because it has great influence on the public.

Structural Analysis Algorithm for Automatic Transcription 'Pansori' (판소리 자동채보를 위한 구조분석 알고리즘)

  • Ju, Young-Ho;Kim, Joon-Cheol;Seo, Kyoung-Suk;Lee, Joon-Whoan
    • The Journal of the Korea Contents Association
    • /
    • v.14 no.2
    • /
    • pp.28-38
    • /
    • 2014
  • For western music there has been a volume of researches on music information analysis for automatic transcription or content-based music retrieval. But it is hard to find the similar research on Korean traditional music. In this paper we propose several algorithms to automatically analyze the structure of Korean traditional music 'Pansori'. The proposed algorithm automatically distinguishes between the 'sound' part and 'speech' part which are named 'sori' and 'aniri', respectively, using the ratio of phonetic and pause time intervals. For rhythm called 'jangdan' classification the algorithm makes the robust decision using the majority voting process based on template matching. Also an algorithm is suggested to detect the bar positions in the 'sori' part based on Kalman filter. Every proposed algorithm in the paper works so well enough for the sample music sources of 'Pansori' that the results may be used to automatically transcribe the 'Pansori'.

The impact of technology on resource sharing (정보기술이 자원공유에 미치는 영향)

  • 이영자
    • Journal of Korean Library and Information Science Society
    • /
    • v.22
    • /
    • pp.205-244
    • /
    • 1995
  • Originally the concept of the traditional resource sharing has been discussed in the context of bibliographic materials, and has been labor-intensive and high-cost activities. The technology has had a great impact on such pattern of the resource sharing, and has expanded the limited scope of the traditional concept into the sharing of library information in the levels of local, regional and national systems, and expertise, materials, facilities, equipments and personnels of the library system. While the traditional circulation service as a basic method to share library materials by users can provide the resource to a single person at a time, the electronic resource can be shared, by multi-users at a time anytime anywhere. The maximization of the electronic resource sharing requires that publishing process should be fundamentally changed and articles, books, chapters, speech manuscripts, music scores, maps, sound, and other formats of materials should be prepared in machine readable format. This study examined the positive effects of the technology on the resource sharing, but not investigate the concrete and complex problems as to the cost, guidelines, detailed procedures, design details, and intellectual properties and protection involved in the resource sharing. Some findings extracted from the study can be summarized as follows; (1) ILL will lose its meaning as a method to share the materials if they are all in the electronic format and the phrase 'networked information resource' becomes omnipresent. (2) The technology keeps on changing the concept of resource sharing. Today, the scope of resource sharing not only encompasses the sharing of the primary and secondary materials but also the sharing of the processings(eg. cataloging), expertise, user education, special facilities, and the integrated automated library systems. (3) The sharing of the networked resource will be a method to provide better services for library users in the low cost. (4) The a n.0, pplication of the technology to the resource sharing should be focus on the method which allows an end-users to do the direct access to the needed materials, and to be delivered the primary document as soon as possible.

  • PDF

A Study on Classification of Waveforms Using Manifold Embedding Based on Commute Time (컴뮤트 타임 기반의 다양체 임베딩을 이용한 파형 신호 인식에 관한 연구)

  • Hahn, Hee-Il
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.51 no.2
    • /
    • pp.148-155
    • /
    • 2014
  • In this paper a commute time embedding is implemented by organizing patches according to the graph-based metric, and its properties are investigated via changing the number of nodes on the graph.. It is shown that manifold embedding methods generate the intrinsic geometric structures when waveforms such as speech or music instrumental sound signals are embedded on the low dimensional Euclidean space. Basically manifold embedding algorithms only project the training samples on the graph into an embedding subspace but can not generalize the learning results to test samples. They are very effective for data clustering but are not appropriate for classification or recognition. In this paper a commute time guided transform is adopted to enhance the generalization ability and its performance is analyzed by applying it to the classification of 6 kinds of music instrumental sounds.

The Prosodic Changes of Korean English Learners in Robot Assisted Learning (로봇보조언어교육을 통한 초등 영어 학습자의 운율 변화)

  • In, Jiyoung;Han, JeongHye
    • Journal of The Korean Association of Information Education
    • /
    • v.20 no.4
    • /
    • pp.323-332
    • /
    • 2016
  • A robot's recognition and diagnosis of pronunciation and its speech are the most important interactions in RALL(Robot Assisted Language Learning). This study is to verify the effectiveness of robot TTS(Text to Sound) technology in assisting Korean English language learners to acquire a native-like accent by correcting the prosodic errors they commonly make. The child English language learners' F0 range and speaking rate in the 4th grade, a prosodic variable, will be measured and analyzed for any changes in accent. We compare whether robot with the currently available TTS technology appeared to be effective for the 4th graders and 1st graders who were not under the formal English learning with native speaker from the acoustic phonetic viewpoint. Two groups by repeating TTS of RALL responded to the speaking rate rather than F0 range.

Voice Personality Transformation Using an Optimum Classification and Transformation (최적 분류 변환을 이용한 음성 개성 변환)

  • 이기승
    • The Journal of the Acoustical Society of Korea
    • /
    • v.23 no.5
    • /
    • pp.400-409
    • /
    • 2004
  • In this paper. a voice personality transformation method is proposed. which makes one person's voice sound like another person's voice. To transform the voice personality. vocal tract transfer function is used as a transformation parameter. Comparing with previous methods. the proposed method makes transformed speech closer to target speaker's voice in both subjective and objective points of view. Conversion between vocal tract transfer functions is implemented by classification of entire vector space followed by linear transformation for each cluster. LPC cepstrum is used as a feature parameter. A joint classification and transformation method is proposed, where optimum clusters and transformation matrices are simultaneously estimated in the sense of a minimum mean square error criterion. To evaluate the performance of the proposed method. transformation rules are generated from 150 sentences uttered by three male and on female speakers. These rules are then applied to another 150 sentences uttered by the same speakers. and objective evaluation and subjective listening tests are performed.

The identification of /I/ in Spanish and French

  • Jorge A. Gurlekian;Benoit Jacques;Miguelina Guirao
    • Proceedings of the KSPS conference
    • /
    • 1996.10a
    • /
    • pp.521-528
    • /
    • 1996
  • This presentation explores on the perceptual characteristics of the lateral sound /l/ in CV syllables. At initial position we found that /l/ has well marked formant transitions. Then several questions arise: 1) are these formant structures dependent on the following vowel\ulcorner. 2) Are the formant transitions giving an additional cue for the identification\ulcorner Considering that the French vocalic system presents a greater variety of vowels than Spanish, several experiments were designed to verify to what extent a more extensive range of vocalic timbres contribute to the perception of /l/. Natural emissions of /l/ produced in Argentine Spanish and Canadian French CV syllables were recorded, where V was successively /i, e, a, o, u/ for Spanish and /i, e, $\varepsilon$, a, $\alpha$, o, u, y, \phi$/ for French. For each item, the segment C was maintained and V was replaced by cutting & splicing by each of the remaining vowels without transitions. Results of the identification tests for Spanish show that natural /l/ segments with low Fl and high formants F3, F4 can be clearly identified in the /i, e, u/ vowel contexts without transitions. For French subjects the combination of /l/ with a vowel without transitions reflected correct identifications for its own original vowel context in /e, $\varepsilon$, y, $\phi$/. For both languages, in all these combinations, F1 values remained rather steady along the syllable. In the case of /o, u/ very likely the F2 difference lead to a variety of perceptions of the original /l/. For example in Ilul, French subjects reported some identifications of /l/ as a vowel, mainly /y/. Our observations reinforce the importance of F1 as a relevant cue for /l/, and the incidence of the relative distance between formants frequencies of both components.

  • PDF

Identifying the Difference between Actual Reporting Voices and False Reporting Voices for Development of the False Report Discrimination System (허위 신고 판별 시스템 개발을 위한 실제 신고 음성과 허위 신고 음성의 차이 규명)

  • Lee, Bum Joo;Cho, Dong Uk;Park, Young;Jeong, Yeon Man
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.42 no.4
    • /
    • pp.848-854
    • /
    • 2017
  • Recently, false reports to governmental offices such as police stations have not been decreased. As a result, if a violent crime or a fire occurs that needs to be promptly responded to and reacted to these accidents in real time, it may lead to serious results such as loss of life. Also, the waste of police enforcement and administration due to false reporting can cause serious problems. In this paper, we try to clarify the difference between the actual and false reports based on the actual sound sources which were reported to the police stations. In addition, we will intend to develop a false report discrimination system that can identifies false reports and actual reports based on this.