Search | Korea Science

A Study on Korean Digit Recognition Using Syllable Based Neural Network (음절 기반 신경망을 이용한 한국어 숫자음 인식에 관한 연구)

Kum Ji Soo;Lee Hyon Soo
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.78-81
- /
- 1999
본 논문에서는 인간의 정보처리 기술을 모방한 신경망과 한국어 음절 구성의 특성을 이용하여 음절을 기반으로 하는 신경망 음성인식 방법을 제안한다. 제안한 방법에서는 임계비율을 정의하여 한국어 음절을 구성하는 초성$\cdot$중성$\cdot$종성을 구분하였고, 구분된 음절의 일부 구간 특징을 학습 및 인식의 특징 패턴으로 사용하여 음성인식 시스템의 전체적인 처리 단계를 줄였다. 한국어 숫자음 인식에 대한 성능 평가에서 20대 남성과 여성을 대상으로 화자 종속에서 $96.5\%$의 인식률을 화자 독립에서 $93\%$의 인식률을 얻었다.
PDF

Trend on the Speech Database of SAMSUNG Advanced Institute of Technology (SAIT) (삼성종합기술원의 음성 DB 구축현황)

김상룡
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1995.06a
- /
- pp.283-284
- /
- 1995
삼성종합기술원의 음성 인식, 합성 분야의 음성 데이터 베이스 구축 현황과 향후 연구 방향에 대하여 기술한다. 삼성종합기술원에서는 1989년 한국어 문음 변환기술 개발을 시작하여 그 동안 남성음, 여성음 합성 시스템을 발표하였고, 최근에는 시각장애자용 컴퓨터를 개발하여 전국 13개 시각 장애자 학교에 기정한 바 있다. 음성 인식 분야는 100 단어 내외으 소용량 화자 종속 시스템을 개발하여 키폰용 음성인식 다이얼 장치를 실용화하였다. 약 5년여에 걸친 연구 결과 자체적으로 구축하게 된 음성 DB는 크게 남, 여 합성용 DB와 인식용 DB로 요약할 수 있다. 이러한 경험을 바탕으로 향후 국내외 대학, 연구소 등과 공동연구를 통해 상품화 수준의 문음 변환기술과 대용량, 화자독립 음성인식 시스템을 개발하고자 한다. 궁극적으로는 휴대용 통역기의 요소 기술을 확보하여 제한된 영역에서 자동 통역기를 상품화하는데 이바지할 계획이다.
PDF

Auto-Segmentation of Unsegmented Speech based on HMM and Time-Synchronous Viterbi Algorithm (시간동기형 Viterbi 알고리즘과 HMM에 기반한 음성의 자동 세그멘테이션)

오세진;황철준;김범국;정호열;정현열
- Proceedings of the Korean Information Science Society Conference
- /
- 2001.04b
- /
- pp.592-594
- /
- 2001
본 연구에서는 음성인식에 있어서 음향모델의 고정도화를 위해 통계적 방법인 HMM과 시간동기형 Viterbi 알고리즘을 기반으로 한 세그멘트되지 않은 음성의 자동 세그멘테이션에 관한 연구를 수행하였다. 본 연구에서는 소량의 세그멘트된 음성에 대해 연속분포형 HMM 기본모델을 작성한 후 이를 표준패턴으로 사용하고, 세그멘트되지 않은 입력음성의 특징 피라미터에 대해 시간동기형 Viterbi 알고리즘의 프레임마다 최대가 되는 지점을 최적경계로 설정하고, 앞에서 구현 최적 경계 정보와 언어학적 지식인 발음사전 정보를 이용하여 음성을 세그멘테이션 하는 것이다. 본 연구와의 비교를 위해 HTK를 이용하여 위와 동일한 과정을 수행하였다. 이렇게 구한 음성의 세그멘테이션 정보를 이용하여 연속분포형 HMM 기본모델과 HTK의 CHMM 기본모델을 각각 작성한 후, 국어공학센터(KLE) 단어 데이터에 대해 단어인식 성능을 평가하였다. 실험결과, KLE 452 남성과 여성에 대해, 본 연구실 인식 시스템은 화자독립 단어인식률 89.4%, 85.1%, HTK의 화자독립 단어인식률 85.1%, 81.9%를 각각 얻었다.
PDF

Break Strength Prediction Using Maximum a Posterior Probability (MAP 확률을 이용한 끊어 읽기 강도 예측)

Kim Sanghun;Park Jun;Lee Youngjik
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.75-78
- /
- 2000
본 논문은 자연스러운 합성음 생성을 위한 끊어 읽기 강도 예측에 관한 것으로, 문장에 대한 품사열이 주어졌을 때 Posteriori 확률을 최대화하는 끊어 읽기 강도를 비터비 디코딩으로 예측한다. 훈련용 데이터는 여성화자 1인이 발성한 2,100 문장이며, 음성 데이터로부터 휴지길이(pause)에 따라 끊어 읽기 강도를 2단계로 할당하고, 텍스트에서는 30개의 품사 태그 심볼을 이용하여 형태소분석 및 태깅을 수행하였다. 관측확률은 3개 연속하는 품사열이 발생할 확률로 하고 끊어 읽기 강도 천이확률은 bigram으로 했을 때, cross validation 방법으로 성능 평가를 수행하였다 평가결과, 훈련데이타에 대해서는 $89.7\%$, 테스트 데이터에 대해서는 $84.9\%$의 예측정확률을 보였다.
PDF

Study on the realization of pause groups and breath groups (휴지 단위와 호흡 단위의 실현 양상 연구)

Yoo, Doyoung;Shin, Jiyoung
- Phonetics and Speech Sciences
- /
- v.12 no.1
- /
- pp.19-31
- /
- 2020
The purpose of this study is to observe the realization of pause and breath groups from adult speakers and to examine how gender, generation, and tasks can affect this realization. For this purpose, we analyzed forty-eight male or female speakers. Their generation was divided into two groups: young, old. Task and gender affected both the realization of pause and breath groups. The length of the pause groups was longer in the read speech than in the spontaneous speech and female speech. On the other hand, the length of the breath group was longer in the spontaneous speech and the male speech. In the spontaneous speech, which requires planning, the speaker produced shorter length of pause group. The short sentence length of the reading material influenced the reason for which the length of the breath group was shorter in the reading speech. Gender difference resulted from difference in pause patterns between genders. In the case of the breath groups, the male speaker produced longer duration of pause than the female speaker did, which may be due to difference in lung capacity between genders. On the other hand, generation did not affect either the pause groups or the breath groups. The generation factor only influenced the number of syllables and the eojeols, which can be interpreted as the result of the difference in speech rate between generations.
https://doi.org/10.13064/KSSS.2020.12.1.019 인용 PDF KSCI

Analysis of Error Characteristics and Usabilities for Korean Consonant Perception Test (한국자음지각검사의 오류특성 및 유용성 분석)

Kim, Dong Chang;Kim, Jin Sook;Lee, Kyoung Won
- 재활복지
- /
- v.18 no.4
- /
- pp.295-314
- /
- 2014
The purpose of this study was to supply the baseline data for auditory rehabilitation in the field through error type and rate of the phoneme which the hearing impaired feel difficulty to discriminate. Thirty participants with sensorineural hearing loss heard KCPT lists through recorded voice by male and female to get the data about error type and KCPT score accordance with talker's gender. In the initial consonant test list, /ㄷ/, /ㅂ/, /ㅃ/, /ㅉ/, /ㅌ/ showed more than 30% error rate while /ㄱ/and /ㄷ/ showed in final consonant test list. The most common error type was the initial consonant substitution or the final consonant substitution for the initial or final consonant test lists. Talker's gender effect was not signigicant showing no statistical difference between the scores when compared results from male voice and female voice. It means that KCPT can be used regardless of talker's gender in clinics.
https://doi.org/10.16884/JRR.2014.18.4.295 인용

A Review on the Sexual Organs Appeared in 'Manhoengcheongnyu,' "Cheongguyeongeon" ("청구영언" '연장' 등장 만횡청류 재론)

Lee, Young-Tae
- Sijohaknonchong
- /
- v.26
- /
- pp.223-242
- /
- 2007
This thesis is to review 'Manhoengcheongnyu,' $\ulcorner$Cheongguyeongeon$\lrcorner$ in which sexual organs have appeared. The result of the review shows that a male narrator wants a large organ and a female narrator, a small one. Although there seems to be a difference between the male and the female with the framework of the size of the organs, they have the similar standpoint, Yeohapbujeol(如合符節), that they give and take the sexual feelings to satisfy their mates each other. As a consequence. $\ulcorner$eokgogeomgokuikeungurenarotgeugeotjochagilgoneopda$\sim$(#1993, *569)$\lrcorner$, refers to the other's satisfaction about the sizes of their own organs rather than the idea that the female sexual life is unilaterally oppressed by the male one. Sijo(時調) touches on organs which are 'bawdy and trifling,' and includes obscene comments. Eumdampaeseol(淫談悖說). Mentioning Eumdampaeseol(淫談悖說), the participants in a banquet of the singing space can be a part of its atmosphere, and by being protected by it, they can recite the sexual organs openly or they can grasp the inner meaning of the verse-joke in a refined and humorous fashion-which expresses organs indirectly. Thus, $\ulcorner$aheunahopgommeogeun老丈濁酒geolleo醉kemeokgo$\sim$(#1854, *534)$\lrcorner$, is not related with 'remorse about old age', but is merely a kind of Sijo(時調) about a sexual organ.
PDF

Aimé Césaire's postcolonial thought as a 'Non-Western resistance discourse': In terms of speaker, language and counter-discourse ('비서구 저항담론'으로서의 세제르(A. Césaire)의 탈식민주의 비평, 그 가능성과 한계: 화자(話者), 언어(言語), 대항담론(對抗談論)의 측면에서)

Choi, Il-Sung
- Cross-Cultural Studies
- /
- v.51
- /
- pp.161-191
- /
- 2018
In the beginning of the 20th century, post-colonialism has directly raised questions about western-centered universalism. One of its main achievements is that the political liberation of a colonial society does not guarantee the social, economic and cultural liberation of a society. Therefore, the discourse of liberation in the Western society, in particular, Marxism, nationalism, feminism and postmodernism, cannot be directly applied to the non-Western society. As a result, Western and non-Western societies are unfortunately dreaming of different futures and liberation; therefore, a'geopolitical dialogue' is needed between them. However, the theorists' efforts for postcolonial liberation failed to distinguish themselves from the western-centric traditions. It is also true that they have, in conjunction with these traditions, established their own power. As we know, many of the postcolonial criticisms somehow had relations with the West. This study will re-read the postcolonial thought of $Aim{\acute{e}}$ $C{\acute{e}}saire$, the father of the so-called $N{\acute{e}}gritude$, as a 'non-western resistance discourse'. Through this process, we have a chance to reflect on $C{\acute{e}}saire$ and his postcolonial thoughts.

Impact of face masks on spectral and cepstral measures of speech: A case study of two Korean voice actors (한국어 스펙트럼과 캡스트럼 측정시 안면마스크의 영향: 남녀 성우 2인 사례 연구)

Wonyoung Yang;Miji Kwon
- The Journal of the Acoustical Society of Korea
- /
- v.43 no.4
- /
- pp.422-435
- /
- 2024
This study intended to verify the effects of face masks on the Korean language in terms of acoustic, aerodynamic, and formant parameters. We chose all types of face masks available in Korea based on filter performance and folding type. Two professional voice actors (a male and a female) with more than 20 years of experience who are native Koreans and speak standard Korean participated in this study as speakers of voice data. Face masks attenuated the high-frequency range, resulting in decreased Vowel Space Area (VSA) and Vowel Articulation Index (VAI)scores and an increased Low-to-High spectral ratio (L/H ratio) in all voice samples. This can result in lower speech intelligibility. However, the degree of increment and decrement was based on the voice characteristics. For female speakers, the Speech Level (SL) and Cepstral Peak Prominence (CPP) increased with increasing face mask thickness. In this study, the presence or filter performance of a face mask was found to affect speech acoustic parameters according to the speech characteristics. Face masks provoked vocal effort when the vocal intensity was not sufficiently strong, or the environment had less reverberance. Further research needs to be conducted on the vocal efforts induced by face masks to overcome acoustic modifications when wearing masks.
https://doi.org/10.7776/ASK.2024.43.4.422 인용 PDF

Comparison of voice range profiles of modal and falsetto register in dysphonic and non-dysphonic adult women (음성장애 성인 여성과 정상음성 성인 여성 간 진성구와 가성구의 음성범위프로파일 비교)

Jaeock Kim;Seung Jin Lee
- Phonetics and Speech Sciences
- /
- v.14 no.4
- /
- pp.67-75
- /
- 2022
This study compared voice range profiles (VRPs) of modal and falsetto register in 53 dysphonic and 53 non-dysphonic adult women with gliding vowel /a/'. The results shows that maximum fundamental frequency (F0_MAX), maximum intensity (I_MAX), F0 range (F0_RANGE), and intensity range (I_RANGE) are lower in the dysphonic group than in the non-dysphonic group. F0_MAX and F0_RANGE are significantly higher in falsetto register than modal register in both groups. I_MAX and I_RANGE are significantly higher in falsetto register in the non-dysphonic group, but those are not different between two registers in the dysphonic group. There was no statistically significant difference in minimum F0 (F0_MIN) and minimum intensity (I_MIN) between the two groups. Modal-falsetto register transition occurred at 378.86 Hz (F4#) in the dysphonic group and 557.79 Hz (C5#) in the non-dysphonic group, which was significantly lower in the dysphonic group. It can be seen that both modal and falsetto registers in dysphonic adult women are reduced compared to non-dysphoinc adult women, indicating that the vocal folds of dysphonic adult women are not easy to vibrate in high pitches. The results of this study would be the basic data for understanding the acoustic features of voice disorders.
https://doi.org/10.13064/KSSS.2022.14.4.067 인용 PDF KSCI

Search Result 63, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)