Search | Korea Science

Style-Specific Language Model Adaptation using TF*IDF Similarity for Korean Conversational Speech Recognition

Park, Young-Hee;Chung, Min-Hwa
- The Journal of the Acoustical Society of Korea
- /
- v.23 no.2E
- /
- pp.51-55
- /
- 2004
In this paper, we propose a style-specific language model adaptation scheme using n-gram based tf*idf similarity for Korean spontaneous speech recognition. Korean spontaneous speech shows especially different style-specific characteristics such as filled pauses, word omission, and contraction, which are related to function words and depend on preceding or following words. To reflect these style-specific characteristics and overcome insufficient data for training language model, we estimate in-domain dependent n-gram model by relevance weighting of out-of-domain text data according to their n-. gram based tf*idf similarity, in which in-domain language model include disfluency model. Recognition results show that n-gram based tf*idf similarity weighting effectively reflects style difference.
PDF KSCI

Language Model Adaptation for Conversational Speech Recognition (대화체 연속음성 인식을 위한 언어모델 적응)

Park Young-Hee;Chung Minhwa
- Proceedings of the KSPS conference
- /
- 2003.05a
- /
- pp.83-86
- /
- 2003
This paper presents our style-based language model adaptation for Korean conversational speech recognition. Korean conversational speech is observed various characteristics of content and style such as filled pauses, word omission, and contraction as compared with the written text corpora. For style-based language model adaptation, we report two approaches. Our approaches focus on improving the estimation of domain-dependent n-gram models by relevance weighting out-of-domain text data, where style is represented by n-gram based tf*idf similarity. In addition to relevance weighting, we use disfluencies as predictor to the neighboring words. The best result reduces 6.5％ word error rate absolutely and shows that n-gram based relevance weighting reflects style difference greatly and disfluencies are good predictor.
PDF

Korean EFL Learners' Sensitivity to Stylistic Differences in Their Letter Writing

Lee, Haemoon;Park, Heesoo
- Journal of English Language & Literature
- /
- v.56 no.6
- /
- pp.1163-1190
- /
- 2010
Korean EFL learners' stylistic sensitivity was examined through the two types of letter writing, professional and personal. The base of comparison with the English native speakers' stylistic sensitivity was the linguistic style markers that were statistically found by Biber's (1988) multi-dimensional model of variation of English language. The main finding was that Korean university students were sensitive to stylistic difference in the correct direction, though their linguistic repertoire was limited to the easy and simple linguistic features. Also, the learners were skewed in the involved style in both types of the letters unlike the native speakers and it was interpreted as due to the general developmental direction from informal to formal linguistic style. Learners were also skewed in the explicit style in both types of letters unlike the native speakers and it was interpreted as due to the learners' heavy reliance on one particular linguistic feature. As a whole, the learners' stylistic sensitivity heavily relied on the small number of linguistic features that they have already acquired, which happen to be simple and basic linguistic features.

Unpaired Korean Text Style Transfer with Masked Language Model (마스크 언어 모델 기반 비병렬 한국어 텍스트 스타일 변환)

Bae, Jangseong;Lee, Changki;Noh, Hyungjong;Hwang, Jeongin
- Annual Conference on Human and Language Technology
- /
- 2021.10a
- /
- pp.391-395
- /
- 2021
텍스트 스타일 변환은 입력 스타일(source style)로 쓰여진 텍스트의 내용(content)을 유지하며 목적 스타일(target style)의 텍스트로 변환하는 문제이다. 텍스트 스타일 변환을 시퀀스 간 변환 문제(sequence-to-sequence)로 보고 기존 기계학습 모델을 이용해 해결할 수 있지만, 모델 학습에 필요한 각 스타일에 대응되는 병렬 말뭉치를 구하기 어려운 문제점이 있다. 따라서 최근에는 비병렬 말뭉치를 이용해 텍스트 스타일 변환을 수행하는 방법들이 연구되고 있다. 이 연구들은 주로 인코더-디코더 구조의 생성 모델을 사용하기 때문에 입력 문장이 가지고 있는 내용이 누락되거나 다른 내용의 문장이 생성될 수 있는 문제점이 있다. 본 논문에서는 마스크 언어 모델(masked language model)을 이용해 입력 텍스트의 내용을 유지하면서 원하는 스타일로 변경할 수 있는 텍스트 스타일 변환 방법을 제안하고 한국어 긍정-부정, 채팅체-문어체 변환에 적용한다.
PDF

Comparative Study on English Proficiency of Children of ESL(English as a Second Language) & EFL(English as Foreign Language) Learning Programs (ESL과 EFL학습프로그램에 의한 아동 영어능력 비교연구)

Yoon, Eu-Gene;Chong, Young-Sook
- Korean Journal of Human Ecology
- /
- v.14 no.6
- /
- pp.961-972
- /
- 2005
The purpose of this study is to investigate the improvement of English proficiency of children in the ESL and EFL learning style classrooms through the experiment method. The results of this research are as follows: first, the scores of listening and speaking and the perception of alphabets in the ESL program are higher than that in the EFL program. This means that learning in the ESL style classroom is the better way to improve English skills than in the EFL style classroom, which is common in Korea. Second, there is no difference in the English listening and speaking skills and the perception of the English alphabets between the two gender groups in the ESL & EFL style classrooms. These results suggest that the target language may be used in the English classrooms by the teachers and the students with the materials, books, and equipment are English. Teachers are expected to be in charge of playing decisive roles as demonstrators of speech, models and correctors of pronunciation and providers of materials including TV, VCR, CD players, and cassette recorders, etc.
PDF

Investigation for Purification of Japanese Style Terminology Used in the Korean Fishing Vessels (어선에서의 일본식 용어 순화에 관한 연구)

Kim, Young-Un
- Journal of Fisheries and Marine Sciences Education
- /
- v.25 no.4
- /
- pp.836-847
- /
- 2013
In contemporary society, shipping and fishery industry tend to use Japanese language or Japanese style terminology extravagantly. It becomes a reason of preventing the communications between crews who have been working for many years and the beginners in the ships. Also the crews cannot easily understand the contents of the manuals that is explained in only Korean language. For this reason, the foreign employee have to study Japanese style terminology before begin start their work. I strongly believe that this is one of the major national contempt. It is reasonable to expel Japanese style terminology from the vessel if possible to enable free communications to each other. I inspected and examined about purification of 125 Japanese style terminology that is already examined in the fishing vessels 11 years ago. So, I expect this paper is a research for the eradication of the Japanese style terminology in the Korean fishing vessels.
https://doi.org/10.13000/JFMSE.2013.25.4.836 인용 PDF KSCI

Individual Differences in Regional Gray Matter Volumes According to the Cognitive Style of Young Adults

Hur, Minyoung;Kim, Chobok
- Science of Emotion and Sensibility
- /
- v.22 no.4
- /
- pp.65-74
- /
- 2019
Extant research has proposed that the Object-Spatial-Verbal cognitive style can elucidate individual differences in the preference for modality-specific information. However, no studies have yet ascertained whether this type of information processing evinces structural correlations in the brain. Therefore, the current study used voxel-based morphometry (VBM) analyses to investigate individual differences in gray matter volumes based on the Object-Spatial-Verbal cognitive style. For this purpose, ninety healthy young adults were recruited to participate in the study. They were administered the Korean version of the Object-Spatial-Verbal cognitive style questionnaire, and their anatomical brain images were scanned. The VBM results demonstrated that the participants' verbal scores were positively correlated with regional gray matter volumes (rGMVs) in the right superior temporal sulcus/superior temporal gyrus, the bilateral parahippocampal gyrus/fusiform gyrus, and the left inferior temporal gyrus. In addition, the rGMVs in these regions were negatively correlated with the relative spatial preference scores obtained by individual participants. The findings of the investigation provide anatomical evidence that the verbal cognitive style could be decidedly relevant to higher-level language processing, but not to basic language processing.
https://doi.org/10.14695/KJSOS.2019.22.4.65 인용 PDF KSCI

ETRI small-sized dialog style TTS system (ETRI 소용량 대화체 음성합성시스템)

Kim, Jong-Jin;Kim, Jeong-Se;Kim, Sang-Hun;Park, Jun;Lee, Yun-Keun;Hahn, Min-Soo
- Proceedings of the KSPS conference
- /
- 2007.05a
- /
- pp.217-220
- /
- 2007
This study outlines a small-sized dialog style ETRI Korean TTS system which applies a HMM based speech synthesis techniques. In order to build the VoiceFont, dialog-style 500 sentences were used in training HMM. And the context information about phonemes, syllables, words, phrases and sentence were extracted fully automatically to build context-dependent HMM. In training the acoustic model, acoustic features such as Mel-cepstrums, logF0 and its delta, delta-delta were used. The size of the VoiceFont which was built through the training is 0.93Mb. The developed HMM-based TTS system were installed on the ARM720T processor which operates 60MHz clocks/second. To reduce computation time, the MLSA inverse filtering module is implemented with Assembly language. The speed of the fully implemented system is the 1.73 times faster than real time.
PDF

Designing a large recording script for open-domain English speech synthesis

Kim, Sunhee;Kim, Hojeong;Lee, Yooseop;Kim, Boryoung;Won, Yongkook;Kim, Bongwan
- Phonetics and Speech Sciences
- /
- v.13 no.3
- /
- pp.65-70
- /
- 2021
This paper proposes a method for designing a large recording script for open domain English speech synthesis. For read-aloud style text, 12 domains and 294 sub-domains were designed using text contained in five different news media publications. For conversational style text, 4 domains and 36 sub-domains were designed using movie subtitles. The final script consists of 43,013 sentences, 27,085 read-aloud style sentences, and 15,928 conversational style sentences, consisting of 549,683 tokens and 38,356 types. The completed script is analyzed using four criteria: word coverage (type coverage and token coverage), high-frequency vocabulary coverage, phonetic coverage (diphone coverage and triphone coverage), and readability. The type coverage of our script reaches 36.86% despite its low token coverage of 2.97%. The high-frequency vocabulary coverage of the script is 73.82%, and the diphone coverage and triphone coverage of the whole script is 86.70% and 38.92%, respectively. The average readability of whole sentences is 9.03. The results of analysis show that the proposed method is effective in producing a large recording script for English speech synthesis, demonstrating good coverage in terms of unique words, high-frequency vocabulary, phonetic units, and readability.
https://doi.org/10.13064/KSSS.2021.13.3.065 인용 PDF KSCI

Defining the Nature of Online Chat in Relation to Speech and Writing

Lee, Hi-Kyoung
- English Language & Literature Teaching
- /
- v.12 no.2
- /
- pp.87-105
- /
- 2006
Style is considered a pivotal construct in sociolinguistic variation studies. While previous studies have examined style in traditional forms of language such as speech, very little research has examined new and emerging styles such as computer-mediated discourse. Thus, the present study attempts to investigate style in the online communication mode of chat. In so doing, the study compares text-based online chat with speech and writing. Online chat has been previously described as a hybrid form of language that is close to speech. Here, the exact nature of online chat is elucidated by focusing on contraction use. Differential acquisition of stylistic variation is also examined according to English learning background. The empirical component consists of data from Korean speakers of English. Data is taken from a written summary, an oral interview, and a text-based online chat session. A multivariate analysis was conducted. Results indicate that online chat is indeed a hybrid form that is difficult to delineate from speech and writing. Text-based online chat shows a somewhat similar rate of contraction to speech, which confirms its hybridity.. Lastly, some implications of the study are given in terms of the learning and acquisition of style in general and in online contextual modes.
PDF

Search Result 356, Processing Time 0.036 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)