• Title/Summary/Keyword: 휴지 단위

Search Result 20, Processing Time 0.021 seconds

Study on the realization of pause groups and breath groups (휴지 단위와 호흡 단위의 실현 양상 연구)

  • Yoo, Doyoung;Shin, Jiyoung
    • Phonetics and Speech Sciences
    • /
    • v.12 no.1
    • /
    • pp.19-31
    • /
    • 2020
  • The purpose of this study is to observe the realization of pause and breath groups from adult speakers and to examine how gender, generation, and tasks can affect this realization. For this purpose, we analyzed forty-eight male or female speakers. Their generation was divided into two groups: young, old. Task and gender affected both the realization of pause and breath groups. The length of the pause groups was longer in the read speech than in the spontaneous speech and female speech. On the other hand, the length of the breath group was longer in the spontaneous speech and the male speech. In the spontaneous speech, which requires planning, the speaker produced shorter length of pause group. The short sentence length of the reading material influenced the reason for which the length of the breath group was shorter in the reading speech. Gender difference resulted from difference in pause patterns between genders. In the case of the breath groups, the male speaker produced longer duration of pause than the female speaker did, which may be due to difference in lung capacity between genders. On the other hand, generation did not affect either the pause groups or the breath groups. The generation factor only influenced the number of syllables and the eojeols, which can be interpreted as the result of the difference in speech rate between generations.

The Modeling of Pause Duration For Text-To-Speech Synthesis System (TTS 시스템을 위한 휴지기간 모델링)

  • Chung Jihye;Lee Yanhee
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • spring
    • /
    • pp.83-86
    • /
    • 2000
  • 본 논문에서는 비정형 단위를 사용한 음성 합성 시스템의 합성음에 대한 자연성을 향상시키기 위한 휴지 구간 추출 및 휴지 지속시간 예측 모델을 제안한다. 제안된 휴지 지속시간 예측 모델은 트리 기반 모델링 기법 중 하나인 CART (Classification And Regression Trees)방법을 이용하였다. 이를 위해 남성 단일 화자가 발성한 6,220개의 어절경계 포함하는 총 400문장의 문 음성 데이터베이스를 구축하였고, 이 데이터베이스로부터 V-fold Cross-Validation 방법에 의해 최적의 트리를 결정하였다. 이 모델을 평가한 결과, 휴지 구간 추출 정확율은 $81\%$로 휴지 구간 존재 추출 정확율은 $83\%, 휴지 구간 비존재 추출 정확율은 $80\%이었고, 실 휴지지속시간과 예측 휴지지속시간과의 다중상관 계수는 0.84로, 오차 범위 20ms 이내에서 의 정 확율은 $88\%$ 이었다. 또한, 휴지지속시간을 예측하여 적용한 합성음을 청취 실험한 결과 자연 음성과 대체적으로 유사하게 나타났다.

  • PDF

Breath and Memory in Speech based on Quantitative Analysis of Breath Groups and Pause Units in Korean (언어 수행에서의 호흡과 기억 -호흡 단위와 휴지 단위의 양적 분석 결과를 바탕으로-)

  • Shin, Jiyoung
    • Korean Linguistics
    • /
    • v.79
    • /
    • pp.91-116
    • /
    • 2018
  • This paper aims at proposing issues of breath and memory in speech based on the quantitative analysis of breath groups and pause units in Korean. As a human being, we have two kinds of limitations on continuing speech; breath and memory. The prosodic structure and temporal structure of spontaneous speech data from six speakers were closely examined. One of the main findings of the present study is that the prosodic structure and temporal structure of Korean appears to reflect the breath and memory problems in speech.

A Study on Implementation of Emotional Speech Synthesis System using Variable Prosody Model (가변 운율 모델링을 이용한 고음질 감정 음성합성기 구현에 관한 연구)

  • Min, So-Yeon;Na, Deok-Su
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.14 no.8
    • /
    • pp.3992-3998
    • /
    • 2013
  • This paper is related to the method of adding a emotional speech corpus to a high-quality large corpus based speech synthesizer, and generating various synthesized speech. We made the emotional speech corpus as a form which can be used in waveform concatenated speech synthesizer, and have implemented the speech synthesizer that can be generated various synthesized speech through the same synthetic unit selection process of normal speech synthesizer. We used a markup language for emotional input text. Emotional speech is generated when the input text is matched as much as the length of intonation phrase in emotional speech corpus, but in the other case normal speech is generated. The BIs(Break Index) of emotional speech is more irregular than normal speech. Therefore, it becomes difficult to use the BIs generated in a synthesizer as it is. In order to solve this problem we applied the Variable Break[3] modeling. We used the Japanese speech synthesizer for experiment. As a result we obtained the natural emotional synthesized speech using the break prediction module for normal speech synthesize.

Manchester coding of compressed binary clusters for reducing IoT healthcare device's digital data transfer time (IoT기반 헬스케어 의료기기의 디지털 데이터 전송시간 감소를 위한 압축 바이너리 클러스터의 맨체스터 코딩 전송)

  • Kim, Jung-Hoon
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.8 no.6
    • /
    • pp.460-469
    • /
    • 2015
  • This study's aim is for reducing big data transfer time of IoT healthcare devices by modulating digital bits into Manchester code including zero-voltage idle as information for secondary compressed binary cluster's compartment after two step compression of compressing binary data into primary and secondary binary compressed clusters for each binary clusters having compression benefit of 1 bit or 2 bits. Also this study proposed that as department information of compressed binary clusters, inserting idle signal into Manchester code will have benefit of reducing transfer time in case of compressing binary cluster into secondary compressed binary cluster by 2 bits, because in spite of cost of 1 clock idle, another 1 bit benefit can play a role of reducing 1 clock transfer time. Idle signal is also never consecutive because the signal is for compartment information between two adjacent secondary compressed binary cluster. Voltage transition on basic rule of Manchester code is remaining while inserting idle signal, so DC balance can be guaranteed. This study's simulation result said that even compressed binary data by another compression algorithms could be transferred faster by as much as about 12.6 percents if using this method.

Prosodic Phrase Noundary Estimation for Continuous Speech Recognition (운율구 단위의 음성인식을 이한 운율구 개수 추정)

  • 강지영
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1998.08a
    • /
    • pp.218-221
    • /
    • 1998
  • 한국어 음성 인식기의 향상을 위한 방법으로서 운율구 단위의 음성인식을 제안하고 운율구 경계를 예측하는 방법을 제시하였다. 실험을 위해서 서울 말씨를 쓰는 남자가 보통속도로 읽은 100개의 문장과 학교 방송국 여자 아나운서가 읽은 100개의 문장에 대해서 운율구 청취테스트한 데이터를 기주능로 사용했다. 피치 정보와 휴지기 경계정보를 이용해서 강한 운율경계강도가 나타나는 지점을 운율구의 경계로 예측했을 때 평균 70% 정도의 예측율을 보여주었다.

  • PDF

A Study on Detection of Accentual Phrase's Boundaries according to Reading Speeds (낭독속도에 따른 강세구 경계 검출에 관한 연구)

  • Ju Jangkyu;Lee Kiyoung;Song Minsuck
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • spring
    • /
    • pp.91-94
    • /
    • 2000
  • 최근 운율 구조와 문장구조 및 음운규칙과 관련 된 많은 언어학적 연구가 이루어져, 언어 이해 차원에서 의미 정보, 문장 구조 정보, discourse structure 등을 위한 운율 정보의 유용성이 입증되었으나, 이러한 결과가 최근의 음성인식 시스템에는 거의 적용되지 못하고 있다. 본 연구에서는 계층적인 방법을 기초로 하여 한국어의 연속음성으로부터 운율구를 검출하는 세그멘테이션법을 제안하였다. 우선, 입력된 음성으로부터 문장단위의 경계를 검출하기 위하여 휴지기를 이용하였으며 에너지, 휴지기의 지속시간 및 피치궤적을 참조하여 강세구의 경계를 검출하였다. 실험음성의 텍스트는 "만물상"이며, 남녀 각 2명의 표준어 화자가 빠른 속도와 보통 속도로 낭독한 음성데이터를 대상으로 비교하였다.

  • PDF

The expression of human Spt16 is associated with cell proliferation (인간 Spt16 단백질 발현과 세포 증식 사이의 연관성에 관한 연구)

  • Gwak, Jung-Sug;Cho, Mun-Ju;Ryu, Min-Jung;Oh, Sang-Taek
    • Journal of Life Science
    • /
    • v.17 no.3 s.83
    • /
    • pp.381-385
    • /
    • 2007
  • Facilitates chromatin transcription (FACT) is a chromatin-specific elongation factor required for transcription of chromatin templates in vivo and in vitro. FACT consists of human homologue of the Saccharomyces cerevisiae Spt16/Cdc68 protein (hSpt16) and the high mobility group-1-like protein structure-specific recognition protein-1 (SSRP-1). Here we show that the protein level of hSpt16 is massively down-regulated in quiescent T98C cells using both immunofluorescence and western blot analysis. In contrast, we observe high level of the hspt16 expression in the proliferative T98G cells. Interestingly, the expression of SSRP-1 is not altered in both quiescent and proliferative states. Taken together, our findings implicate that the expression of hSpt16 is associated with the proliferative state and can be used as a proliferation marker.

Zeolitization of the Dacitic Tuff in the Miocene Janggi Basin, SE Korea (장기분지 데사이트질 응회암의 불석화작용)

  • Kim, Jinju;Jeong, Jong Ok;Shinn, Young-Jae;Sohn, Young Kwan
    • Economic and Environmental Geology
    • /
    • v.55 no.1
    • /
    • pp.63-76
    • /
    • 2022
  • Dacitic tuffs, 97 to 118 m thick, were recovered from the lower part of the subsurface Seongdongri Formation, Janggi Basin, which was drilled to assess the potential for underground storage of carbon dioxide. The tuffs are divided into four depositional units(Unit 1 to 4) based on internal structures and particle componentry. Unit 1 and Units 3/4 are ignimbrites that accumulated in subaerial and subaqueous settings, respectively, whereas Unit 2 is braided-stream deposits that accumulated during a volcanic quiescence, and no dacitic tuff is observed. A series of analysis shows that mordenite and clinoptilolite mainly fill the vesicles of glass shards, suggesting their formation by replacement and dissolution of volcanic glass and precipitation from interstitial water during burial and diagenesis. Glass-replaced clinoptilolite has higher Si/Al ratios and Na contents than the vesicle-filling clinoptilolite in Units 3. However, the composition of clinoptilolite becomes identical in Unit 4, irrespective of the occurrence and location. This suggests that the Si/Al ratio and pH in the interstitial water increased with time because of the replacement and leaching of volcanic glass, and that the composition of interstitial water was different between the eastern and western parts of the basin during the formation of the clinoptilolite in Units 1 and 3. It is also inferred that the formation of the two zeolite minerals was sequential according to the depositional units, i.e., the clinoptilolite formed after the growth of mordenite. To summarize, during a volcanic quiescence after the deposition of Unit 1, pH was higher in the western part of the basin because of eastward tilting of the basin floor, and the zeolite ceased to grow because of the closure of the pore space as a result of the growth of smectite. On the other hand, clinoptilolite could grow in the eastern part of the basin in an open system affected by groundwater, where braided stream was developed. Afterwards, Units 3 and 4 were submerged under water because of the basin subsidence, and the alkali content of the interstitial water increased gradually, eventually becoming identical in the eastern and western parts of the basin. This study thus shows that volcanic deposits of similar composition can have variable distribution of zeolite mineral depending on the drainage and depositional environment of basins.

A comparative study of prosodic features according to the syntactic diversities between children with reading disability and nondisabled children (읽기장애아동과 일반아동의 통사적 다양성에 따른 운율 특성 비교)

  • Park, Sungsook;Seong, Cheoljae
    • Phonetics and Speech Sciences
    • /
    • v.13 no.4
    • /
    • pp.55-66
    • /
    • 2021
  • Proper prosody in reading allows the reader to naturally convey the meaning, which manifests as changes in pitch, loudness, and speech rate. Children with reading disability face difficulty in delivering information due to poor prosody. This study identified the difference in prosodic features between children with reading disabilities and nondisabled children through means of reading tasks. Reading tasks, according to sentence types (short sentences, assumptions/conditions, intentions, relative-clause), were recorded by 15 children studying in the 3rd to 6th grade in elementary school. Children with reading disability had a statistically significant wider range of pitch, slower speech rate, more frequent usage of pauses, longer total pause duration, and steeper pitch slope than nondisabled one in sentence-final and -medial words. Children with reading disability, therefore, exhibited a less natural and expressive reading than nondisabled children. Through this study, the characteristics of prosody observed in children with reading disability were identified and the need for an approach for effective intervention was also suggested.