• Title/Summary/Keyword: 음합성

Search Result 333, Processing Time 0.025 seconds

Perception of Japanese word-initial stops by native listeners (모어청자에 의한 일본어 어두 폐쇄음의 지각)

  • Byun, Hi-Gyung
    • Phonetics and Speech Sciences
    • /
    • v.13 no.3
    • /
    • pp.53-64
    • /
    • 2021
  • It is known that the voicing contrast for Japanese word-initial stops is primarily realized as differences in the voice onset time (VOT). However, recent studies have reported that voiced stops are more often produced with a positive VOT than with a negative VOT among the younger generation nationwide. It is also known that post-stop F0 is associated with the stop contrast, but the degree of F0 use differs from region to region. This study explores whether the difference in post-stop F0 functions as a perceptual cue to the stop contrast along with VOT. Fifty-five college students who are native listeners from four different regions participated in two or three perception tests. The results show that VOT is a primary cue to the voiced-voiceless distinction of word-initial stops, but that the effect of post-stop F0 on the stop contrast is marginal. The post-stop F0 is involved in perception only when VOT is ambiguous, such that a sound with high F0 is more often perceived as a voiceless stop, but not vice versa. The results of this study indicate that the acoustic parameters associated with the stop contrast are not the same in production and perception, and suggest that other factors such as context, which is not an acoustic characteristic, may also be involved in the stop contrast.

A Study on Speech Synthesizer Using Distributed System (분산형 시스템을 적용한 음성합성에 관한 연구)

  • Kim, Jin-Woo;Min, So-Yeon;Na, Deok-Su;Bae, Myung-Jin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.29 no.3
    • /
    • pp.209-215
    • /
    • 2010
  • Recently portable terminal is received attention by wireless networks and mass capacity ROM. In this result, TTS(Text to Speech) system is inserted to portable terminal. Nevertheless high quality synthesis is difficult in portable terminal, users need high quality synthesis. In this paper, we proposed Distributed TTS (DTTS) that was composed of server and terminal. The DTTS on corpus based speech synthesis can be high quality synthesis. Synthesis system in server that generate optimized speech concatenation information after database search and transmit terminal. Synthesis system in terminal make high quality speech synthesis as low computation using transmitted speech concatenation information from server. The proposed method that can be reducing complexity, smaller power consumption and efficient maintenance.

Synthesis of Multiplexed MACE Filter for Optical Korean Character Recognition (인쇄체 한글의 광학적 인식을 위한 다중 MACE 필터의 합성)

  • 김정우;김철수;배장근;도양회;김수중
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.19 no.12
    • /
    • pp.2364-2375
    • /
    • 1994
  • For the efficient recognition of printed Korean characters, a multiplexed minimum average correlation energy(MMACE) filter is proposed. Proposed method solved the disadvantages of the tree structure algorithm which recognition system is very huge and recognition method is sophisticated. Using only one consonant MMACE filter and one vowel one, we recognized the full Korean character. Each MMACE filter is multiplexed by 4 K-tuple MACE filters which are synthesized by 24 consonants and vowels. Hence the proposed MMACE filter and the correlation distribution plane are divided by 4 subregion. We obtained the binary codes for the Korean character recognition from each correlation distribution subplane. And the obtained codes are compared with the truth table for consonants and vowels in computer. We can recognize the full Korean characters when substitute the corresponded consonant or vowel font of the consistent code to the correlation peak place in the output correlation plane. The computer simulation and optical experiment results show that the proposed compact Korean character recognition system using the MMACE filters has high discrimination capability.

  • PDF

Constraints for the Design of Room Reverberation Filter by Using 5-DOF Reverberation Model (5자유도 잔향 모델을 이용한 실내 잔향 필터 설계를 위한 조건)

  • 김소희;김양한
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.2
    • /
    • pp.58-65
    • /
    • 2001
  • Recently, a 5-degrees-of-freedom (DOF) reverberation model was proposed as a method of representing subjective perception of reverberation as objective measures[1]. This model approximates sound energy decay curve by five objective measures, widely used in which have been concert hall acoustics. However, it is note worthy that there can be infinite number of impulse responses which correspond to a selected 5-DOF reverberation model. There may exist some filters making very unnatural and unrealistic sound. In this paper, the limitation of the 5-DOF reverberation model when it is used as a filter design criteria is investigated. When a 5-DOF reverberation model is given, additional constraints to get natural reverberation are suggested. This is based on the listening tests for several quite different source sounds.

  • PDF

Studies on Properties with Different Filler and Content in Pb-free Sealing Frit for Electronic Devices

  • An, Yong-Tae;Choe, Byeong-Hyeon;Ji, Mi-Jeong;Jang, U-Seok;Lee, Jun-Ho;Hwang, Hae-Jin
    • Proceedings of the Korean Institute of Electrical and Electronic Material Engineers Conference
    • /
    • 2009.11a
    • /
    • pp.181-181
    • /
    • 2009
  • 전자부품용 Pb-free sealing frit의 열팽창계수를 기판에 matching 시키기 위하여 음의 열팽창계수를 가지고 있는 $\beta$-Eucryptite, $\beta$-Spodumene를 합성하여 filler로 첨가하였다. 합성된 filler는 저온소성용 유리프리트의 높은 열팽창계수를 조절하기 용이하고, 유리프리트와 복합화 하여 소성하면 낮은 열팽창계수로 인한 우수한 열충격 저항성을 갖는다. Filler로써 $\beta$-Eucryptite, $\beta$-Spodumene의 결정성을 향상시키기 위해 $1250^{\circ}C$에서 5 시간 동안 유지하는 합성공정을 3회 반복 진행한 후 XRD를 사용하여 결정성을 분석하였고, TMA를 이용하여 filler 첨가량에 따른 유리프리트의 열팽장계수의 변화를 측정하였다. 또한, filler 입도와 함량에 따른 melting 특성을 분석하기 위해 Pill test를 진행하였으며, soda-lime glass 기판과의 접합면을 SEM을 사용하여 관찰하였다.

  • PDF

Speech Modification and Concatenative Speech Synthesis by using Analysis-By-Synthesis/OverLap-Add(ABS/OLA) Sinusoidal Model (Analysis- By-Synthesis/OverLap- Add( ABS/OLA) Sinusoidal Model 을 이용한 음성변환과 연결음성합성)

  • 구자형
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1998.08a
    • /
    • pp.339-343
    • /
    • 1998
  • Sinusoidal model 은 음성신호처리의 넓은 분야에 적용되고 있는 방법으로 고음질의 합성음을 생성해 낼 수 있고, 조작이 용이하다는 장점을 가지고 있다. 본 논문에서는 Analysis-by-synthesis/Overlap-add Sinusoidal model 이라는 방법을 이용하여 시간축 변환과 dam성 변환을 수행하였다. 특히 본 논문에서는 음질향상을 위하여 시간축 변환시에는 정적인 구간과 변화하는 구간을 구별하여 서로 다른 시간축 변환비를 이용하였고, 기존의 LPC 방법에 비해 스펙트럼 포락선을 보다 잘 추정하는 Improved Cepstrum을 이용하여 음정변환에 적용하였다. 또 서로 다른 문맥에서 얻어진 음성단위들을 결합할 때 생기는 위상차이를 극복하기 위하여, 기본주파수 성분이 일치하도록 시간축을 이동하여 합성하였다. 실험결과 본 논문에서 적용한 방법들을 통해 기존 방식에 비해 개선된 음질을 얻을 수 있었다.

  • PDF

Application of Central Composite Design in Simulation Experiment (시뮬레이션 실험에서 중심합성계획의 응용)

  • 권치명
    • Proceedings of the Korea Society for Simulation Conference
    • /
    • 2004.05a
    • /
    • pp.41-47
    • /
    • 2004
  • 중심합성계획(central composite design: ccd)은 반응 표면이 곡면적인 특성을 나타낼때 반응 공간을 추정하기 위해 사용되는 실험계획이다. 반응공간이 2차 회귀모형으로 나타나는 경우에 반응곡면의 변화량을 알기 위해서는 변수의 수준이 3이상이 되어야하는데 ccd는 적은 횟수의 실험으로 곡면을 효과적으로 추정하기 위해 2$^{k}$ 요인실험에 추가적으로 중심점(central point)과 축점(axial point)을 표본점에 포함시키는 계획이다. 본 연구에서는 시뮬레이션 실험에서 반응변수가 2차 회귀모형으로 근사되는 경우에 cod를 이용하여 관심 성과치의 반응표면을 추정하고자 한다. 일반적인 실험에서와는 달리 시뮬레이션 실험에서는 두개의 표본점(인자 수준의 조합)에서 분석자가 공통 난수계열(common random number series)을 부여하여 시뮬레이션 시스템 요소의 변화과정을 유사하게 통제할 수 있다. 일반적으로 공통난수법(common random number method)에 의해 얻어지는 두 표본점에서의 반응변수는 서로 양의 상관관계를 가지며 대조 난수(antithetic random number)에 의한 두 반응변수는 음의 상관성을 가지는 것으로 알려졌다. 본 연구는 ccd의 표본점에 공통난수와 대조난수 법을 이용하여 회귀모형의 파라미터를 효과적으로 추정하는 방법을 조사하고 이를 (s, S) 재고관리 모형에 적용하여 그 효율성을 평가하고자 한다.

  • PDF

A Study on the Generation of Multi-syllable Nonsense Wordset for the Assessment of Synthetic Speech (합성음성평가를 위한 다음절 무의미단어 생성과 이용에 관한 연구)

  • Jo, Cheol-Woo;Kim, Kyung-Tae;Lee, Yong-Ju
    • The Journal of the Acoustical Society of Korea
    • /
    • v.13 no.5
    • /
    • pp.51-58
    • /
    • 1994
  • These times many kinds of man-machine Interfaces using speech signal, speech recognizers or speech synthesizers, are proposed and utilized in practice. Especially speech synthesis system is widely used in our life. But its assessment method is still in its first stage. In this paper we propose a method to generate multi-syllable nonsense wordset for the purpose of synthetic speech assessment and applies the wordset to one commercial text-to-speech system. Some results about the experiment is suggested and it is verified that the method to generate a nonsense wordset can be used to assess the intelligibility of the synthesizer in phoneme level or in phonemic environmental level.

  • PDF

Physical and Structural Properties of Amorphous Carbon Films Synthesized by Magnetron Sputtering Method (마그네트론 스퍼터링법에 의해 합성되어진 비정질 탄소박막들의 구조적, 물리적 특성)

  • Park, Yong-Seob;Cho, Hyung-Jun;Hong, Byung-You
    • Journal of the Korean Vacuum Society
    • /
    • v.16 no.2
    • /
    • pp.122-127
    • /
    • 2007
  • In this research, amophous carbon films (a-C, a-C:H, a-C:N) were synthesized by closed-field unbalanced magnetron (CFUBM) sputtering using graphite target. We also fabricated amorphous carbon films with applying negative DC bias voltage of 200 V in during the deposition in working pressure. Also, a-C:H and a-C:N films was synthesized by adding acethylene($C_{2}H_{2}$) and nitrogen(N) gases of 4 and 3 sccm into Ar pressure. The a-C:H film synthesized at -200 V exhibited the maxumum hardness of 26.3 GPa, the smooth surface of 0.1 nm and the good adhesion of 30.5 N. And a-C:N film synthesized at -200 V exhibited at -200 V exhibited the best adhesion of 32 N. This paper examined the effect of $C_{2}H_{2}$ gas, $N_{2}$ gas and negative DC bias voltage as the parameter for improving the physical properties and the relation between structral and physical properties of carbon films.

The role of voice onset time (VOT) and post-stop fundamental frequency (F0) in the perception of Tohoku Japanese stops (도호쿠 일본어의 폐쇄음 지각에 있어서 voice onset time(VOT)과 후속모음 fundamental frequency(F0)의 역할)

  • Hi-Gyung Byun
    • Phonetics and Speech Sciences
    • /
    • v.15 no.1
    • /
    • pp.35-45
    • /
    • 2023
  • Tohoku Japanese is known to have voiced stops without pre-voicing in word-initial position, whereas traditional or conservative Japanese has voiced stops with pre-voicing in the same position. One problem with this devoicing of voiced stops is that it affects the distinction between voiced and voiceless stops because their voice onset time (VOT) values overlap. Previous studies have confirmed that Tohoku speakers use post-stop fundamental frequency (F0) as an acoustic cue along with VOT to avoid overlap. However, the role of post-stop F0 as a perceptual cue in this region has barely been investigated. Therefore, this study explored the role of post-stop F0 in stop voicing perception along with VOT. Several perception tests were conducted using resynthesized stimuli, which were manipulated along a VOT continuum orthogonal to an F0 continuum. The results showed no significant regional difference (Tohoku vs. Chubu) for nonsense words (/ta-da/). However, for meaningful words (/pari/ 'Paris' vs. /bari/ 'Bali,' /piza/ 'pizza' vs. /biza/ 'visa'), a significant word effect was found, and it was confirmed that some listeners utilized the post-stop F0 more consistently and steadily than others. Based on these results, we discuss innovative listeners who may lead the change in the perception of stop voicing.