Search | Korea Science

Speech syntheis engine for TTS (TTS 적용을 위한 음성합성엔진)

이희만;김지영
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.23 no.6
- /
- pp.1443-1453
- /
- 1998
This paper presents the speech synthesis engine that converts the character strings kept in a computer memory into the synthesized speech sounds with enhancing the intelligibility and the naturalness by adapting the waveform processing method. The speech engine using demisyllable speech segments receives command streams for pitch modification, duration and energy control. The command based engine isolates the high level processing of text normalization, letter-to-sound and the lexical analysis and the low level processing of signal filtering and pitch processing. The TTS(Text-to-Speech) system implemented by using the speech synthesis engine has three independent object modules of the Text-Normalizer, the Commander and the said Speech Synthesis Engine those of which are easily replaced by other compatible modules. The architecture separating the high level and the low level processing has the advantage of the expandibility and the portability because of the mix-and-match nature.
PDF

Improving the Performance of Korean Text Chunking by Machine learning Approaches based on Feature Set Selection (자질집합선택 기반의 기계학습을 통한 한국어 기본구 인식의 성능향상)

Hwang, Young-Sook;Chung, Hoo-jung;Park, So-Young;Kwak, Young-Jae;Rim, Hae-Chang
- Journal of KIISE:Software and Applications
- /
- v.29 no.9
- /
- pp.654-668
- /
- 2002
In this paper, we present an empirical study for improving the Korean text chunking based on machine learning and feature set selection approaches. We focus on two issues: the problem of selecting feature set for Korean chunking, and the problem of alleviating the data sparseness. To select a proper feature set, we use a heuristic method of searching through the space of feature sets using the estimated performance from a machine learning algorithm as a measure of "incremental usefulness" of a particular feature set. Besides, for smoothing the data sparseness, we suggest a method of using a general part-of-speech tag set and selective lexical information under the consideration of Korean language characteristics. Experimental results showed that chunk tags and lexical information within a given context window are important features and spacing unit information is less important than others, which are independent on the machine teaming techniques. Furthermore, using the selective lexical information gives not only a smoothing effect but also the reduction of the feature space than using all of lexical information. Korean text chunking based on the memory-based learning and the decision tree learning with the selected feature space showed the performance of precision/recall of 90.99%/92.52%, and 93.39%/93.41% respectively.
PDF KSCI

"In the Beginning was the Deed": Sigmund Freud's Auditory Imagination

KIM, TaeChul
- English & American cultural studies
- /
- v.9 no.1
- /
- pp.113-139
- /
- 2009
Such is an elective affinity between literary studies and psychoanalysis that the latter sometime serves as a form of literary pedagogy. The affinity mainly consists in their shared concern for language. The signification of language in psychoanalysis is much similar to that of literature. Many of psychoanalytic terms and theoretical tenets bear witness to its dependence clinically on speech phenomena and theoretically on language in general. It is most true of Sigmund Freud, for whom the unconscious is in effect the linguistic unconscious. The Freudian unconscious, compressing and displacing through images and ideas, works as a text for psychoanalysis, which approach has not only paved one of the ways to poststructuralist anti-essentialism but with which literary studies also feel uncanny familiarity. Freudian psychoanalysis, starting empirically from clinical observations, discovers that words exist independent of meanings in the form of things in the unconscious system. Out of the various sensory elements of a word-thing, in psychoanalytic terms, the auditory is central. Now with the auditory imagination cultivated in the clinic, Freud figures out compression and displacement as the chief unconscious works, of which my main argument is that they are based phonetically on heteronym and homonym associations respectively. Compression and displacement work to be masks, which excites Freud's sense of challenge: his is a kind of poststructuralist approach, in the sense that the closed interrelatedness of words without external referents determines the signification in a given situation. But the works of compression and displacement, viewed in auditory terms rather than mapped on to metaphor and metonymy, can provide a new insight for a literary reading of Freud. Pursuing Freud's auditory imagination is not only an attempt to read his writing as literary text rather than for theoretical discussion, but also an experiment with the possibility of literary reading of a theoretical text in the age of after-theory.

Language-Independent Word Acquisition Method Using a State-Transition Model

Xu, Bin;Yamagishi, Naohide;Suzuki, Makoto;Goto, Masayuki
- Industrial Engineering and Management Systems
- /
- v.15 no.3
- /
- pp.224-230
- /
- 2016
The use of new words, numerous spoken languages, and abbreviations on the Internet is extensive. As such, automatically acquiring words for the purpose of analyzing Internet content is very difficult. In a previous study, we proposed a method for Japanese word segmentation using character N-grams. The previously proposed method is based on a simple state-transition model that is established under the assumption that the input document is described based on four states (denoted as A, B, C, and D) specified beforehand: state A represents words (nouns, verbs, etc.); state B represents statement separators (punctuation marks, conjunctions, etc.); state C represents postpositions (namely, words that follow nouns); and state D represents prepositions (namely, words that precede nouns). According to this state-transition model, based on the states applied to each pseudo-word, we search the document from beginning to end for an accessible pattern. In other words, the process of this transition detects some words during the search. In the present paper, we perform experiments based on the proposed word acquisition algorithm using Japanese and Chinese newspaper articles. These articles were obtained from Japan's Kyoto University and the Chinese People's Daily. The proposed method does not depend on the language structure. If text documents are expressed in Unicode the proposed method can, using the same algorithm, obtain words in Japanese and Chinese, which do not contain spaces between words. Hence, we demonstrate that the proposed method is language independent.
https://doi.org/10.7232/iems.2016.15.3.224 인용 PDF KSCI

A Case Study of Line Friends Character TransMedia Branding ('라인 프렌즈' 캐릭터의 트랜스미디어 브랜딩 사례연구)

Chang, Hyo Jin;Kim, Young Jae
- Journal of Korea Society of Digital Industry and Information Management
- /
- v.11 no.2
- /
- pp.153-166
- /
- 2015
This paper proposes a trans-media branding for the trans-media-based cultural content marketing strategy. Trans-media brand analytical framework is proposed with previous studies. And mobile messenger Character 'Line Friends' is analyzed for the text. Trans-media branding is accessible through a multi-platform in the technological environment. Consumer culture, as well as participate include business models to generate revenue also as brand equity. While the character elements that make up the story from the perspective of cultural content storytelling act as an independent cultural goods. Character is segmented elements. Therefore, trans- media branding of the characters are more meaningful. 'Line Friends' trans-media branding can be summarized as follows: First, it takes advantage of the characteristics of the existing Information-Technology-based mobile. Second, it puts consistently found the content of the attributes of Mobile Messenger 'communication' and 'friendship'. And third, while the content of each platform is constantly linked with other platforms, the brand is positioned inside the window effect.
https://doi.org/10.17662/ksdim.2015.11.2.153 인용 PDF KSCI

Text-Independent Speaker Recognition Using Glottal Flow Waveform (성문파형을 이용한 문장독립 화자 인식기)

Yang Ki-Hyuk;Jeon Bumki;Baek SeongJoon;Kang Sang-Ki;Sung Koeng-Mo
- Proceedings of the Acoustical Society of Korea Conference
- /
- autumn
- /
- pp.57-60
- /
- 1999
본 논문에서는 성문파에서 화자특성 계수를 추출하여 화자 인식기에 적용하고자 한다. 공분산 방법으로 음성의 잔류신호를 추정하고 이를 적분하여 성문파를 얻어낸다. 하나의 성문파 구간을 성문닫힘순간 사이가 아닌 잔류신호의 오차가 최대가 되는 순간 사이로 잡았다. 구해진 성문파를 M개의 데이터로 다시 샘플링하여 특성 벡터로 삼고 VQ기반 인식기를 사용하여 인식률을 측정하였다. 4초의 test data와 30차의 특성벡터를 사용한 경우 남성의 경우 평균 $96.08\%$, 여성에 대하여 $93.61\%$의 평균 인식률을 얻었다.
PDF

A Study on the Text-Independent Speaker Recognition Using Frequency Energy (주파수 에너지를 이용한 텍스트 독립 화자인식에 관한 연구)

조연아
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1994.06c
- /
- pp.235-240
- /
- 1994
모음 검출을 통하여 미리 등록한 단어가 아닌 경우에도 화자를 인식할 수 있도록 특징 파라메터를 개발하고, 실용화가 가능하도록 처리 방법을 간략화한 텍스트 독립 화자 인식 연구를 진행하였다. 이를 위해서, 화자가 발성한 음성에서 모음을 검출하여 화자인식에 사용하는 방법을 제안하였으며, 인식은 각 화자가 발성한 음성 신호에서 모음을 검출한 다음, 검출된 모음의 29 채널의 주파수 에너지를 퍼지값으로 효현한 후, 퍼지 추론을 적용하여 수행하였다. 실험을 위해 모음 검출 알고리듬을 개발하였으며, 화자인식의 특징 파라메터로 29 채널 주파수 에너지를 제안하였는데, 별도의 코드북 없이 사용이 가능하고, 기존의 파라메터에 비해 인식율이 높으면서도 구성 및 계산이 간단한 특징이 있다. 실험결과, 미리 작성된 표준패턴과 동일한 단어를 사용한 텍스트 의존 화자 인식 실험은 95.5% 인식율을 보였고, 표준 패턴과 다른 종류의 단어를 사용한 텍스트 독립 화자인식 실험은 94.2% 인식율을 보이고 있다.
PDF

Text Independent Speaker Identification Using Separate Matrix Quantization (분할 매트릭스 부호화를 이용한 문장 독립형 화자인식 시스템)

경연정;이황수
- The Journal of the Acoustical Society of Korea
- /
- v.17 no.5
- /
- pp.69-72
- /
- 1998
본 논문에서는 문장독립형 화자인식 시스템에 MQ(Matrix Quantization) 방법 사용 을 제안한다. 또한 인식율을 높이기 위해 MQ를 수정한 방법인 SMQ(Separated Matrix Quantization)을 제안한다. 기존의 VQ-distortion 방법은 대체로 좋은 성능을 가지나 화자의 동적 특성을 이용하지 못한다는 단점이 있다. MQ와 SMQ는 화자의 동적 특성을 이용할 수 있으므로 시간 변화에 대한 화자의 특징 변화까지 모델링 할 수 있는 장점이 있다. MQ는 여러 프레임을 묶어 Matrix Codebook을 가지며 SMQ는 MQ의 기본 codebook을 다시 켑스 트럼의 차수에 따라 나누어 codebook을 만든다. 즉, 켑스트럼 차수를 저, 중, 고차로 나누어 각 부분별로 Matrix codebook을 만들도록 한다. 인식실험은 문장독립 음성 데이터에 대해 실행했으며 MQ모델의 경우 Matrix의 크기를 짧은 음소크기부터 음절단위까지 변화시켜 실 험하였다. 아울러 SMQ 모델에서의 실험은 차수별 유용도를 보기 위하여 부분 차수를 이용 하여 실험하였다. 실험결과 MQ와 SMQ방법이 VQ에 비해 좋은 성능을 가짐을 확인하였다.
PDF

The bootstrap VQ model for automatic speaker recognition system (VQ 방식의 화자인식 시스템 성능 향상을 위한 부쓰트랩 방식 적용)

Kyung YounJeong;Lee Jin-Ick;Lee Hwang-Soo
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.39-42
- /
- 2000
A bootstrap and aggregating (bagging) vector quantization (VQ) classifier is proposed for speaker recognition. This method obtains multiple training data sets by resampling the original training data set, and then integrates the corresponding multiple classifiers into a single classifier. Experiments involving a closed set, text-independent and speaker identification system are carried out using the TIMIT database. The proposed bagging VQ classifier shows considerably improved performance over the conventional VQ classifier.
PDF

A Study on The Characteristics of The Avant-garde′s Style Expressed in Modern Fashion (현대 복식에 표현된 아방가르드의 유형별 특성 연구)

엄소희;김문숙
- The Research Journal of the Costume Culture
- /
- v.8 no.2
- /
- pp.315-333
- /
- 2000
The purpose of this study is to find out how the aesthetic values and characters of the Avant-garde fashion through semantics analysis of Avant-garde experiments in the early 20th century. Inner expressions of Avant-garde fashion in future dynamism, alien-hostile, and surreal-experimentalism are as followings (1) Reject tradition of existing fashion concept, (2) Dismantle costume material and inter-text characteristics in fashion field, (3) Laugh at material civilization and elite fashion, (4) Pursue primitive and fundamental sensibility on non-civilized world (5) Express human estrangement due to material civilization, (6) Remove the barrier of fashion between luxury and cheap ones, (7) Time, space and purpose is mixed, (8) Open concept as space structure independent of human body, (9) Complicatedness, ambiguity and expression of irregularity as changeableness, (10) Dismantle concept of beauty and ugliness. As you see, fashion design in modern Avan-garde is pursuing newness as beauty of open concept, rejecting all modern tradition and allowing extremity such as experimental, illogic, unreasonable and non-formatted expressions.
PDF

Search Result 237, Processing Time 0.021 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)