• Title/Summary/Keyword: speech style

Search Result 85, Processing Time 0.025 seconds

A Study on Verification of Back TranScription(BTS)-based Data Construction (Back TranScription(BTS)기반 데이터 구축 검증 연구)

  • Park, Chanjun;Seo, Jaehyung;Lee, Seolhwa;Moon, Hyeonseok;Eo, Sugyeong;Lim, Heuiseok
    • Journal of the Korea Convergence Society
    • /
    • v.12 no.11
    • /
    • pp.109-117
    • /
    • 2021
  • Recently, the use of speech-based interfaces is increasing as a means for human-computer interaction (HCI). Accordingly, interest in post-processors for correcting errors in speech recognition results is also increasing. However, a lot of human-labor is required for data construction. in order to manufacture a sequence to sequence (S2S) based speech recognition post-processor. To this end, to alleviate the limitations of the existing construction methodology, a new data construction method called Back TranScription (BTS) was proposed. BTS refers to a technology that combines TTS and STT technology to create a pseudo parallel corpus. This methodology eliminates the role of a phonetic transcriptor and can automatically generate vast amounts of training data, saving the cost. This paper verified through experiments that data should be constructed in consideration of text style and domain rather than constructing data without any criteria by extending the existing BTS research.

An Analysis of Short and Long Syllables of Sino-Korean Words Produced by College Students with Kyungsang Dialect (경상방언 대학생들이 발음한 국어 한자어 장단음 분석)

  • Yang, Byunggon
    • Phonetics and Speech Sciences
    • /
    • v.7 no.4
    • /
    • pp.131-138
    • /
    • 2015
  • The initial syllables of a pair of Sino-Korean words are generally differentiated in their meaning by either short or long durations. They are realized differently by the dialect and generation of speakers. Recent research has reported that the temporal distinction has gradually faded away. The aim of this study is to examine whether college students with Kyungsang dialect made the distinction temporally using a statistical method of Mixed Effects Model. Thirty students participated in the recording of five pairs of Korean words in clear or casual speaking styles. Then, the author measured the durations of the initial syllables of the words and made a descriptive analysis of the data followed by applying Mixed Effects Models to the data by setting gender, length, and style as fixed effects, and subject and syllable as random effects, and tested their effects on the initial syllable durations. Results showed that college students with Kyungsang dialect did not produce the long and short syllables distinctively with any statistically significant difference between them. Secondly, there was a significant difference in the duration of the initial syllables between male and female students. Thirdly, there was also a significant difference in the duration of the initial syllables produced in the clear or casual styles. The author concluded that college students with Kyungsang dialect do not produce long and short Sino-Korean syllables distinctively, and any statistical analysis on the temporal aspect should be carefully made considering both fixed and random effects. Further studies would be desirable to examine production and perception of the initial syllables by speakers with various dialect, generation, and age groups.

Sasang Constitution Classification by Speech Signal Processing (음성 신호 분석에 의한 사상 체질 분류)

  • Cho Dong-Uk
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.31 no.5C
    • /
    • pp.548-555
    • /
    • 2006
  • This paper proposes on the Sasang constitution classification method which is the most important things in the Sasang constitution medicine. Pre-existing methods of Sasang constitution classification are a shape of the body and its countenance & morpological aspect and temper. Many diagnostic methods have been developed and used including the questionnaires on personal life style and propensities(QSCC, QSCC II), and the tonal analysis of person's voice. Recently the constitutional acupunture and the herbal medicine response analyses are developed and used additionally. But these methods which is done by the doctor's intuition. In this article, I propose a methodology to classify the Sasang constitution. pitch, intensity and formants are used to classify the Sasang constitution by comparing the similarities and differencies of tonal analysis. Finally, the validity of the method is proven through the experiments.

Performance Improvement of Connected Digit Recognition by Considering Phonemic Variations in Korean Digit and Speaking Styles (한국어 숫자음의 음운변화 및 화자 발성특성을 고려한 연결숫자 인식의 성능향상)

  • 송명규;김형순
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.4
    • /
    • pp.401-406
    • /
    • 2002
  • Each Korean digit is composed of only a syllable, so recognizers as well as Korean often have difficulty in recognizing it. When digit strings are pronounced, the original pronunciation of each digit is largely changed due to the co-articulation effect. In addition to these problems, the distortion caused by various channels and noises degrades the recognition performance of Korean connected digit string. This paper dealt with some techniques to improve recognition performance of it, which include defining a set of PLUs by considering phonemic variations in Korean digit and constructing a recognizer to handle speakers various speaking styles. In the speaker-independent connected digit recognition experiments using telephone speech, the proposed techniques with 1-Gaussian/state gave string accuracy of 83.2%, i. e., 7.2% error rate reduction relative to baseline system. With 11-Gaussians/state, we achieved the highest string accuracy of 91.8%, i. e., 4.7% error rate reduction.

The effect of professor's image-making on college student's class satisfaction and class commitment (대학수업에서 교수의 이미지메이킹이 학습자의 수업만족 및 수업몰입에 미치는 영향)

  • Jung, Hea-Rim;Park, Sun-Ju
    • Journal of the Korea Fashion and Costume Design Association
    • /
    • v.23 no.3
    • /
    • pp.73-85
    • /
    • 2021
  • The purpose of this study is to understand the influence of the professor's image making (internal, external, social image) perceived by college students on instructional outcomes. The influence of the professor's image making on class satisfaction and class commitment was analyzed, and the mediating effect of class satisfaction and the relationship between class satisfaction and class commitment in the relationship between image making and class commitment was considered. First, it was found that the external image and social image of the professor had a significant effect on class satisfaction. The level of interpersonal relations, such as communication, manners, manners, and intimacy as well as the management of external expressions, clothing style, makeup, hair, gestures, postures, attitudes, voices, speech, and speech speed brings satisfaction to the class. Second, it was found that the professor's inner image, outer image, and social image had a significant effect on class commitment. In order to satisfy the students' immersion in class, professors are required to manage internal, external, and social images. Third, it was found that class satisfaction had a significant effect on class commitment. If the class satisfaction is high, it means that class immersion also increases. Fourth, as for the social image of a professor, it was found that class satisfaction had a completely mediating effect in the relationship between class commitment, and the external image of a professor was found to have a partial mediating effect in class satisfaction in the relationship between class commitment. It was found that the social image of professors perceived by college students improve class satisfaction, and this improves class satisfaction further enhances class immersion.

A Study on Expression of NPC Colloquial Speech using Chat-GPT API in Games against Joseon Dynasty Settings (조선시대 배경의 게임에서 Chat-GPT API를 사용한 NPC 대화체 표현 연구)

  • Jin-Seok Lee;In-Chal Choi;Jung-Yi Kim
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.24 no.3
    • /
    • pp.157-162
    • /
    • 2024
  • This study was conducted to implement Joseon Dynasty conversational style using the ChatGPT API to enhance the immersion of games set in the Joseon era. The research focuses on interactions between middle-class players and other classes. Two methods were employed: learning the dialogues from historical dramas set in the Joseon Dynasty and learning the sentence endings typical of the period. The method of learning sentence endings was rated higher based on self-evaluation criteria. Reflecting this, prompts were constructed to represent NPC dialogues in the game settings of the Joseon era. Additionally, a method was proposed for creating various NPC prompts using prompt combination techniques. This study can serve as a reference for NPC dialogue creation in games set in the Joseon Dynasty.

A Study of Changes in Consumption Values Shown in Women's Magazines - Focus on Advertisement Content in Women's Magazines from 1955 to 2008 - (여성잡지광고에 나타난 소비가치의 변화와 광고소구방법 및 문장표현방법 분석연구 - 1955~2008년 여성잡지광고내용 분석을 중심으로 -)

  • Ko, Eun-Ju;Do, Hyun-Ji;Kim, Seon-Sook
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.34 no.2
    • /
    • pp.226-241
    • /
    • 2010
  • This study details the history and characteristics of consumption values, text style analyses, and appeal types expressed in magazine commercials from 1955 to 2008. This study analyzes the level of the social structure of commercial expression for each period. Consumption values based on the categories of consumption values by Sheth (1991) were classified through a total commercials analysis. Analyses on closing types of sentences, types of sentences, and rhetorical figures were executed focusing on headline text and text style. Appealing types were composed of rational, emotional, and ethical appeals. For analysis, the crosstab analysis and chi-square test of SPSS are used. The results are as follow. Seven values were constructed, functional value, social value, emotional value, conditional value, epistemic value, fashionable value, and indistinct value. The ratio of emotional value was the highest and functional value, epistemic value conditional value, fashionable value, social value, and indistinct value followed. The emotional value social value, conditional value, fashionable value, and epistemic value that focused on the emotion of consumers increased, while the functional value decreased. Sentences that use narrative styles, hyperboles, and metaphors that increased the interest of readers were dominantly used in the headline texts. For sentence expression, a declarative sentence in a sentence type, exciting curiosity in the expression method where hyperbole and figures of speech in rhetorical expressions are used most often. Emotional appeal was used almost twice more than the reasonable appeal for appeal types of the total commercial. The lower level of reasonable appeal is information that provides the product function. Interest and expression (such as pleasure and achievement) were used most often for emotional appeal. These results show that the most important issue is the emotional value in consumption in understanding the consumer. Marketing managers should also be aware of the functional value as well as an emotional value.

Improvement of a Korean Speller with Collocation of Parts of Speech (연어 정보를 이용한 한국어 철자 검사기의 기능 개선)

  • Sim, Chul-Min;Kim, Hyun-Jin;Kim, Young-Jin;Kwon, Hyuk-Chul
    • Annual Conference on Human and Language Technology
    • /
    • 1995.10a
    • /
    • pp.86-90
    • /
    • 1995
  • 본 논문에서는 한 어절 단위에서 다수 어절 단위로 그 고려 영역을 확장한 개선된 철자 검사기를 제시한다. 개선된 철자 검사기는 1) 한 어절 철자 검사 교정부, 2) 언어 규칙 처리부, 3) 문장 부호 규칙 처리부로 구성된다. 한 어절 철자 검사 교정부는 기존의 철자 검사기와 같은 기능을 수행한다. 연어 규칙처리부는 형태소간의 연어 관계를 이용하여 7가지로 유형 분류된 어절 간 오류를 처리한다. 문장 부호 처리부는 문장 부호 자체의 오류와 문장 부호를 참조하여 좌우 어절들의 오류를 검사한다. 현재 256가지의 연이 규칙과 51가지의 문장 부호 규칙이 구축되어 있다. 본 논문에서 제시한 개선된 철자 검사기는 한국어 문체 검사기(Korean Style Checker) 로서 의의를 가지며, 형태소의 연어 정보는 향후 파싱 등의 문장 분석이나 의미 분석에 중요한 자료로 이용될 수 있을 것으로 기대된다.

  • PDF

A Novel Model, Recurrent Fuzzy Associative Memory, for Recognizing Time-Series Patterns Contained Ambiguity and Its Application (모호성을 포함하고 있는 시계열 패턴인식을 위한 새로운 모델 RFAM과 그 응용)

  • Kim, Won;Lee, Joong-Jae;Kim, Gye-Young;Choi, Hyung-Il
    • The KIPS Transactions:PartB
    • /
    • v.11B no.4
    • /
    • pp.449-456
    • /
    • 2004
  • This paper proposes a novel recognition model, a recurrent fuzzy associative memory(RFAM), for recognizing time-series patterns contained an ambiguity. RFAM is basically extended from FAM(Fuzzy Associative memory) by adding a recurrent layer which can be used to deal with sequential input patterns and to characterize their temporal relations. RFAM provides a Hebbian-style learning method which establishes the degree of association between input and output. The error back-propagation algorithm is also adopted to train the weights of the recurrent layer of RFAM. To evaluate the performance of the proposed model, we applied it to a word boundary detection problem of speech signal.

A Study on the Domestic Architecture of Vincenzo Scamozzi (빈첸초 스카모치(Vincenzo Scamozzi)의 주거건축에 관한 연구)

  • Lee, Eun-Jung;Hong, Seok-Joo
    • Journal of The Korean Digital Architecture Interior Association
    • /
    • v.11 no.4
    • /
    • pp.13-20
    • /
    • 2011
  • Vincenzo Scamozzi as the successor of Palladio stands as a major accomplishment of the task of cleanup to the classicism in 16th century. In addition, unlike the trend seen over Palladian, he shows succession and change of Renaissance villa. "L'Idea dell' Archittectura Universale; The Idea of a Universal Architecture"(1615) is a book of Scamozz's representative. This book is represented his idea for a residential building. His concepts for a residential building were analyzed through the analysis of his book and work. Scamozzi thought that domestic architecture should be designed according to he owner's social status and reputation. These concepts as decorum and this is divided into three categories. This is a threefold order, the first category and highest encompassing reigning princess and their families, who were more or less regarded as God's representatives on earth. The second category comprised noblemen and high office holders whose houses were to be, in all respects, a degree less grand, costly and dignified than the prince's residence. The third category was made up of prominent citizens and wealthy merchants whose houses were to have only moderate commoner did not come into this classification at all. - the magnifiche, honorevoli, and commune style of speech.