• Title/Summary/Keyword: 음소

Search Result 529, Processing Time 0.029 seconds

(<한국어 립씽크를 위한 3D 디자인 시스템 연구>)

  • Shin, Dong-Sun;Chung, Jin-Oh
    • 한국HCI학회:학술대회논문집
    • /
    • 2006.02b
    • /
    • pp.362-369
    • /
    • 2006
  • 3 차원 그래픽스에 적용하는 한국어 립씽크 합성 체계를 연구하여, 말소리에 대응하는 자연스러운 립씽크를 자동적으로 생성하도록 하는 디자인 시스템을 연구 개발하였다. 페이셜애니메이션은 크게 나누어 감정 표현, 즉 표정의 애니메이션과 대화 시 입술 모양의 변화를 중심으로 하는 대화 애니메이션 부분으로 구분할 수 있다. 표정 애니메이션의 경우 약간의 문화적 차이를 제외한다면 거의 세계 공통의 보편적인 요소들로 이루어지는 반면 대화 애니메이션의 경우는 언어에 따른 차이를 고려해야 한다. 이와 같은 문제로 인해 영어권 및 일본어 권에서 제안되는 음성에 따른 립싱크 합성방법을 한국어에 그대로 적용하면 청각 정보와 시각 정보의 부조화로 인해 지각의 왜곡을 일으킬 수 있다. 본 연구에서는 이와 같은 문제점을 해결하기 위해 표기된 텍스트를 한국어 발음열로 변환, HMM 알고리듬을 이용한 입력 음성의 시분할, 한국어 음소에 따른 얼굴특징점의 3 차원 움직임을 정의하는 과정을 거쳐 텍스트와 음성를 통해 3 차원 대화 애니메이션을 생성하는 한국어 립싱크합성 시스템을 개발 실제 캐릭터 디자인과정에 적용하도록 하였다. 또한 본 연구는 즉시 적용이 가능한 3 차원 캐릭터 애니메이션뿐만 아니라 아바타를 활용한 동적 인터페이스의 요소기술로서 사용될 수 있는 선행연구이기도 하다. 즉 3 차원 그래픽스 기술을 활용하는 영상디자인 분야와 HCI 에 적용할 수 있는 양면적 특성을 지니고 있다. 휴먼 커뮤니케이션은 언어적 대화 커뮤니케이션과 시각적 표정 커뮤니케이션으로 이루어진다. 즉 페이셜애니메이션의 적용은 보다 인간적인 휴먼 커뮤니케이션의 양상을 지니고 있다. 결국 인간적인 상호작용성이 강조되고, 보다 편한 인간적 대화 방식의 휴먼 인터페이스로 그 미래적 양상이 변화할 것으로 예측되는 아바타를 활용한 인터페이스 디자인과 가상현실 분야에 보다 폭넓게 활용될 수 있다.

  • PDF

On the Development of a Continuous Speech Recognition System Using Continuous Hidden Markov Model for Korean Language (연속분포 HMM을 이용한 한국어 연속 음성 인식 시스템 개발)

  • Kim, Do-Yeong;Park, Yong-Kyu;Kwon, Oh-Wook;Un, Chong-Kwan;Park, Seong-Hyun
    • The Journal of the Acoustical Society of Korea
    • /
    • v.13 no.1
    • /
    • pp.24-31
    • /
    • 1994
  • In this paper, we report on the development of a speaker independent continuous speech recognition system using continuous hidden Markov models. The continuous hidden Markov model consists of mean and covariance matrices and directly models speech signal parameters, therefore does not have quantization error. Filter bank coefficients with their 1st and 2nd-order derivatives are used as feature vectors to represent the dynamic features of speech signal. We use the segmental K-means algorithm as a training algorithm and triphone as a recognition unit to alleviate performance degradation due to coarticulation problems critical in continuous speech recognition. Also, we use the one-pass search algorithm that Is advantageous in speeding-up the recognition time. Experimental results show that the system attains the recognition accuracy of $83\%$ without grammar and $94\%$ with finite state networks in speaker-indepdent speech recognition.

  • PDF

Efficient context dependent process modeling using state tying and decision tree-based method (상태 공유와 결정트리 방법을 이용한 효율적인 문맥 종속 프로세스 모델링)

  • Ahn, Chan-Shik;Oh, Sang-Yeob
    • Journal of Korea Multimedia Society
    • /
    • v.13 no.3
    • /
    • pp.369-377
    • /
    • 2010
  • In vocabulary recognition systems based on HMM(Hidden Markov Model)s, training process unseen model bring on show a low recognition rate. If recognition vocabulary modify and make an addition then recreated modeling of executed database collected and training sequence on account of bring on additional expenses and take more time. This study suggest efficient context dependent process modeling method using decision tree-based state tying. On study suggest method is reduce recreated of model and it's offered that robustness and accuracy of context dependent acoustic modeling. Also reduce amount of model and offered training process unseen model as concerns context dependent a likely phoneme model has been used unseen model solve the matter. System performance as a result of represent vocabulary dependence recognition rate of 98.01%, vocabulary independence recognition rate of 97.38%.

Similar Question Search System for online Q&A for the Korean Language Based on Topic Classification (온라인가나다를 위한 주제 분류 기반 유사 질문 검색 시스템)

  • Mun, Jung-Min;Song, Yeong-Ho;Jin, Ji-Hwan;Lee, Hyun-Seob;Lee, Hyun Ah
    • Korean Journal of Cognitive Science
    • /
    • v.26 no.3
    • /
    • pp.263-278
    • /
    • 2015
  • Online Q&A for the National Institute of the Korean Language provides expert's answers for questions about the Korean language, in which many similar questions are repeatedly posted like other Q&A boards. So, if a system automatically finds questions that are similar to a user's question, it can immediately provide users with recommendable answers to their question and prevent experts from wasting time to answer to similar questions repeatedly. In this paper, we set 5 classes of questions based on its topic which are frequently asked, and propose to classify questions to those classes. Our system searches similar questions by combining topic similarity, vector similarity and sequence similarity. Experiment shows that our method improves search correctness with topic classification. In experiment, Mean Reciprocal Rank(MRR) of our system is 0.756, and precision for the first result is 68.31% and precision for top five results is 87.32%.

Effects of auditory and visual presentation on phonemic awareness in 5- to 6- year-old children (청각적 말소리 자극과 시각적 글자 자극 제시방법에 따른 5, 6세 일반아동의 음소인식 수행력 비교)

  • Kim, Myung-Heon;Ha, Ji-Wan
    • Phonetics and Speech Sciences
    • /
    • v.8 no.1
    • /
    • pp.71-80
    • /
    • 2016
  • The phonemic awareness tasks (phonemic synthesis, phonemic elision, phonemic segmentation) by auditory presentation and visual presentation were conducted to 40 children who are 5 and 6 years old. The scores and error types in the sub-tasks by two presentations were compared to each other. Also, the correlation between the performances of phonemic awareness sub-tasks in two presentation conditions were examined. As a result, 6-year-old group showed significantly higher phonemic awareness scores than 5-year-old group. Both group showed significantly higher scores in visual presentation than auditory presentation. While the performance under the visual presentation was significantly lower especially in the segmentation than the other two tasks, there was no significant difference among sub-tasks under the auditory presentation. 5-year-old group showed significantly more 'no response' errors than 6-year-old group and 6-year-old group showed significantly more 'phoneme substitution' and 'phoneme omission' errors than 5-year-old group. Significantly more 'phoneme omission' errors were observed in the segmentation than the elision task, and significantly more 'phoneme addition' errors were observed in elision than the synthesis task. Lastly, there are positive correlations in auditory and visual synthesis tasks, auditory and visual elision tasks, and auditory and visual segmentation tasks. Summarizing the results, children tend to depend on orthographic knowledge when acquiring the initial phonemic awareness. Therefore, the result of this research would support the position that the orthographic knowledge affects the improvement of phonemic awareness.

A Study on the Usability of University Remote Lecture -Focusing on Zoom and Webex Meetings- (대학 원격강의 프로그램의 사용성 연구 -Zoom과 Webex Meetings를 중심으로-)

  • Shin, Jun;Kim, Seung-In
    • Journal of Digital Convergence
    • /
    • v.18 no.10
    • /
    • pp.403-408
    • /
    • 2020
  • This paper is to evaluate the usability of two representative video meeting services currently used by university for research to improve the quality of university remote lecture. questionnaires based on Kano Model were designed and in-depth interviews were conducted to provide qualitative approaches. Screen-sharing functions, the one-dimensional functions was the most important function. and attractive functions had relatively diverse directions. For essential functions, there was a wide gap in quality due to user-specific equipment. The function in which other platforms exist or business-related was not important. Webex reacted negatively to the aging UI, while Zoom responded negatively to the unilateral mute function. In addition, the development direction was presented in five ways as a result of analysis of these results. under Corona-19 situation, I hope this study will lead to continuous research to make stepping stone for remoted educational development.

Microanatomical Structure of the Digestive Diverticulum of Mytilus galloprovincialis (Bivalvia: Mytilidae) (지중해담치, Mytilus galloprovincialis 소화맹낭의 미세해부학적 구조)

  • Ju, Sun-Mi;Lee, Jung-Sick
    • Applied Microscopy
    • /
    • v.41 no.4
    • /
    • pp.257-263
    • /
    • 2011
  • The microanatomy and ultrastructure of the digestive diverticulum of Mytilus galloprovincialis were described using light and electron microscopy. The digestive diverticulum of tawny color was surrounded the stomach and connected to stomach by a primary duct. Digestive diverticulum is composed of numerous digestive tubules. The epithelial layer of a simple digestive tubule, which is simple, is composed of basophilic cells and digestive cells. Basophilic cells are columnar in shape, and has a well-developed endoplasmic reticula, tubular mitochondria, Golgi complex and membrane-bounded granules of high electron density in the cytoplasm. Whereas digestive cells are columnar in shape, with development of microvilli and cilia on the free surface. Pinocytic vasicles, active lysosomes and numerous mitochondria were observed in the apical cytoplasm of digestive cells. The results of this study suggest that basophilic cell and digestive cell of the digestive tubule are specialized in the extracellular and intracellular digestion, respectively.

Variation of the Incident Sound Level at the Underwater Target`s Position due to Roll Motion of the Ship (선체의 횡요로 인한 수중물표입사음압의 변동에 관하여)

  • Park, Jung-Hui;Lee, Dae-Jae
    • Journal of the Korean Society of Fisheries and Ocean Technology
    • /
    • v.19 no.2
    • /
    • pp.106-110
    • /
    • 1983
  • As the first step to investigate the effect of ship's motion when detecting target with an echo sounder, variations in the incident sound level at the optional position within the sound beam due to roll motion of the transmitter have been measured and calculated. In this experiment, the transmitter (75 KHz) was mounted to the bottom of a FRP model of the 2,275 G. T. stern trawler and the receiver (75 KHz) was installed at each measuring point within the transmitter's beam. Then, the incident sound level was measured for the roll angles from the free roll test on the model ship. For a range of roll angle of $\pm$20$^{\circ}$from the vertical, the measuring values of the incident sound level at each measuring point were rapidly fluctuated from 12.9% to 78.1 depending on the roll angle, and agreed well with the caculated ones. Consquently, we concluded that the effect of ship's motion when detecting target with an echo sounder should be sufficiently considered.

  • PDF

Articulation Scores and Confusion Patterns of the 100 Monosyllable Korean Speech Sounds (우리말 100단음절의 명료도와 오청상에 관한 연구)

  • 유방환;김홍기;노관택
    • Proceedings of the KOR-BRONCHOESO Conference
    • /
    • 1972.03a
    • /
    • pp.1.1-1
    • /
    • 1972
  • It is well known that speech signals are the most riliable materials for the hearing test and there are various difficult problems in the selection of these materials. Because of these difficulties, there is not a confirmed test material of Korean speech sound up to date. For the basis of the test materials, author had studied articulation scores and confusion patterns of 100 monosyllable korean speech sounds in normal listners, in normal listners under various noisy (white noise and speech noise) circumstances, and in patients with hearing loss, The results reveal as follows. 1. Except for perceptive deafness with poor articulation score, Confusion was occured among initial consonants, vowels and final consonants respectively according to their distinctive features under above various test conditions. 2. There is remarkable differences in articulation scores between different kindes of noise under some intensity levels.

  • PDF

A High-Speed Korean Morphological Analysis Method based on Pre-Analyzed Partial Words (부분 어절의 기분석에 기반한 고속 한국어 형태소 분석 방법)

  • Yang, Seung-Hyun;Kim, Young-Sum
    • Journal of KIISE:Software and Applications
    • /
    • v.27 no.3
    • /
    • pp.290-301
    • /
    • 2000
  • Most morphological analysis methods require repetitive procedures of input character code conversion, segmentation and lemmatization of constituent morphemes, filtering of candidate results through looking up lexicons, which causes run-time inefficiency. To alleviate such problem of run-time inefficiency, many systems have introduced the notion of 'pre-analysis' of words. However, this method based on pre-analysis dictionary of surface also has a critical drawback in its practical application because the size of the dictionaries increases indefinite to cover all words. This paper hybridizes both extreme approaches methodologically to overcome the problems of the two, and presents a method of morphological analysis based on pre-analysis of partial words. Under such hybridized scheme, most computational overheads, such as segmentation and lemmatization of morphemes, are shifted to building-up processes of the pre-analysis dictionaries and the run-time dictionary look-ups are greatly reduced, so as to enhance the run-time performance of the system. Moreover, additional computing overheads such as input character code conversion can also be avoided because this method relies upon no graphemic processing.

  • PDF