• Title/Summary/Keyword: Segmental features

Search Result 71, Processing Time 0.031 seconds

Continuous Speech Recognition based on Parmetric Trajectory Segmental HMM (모수적 궤적 기반의 분절 HMM을 이용한 연속 음성 인식)

  • 윤영선;오영환
    • The Journal of the Acoustical Society of Korea
    • /
    • v.19 no.3
    • /
    • pp.35-44
    • /
    • 2000
  • In this paper, we propose a new trajectory model for characterizing segmental features and their interaction based upon a general framework of hidden Markov models. Each segment, a sequence of vectors, is represented by a trajectory of observed sequences. This trajectory is obtained by applying a new design matrix which includes transitional information on contiguous frames, and is characterized as a polynomial regression function. To apply the trajectory to the segmental HMM, the frame features are replaced with the trajectory of a given segment. We also propose the likelihood of a given segment and the estimation of trajectory parameters. The obervation probability of a given segment is represented as the relation between the segment likelihood and the estimation error of the trajectories. The estimation error of a trajectory is considered as the weight of the likelihood of a given segment in a state. This weight represents the probability of how well the corresponding trajectory characterize the segment. The proposed model can be regarded as a generalization of a conventional HMM and a parametric trajectory model. The experimental results are reported on the TIMIT corpus and performance is show to improve significantly over that of the conventional HMM.

  • PDF

A Case of Segmental Nectotizing Jejunitis (분절성 괴사성 공장염(Segmental necrotizing jejunitis) 1례)

  • Yoo, Jee-Hyung;Kim, Hyo-Shin;Kim, Je-Woo;Chung, Ki-Sup;Han, Suk-Joo;Kim, Ho-Kun
    • Pediatric Gastroenterology, Hepatology & Nutrition
    • /
    • v.2 no.2
    • /
    • pp.222-226
    • /
    • 1999
  • Segmental necrotizing jejunitis is characterized by severe abdominal pain of acute onset, bilious vomitings and foul smelling loose stools containing blood. Pathologic features include circumferential intestinal wall inflammation ranging from edema with minimal congestion to severe congestion, hemorrhage with necrosis, ulceration, and gangrene with perforation. Early diagnosis and suitable supportive measures prevent unnecessary laparatomy and complications. There was no report of this disease entity in children in Korea. We experienced a case of segmental necrotizing jejunitis with fever, abdominal pain and bloody stools, which was diagnosed by exploration and was treated successfully by antibiotics and supportive measures.

  • PDF

Acoustic Measurement of English read speech by native and nonnative speakers

  • Choi, Han-Sook
    • Phonetics and Speech Sciences
    • /
    • v.3 no.3
    • /
    • pp.77-88
    • /
    • 2011
  • Foreign accent in second language production depends heavily on the transfer of features from the first language. This study examines acoustic variations in segments and suprasegments by native and nonnative speakers of English, searching for patterns of the transfer and plausible indexes of foreign accent in English. The acoustic variations are analyzed with recorded read speech by 20 native English speakers and 50 Korean learners of English, in terms of vowel formants, vowel duration, and syllabic variation induced by stress. The results show that the acoustic measurements of vowel formants and vowel and syllable durations display difference between native speakers and nonnative speakers. The difference is robust in the production of lax vowels, diphthongs, and stressed syllables, namely the English-specific features. L1 transfer on L2 specification is found both at the segmental levels and at the suprasegmental levels. The transfer levels measured as groups and individuals further show a continuum of divergence from the native-like target. Overall, the eldest group, students who are in the graduate schools, shows more native-like patterns, suggesting weaker foreign accent in English, whereas the high school students tend to involve larger deviation from the native speakers' patterns. Individual results show interdependence between segmental transfer and prosodic transfer, and correlation with self-reported proficiency levels. Additionally, experience factors in English such as length of English study and length of residence in English speaking countries are further discussed as factors to explain the acoustic variation.

  • PDF

Analysis of the Timing of Spoken Korean Using a Classification and Regression Tree (CART) Model

  • Chung, Hyun-Song;Huckvale, Mark
    • Speech Sciences
    • /
    • v.8 no.1
    • /
    • pp.77-91
    • /
    • 2001
  • This paper investigates the timing of Korean spoken in a news-reading speech style in order to improve the naturalness of durations used in Korean speech synthesis. Each segment in a corpus of 671 read sentences was annotated with 69 segmental and prosodic features so that the measured duration could be correlated with the context in which it occurred. A CART model based on the features showed a correlation coefficient of 0.79 with an RMSE (root mean squared prediction error) of 23 ms between actual and predicted durations in reserved test data. These results are comparable with recent published results in Korean and similar to results found in other languages. An analysis of the classification tree shows that phrasal structure has the greatest effect on the segment duration, followed by syllable structure and the manner features of surrounding segments. The place features of surrounding segments only have small effects. The model has application in Korean speech synthesis systems.

  • PDF

PROSODY IN SPEECH TECHNOLOGY - National project and some of our related works -

  • Hirose Keikichi
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • spring
    • /
    • pp.15-18
    • /
    • 2002
  • Prosodic features of speech are known to play an important role in the transmission of linguistic information in human conversation. Their roles in the transmission of para- and non- linguistic information are even much more. In spite of their importance in human conversation, from engineering viewpoint, research focuses are mainly placed on segmental features, and not so much on prosodic features. With the aim of promoting research works on prosody, a research project 'Prosody and Speech Processing' is now going on. A rough sketch of the project is first given in the paper. Then, the paper introduces several prosody-related research works, which are going on in our laboratory. They include, corpus-based fundamental frequency contour generation, speech rate control for dialogue-like speech synthesis, analysis of prosodic features of emotional speech, reply speech generation in spoken dialogue systems, and language modeling with prosodic boundaries.

  • PDF

Applicability Discrimination for Line-clustering Segmental Approach to Steel-tube X-ray Image (선군집분할방식의 강판튜브 엑스선 영상에의 적용성 판별)

  • Hwang, Jung-Won;Hwang, Jae-Ho
    • Proceedings of the IEEK Conference
    • /
    • 2007.07a
    • /
    • pp.397-398
    • /
    • 2007
  • In this paper, we have verified the applicability of the line-clustering segmentation method to steel-tube X-ray images. Image data is partitioned into three regions on the base of vertical line edge detection. Parameters for necessary condition, such as neighborlity, similarity and directional neighbor correlation coefficients, proposed in that method is calculated and applied to such selected regions separately Segmental features at each region is extracted statistically and functional classification is clustered by the point or space process. The analyzed data and experimental results show that the line-clustering segmentation method has a high applicability to X-ray image.

  • PDF

Podocytopathy and Morphologic Changes in Focal Segmental Glomerulosclerosis (초점분절사구체경화증에서 발세포병증과 형태 변화)

  • Jeong, Hyeon Joo
    • Childhood Kidney Diseases
    • /
    • v.17 no.1
    • /
    • pp.13-18
    • /
    • 2013
  • Podocytopathy is glomerular lesions characterized by podocyte injury. It is observed in various glomerular diseases, but minimal change disease and focal segmental glomerulosclerosis (FSGS) are the prototypes. In this review, morphologic features of podocyte injury and subtypes of FSGS will be reviewed briefly. Effacement of podocyte foot processes is the most common feature of podocyte injury. As podocytic injury progresses, intracytoplasmic vacuoles, subpodocytic cyst, detachment of podocytes from the glomerular basement membrane and apoptosis develop. Glomerular capillary loops in epithelium-denuded area undergo capillary collapse. Synechia and hyalinosis may accompany this lesion. To manifest segmental sclerosis, podocyte loss above a threshold level may be required. Injured podocytes can injure neighboring intact podocytes, and thereby spread injury within the same lobule. FSGS can be categorized into five subtypes by morphologic characteristics; not otherwise specified (NOS), perihilar, cellular, tip, and collapsing types. Each subtype has been reported to show different clinical courses and associated conditions, but there are controversies on its significance. With recent progress in the discovery of genetic abnormalities causing FSGS and plasma permeability factors, we expect to unravel pathophysiology of FSGS and to understand histological sequences leading to FSGS in near future.

A Study on the Rhythm of Korean EFL Learners' English Pronunciation (한국인 영어학습자의 영어리듬구현 연구)

  • Chung, Hyun-Song
    • Phonetics and Speech Sciences
    • /
    • v.1 no.2
    • /
    • pp.141-149
    • /
    • 2009
  • An emphasis on teaching suprasegmental features of English, specifically English rhythm, is essential in order to improve the 'intelligibility' of the pronunciation of Korean EFL learners among interlocutors who use English as a Lingua Franca(ELF). By redefining the ELF suggested by Jenkins (2000, 2002), this paper argues that Lingua Franca Core (LFC) must include suprasegmental features such as 'stress-based rhythm' and word stress. However, because 'isochrony' is difficult to measure in a foot, the rhythm unit must be expanded to an intonational phrase which has prominence in it and the rhythm of the unit can be measured by calculating the duration of each segment in context The rhythmic pattern of Korean learners of English and that of native speakers or other non-native English speakers can then be calculated and compared by using correlation coefficients of the segmental duration. In terms of sociolinguistic factors, improving the 'comprehensibility' and 'accentedness' of Korean EFL learners' pronunciation is also important in international communication, which calls for more emphasis on suprasegmental features.

  • PDF

Annotation of a Non-native English Speech Database by Korean Speakers

  • Kim, Jong-Mi
    • Speech Sciences
    • /
    • v.9 no.1
    • /
    • pp.111-135
    • /
    • 2002
  • An annotation model of a non-native speech database has been devised, wherein English is the target language and Korean is the native language. The proposed annotation model features overt transcription of predictable linguistic information in native speech by the dictionary entry and several predefined types of error specification found in native language transfer. The proposed model is, in that sense, different from other previously explored annotation models in the literature, most of which are based on native speech. The validity of the newly proposed model is revealed in its consistent annotation of 1) salient linguistic features of English, 2) contrastive linguistic features of English and Korean, 3) actual errors reported in the literature, and 4) the newly collected data in this study. The annotation method in this model adopts the widely accepted conventions, Speech Assessment Methods Phonetic Alphabet (SAMPA) and the TOnes and Break Indices (ToBI). In the proposed annotation model, SAMPA is exclusively employed for segmental transcription and ToBI for prosodic transcription. The annotation of non-native speech is used to assess speaking ability for English as Foreign Language (EFL) learners.

  • PDF

Cervical Stand-Alone Polyetheretherketone Cage versus Zero-Profile Anchored Spacer in Single-Level Anterior Cervical Discectomy and Fusion : Minimum 2-Year Assessment of Radiographic and Clinical Outcome

  • Cho, Hyun-Jun;Hur, Junseok W.;Lee, Jang-Bo;Han, Jin-Sol;Cho, Tai-Hyoung;Park, Jung-Yul
    • Journal of Korean Neurosurgical Society
    • /
    • v.58 no.2
    • /
    • pp.119-124
    • /
    • 2015
  • Objective : We compared the clinical and radiographic outcomes of stand-alone polyetheretherketone (PEEK) cage and Zero-Profile anchored spacer (Zero-P) for single level anterior cervical discectomy and fusion (ACDF). Methods : We retrospectively reviewed 121 patients who underwent single level ACDF within 2 years (Jan 2011-Jan 2013) in a single institute. Total 50 patients were included for the analysis who were evaluated more than 2-year follow-up. Twenty-nine patients were allocated to the cage group (m : f=19 : 10) and 21 for Zero-P group (m : f=12 : 9). Clinical (neck disability index, visual analogue scale arm and neck) and radiographic (Cobb angle-segmental and global cervical, disc height, vertebral height) assessments were followed at pre-operative, immediate post-operative, post-3, 6, 12, and 24 month periods. Results : Demographic features and the clinical outcome showed no difference between two groups. The change between final follow-up (24 months) and immediate post-op of Cobb-segmental angle (p=0.027), disc height (p=0.002), vertebral body height (p=0.033) showed statistically better outcome for the Zero-P group than the cage group, respectively. Conclusion : The Zero-Profile anchored spacer has some advantage after cage for maintaining segmental lordosis and lowering subsidence rate after single level anterior cervical discectomy and fusion.