• 제목/요약/키워드: Corpus Frequency

검색결과 166건 처리시간 0.028초

음악검색을 위한 가변임계치 기반의 음성 질의 변환 기법 (A Threshold Adaptation based Voice Query Transcription Scheme for Music Retrieval)

  • 한병준;노승민;황인준
    • 전기학회논문지
    • /
    • 제59권2호
    • /
    • pp.445-451
    • /
    • 2010
  • This paper presents a threshold adaptation based voice query transcription scheme for music information retrieval. The proposed scheme analyzes monophonic voice signal and generates its transcription for diverse music retrieval applications. For accurate transcription, we propose several advanced features including (i) Energetic Feature eXtractor (EFX) for onset, peak, and transient area detection; (ii) Modified Windowed Average Energy (MWAE) for defining multiple small but coherent windows with local threshold values as offset detector; and finally (iii) Circular Average Magnitude Difference Function (CAMDF) for accurate acquisition of fundamental frequency (F0) of each frame. In order to evaluate the performance of our proposed scheme, we implemented a prototype music transcription system called AMT2 (Automatic Music Transcriber version 2) and carried out various experiments. In the experiment, we used QBSH corpus [1], adapted in MIREX 2006 contest data set. Experimental result shows that our proposed scheme can improve the transcription performance.

PROSODY IN SPEECH TECHNOLOGY - National project and some of our related works -

  • Hirose Keikichi
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 2002년도 하계학술발표대회 논문집 제21권 1호
    • /
    • pp.15-18
    • /
    • 2002
  • Prosodic features of speech are known to play an important role in the transmission of linguistic information in human conversation. Their roles in the transmission of para- and non- linguistic information are even much more. In spite of their importance in human conversation, from engineering viewpoint, research focuses are mainly placed on segmental features, and not so much on prosodic features. With the aim of promoting research works on prosody, a research project 'Prosody and Speech Processing' is now going on. A rough sketch of the project is first given in the paper. Then, the paper introduces several prosody-related research works, which are going on in our laboratory. They include, corpus-based fundamental frequency contour generation, speech rate control for dialogue-like speech synthesis, analysis of prosodic features of emotional speech, reply speech generation in spoken dialogue systems, and language modeling with prosodic boundaries.

  • PDF

대화체 억양구말 형태소의 경계성조 연구 (Boundary Tones of Intonational Phrase-Final Morphemes in Dialogues)

  • 한선희
    • 음성과학
    • /
    • 제7권4호
    • /
    • pp.219-234
    • /
    • 2000
  • The study of boundary tones in connected speech or dialogues is one of the most underdeveloped areas of Korean prosody. This. paper concerns the boundary tones of intonational phrase-final morphemes which are shown in the speech corpus of dialogues. Results of phonetic analysis show that different kinds of boundary tones are realized, depending on the positions of the intonational phrase-final morphemes in the sentences.. This study has also shown that boundary tone patterning is somewhat related to the sentence structure, and for better speech recognition and speech synthesis, it presents a simple model of boundary tones based on the fundamental frequency contour. The results of this study will contribute to our understanding of the prosodic pattern of Korean connected speech or dialogues.

  • PDF

A Text Similarity Measurement Method Based on Singular Value Decomposition and Semantic Relevance

  • Li, Xu;Yao, Chunlong;Fan, Fenglong;Yu, Xiaoqiang
    • Journal of Information Processing Systems
    • /
    • 제13권4호
    • /
    • pp.863-875
    • /
    • 2017
  • The traditional text similarity measurement methods based on word frequency vector ignore the semantic relationships between words, which has become the obstacle to text similarity calculation, together with the high-dimensionality and sparsity of document vector. To address the problems, the improved singular value decomposition is used to reduce dimensionality and remove noises of the text representation model. The optimal number of singular values is analyzed and the semantic relevance between words can be calculated in constructed semantic space. An inverted index construction algorithm and the similarity definitions between vectors are proposed to calculate the similarity between two documents on the semantic level. The experimental results on benchmark corpus demonstrate that the proposed method promotes the evaluation metrics of F-measure.

사건명사의 네트워크 분석 (A Network Analysis of Event Nouns)

  • 김혜영;강범모;이도길
    • 한국정보과학회 언어공학연구회:학술대회논문집(한글 및 한국어 정보처리)
    • /
    • 한국정보과학회언어공학연구회 2010년도 제22회 한글 및 한국어 정보처리 학술대회
    • /
    • pp.94-99
    • /
    • 2010
  • This paper is to present how a network between words is formed. Not only have we looked at the distributivity, frequency and strength in connections between related words, but we have also presented some way to shed lights on what this network means to linguistic and social studies. The target source is morpho-analysis components of Trends 21 corpus which cover all newspaper articles from lour major newspapers, including Chosun, Joongang, Donga, and Hankyoreh, issued between 2000 and 2008. Based on nodes, links, and their connectivity indexes - density, degree, and centralizations, we have been able to retrieve and cluster related words forming the network with 20 event nouns. To reduce noise, we have considered the words whose t-score is above 1.64. By conducting both network and statistical analyses, we have presented the network of each event noun.

  • PDF

La Variación de /ɾ/ en Posición Posnuclear en el Español Andino del Perú

  • Kim, Kyoung-Lai
    • 이베로아메리카
    • /
    • 제21권1호
    • /
    • pp.127-158
    • /
    • 2019
  • In this paper, the variation in coda /ɾ/ is analyzed in the Spanish of the Tupe district in Peru. The work was carried out on the corpus of 24 semi-structured interviews. Four variants of /-ɾ/ were distinguished and 1920 tokens were analyzed. Praat was used to recognize and describe the variants and two statistical analysis were carried out: descriptive analysis and probabilistic analysis using the statistical program Goldvarb X. The results obtained from the analysis show that the assibilated variant is favored in the prepausal position and before homorganic consonants. The frequency of occurrence was very low before other consonants. Regarding the social factor that contributes to the assimilated variant, the young and middle-aged men (from 20 to 60), those who did not live more than a year on the Peruvian coast and male speakers favor it.

장기간 양성자펌프억제제의 사용과 위암 (Long Term Proton Pump Inhibitor Use and Gastric Cancer)

  • 서승인
    • Journal of Digestive Cancer Research
    • /
    • 제10권1호
    • /
    • pp.9-15
    • /
    • 2022
  • Proton pump inhibitors (PPIs), a potent gastric acid inhibitor, are widely used in gastric acid-related diseases such as gastroesophageal reflux disease and peptic ulcer, and are known as the most frequently used drugs worldwide. However, as the frequency of use increases, the number of cases of long-term PPI therapy without clear indications is increasing. Recently, there have been concerns about the risk of gastric cancer in patients with long-term PPI users. Potential mechanisms for the association between PPI and gastric cancer include enterochromaffin-like cell proliferation due to hypergastrinemia caused by gastric acid suppression, progression of atrophic gastritis, and corpus-predominant type through interaction with Helicobacter pylori (H. pylori) infection. Several epidemiologic studies showed controversial results on the issue, and it is difficult to prove a causal relationship between PPI and gastric cancer. Nevertheless, long-term PPI should be administered cautiously based on individual risk-benefit profile, specifically among those with history of H. pylori infection, in high-risk region of gastric cancer.

The effect of word frequency on the reduction of English CVCC syllables in spontaneous speech

  • Kim, Jungsun
    • 말소리와 음성과학
    • /
    • 제7권3호
    • /
    • pp.45-53
    • /
    • 2015
  • The current study investigated CVCC syllables in spontaneous American English speech to find out whether such syllables are produced as phonological units with a string of segments, showing a hierarchical structure. Transcribed data from the Buckeye Speech Corpus was used for the analysis in this study. The result of the current study showed that the constituents within a CVCC syllable as a phonological unit may have phonetic variations (namely, the final coda may undergo deletion). First, voiceless alveolar stops were the most frequently deleted when they occurred as the second final coda consonants of a CVCC syllable; this deletion may be an intermediate process on the way from the abstract form CVCC (with the rime VCC) to the actual pronunciation CVC (with the rime VC), a production strategy employed by some individual speakers. Second, in the internal structure of the rime, the proportion of deletion of the final coda consonant depended on the frequency of the word rather than on the position of postvocalic consonants on the sonority hierarchy. Finally, the segment following the consonant cluster proved to have an effect on the reduction of that cluster; more precisely, the following contrast was observed between obstruents and non-obstruents, reflecting the effect of sonority: when the segment following the consonant cluster was an obstruent, the proportion of deletion of the final coda consonant was increased. Among these results, the effect of word frequency played a critical role for promoting the deletion of the second coda consonant for clusters in CVCC syllables in spontaneous speech. The current study implies that the structure of syllables as phonological units can vary depending on individual speakers' lexical representation.

뇌신경교세포(腦神經膠細胞) 집단(集團)의 발생(發生)과 이동(移動)에 대한 방사선(放射線) 자기법적(自記法的) 관찰 I, 설치류 뇌(腦)에 외배엽성(外胚葉性) 신경교세포(神經膠細胞) 집단(集團)의 출현(出現)에 대하여 (Radioautographical observations of development and appearance of glia cells in brain I. Apperarace of ectodermal glial cell aggregates in rodent brain)

  • 곽수동
    • 대한수의학회지
    • /
    • 제32권4호
    • /
    • pp.481-487
    • /
    • 1992
  • The present study was designed to investigate the appearance of the congenital aggregates of the ectodermal glial cells in the brain of the normal rodents. The brain samples were taken from mice fetus, juvenile mice, rats and rabbits. The appearance regions of the glial cell aggregates (GCA) were investigated and the cells in the GCA were identified with electron microscope. 1. GCA in the mouse fetus tended to be higher in cell density, larger in size and lower frequency in appearance than juvenile mouse. The regions of higher appearance frequency of GCA in the juveniles of mice, rats and rabbits were ordered as subependymal layer in the collateral trigone of lateral ventricles, molecular layer of the neocortex, inner layer except the molecular layer in the neocortex, cerebral medulla, corpus callosum and hippocampus. Appearance frequency of GCA in the neonatal mice tended to be higher until 5 day after birth, and were markedly decreased on 10 and 15 day after birth. 2. GCA tended to be closed on one side of the blood vessels or neurons but not perivascular or perineuronal appearance. 3. In electron microscophy, GCA were composed of immature oligodendrocytes and astrocytes in the subependymal, and tended to be more mature and loose in the neocortex and to be appended some microglia cells with age. The cells in the GCA of older mice tended to be more mature than in young mice.

  • PDF

여성(女性) 불임(不姙)의 원인(原因)에 관(關)한 문헌적(文獻的) 고찰(考察) (Literatural Study on the Causes of Infertility in Women)

  • 김은섭;유동열
    • 혜화의학회지
    • /
    • 제9권1호
    • /
    • pp.267-285
    • /
    • 2000
  • According to the literatural study on the causes of Infertility in women, the results were as follows. 1. The causes of Infertility in women were arranged scholarly thoery during to Jin-Yuan era (金 元 時代) from Huang-Di-Nei-Jing(黃帝內經), and literatures after Ming-Qing era(明 淸 時代) divided and added one's own thoery since they choose preceding thoery. 2. In the Modern Medicine, the causes of Infertility in women are divided the product obstruction of Oocyte, the union obstruction of sperm and oocyte by abnormality of vagina, cervix, corpus, fallopian tubes, pelvic, and peritoneum, Endocrine factor, Immunologic factor, and Emotion factor. 3. In the Oriental Medicine, the causes of Infertility in women are attached importance to functional side as 'asthenia-cool of uterus'(子宮虛寒), 'deficiency of vital energy and blood'(氣血虛), 'deficiency of yin'(陰虛), 'impairement of seven emotion'(七情傷), 'disease of extra mierdians'(寄經病), and so forth; while on the other in the Modern Medicine, the causes of Infertility in women are attached importance to organic side as abnormality of uterus and ovary. 4. In the successive literatures, 'asthenia-cool of uterus'(子宮虛寒) occupied most frequency in the causes of Infertility in women and in the next obesity(體肥), 'deficiency of vital energy and blood'(氣血虛), menstrual irregularity(月經不調), deficiency of yin'(陰虛), 'impairement of seven emotion'(七情傷), emaciation(體瘦), 'disease of extra mierdians'(寄經病), and so forth occupied much frequency. 5. In the bodily form, obesity(體肥) and emaciation(體瘦) occupied comparatively more frequency.

  • PDF