• Title/Summary/Keyword: Speech discrimination

Search Result 157, Processing Time 0.026 seconds

A Proposition of the Fuzzy Correlation Dimension for Speaker Recognition (화자인식을 위한 퍼지상관차원 제안)

  • Yoo, Byong-Wook;Kim, Chang-Seok;Park, Hyun-Sook
    • Journal of the Korean Institute of Telematics and Electronics S
    • /
    • v.36S no.1
    • /
    • pp.115-122
    • /
    • 1999
  • In this paper, we confirmed that a speech signal is a chaos signal, and in order to use it as a speaker recognition parameter, analyzed chaos dimension. In order to raise speaker identification and pattern recognition, by making up the strange attractor involving an individual's vocal tract characteristics very well and applying fuzzy membership function to correlation dimension, we proposed fuzzy correlation dimension. By estimating the correlation of the points making up an attractor are limited according space dimension value, fuzzy correlation dimension absorbed the variation of the reference pattern attractor and test pattern attractor. Concerning fuzzy correlation dimension, by estimating the distance according to the average value of discrimination error per each speaker and reference pattern, investigated the validity of speaker recognition parameter.

  • PDF

The Recognition of Korean Single vowels by Use of the Diffusion Filter Bank as a Pre-processor (확산필터뱅크를 전처리기로 사용한 한국어 단모음인식)

  • Huh, Man-Tak;Kim, Jae-Chang
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.1
    • /
    • pp.81-87
    • /
    • 1997
  • In this paper, a new pre-processing method for the recognition of single vowels by use of spectrum envelope is presented. We use new extraction method of a spectrum envelope using the diffusion filter bank. By dividing analysis band of a diffusion filter bank into subbands, we decreased the number of diffusion process. And, by increasing the number of difference, we got higher selectivity. As a result of them, we reduced the total processing time, and got higher enhancement of discrimination. By getting 88.3% of average recognition rate for single vowels of natural voice through computer simulation. We confirmed it to be useful for speech recognition which use spectrum analysis of the voice signal to have many frequency components.

  • PDF

Long Term Average Spectral Analysis for Acoustical Discrimination of Korean Nasal Consonants (한국어 비음의 음향학적 구분을 위한 장구간 스펙트럼(LTAS) 분석)

  • Choi, Soon-Ai;Seong, Cheol-Jae
    • MALSORI
    • /
    • no.60
    • /
    • pp.67-84
    • /
    • 2006
  • The purpose of this study is to find some acoustic parameters on frequency domain to distinguish the Korean nasals, $/m,\;n,\;{\eta}/$ from each other. The new parameters are devised on the basis of LTAS (Long Term Average Spectrum). The maximum peak amplitude and the relevant formant frequency are measured in low and high frequency range, respectively. The frequency of spectral valley and its energy level are also obtained in the specific frequency range of the spectrum. Spectral slope, total energy value in specific frequency range, statistical distribution of spectral energy like centroid, skewness, and kurtosis are suggested as new parameters as well. The parameters that show statistically significant differences across nasals are summerized as follows. 1) in syllable initial positions: the total energy value from 1,500 to 2,200 Hz(zeroENG); 2) in syllable final positions: the peak amplitude of the first formant(peak1_a), the formant frequency with maximum peak amplitude from 4,000 to 8,000 Hz(peak2_f), the maximum peak amplitude of the formant frequency from 4,000 to 8,000 Hz(peak2_a), and the total energy value from 1,500 to 2,200 Hz(zeroENG).

  • PDF

The Perception-Based study of a weak syllable in English Words with Weak-Strong pattern by Korean Learners(I) (약강구조 영어 단어에 대한 초급 및 고급 영어학습자의 약음절 지각과 반응시간(I))

  • Kim, Hee-Sung;Shin, Ji-Young;Kim, Kee-Ho
    • Proceedings of the KSPS conference
    • /
    • 2005.11a
    • /
    • pp.73-77
    • /
    • 2005
  • The purpose of this study is to observe how Korean learners of English perceive a weak syllable in words with WS syllable pattern. According to the automated discrimination task using E-Prime, the proportion of right answer and reaction time of the stimuli with same word pairs (a-a, b-b) was more and faster respectively than that with different word pairs (a-b, b-a). Specifically, in a-b or b-a stimuli structure, familiarity(word frequency) of stressed word succeeding weak syllable and whether the weak syllable had coda in it was two important factors in distinguishing between a word with and without weak syllable. Even though the high English proficiency Koreans had faster reaction time than the low English proficiency Koreans, all Korean learners somewhat had difficulty perceiving the weak syllable at the beginning of the word.

  • PDF

Direction of Global Citizenship Education in the Age of Infodemic : A Case Study of the COVID-19 Pandemic in Korea

  • Jisu Park
    • International journal of advanced smart convergence
    • /
    • v.12 no.1
    • /
    • pp.82-91
    • /
    • 2023
  • In 2020 when the COVID-19 pandemic began in full-scale, the WHO Director-General warned of the dangers of an infodemic. The infodemic is a phenomenon in which false information spreads rapidly like an epidemic and causes chaos, and it was noted that the COVID-19 pandemic is not just limited to health problems, but also linked to a variety of issues such as human rights, economic inequality, various discrimination, hate speech, fake news, global governance etc. In the field of education, it is necessary to think about how to connect this global situation with school classes. Accordingly, this study suggested the direction for global citizenship education by analyzing how the infodemic spreads on Korean social media with the case of the recent global COVID-19 pandemic. According to the research results, the rate of negative emotions was higher than positive ones in the emotions that generate infodemic, while anxiety and anger were focused among negative emotions. In addition, the infodemic tended to spread widely with the feelings of anger rather than anxiety, and the feelings of anger led to advocating aggressive policies against certain country and regions. Therefore, global citizenship education is required to focus on a sense of duty and responsibility as a citizen, not on the level of national identity based on an exclusive sense of belonging. Furthermore, global citizenship education needs to lead to presenting a blueprint for education in a way that can enhance the awareness of the global community for joint response to global challenges and realize common prosperity based on sustainability and justice.

Social Perceptions and Attitudes toward the Elderly Shared Online: Focusing on Social Big Data Analysis (온라인상에서 공유되는 노인에 대한 사회적 인식과 태도: 소셜 빅데이터 분석을 중심으로)

  • An, Soontae;Lee, Hannah;Chung, Soondool
    • 한국노년학
    • /
    • v.41 no.4
    • /
    • pp.505-525
    • /
    • 2021
  • Purpose. The purpose of this study is to examine how the phrase "old person" are expressed and used in the online sphere. Based on the theoretical concept of stigma, this study investigates the images and attitudes in society toward the elderly, and the characteristics of hate speech aimed at the elderly. Method. This study conducted text mining based on social big data using anonymous conversations. Results. It was confirmed that the elderly images shared online were generally negative. The attitudes expressed toward them also tended to be negative due to the negative images that are propagated of the elderly. The hate speech relating to the elderly, in usages such as 'Teul-ttag' and 'Kon-dae', were mainly identified in comments that negatively evaluate the elderly, and these expressions demonstrate the depth of hate and discrimination towards the elderly who are considered burdensome by young people. Interestingly, the hateful expressions towards the elderly were found more with regard to issues related to politics and economics and not just any content about the elderly. Conclusions. This study discussed the ways and means to enhance inter-generational understanding and solidity.

A Clinical Study on Binaural Hearing Aid (양이 보청효과에 관한 연구)

  • 김기령;김영명;심윤주
    • Proceedings of the KOR-BRONCHOESO Conference
    • /
    • 1978.06a
    • /
    • pp.9.2-9
    • /
    • 1978
  • Monaural and binaural hearing aid performance under quiet and noisy conditions were compared in regard to (1) the degree of hearing impairment, (2) the symmetry of pure tone audiogram, (3) the automatic gain control of the hearing aid. (4) hearing impairement with recruitment and, word discrimination ability. Performance using binaural hearing aids was consistently superior to that using monaural hearing aids. The results were as follows. 1. Speech detection thresholds were enhanced by a mean of 4.25dB when tested with danavox 747 PP stereo type hearing aid and by a mean of 4.12 dB when tested hearing aids connected seperately to the right and left ears. 2. Binaurally tested speech reception thresholds were superior to monaurally tested thresholds by a mean of 3.56dB when tested in quiet and by a mean of 5.56dB when tested in noise. 3. Binaurally tested word discrimination scores were also superior by a mean of 17.09% in quiet and by a mean 19.63% in noise. 4. Both SRT and word discrimination scores were performed best by subjects with moderately-severe impairement. The performance by one mildly impaired subject was the poorest of all performances. The levels of performance order were; moderately-severe loss, severe loss. moderate loss and mild loss. 5. The data obtained using AGC aids when compaired with that of linear amplification show that when AGC aids were worn in both ears. the results were very poor but when one AGC aid was worn in one ear and linear amplification in the other. the results were good. 6. The advantages of binaural hearing aids were obvious even in cases 1) with great diferences in hearing thresholds between right and left ears, 2) when the subject was unable to discriminate words without vision and. 3) when the subject had extreme recruitme t phenomenon.

  • PDF

Perceptive evaluation of Korean native speakers on the polysemic sentence final ending produced by Chinese Korean learners (KFL중국인학습자들의 한국어 동형다의 종결어미 발화문에 대한 원어민화자의 지각 평가 양상)

  • Yune, Youngsook
    • Phonetics and Speech Sciences
    • /
    • v.12 no.4
    • /
    • pp.27-36
    • /
    • 2020
  • The aim of this study is to investigate the perceptive aspects of the polysemic sentence final ending "-(eu)lgeol" produced by Chinese Korean learners. "-(Eu)lgeol" has two different meanings, that is, a guess and a regret, and these different meanings are expressed by the different prosodic features of the last syllable of "-(eu)lgeol". To examine how Korean native speakers perceive "-(eu)lgeol" sentences produced by Chinese Korean learners and the most saliant prosodic variable for the semantic discrimination of "-(eu)lgeol" at the perceptive level, we performed a perceptual experiment. The analysed material constituted four Korean sentences containing "-(eu)lgeol" in which two sentences expressed guesses and the other two expressed regret. Twenty-five Korean native speakers participated in the perceptual experiment. Participants were asked to mark whether "-(eu)lgeol" sentences they listened to were (1) definitely regrets, (2) probably regrets, (3) ambiguous, (4) probably guesses, or (5) definitely guesses based on the prosodic features of the last syllable of "-(eu)lgeol". The analysed prosodic variables were sentence boundary tones, slopes of boundary tones, pitch difference between sentence-final and penultimate syllables, and pitch levels of boundary tones. The results show that all the analysed prosodic variables are significantly correlated with the semantic discrimination of "-(eu)lgeol" and among these prosodic variables, the most salient role in the semantic discrimination of "-(eu)lgeol" is pitch difference between sentence-final syllable and penultimate syllable.

Financial Fraud Detection using Text Mining Analysis against Municipal Cybercriminality (지자체 사이버 공간 안전을 위한 금융사기 탐지 텍스트 마이닝 방법)

  • Choi, Sukjae;Lee, Jungwon;Kwon, Ohbyung
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.3
    • /
    • pp.119-138
    • /
    • 2017
  • Recently, SNS has become an important channel for marketing as well as personal communication. However, cybercrime has also evolved with the development of information and communication technology, and illegal advertising is distributed to SNS in large quantity. As a result, personal information is lost and even monetary damages occur more frequently. In this study, we propose a method to analyze which sentences and documents, which have been sent to the SNS, are related to financial fraud. First of all, as a conceptual framework, we developed a matrix of conceptual characteristics of cybercriminality on SNS and emergency management. We also suggested emergency management process which consists of Pre-Cybercriminality (e.g. risk identification) and Post-Cybercriminality steps. Among those we focused on risk identification in this paper. The main process consists of data collection, preprocessing and analysis. First, we selected two words 'daechul(loan)' and 'sachae(private loan)' as seed words and collected data with this word from SNS such as twitter. The collected data are given to the two researchers to decide whether they are related to the cybercriminality, particularly financial fraud, or not. Then we selected some of them as keywords if the vocabularies are related to the nominals and symbols. With the selected keywords, we searched and collected data from web materials such as twitter, news, blog, and more than 820,000 articles collected. The collected articles were refined through preprocessing and made into learning data. The preprocessing process is divided into performing morphological analysis step, removing stop words step, and selecting valid part-of-speech step. In the morphological analysis step, a complex sentence is transformed into some morpheme units to enable mechanical analysis. In the removing stop words step, non-lexical elements such as numbers, punctuation marks, and double spaces are removed from the text. In the step of selecting valid part-of-speech, only two kinds of nouns and symbols are considered. Since nouns could refer to things, the intent of message is expressed better than the other part-of-speech. Moreover, the more illegal the text is, the more frequently symbols are used. The selected data is given 'legal' or 'illegal'. To make the selected data as learning data through the preprocessing process, it is necessary to classify whether each data is legitimate or not. The processed data is then converted into Corpus type and Document-Term Matrix. Finally, the two types of 'legal' and 'illegal' files were mixed and randomly divided into learning data set and test data set. In this study, we set the learning data as 70% and the test data as 30%. SVM was used as the discrimination algorithm. Since SVM requires gamma and cost values as the main parameters, we set gamma as 0.5 and cost as 10, based on the optimal value function. The cost is set higher than general cases. To show the feasibility of the idea proposed in this paper, we compared the proposed method with MLE (Maximum Likelihood Estimation), Term Frequency, and Collective Intelligence method. Overall accuracy and was used as the metric. As a result, the overall accuracy of the proposed method was 92.41% of illegal loan advertisement and 77.75% of illegal visit sales, which is apparently superior to that of the Term Frequency, MLE, etc. Hence, the result suggests that the proposed method is valid and usable practically. In this paper, we propose a framework for crisis management caused by abnormalities of unstructured data sources such as SNS. We hope this study will contribute to the academia by identifying what to consider when applying the SVM-like discrimination algorithm to text analysis. Moreover, the study will also contribute to the practitioners in the field of brand management and opinion mining.

Pivot Discrimination Approach for Paraphrase Extraction from Bilingual Corpus (이중 언어 기반 패러프레이즈 추출을 위한 피봇 차별화 방법)

  • Park, Esther;Lee, Hyoung-Gyu;Kim, Min-Jeong;Rim, Hae-Chang
    • Korean Journal of Cognitive Science
    • /
    • v.22 no.1
    • /
    • pp.57-78
    • /
    • 2011
  • Paraphrasing is the act of writing a text using other words without altering the meaning. Paraphrases can be used in many fields of natural language processing. In particular, paraphrases can be incorporated in machine translation in order to improve the coverage and the quality of translation. Recently, the approaches on paraphrase extraction utilize bilingual parallel corpora, which consist of aligned sentence pairs. In these approaches, paraphrases are identified, from the word alignment result, by pivot phrases which are the phrases in one language to which two or more phrases are connected in the other language. However, the word alignment is itself a very difficult task, so there can be many alignment errors. Moreover, the alignment errors can lead to the problem of selecting incorrect pivot phrases. In this study, we propose a method in paraphrase extraction that discriminates good pivot phrases from bad pivot phrases. Each pivot phrase is weighted according to its reliability, which is scored by considering the lexical and part-of-speech information. The experimental result shows that the proposed method achieves higher precision and recall of the paraphrase extraction than the baseline. Also, we show that the extracted paraphrases can increase the coverage of the Korean-English machine translation.

  • PDF