Search | Korea Science

Phonetic Tied-Mixture Syllable Model for Efficient Decoding in Korean ASR (효율적 한국어 음성 인식을 위한 PTM 음절 모델)

Kim Bong-Wan;Lee Yong-Jn
- MALSORI
- /
- no.50
- /
- pp.139-150
- /
- 2004
A Phonetic Tied-Mixture (PTM) model has been proposed as a way of efficient decoding in large vocabulary continuous speech recognition systems (LVCSR). It has been reported that PTM model shows better performance in decoding than triphones by sharing a set of mixture components among states of the same topological location[5]. In this paper we propose a Phonetic Tied-Mixture Syllable (PTMS) model which extends PTM technique up to syllables. The proposed PTMS model shows 13% enhancement in decoding speed than PTM. In spite of difference in context dependent modeling (PTM : cross-word context dependent modeling, PTMS : word-internal left-phone dependent modeling), the proposed model shows just less than 1% degradation in word accuracy than PTM with the same beam width. With a different beam width, it shows better word accuracy than in PTM at the same or higher speed.
PDF

An analysis of illocutionary force types in a dialogue, based on the context and modal information in the ending of a word (문맥 및 종결어미의 서법정보를 이용한 대화문의 화수력 분석)

김영길;최병욱
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.33B no.10
- /
- pp.98-106
- /
- 1996
This paper proposes an algorithm for analyzing illocutionary force type (IfT)s in a dialogue, based on the context and modal information in the ending of a word. In korean, the variation of an illocutionary force type that represents a speaker's intention frequently occurs at the ending of a word, according to the type of modal information. And in an analysis of speech acts, the modal information illocutionary force types. In this paper, we analyze real dialogue dta, classify the types of illocutionary forces, perform the manual tagging of IFTs and show the freqency of each IFT's occurence. And we also propose an algorithm to extract IFTs, based on the relationship between the analyzed IFTs and the endings of a word. And we use this proposed algorithm to make an experiment on dialogue data and show its efficiency.
PDF

The Interlanguage Speech Intelligibility Benefit for Listeners (ISIB-L): The Case of English Liquids

Lee, Joo-Kyeong;Xue, Xiaojiao
- Phonetics and Speech Sciences
- /
- v.3 no.1
- /
- pp.51-65
- /
- 2011
This study attempts to investigate the interlanguage speech intelligibility benefit for listeners (ISIB-L), examining Chinese talkers' production of English liquids and its perception of native listeners and non-native Chinese and Korean listeners. An Accent Judgment Task was conducted to measure non-native talkers' and listeners' phonological proficiency, and two levels of proficiency groups (high and low) participated in the experiment. The English liquids /l/ and /r/ produced by Chinese talkers were considered in terms of positions (syllable initial and final), contexts (segment, word and sentence) and lexical density (minimal vs. nonminimal pair) to see if these factors play a role in ISIIB-L. Results showed that both matched and mismatched interlanguage speech intelligibility benefit for listeners occurred except for the initial /l/. Non-native Chinese and Korean listeners, though only with high proficiency, were more accurate at identifying initial /r/, final /l/ and final /r/, but initial /l/ was significantly more intelligible to native listeners than non-native listeners. There was evidence of contextual and lexical density effects on ISIB-L. No ISIB-L was demonstrated in sentence context, but both matched and mismatched ISIB-L was observed in word context; this finding held true for only high proficiency listeners. Listeners recognized the targets better in the non-minimal pair (sparse density) environment than the minimal pair (higher density) environment. These findings suggest that ISIB-L for English liquids is influenced by talkers' and listeners' proficiency, syllable position in association with L1 and L2 phonological structure, context, and word neighborhood density.
PDF

An Iterative Approach to Graph-based Word Sense Disambiguation Using Word2Vec (Word2Vec을 이용한 반복적 접근 방식의 그래프 기반 단어 중의성 해소)

O, Dongsuk;Kang, Sangwoo;Seo, Jungyun
- Korean Journal of Cognitive Science
- /
- v.27 no.1
- /
- pp.43-60
- /
- 2016
Recently, Unsupervised Word Sense Disambiguation research has focused on Graph based disambiguation. Graph-based disambiguation has built a semantic graph based on words collocated in context or sentence. However, building such a graph over all ambiguous word lead to unnecessary addition of edges and nodes (and hence increasing the error). In contrast, our work uses Word2Vec to consider the most similar words to an ambiguous word in the context or sentences, to rebuild a graph of the matched words. As a result, we show a higher F1-Measure value than the previous methods by using Word2Vec.
PDF

The Influence of YouTube "Mukbang" Content Characteristics on Viewers' Satisfaction and Word-of-Mouth Intentions

Jeong Sun LEE;Seunghyeon LEE;Seong Soo CHA
- The Journal of Industrial Distribution & Business
- /
- v.15 no.9
- /
- pp.1-9
- /
- 2024
Purpose: This study examines the impact of YouTube mukbang content characteristics on viewer satisfaction and word-of-mouth behavior. Drawing from theories in media psychology, consumer behavior, and communication studies, we investigate five key content characteristics: credibility, entertainment value, informativeness, visual appeal, and auditory quality. Research design, data and methodology: Using structural equation modeling with data from 206 mukbang viewers, we test hypothesized relationships between these characteristics, viewer satisfaction, and word-of-mouth behavior. Results: Research reveal that credibility and informativeness significantly and positively influence viewer satisfaction, while entertainment value, visual appeal, and auditory quality show no significant effect. Viewer satisfaction positively impacts word-of-mouth behavior. These findings challenge conventional assumptions about video content consumption and highlight the unique nature of mukbang viewing. Conclusions: The study contributes to digital content consumption literature by providing empirical evidence of factors influencing viewer engagement in the mukbang context. It offers practical insights for content creators, marketers, and platform developers, emphasizing the importance of informative and credible content in driving viewer satisfaction and promoting positive word-of-mouth. By extending established media theories to this emerging form of digital entertainment, our research paves the way for future studies. The study's limitations, including its cross-sectional nature and specific cultural context, suggest directions for future research.
https://doi.org/10.13106/jidb.2024.vol15.no9.1 인용 PDF

Word Sense Disambiguation Using Embedded Word Space

Kang, Myung Yun;Kim, Bogyum;Lee, Jae Sung
- Journal of Computing Science and Engineering
- /
- v.11 no.1
- /
- pp.32-38
- /
- 2017
Determining the correct word sense among ambiguous senses is essential for semantic analysis. One of the models for word sense disambiguation is the word space model which is very simple in the structure and effective. However, when the context word vectors in the word space model are merged into sense vectors in a sense inventory, they become typically very large but still suffer from the lexical scarcity. In this paper, we propose a word sense disambiguation method using word embedding that makes the sense inventory vectors compact and efficient due to its additive compositionality. Results of experiments with a Korean sense-tagged corpus show that our method is very effective.
https://doi.org/10.5626/JCSE.2017.11.1.32 인용 PDF KSCI

Context-Weighted Metrics for Example Matching (문맥가중치가 반영된 문장 유사 척도)

Kim, Dong-Joo;Kim, Han-Woo
- Journal of the Institute of Electronics Engineers of Korea CI
- /
- v.43 no.6 s.312
- /
- pp.43-51
- /
- 2006
This paper proposes a metrics for example matching under the example-based machine translation for English-Korean machine translation. Our metrics served as similarity measure is based on edit-distance algorithm, and it is employed to retrieve the most similar example sentences to a given query. Basically it makes use of simple information such as lemma and part-of-speech information of typographically mismatched words. Edit-distance algorithm cannot fully reflect the context of matched word units. In other words, only if matched word units are ordered, it is considered that the contribution of full matching context to similarity is identical to that of partial matching context for the sequence of words in which mismatching word units are intervened. To overcome this drawback, we propose the context-weighting scheme that uses the contiguity information of matched word units to catch the full context. To change the edit-distance metrics representing dissimilarity to similarity metrics, to apply this context-weighted metrics to the example matching problem and also to rank by similarity, we normalize it. In addition, we generalize previous methods using some linguistic information to one representative system. In order to verify the correctness of the proposed context-weighted metrics, we carry out the experiment to compare it with generalized previous methods.
PDF KSCI

Modified multi-sense skip-gram using weighted context and x-means (가중 문맥벡터와 X-means 방법을 이용한 변형 다의어스킵그램)

Jeong, Hyunwoo;Lee, Eun Ryung
- The Korean Journal of Applied Statistics
- /
- v.34 no.3
- /
- pp.389-399
- /
- 2021
In recent years, word embedding has been a popular field of natural language processing research and a skip-gram has become one successful word embedding method. It assigns a word embedding vector to each word using contexts, which provides an effective way to analyze text data. However, due to the limitation of vector space model, primary word embedding methods assume that every word only have a single meaning. As one faces multi-sense words, that is, words with more than one meaning, in reality, Neelakantan (2014) proposed a multi-sense skip-gram (MSSG) to find embedding vectors corresponding to the each senses of a multi-sense word using a clustering method. In this paper, we propose a modified method of the MSSG to improve statistical accuracy. Moreover, we propose a data-adaptive choice of the number of clusters, that is, the number of meanings for a multi-sense word. Some numerical evidence is given by conducting real data-based simulations.
https://doi.org/10.5351/KJAS.2021.34.3.389 인용 PDF KSCI

Categorization of POIs Using Word and Context information (관심 지점 명칭의 단어와 문맥 정보를 활용한 관심 지점의 분류)

Choi, Su Jeong;Park, Seong-Bae
- Journal of the Korean Institute of Intelligent Systems
- /
- v.24 no.5
- /
- pp.470-476
- /
- 2014
A point of interest is a specific point location such as a cafe, a gallery, a shop, or a park. It consists of a name, a category, a location, and so on. Its information is necessary for location-based application, above all category is basic information. However, category information should be automatically gathered because it costs high to gather it manually. In this paper, we propose a novel method to estimate category of POIs automatically using an inner word and local context. An inner word is a word that contains POI's name. Their name sometimes expose category information. Thus, their name is used as inner word information in estimating category of POIs. Local context information means words around a POI's name in a document that mentioned the name. The context include information to estimate category. The evaluation of the proposed method is performed on two data sets. According to the experimental results, proposed model using combination inner word and local context show higher accuracy than that of model using each.
https://doi.org/10.5391/JKIIS.2014.24.5.470 인용 PDF KSCI

Multicriteria-Based Computer-Aided Pronunciation Quality Evaluation of Sentences

Yoma, Nestor Becerra;Berrios, Leopoldo Benavides;Sepulveda, Jorge Wuth;Torres, Hiram Vivanco
- ETRI Journal
- /
- v.35 no.1
- /
- pp.89-99
- /
- 2013
The problem of the sentence-based pronunciation evaluation task is defined in the context of subjective criteria. Three subjective criteria (that is, the minimum subjective word score, the mean subjective word score, and first impression) are proposed and modeled with the combination of word-based assessment. Then, the subjective criteria are approximated with objective sentence pronunciation scores obtained with the combination of word-based metrics. No a priori studies of common mistakes are required, and class-based language models are used to incorporate incorrect and correct pronunciations. Incorrect pronunciations are automatically incorporated by making use of a competitive lexicon and the phonetic rules of students' mother and target languages. This procedure is applicable to any second language learning context, and subjective-objective sentence score correlations greater than or equal to 0.5 can be achieved when the proposed sentence-based pronunciation criteria are approximated with combinations of word-based scores. Finally, the subjective-objective sentence score correlations reported here are very comparable with those published elsewhere resulting from methods that require a priori studies of pronunciation errors.
https://doi.org/10.4218/etrij.13.0112.0016 인용 PDF KSCI

Search Result 356, Processing Time 0.023 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)