• Title/Summary/Keyword: New Words

Search Result 1,475, Processing Time 0.029 seconds

The Effect of Strong Syllables on Lexical Segmentation in English Continuous Speech by Korean Speakers (강음절이 한국어 화자의 영어 연속 음성의 어휘 분절에 미치는 영향)

  • Kim, Sunmi;Nam, Kichun
    • Phonetics and Speech Sciences
    • /
    • v.5 no.2
    • /
    • pp.43-51
    • /
    • 2013
  • English native listeners have a tendency to treat strong syllables in a speech stream as the potential initial syllables of new words, since the majority of lexical words in English have a word-initial stress. The current study investigates whether Korean (L1) - English (L2) late bilinguals perceive strong syllables in English continuous speech as word onsets, as English native listeners do. In Experiment 1, word-spotting was slower when the word-initial syllable was strong, indicating that Korean listeners do not perceive strong syllables as word onsets. Experiment 2 was conducted in order to avoid any possibilities that the results of Experiment 1 may be due to the strong-initial targets themselves used in Experiment 1 being slower to recognize than the weak-initial targets. We employed the gating paradigm in Experiment 2, and measured the Isolation Point (IP, the point at which participants correctly identify a word without subsequently changing their minds) and the Recognition Point (RP, the point at which participants correctly identify the target with 85% or greater confidence) for the targets excised from the non-words in the two conditions of Experiment 1. Both the mean IPs and the mean RPs were significantly earlier for the strong-initial targets, which means that the results of Experiment 1 reflect the difficulty of segmentation when the initial syllable of words was strong. These results are consistent with Kim & Nam (2011), indicating that strong syllables are not perceived as word onsets for Korean listeners and interfere with lexical segmentation in English running speech.

Research on Subword Tokenization of Korean Neural Machine Translation and Proposal for Tokenization Method to Separate Jongsung from Syllables (한국어 인공신경망 기계번역의 서브 워드 분절 연구 및 음절 기반 종성 분리 토큰화 제안)

  • Eo, Sugyeong;Park, Chanjun;Moon, Hyeonseok;Lim, Heuiseok
    • Journal of the Korea Convergence Society
    • /
    • v.12 no.3
    • /
    • pp.1-7
    • /
    • 2021
  • Since Neural Machine Translation (NMT) uses only a limited number of words, there is a possibility that words that are not registered in the dictionary will be entered as input. The proposed method to alleviate this Out of Vocabulary (OOV) problem is Subword Tokenization, which is a methodology for constructing words by dividing sentences into subword units smaller than words. In this paper, we deal with general subword tokenization algorithms. Furthermore, in order to create a vocabulary that can handle the infinite conjugation of Korean adjectives and verbs, we propose a new methodology for subword tokenization training by separating the Jongsung(coda) from Korean syllables (consisting of Chosung-onset, Jungsung-neucleus and Jongsung-coda). As a result of the experiment, the methodology proposed in this paper outperforms the existing subword tokenization methodology.

A Study on the Perception of Fashion Platforms and Fashion Smart Factories using Big Data Analysis (빅데이터 분석을 이용한 패션 플랫폼과 패션 스마트 팩토리에 대한 인식 연구)

  • Song, Eun-young
    • Fashion & Textile Research Journal
    • /
    • v.23 no.6
    • /
    • pp.799-809
    • /
    • 2021
  • This study aimed to grasp the perceptions and trends in fashion platforms and fashion smart factories using big data analysis. As a research method, big data analysis, fashion platform, and smart factory were identified through literature and prior studies, and text mining analysis and network analysis were performed after collecting text from the web environment between April 2019 and April 2021. After data purification with Textom, the words of fashion platform (1,0591 pieces) and fashion smart factory (9750 pieces) were used for analysis. Key words were derived, the frequency of appearance was calculated, and the results were visualized in word cloud and N-gram. The top 70 words by frequency of appearance were used to generate a matrix, structural equivalence analysis was performed, and the results were displayed using network visualization and dendrograms. The collected data revealed that smart factory had high social issues, but consumer interest and academic research were insufficient, and the amount and frequency of related words on the fashion platform were both high. As a result of structural equalization analysis, it was found that fashion platforms with strong connectivity between clusters are creating new competitiveness with service platforms that add sharing, manufacturing, and curation functions, and fashion smart factories can expect future value to grow together, according to digital technology innovation and platforms. This study can serve as a foundation for future research topics related to fashion platforms and smart factories.

Deep-Learning-based smartphone application for automatic recognition of ingredients on curved containers (곡면 용기에 표시된 성분표 자동 인식을 위한 인공지능 기반 스마트폰 애플리케이션)

  • Hieyong Jeong;Choonsung Shin
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.28 no.6
    • /
    • pp.29-43
    • /
    • 2023
  • Consumers should look at the ingredients of cosmetics or food for their health and purchase them after checking whether they contain allergy-causing ingredients. Therefore, this paper aimed to develop an artificial intelligence-based smartphone application for automatically recognizing the ingredients displayed on a curved container and delivering it to consumers in an easy-to-understand manner. The app needs to allow consumers to immediately comprehend the restricted ingredients by recognizing the ingredients' words in the cropped image. Two major issues should be solved during the development process: First, although there were flat containers for cosmetics or food, most were curved containers. Thus, it was necessary to recognize the ingredient table displayed on the curved containers. Second, since the ingredients' words were displayed on the curved surface, the transformed or line-changed words also needed to be recognized. The proposed new methods were enough to solve the above two problems. The application developed through various tests verified that there was no problem recognizing the ingredients' words contained in a cylindrical curved container.

Two-Path Language Modeling Considering Word Order Structure of Korean (한국어의 어순 구조를 고려한 Two-Path 언어모델링)

  • Shin, Joong-Hwi;Park, Jae-Hyun;Lee, Jung-Tae;Rim, Hae-Chang
    • The Journal of the Acoustical Society of Korea
    • /
    • v.27 no.8
    • /
    • pp.435-442
    • /
    • 2008
  • The n-gram model is appropriate for languages, such as English, in which the word-order is grammatically rigid. However, it is not suitable for Korean in which the word-order is relatively free. Previous work proposed a twoply HMM that reflected the characteristics of Korean but failed to reflect word-order structures among words. In this paper, we define a new segment unit which combines two words in order to reflect the characteristic of word-order among adjacent words that appear in verbal morphemes. Moreover, we propose a two-path language model that estimates probabilities depending on the context based on the proposed segment unit. Experimental results show that the proposed two-path language model yields 25.68% perplexity improvement compared to the previous Korean language models and reduces 94.03% perplexity for the prediction of verbal morphemes where words are combined.

Research on the Automatic Software Keyboard Based on Database (데이터베이스에 근거한 자동 키보드의 입력 방법)

  • Lee Kye Suk;Yong Hwan Seung
    • Journal of Korea Multimedia Society
    • /
    • v.8 no.1
    • /
    • pp.101-110
    • /
    • 2005
  • Recently software keyboard is widely used in mobile devices where restrictive hardware keyboard is available. In this paper, new software-driven keyboard input method is proposed, which use minimum number of keyboard input with small keyboard space generated after analyzing of database. In this software keyboard is generated dynamically at each input step by analyzing all possible input words. Software keyboard, only possible key buttons are displayed for minimizing keyboard space and preventing mistyping. And it also provide input word completion function when the number of the candidate words is within threshold scope.

  • PDF

A Study on Characteristics of Hybrid System on Affordance-based Future Housing using Convergence Technology (컨버전스 기술을 이용한 어포던스 기반 미래 주거공간의 하이브리드 특성에 관한 연구)

  • Kang, Min-Soo;Choo, Seung-Yeon;Park, Yong-Seo
    • Journal of the Korean housing association
    • /
    • v.20 no.5
    • /
    • pp.85-92
    • /
    • 2009
  • In the coming 21st centuries, words of development of information communication technology among the key words being emerged as an important concern have been talked about frequently and ubiquitous environment that helps human living being networked with humans, objects and environments has been rapidly progressed, influencing significantly over the various fields as well as architectural area. And eventually in this architectural area, the space that is desired to be shown to and experienced by the people could be found in the creation of a space in a new form that has not been existed in this world by utilizing the information communication technology. The future housing delicately add using technology and AR system which is an essential element. The purpose of this study is to production and using each element and develop one-step advanced the hybrid system space. We have to select the best way of the construction future housing.

Frame synchronization Confirmation Technique Using Pilot Pattern

  • Song, Young-Joon
    • Journal of Communications and Networks
    • /
    • v.2 no.1
    • /
    • pp.69-75
    • /
    • 2000
  • A new frame synchronization confirmation technique using a pilot pattern of both uplink and downlink channels is proposed for W-CDMA (Wideband Code Division Multiple Access) system. It is shown that by using this technique, we can cancel the side lobe for autocorrelation functions of the frame synchronization words of pilot pattern have the maximum to-of-phase autocorrelation value "4" with two peak values equal in magnitude and opposite in polarity at zero and middle shifts. Due to this side lobe cancellation effect, therefore, the autocorrelation function of the frame synchronization words becomes ideal for the frame synchronization confirmation since double maximum correlation values equal in magnitude and opposite polarity at zero ad middle shifts can be achieved. This property can be used to double check frame synchronization timing and thus. improve the frame synchronization confirmation performance.

  • PDF

Development of Spatio-Temporal Neural Network for Connected Korean Digits Recognition (한국어 연결 숫자음 인식을 위한 시공간 신경회로망의 개발)

  • 이종식
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1995.06a
    • /
    • pp.69-72
    • /
    • 1995
  • In this paper, a new approach for Korean connected digits recognition using the spatio-temporal neural network is reported. The data of seven digits phone numbers are used in the recognition of connected words, and in the initial experiment, digit recognition rate of 28% was achieved. In this paper, to increase recognition rate, two different approaches are analyzed. In the first system, to compensate the STNN's own defect and to emphasize the Korean word's phonic characters, the starting point of phone is pointed by comparing the average magnitude and zero-crossing rate and the ending point is pointed by comparing only zero-crossing rate. The digit recoginiton rate increased to 61%. Also, in the second system, to consider fact that same word's phone is varied severally, the number of STNN's of each word is increased from one to five, and then the varied same word's phones can be included to the increased STNN's. The digit recogniton rate of connected words increased to 89%.

  • PDF

Design of Big Data Preference Analysis System (빅데이터 선호도 분석 시스템 설계)

  • Son, Sung Il;Park, Chan Khon
    • Journal of Korea Multimedia Society
    • /
    • v.17 no.11
    • /
    • pp.1286-1295
    • /
    • 2014
  • This paper suggests the way that it could improve the reliability about preference of user's feedback by adding weighting factor on sentiment analysis, and efficiently make a sentiment analysis of users' emotional perspective on the big data massively generated on twitter. To solve errors on earlier studies, this paper has improved recall and precision of sensibility determination by using sensibility dictionary subdivided sentiment polarity based on the level of sensibility and given impotance to sensibility determination by populating slang, new words, emoticons and idiomatic expressions not in the system dictionary. It has considered the context through conjunctive adverbs fixed in korean characteristics which are free to the word order. It also recognize sensibility words such as TF(Term Frequency), RT(Retweet), Follower which are weighting factors of preference and has increased reliability of preference analysis considering weight on 'a very emotional tweet', 'a recognised tweet from users' and 'a tweeter influencer'