• Title/Summary/Keyword: 단어 인식

Search Result 925, Processing Time 0.034 seconds

An automatic pronunciation evaluation system using non-native teacher's speech model (비원어민 교수자 음성모델을 이용한 자동발음평가 시스템)

  • Park, Hye-bin;Kim, Dong Heon;Joung, Jinoo
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.16 no.2
    • /
    • pp.131-136
    • /
    • 2016
  • An appropriate evaluation on learner's pronunciation has been an important part of foreign language education. The learners should be evaluated and receive proper feedback for pronunciation improvement. Due to the cost and consistency problem of human evaluation, automatic pronunciation evaluation system has been studied. The most of the current automatic evaluation systems utilizes underlying Automatic Speech Recognition (ASR) technology. We suggest in this work to evaluate learner's pronunciation accuracy and fluency in word-level using the ASR and non-native teacher's speech model. Through the performance evaluation on our system, we confirm the overall evaluation result of pronunciation accuracy and fluency actually represents the learner's English skill level quite accurately.

High-Speed Korean Address Searching System for Efficient Delivery Point Code Generation (효율적인 순로코드 발생을 위한 고속 한글 주소검색 시스템 개발)

  • Kim, Gyeong-Hwan;Lee, Seok-Goo;Shin, Mi-Young;Nam, Yun-Seok
    • The KIPS Transactions:PartD
    • /
    • v.8D no.3
    • /
    • pp.273-284
    • /
    • 2001
  • A systematic approach for interpreting Korean addresses based on postal code is presented in this paper. The implementation is focused on producing the final delivery point code from various types of address recognized. There are two stages in the address interpretation : 1) agreement verification between the recognized postal code and upper part of the address and 2) analysis of lower part of the address. In the agreement verification procedure, the recognized postal code is used as the key to the address dictionary and each of the retrieved addresses is compared with the words in the recognized address. As the result, the boundary between the upper part and the lower part is located. The confusion matrix, which is introduced to correct possible mis-recognized characters, is applied to improve the performance of the process. In the procedure for interpreting the lower part address, a delivery code is assigned using the house number and/or the building name. Several rules for the interpretation have been developed based on the real addresses collected. Experiments have been performed to evaluate the proposed approach using addresses collected from Kwangju and Pusan areas.

  • PDF

Image Analysis and Management Strategy for The National Science Museum Utilizing SNS Big Data Analysis (SNS 빅데이터 분석을 활용한 국립과학관에 대한 이미지 분석과 경영전략 제안)

  • Shin, Seongyeon
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.21 no.1
    • /
    • pp.81-89
    • /
    • 2020
  • The purpose of this study is to investigate science consumers' perceptions of the National Science Museum and suggest effective management strategies for the museum. Research questions were established and the analyses were conducted to achieve the research goals. The collection and analysis of the data were conducted through a new approach to image analysis that combines qualitative and quantitative methods. First, the image of the concept of science was derived from science consumers (adults, undergraduate and graduate students) through a qualitative research method (group-interviewing), and then text analysis was conducted. Second, quantitative research was conducted through LDA (Latent Dirichlet Allocation)-based topical modeling of 63,987 words extracted from 12,920 titles of blog postings from one of the most heavily-trafficked portal sites in Korea. The results of this study indicate that the perception of science differs according to the characteristics of the respondents. Further, topic-modeling extracted 20 topics from the blog posting titles and the topics were condensed into seven factors. Detailed discussions and managerial implications are provided in the conclusion section.

How to Express Emotion: Role of Prosody and Voice Quality Parameters (감정 표현 방법: 운율과 음질의 역할)

  • Lee, Sang-Min;Lee, Ho-Joon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.19 no.11
    • /
    • pp.159-166
    • /
    • 2014
  • In this paper, we examine the role of emotional acoustic cues including both prosody and voice quality parameters for the modification of a word sense. For the extraction of prosody parameters and voice quality parameters, we used 60 pieces of speech data spoken by six speakers with five different emotional states. We analyzed eight different emotional acoustic cues, and used a discriminant analysis technique in order to find the dominant sequence of acoustic cues. As a result, we found that anger has a close relation with intensity level and 2nd formant bandwidth range; joy has a relative relation with the position of 2nd and 3rd formant values and intensity level; sadness has a strong relation only with prosody cues such as intensity level and pitch level; and fear has a relation with pitch level and 2nd formant value with its bandwidth range. These findings can be used as the guideline for find-tuning an emotional spoken language generation system, because these distinct sequences of acoustic cues reveal the subtle characteristics of each emotional state.

Development of personalized clothing recommendation service based on artificial intelligence (인공지능 기반 개인 맞춤형 의류 추천 서비스 개발)

  • Kim, Hyoung Suk;Lee, Jong Hyuck;Lee, Hyun Dong
    • Smart Media Journal
    • /
    • v.10 no.1
    • /
    • pp.116-123
    • /
    • 2021
  • Due to the rapid growth of the online fashion market and the resulting expansion of online choices, there is a problem that the seller cannot directly respond to a large number of consumers individually, although consumers are increasingly demanding for more personalized recommendation services. Images are being tagged as a way to meet consumer's personalization needs, but when people tagging, tagging is very subjective for each person, and artificial intelligence tagging has very limited words and does not meet the needs of users. To solve this problem, we designed an algorithm that recognizes the shape, attribute, and emotional information of the product included in the image with AI, and codes this information to represent all the information that the image has with a combination of codes. Through this algorithm, it became possible by acquiring a variety of information possessed by the image in real time, such as the sensibility of the fashion image and the TPO information expressed by the fashion image, which was not possible until now. Based on this information, it is possible to go beyond the stage of analyzing the tastes of consumers and make hyper-personalized clothing recommendations that combine the tastes of consumers with information about trends and TPOs.

Emotional Expression Technique using Facial Recognition in User Review (사용자 리뷰에서 표정 인식을 이용한 감정 표현 기법)

  • Choi, Wongwan;Hwang, Mansoo;Kim, Neunghoe
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.22 no.5
    • /
    • pp.23-28
    • /
    • 2022
  • Today, the online market has grown rapidly due to the development of digital platforms and the pandemic situation. Therefore, unlike the existing offline market, the distinctiveness of the online market has prompted users to check online reviews. It has been established that reviews play a significant part in influencing the user's purchase intention through precedents of several studies. However, the current review writing method makes it difficult for other users to understand the writer's emotions by expressing them through elements like tone and words. If the writer also wanted to emphasize something, it was very cumbersome to thicken the parts or change the colors to reflect their emotions. Therefore, in this paper, we propose a technique to check the user's emotions through facial expression recognition using a camera, to automatically set colors for each emotion using research on existing emotions and colors, and give colors based on the user's intention.

Development of Dog Name Recommendation System for the Image Abstraction (이미지 추상화 기법을 이용한 반려견 이름 추천 시스템 개발)

  • Jae-Heon Lee;Ye-Rin Jeong;Mi-Kyeong Moon;Seung-Min Park
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.18 no.2
    • /
    • pp.313-320
    • /
    • 2023
  • The cumulative registration status of dogs is from 1.07 million in 2016 to 2.32 million in 2020. Animal registration is increasing by more than 10% every year, and accordingly, a name must be decided when registering a dog. We want to give a name that fits the characteristics of a dog's appearance, but there are many difficulties in naming it. This paper explains the development of a system for recognizing dog images and recommends dog names based on similar objects or food. This system extracts similarities with dogs' images through models that learn images of various objects and foods, and recommends dog names based on similarities. In addition, by recommending additional related words based on the image data of the result value, it was possible to provide users with various options, increase convenience, and increase interest and fun. Through this system, it is expected that users will be able to solve their concerns about naming their dogs, check names that suit their dogs comfortably, and give them various options through various recommended names to increase satisfaction.

A Study on the Spectrum Variation of Korean Speech (한국어 음성의 스펙트럼 변화에 관한 연구)

  • Lee Sou-Kil;Song Jeong-Young
    • Journal of Internet Computing and Services
    • /
    • v.6 no.6
    • /
    • pp.179-186
    • /
    • 2005
  • We can extract spectrum of the voices and analyze those, after employing features of frequency that voices have. In the spectrum of the voices monophthongs are thought to be stable, but when a consonant(s) meet a vowel(s) in a syllable or a word, there is a lot of changes. This becomes the biggest obstacle to phoneme speech recognition. In this study, using Mel Cepstrum and Mel Band that count Frequency Band and auditory information, we analyze the spectrums that each and every consonant and vowel has and the changes in the voices reftects auditory features and make it a system. Finally we are going to present the basis that can segment the voices by an unit of phoneme.

  • PDF

A Study on the Real Time Recognition of Korean Isolated Words with Filter Bank Output (필터뱅크 출력을 이용한 실시간 격리 단어 인식에 관한 연구)

  • Kim, Kye-Kook;Lee, Jong-Arc;Kahng, Seong-Jin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.10 no.3
    • /
    • pp.5-12
    • /
    • 1991
  • In this paper, 10 city names of Korean were recognized. The name are articulated each 5 times by 10 male speakers. Filter bank output on total 500 words were extracted and they were used as feature parameters. Filter bank was constructed of 15 channels with 1/3 octave spacing from 200[Hz], using RC active circuit. Reference templates were created by clustering algorithm. DTW algorithm was used to compare similarity between reference templates and input words. Euclidean distance equation and Chebyshev distance equation were used to know the distinction between the recognition results obtained by the method of distance caculation, error rates are 16.4[%], 15.0[%], respectively.

  • PDF

Concept-based Automatic Scoring System for Korean Free-text or Constructed Answers (개념 기반 한국어 서답형 답안의 자동채점 시스템)

  • Park, Il-Nam;Noh, Eun-Hee;Sim, Jae-Ho;Kim, Myung-Hwa;Kang, Seung-Shik
    • Annual Conference on Human and Language Technology
    • /
    • 2012.10a
    • /
    • pp.69-72
    • /
    • 2012
  • 본 논문은 한국어 서답형(단어, 구 수준) 문항 유형을 분석하고 실제 채점자가 채점 기준표를 보고 채점하는 방법을 컴퓨터가 인식할 수 있도록 정답 템플릿을 설계 및 개념 정의를 하여 한국어 서답형에 특화된 자동채점 시스템 방법을 제시한다. 본 시스템을 사용하여 1000개의 학생 답안지에 대한 유형 가지수 500개 이하의 2011년도 학업성취도 평가 과학 6개 문항에 대하여 채점 기준표 내용을 정답 템플릿으로 작성한 뒤 250개 학생 답안을 학습데이터로, 정답 템플릿을 업데이트로 사용, 750개 학생 답안에 대하여 자동채점한 결과, 평균 카파계수 0.84라는 수치로서 실제 사람 채점 결과와 거의 완벽히 일치라는 결과를 얻었다.

  • PDF