• Title/Summary/Keyword: Language convergence

Search Result 789, Processing Time 0.029 seconds

Development of Korean dataset for joint intent classification and slot filling (발화 의도 예측 및 슬롯 채우기 복합 처리를 위한 한국어 데이터셋 개발)

  • Han, Seunggyu;Lim, Heuiseok
    • Journal of the Korea Convergence Society
    • /
    • v.12 no.1
    • /
    • pp.57-63
    • /
    • 2021
  • Spoken language understanding, which aims to understand utterance as naturally as human would, are mostly focused on English language. In this paper, we construct a Korean language dataset for spoken language understanding, which is based on a conversational corpus between reservation system and its user. The domain of conversation is limited to restaurant reservation. There are 7 types of slot tags and 5 types of intent tags in 6857 sentences. When a model proposed in English-based research is trained with our dataset, intent classification accuracy decreased a little, while slot filling F1 score decreased significantly.

CNN-based Online Sign Language Translation Counseling System (CNN기반의 온라인 수어통역 상담 시스템에 관한 연구)

  • Park, Won-Cheol;Park, Koo-Rack
    • Journal of Convergence for Information Technology
    • /
    • v.11 no.5
    • /
    • pp.17-22
    • /
    • 2021
  • It is difficult for the hearing impaired to use the counseling service without sign language interpretation. Due to the shortage of sign language interpreters, it takes a lot of time to connect to sign language interpreters, or there are many cases where the connection is not available. Therefore, in this paper, we propose a system that captures sign language as an image using OpenCV and CNN (Convolutional Neural Network), recognizes sign language motion, and converts the meaning of sign language into textual data and provides it to users. The counselor can conduct counseling by reading the stored sign language translation counseling contents. Consultation is possible without a professional sign language interpreter, reducing the burden of waiting for a sign language interpreter. If the proposed system is applied to counseling services for the hearing impaired, it is expected to improve the effectiveness of counseling and promote academic research on counseling for the hearing impaired in the future.

Improving Elasticsearch for Chinese, Japanese, and Korean Text Search through Language Detector

  • Kim, Ki-Ju;Cho, Young-Bok
    • Journal of information and communication convergence engineering
    • /
    • v.18 no.1
    • /
    • pp.33-38
    • /
    • 2020
  • Elasticsearch is an open source search and analytics engine that can search petabytes of data in near real time. It is designed as a distributed system horizontally scalable and highly available. It provides RESTful APIs, thereby making it programming-language agnostic. Full text search of multilingual text requires language-specific analyzers and field mappings appropriate for indexing and searching multilingual text. Additionally, a language detector can be used in conjunction with the analyzers to improve the multilingual text search. Elasticsearch provides more than 40 language analysis plugins that can process text and extract language-specific tokens and language detector plugins that can determine the language of the given text. This study investigates three different approaches to index and search Chinese, Japanese, and Korean (CJK) text (single analyzer, multi-fields, and language detector-based), and identifies the advantages of the language detector-based approach compared to the other two.

Research on Form-focused Instruction in Korean Language Education: A Critical Review (한국어교육에서의 형태초점교수법 연구: 비판적 검토)

  • Choi, Sunhee;Kim, Dae-hee
    • Journal of the Korea Convergence Society
    • /
    • v.8 no.2
    • /
    • pp.269-276
    • /
    • 2017
  • The purpose of this study is to review empirical research on Form-focused instruction (FFI) in Korean language education from a critical perspective to better understand the effectiveness of FFI. To achieve this goal, several databases were searched to locate relevant experimental and quasi experimental studies published in peer-reviewed journals. Out of 66 studies collected, 12 studies met the inclusion criteria. The studies were then analyzed in terms of subjects, target grammar, treatment, measurement, and the learning outcomes of different techniques. In general, several types of FFI techniques had positive effects on helping learners acquire Korean as a second or foreign language. The results of the study will provide a conceptual framework which identifies the major factors affecting the effectiveness of FFI. The results will also be able to inform future meta-analytical research of existing studies.

Bi-directional Maximal Matching Algorithm to Segment Khmer Words in Sentence

  • Mao, Makara;Peng, Sony;Yang, Yixuan;Park, Doo-Soon
    • Journal of Information Processing Systems
    • /
    • v.18 no.4
    • /
    • pp.549-561
    • /
    • 2022
  • In the Khmer writing system, the Khmer script is the official letter of Cambodia, written from left to right without a space separator; it is complicated and requires more analysis studies. Without clear standard guidelines, a space separator in the Khmer language is used inconsistently and informally to separate words in sentences. Therefore, a segmented method should be discussed with the combination of the future Khmer natural language processing (NLP) to define the appropriate rule for Khmer sentences. The critical process in NLP with the capability of extensive data language analysis necessitates applying in this scenario. One of the essential components in Khmer language processing is how to split the word into a series of sentences and count the words used in the sentences. Currently, Microsoft Word cannot count Khmer words correctly. So, this study presents a systematic library to segment Khmer phrases using the bi-directional maximal matching (BiMM) method to address these problematic constraints. In the BiMM algorithm, the paper focuses on the Bidirectional implementation of forward maximal matching (FMM) and backward maximal matching (BMM) to improve word segmentation accuracy. A digital or prefix tree of data structure algorithm, also known as a trie, enhances the segmentation accuracy procedure by finding the children of each word parent node. The accuracy of BiMM is higher than using FMM or BMM independently; moreover, the proposed approach improves dictionary structures and reduces the number of errors. The result of this study can reduce the error by 8.57% compared to FMM and BFF algorithms with 94,807 Khmer words.

A Survey on Open Source based Large Language Models (오픈 소스 기반의 거대 언어 모델 연구 동향: 서베이)

  • Ha-Young Joo;Hyeontaek Oh;Jinhong Yang
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.16 no.4
    • /
    • pp.193-202
    • /
    • 2023
  • In recent years, the outstanding performance of large language models (LLMs) trained on extensive datasets has become a hot topic. Since studies on LLMs are available on open-source approaches, the ecosystem is expanding rapidly. Models that are task-specific, lightweight, and high-performing are being actively disseminated using additional training techniques using pre-trained LLMs as foundation models. On the other hand, the performance of LLMs for Korean is subpar because English comprises a significant proportion of the training dataset of existing LLMs. Therefore, research is being carried out on Korean-specific LLMs that allow for further learning with Korean language data. This paper identifies trends of open source based LLMs and introduces research on Korean specific large language models; moreover, the applications and limitations of large language models are described.

Development of Sign Language Translation System using Motion Recognition of Kinect (키넥트의 모션 인식 기능을 이용한 수화번역 시스템 개발)

  • Lee, Hyun-Suk;Kim, Seung-Pil;Chung, Wan-Young
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.14 no.4
    • /
    • pp.235-242
    • /
    • 2013
  • In this paper, the system which can translate sign language through motion recognition of Kinect camera system is developed for the communication between hearing-impaired person or language disability, and normal person. The proposed algorithm which can translate sign language is developed by using core function of Kinect, and two ways such as length normalization and elbow normalization are introduced to improve accuracy of translating sign langauge for various sign language users. After that the sign language data is compared by chart in order to know how effective these ways of normalization. The accuracy of this program is demonstrated by entering 10 databases and translating sign languages ranging from simple signs to complex signs. In addition, the reliability of translating sign language is improved by applying this program to people who have various body shapes and fixing measure errors in body shapes.

A Study on The Korean Sign Language platform base on DirectX (DirectX 기반의 KSL 실행 플랫폼의 개발과 구현)

  • Ku, Ja-Hyo;Ryoo, Yun-Kyoo
    • Journal of the Korea society of information convergence
    • /
    • v.1 no.1
    • /
    • pp.25-32
    • /
    • 2008
  • The developments of digital and multimedia have been increasing the demand of humans desiring the acquisition of real and intuitive information and diversified expressions and the use of animation characters are continually increasing in mass media. With the development of graphic techniques, these expressions of animation characters have become enabled of real and smooth representations. Although, in general, even fine movements of the hair of characters can be expressed using diverse data input devices, the studies on the multimedia technologies for disabled persons are quite insufficient. In this paper, Directness it extracts the data which it move sign language and It propose the method which creates a Korea sign language platform.

  • PDF

PRAAT Software: A Spech Interaction Tool to Analyze Teacher Voices (PRAAT 소프트웨어: 교사 목소리 분석을 위한 맞춤법 상호작용 도구)

  • Kidd, Ella Jane
    • Journal of Convergence for Information Technology
    • /
    • v.9 no.9
    • /
    • pp.158-165
    • /
    • 2019
  • Through the use of speech software technology, this paper examines the effects of voice interactions within the inner circle of English. The fundamental frequency (F0) was obtained by analyzing native speakers (aged 30-55) speech effects based on nationality, age, and gender. The findings within this study reveal that the Caucasian British female (age 33) and the Caucasian American male (age 55) produced the most interactive speech. The contributing factor is the students' experience with various language styles throughout their language acquisition studies. The results of this study are compatible with $Traunm{\ddot{u}}eller$ & Eriksson (1995) and previous studies which agree that continuous speech above average is paramount towards student engagement and interactions.

Research on Korean Language Textbooks to Activate Media Literacy for the Era of Cultural Convergence (문화융합시대의 미디어 리터러시 활성화를 위한 국어교재 연구)

  • Lim, Ji-Won
    • Journal of Korea Entertainment Industry Association
    • /
    • v.14 no.7
    • /
    • pp.389-395
    • /
    • 2020
  • This discussion is a study that proposes to introduce a strategy to positively interpret the narrative meaning in media language based on a generalized cognitive environment in order to activate correct media literacy in the era of cultural convergence. have. In particular, by using advertising content that has the most reinforced creativity related to cultural interpretation, it was induced to grasp the informational manifestation of that era and to reproduce meaning interpretation with relevance. In addition, it attempted to utilize an argumentative writing strategy in the process of reproducing Korean language learners' writing, which was capable of cognitive interpretation. The intention of the public service advertisement content developer always expects a positive effect in the social and cultural aspect, and the learner dreams of reflection and a correct future through the effect. The research on activating media literacy in the era of cultural convergence, which I intended, has not yet been much discussed. We hope that the proposed discussion of this study will be actively utilized in mass media language education of the contents of textbooks for Korean language learners, and we are sorry for the part that does not contain quantitative analysis contents, and we expect the results in subsequent thesis.