• Title/Summary/Keyword: Language-based

검색결과 6,226건 처리시간 0.039초

Improving Elasticsearch for Chinese, Japanese, and Korean Text Search through Language Detector

  • Kim, Ki-Ju;Cho, Young-Bok
    • Journal of information and communication convergence engineering
    • /
    • 제18권1호
    • /
    • pp.33-38
    • /
    • 2020
  • Elasticsearch is an open source search and analytics engine that can search petabytes of data in near real time. It is designed as a distributed system horizontally scalable and highly available. It provides RESTful APIs, thereby making it programming-language agnostic. Full text search of multilingual text requires language-specific analyzers and field mappings appropriate for indexing and searching multilingual text. Additionally, a language detector can be used in conjunction with the analyzers to improve the multilingual text search. Elasticsearch provides more than 40 language analysis plugins that can process text and extract language-specific tokens and language detector plugins that can determine the language of the given text. This study investigates three different approaches to index and search Chinese, Japanese, and Korean (CJK) text (single analyzer, multi-fields, and language detector-based), and identifies the advantages of the language detector-based approach compared to the other two.

Towards a small language model powered chain-of-reasoning for open-domain question answering

  • Jihyeon Roh;Minho Kim;Kyoungman Bae
    • ETRI Journal
    • /
    • 제46권1호
    • /
    • pp.11-21
    • /
    • 2024
  • We focus on open-domain question-answering tasks that involve a chain-of-reasoning, which are primarily implemented using large language models. With an emphasis on cost-effectiveness, we designed EffiChainQA, an architecture centered on the use of small language models. We employed a retrieval-based language model to address the limitations of large language models, such as the hallucination issue and the lack of updated knowledge. To enhance reasoning capabilities, we introduced a question decomposer that leverages a generative language model and serves as a key component in the chain-of-reasoning process. To generate training data for our question decomposer, we leveraged ChatGPT, which is known for its data augmentation ability. Comprehensive experiments were conducted using the HotpotQA dataset. Our method outperformed several established approaches, including the Chain-of-Thoughts approach, which is based on large language models. Moreover, our results are on par with those of state-of-the-art Retrieve-then-Read methods that utilize large language models.

A Structure and Framework for Sign Language Interaction

  • Kim, Soyoung;Pan, Younghwan
    • 대한인간공학회지
    • /
    • 제34권5호
    • /
    • pp.411-426
    • /
    • 2015
  • Objective: The goal of this thesis is to design the interaction structure and framework of system to recognize sign language. Background: The sign language of meaningful individual gestures is combined to construct a sentence, so it is difficult to interpret and recognize the meaning of hand gesture for system, because of the sequence of continuous gestures. This being so, in order to interpret the meaning of individual gesture correctly, the interaction structure and framework are needed so that they can segment the indication of individual gesture. Method: We analyze 700 sign language words to structuralize the sign language gesture interaction. First of all, we analyze the transformational patterns of the hand gesture. Second, we analyze the movement of the transformational patterns of the hand gesture. Third, we analyze the type of other gestures except hands. Based on this, we design a framework for sign language interaction. Results: We elicited 8 patterns of hand gesture on the basis of the fact on whether the gesture has a change from starting point to ending point. And then, we analyzed the hand movement based on 3 elements: patterns of movement, direction, and whether hand movement is repeating or not. Moreover, we defined 11 movements of other gestures except hands and classified 8 types of interaction. The framework for sign language interaction, which was designed based on this mentioned above, applies to more than 700 individual gestures of the sign language, and can be classified as an individual gesture in spite of situation which has continuous gestures. Conclusion: This study has structuralized in 3 aspects defined to analyze the transformational patterns of the starting point and the ending point of hand shape, hand movement, and other gestures except hands for sign language interaction. Based on this, we designed the framework that can recognize the individual gestures and interpret the meaning more accurately, when meaningful individual gesture is input sequence of continuous gestures. Application: When we develop the system of sign language recognition, we can apply interaction framework to it. Structuralized gesture can be used for using database of sign language, inventing an automatic recognition system, and studying on the action gestures in other areas.

흐름 제어 언어의 통합 처리 (Integrate Processing Scheme of Flow Control Language)

  • 김태완;장천현
    • 정보처리학회논문지D
    • /
    • 제11D권2호
    • /
    • pp.415-422
    • /
    • 2004
  • 산업분야에서 자동화 시스템은 제품의 설계, 생산 공정의 제어, 장애 처리, 품질검사 등과 관련된 처리 과정을 자동으로 수행할 수 있도록 하여 생산성을 향상시킨다. 이러한 자동화 시스템에서 감시 및 제어에 대한 처리 과정을 기술하는 언어를 흐름 제어 언어라 한다. 현재 사용되고 있는 흐름 제어 언어는 문자 기반의 IL, ST와 그래픽 기반의 FBD, SFC, LD가 있다. 일반적으로 감시 제어 시스템에서 사용되는 소프트웨어는 사용할 수 있는 흐름 제어 언어를 2종류 이하로 제한하고 있고, 동일한 시스템 환경에서는 언어의 혼용을 통한 통합 시뮬레이션이 불가능하다. 본 논문에서는 흐름 제어 언어의 특성을 분석하고 기존 시스템 환경에서 언어 작성 및 처리 과정에 대하여 분석하고, 언어의 통합 처리를 위하여 고급언어 형태의 ST를 확장한 EST 언어를 제안하였다. 이러한 연구를 기초로 그래픽 언어인 FBD, LD, SFC를 통합 처리하여 EST로 변환하는 그래픽 언어 편집기와 EST를 저급언어인 교로 변환하는 EST-IL변환기를 구현하였다. 이러한 편집기 및 변환기를 통한 교 기반의 시스템 구현 및 실험 결과는 흐름 제어 언어의 통합 처리 방안을 제시한 것이다.

A Corpus-Based Study on the Vocabulary Development of Korean Learners

  • Sinhye Nam;Chaerin Jang;Sunyoung Kim
    • Journal of Information Processing Systems
    • /
    • 제20권4호
    • /
    • pp.477-490
    • /
    • 2024
  • This study identifies the vocabulary usage patterns of Korean heritage language learners. We analyzed the interlanguage of the Korean heritage language learners and examined their vocabulary usage patterns, especially the major content keywords being used at their respective proficiency levels. The Korean Learner's Corpus from the National Institute of Korean Language is used for the data analysis. We found that as the heritage language learners' proficiency increases, low-frequency (high-level) vocabulary is often used as the keywords and the semantic vocabulary areas expand from daily to social to specialized fields. It is therefore confirmed that the vocabulary use of Korean heritage language learners develops as their proficiency increases. This study confirms the development of Korean vocabulary in Korean heritage language learners and exemplifies how corpus-based applied linguistic research and computer science can be integrated using a keyword extraction algorithm.

성경에 기초한 유아 언어 교육 활동 개발을 통한 기독 예비 유아 교사의 변화 (The Change of Christian Pre-Service Early Childhood Teachers through Development of Bible-Based Early Childhood Language Education Activities)

  • 김민정
    • 기독교교육논총
    • /
    • 제61권
    • /
    • pp.165-201
    • /
    • 2020
  • 본 연구는 성경에 기초한 유아 언어 교육 활동 개발을 통한 기독 예비 유아교사의 변화를 탐구하여 기독유아교육의 언어교육 개발 방향을 모색하는데 목적이 있다. 유아 언어 교육의 세부 주제인 '성경에 기초한 유아 언어 교육 활동 개발'에 참여한 기독교교육과 학생 19명을 대상으로 2018년 9월 3일 ~ 12월 28일 동안 면담, 설문조사, 활동계획안, 성찰이 담긴 포트폴리오 등을 통해 자료를 수집하였다. 수집된 자료를 분석하여 핵심 범주를 도출하고 이를 범주화 하였다. 자료 분석 및 해석의 객관화를 위해 신학 및 유아교육 전문가 2인의 동료 확인을 거쳤다. 연구 결과, 성경에 기초한 유아 언어 교육 활동 개발에 대한 기독 예비 유아교사 경험은 인지적 변화, 인성적 변화, 실천적 변화로 범주화하였다. 첫째, 성경에 기초한 유아 언어 교육 활동을 개발하면서 기독 예비 유아교사는 유아 언어교육의 발달적 성취와 결과보다는 '언어교육활동의 과정'의 인지적 변화를 경험하였다. 또한, 유아 언어 교육 영역의 분리가 아닌 '듣기-말하기-읽기-쓰기의 통합'의 필요성을 인식하였다. 기독 예비 유아교사는 교사중심의 형식적 언어교육과 더불어 '유치원 생활 속의 비형식 언어교육'의 중요성을 인식하였고, 유아 언어 교육의 효과성 검증보다는 '유아 중심의 의미 있는 언어교육 경험'이 중요하다는 인지적 변화가 이루어졌다. 둘째, 성경에 기초한 유아 언어 교육 활동을 개발하면서 기독 예비 유아 교사는 '자신감 있는 교사', '전문성 있는 교사', '반성적 사고와 태도를 가진 교사'로서 인성적 변화를 나타났다. 마지막으로 성경에 기초한 유아 언어 교육 활동을 개발하면서 기독 예비 유아 교사는 '긍정 언어의 힘'을 인식하였고, '바른 언어 사용 습관' 형성과 '기독교 교육과 유아교육 연계'를 위해 노력하는 실천적 변화가 있었다. 성경에 기초한 유아 언어교육 활동 개발을 통해 기독 예비 유아교사는 예측할 수 없는 교육 상황과 빠르게 변화하는 교육 현실 속에서 유아를 위한 진정한 유아 교사가 되기 위한 마음 자세와 교사로서 요구되는 열정의 자질을 함양하게 되어 교사 효능감이 증진되었다. 향후, 기독교교육과 유아교육이 연계된 다양한 교사 교육 프로그램이 지속적이고 체계적으로 이루어지길 기대한다.

한국어교육학에서의 담화 연구 분석 (Issues of Discourse Studies in Korean Language Education)

  • 강현화
    • 한국어교육
    • /
    • 제23권1호
    • /
    • pp.219-256
    • /
    • 2012
  • The aim of this study is to observe the trend of discourse study in language education and analyze the main issues by investigating the literatures related to discourse in Korean language education in the last ten years. This study observed the discourse study conducted in Korean language education from the perspectives of study subject, study method and study data. Moreover, based on the results, it estimated the achievements and effectiveness of the discourse study conducted in Korean language education. The subject of discourse study was mainly dealt with discourse function, discourse pattern, discourse marker, discourse structure. In the study methods, analysis of corpus and survey were mainly used as the study methods, and spoken corpus, written corpus and semi-spoken corpus were used as study materials. In particular, the semi-spoken corpus was used at a very high rate among them. This showed that discourse study in Korean language education was mainly focused on spoken corpus study. This study divided the detailed field of Korean language education into four fields of linguistic knowledge, communication function, teaching activities and learning activities, and observed the trends of discourse study in each field. Overall, it was recognized that relatively many studies were focused on linguistic knowledge, particularly in pragmatic perspective. It can be said that the study based on discourse has a language educational effectiveness in that it is based on actual data and improves practical communication skills in the environment of various languages.

On the Analysis of Natural Language Processing Morphology for the Specialized Corpus in the Railway Domain

  • Won, Jong Un;Jeon, Hong Kyu;Kim, Min Joong;Kim, Beak Hyun;Kim, Young Min
    • International Journal of Internet, Broadcasting and Communication
    • /
    • 제14권4호
    • /
    • pp.189-197
    • /
    • 2022
  • Today, we are exposed to various text-based media such as newspapers, Internet articles, and SNS, and the amount of text data we encounter has increased exponentially due to the recent availability of Internet access using mobile devices such as smartphones. Collecting useful information from a lot of text information is called text analysis, and in order to extract information, it is performed using technologies such as Natural Language Processing (NLP) for processing natural language with the recent development of artificial intelligence. For this purpose, a morpheme analyzer based on everyday language has been disclosed and is being used. Pre-learning language models, which can acquire natural language knowledge through unsupervised learning based on large numbers of corpus, are a very common factor in natural language processing recently, but conventional morpheme analysts are limited in their use in specialized fields. In this paper, as a preliminary work to develop a natural language analysis language model specialized in the railway field, the procedure for construction a corpus specialized in the railway field is presented.

Project-based CALL Class: Linking the Theory and Practice

  • Yang, Eun-Mi
    • 영어어문교육
    • /
    • 제10권1호
    • /
    • pp.53-76
    • /
    • 2004
  • This paper introduces a class model based on a course, Internet English, offered by an English department at a university. The course has dual purposes of developing students I English skills and Internet using skills at the same time. In support of using the Internet for language learning, the advantages of project-based language learning and constructivist learning in relation to CALL are explored. The activities in this course, which are basically project-based under the paradigm of constructivist learning perspective, are explained in detail to show the relationship between second language learning theory and teaching application. The way how the four language skills - speaking, listening, reading, and writing - are integrated in this class is described as well. Finally, judgmental evaluation of the course by the students is noted. The results show that a project-based CALL class could be a promising class model to realize an integrative, constructivist, and authentic learning.

  • PDF

CMC기반의 영어학습 환경에서 상호작용 촉진을 위한 교수설계가 영어학습에 미치는 효과 : 교양 영작문 과목을 중심으로 (A Study on the Effectiveness of the Instructional Design for Further Interaction on English Learning in a CMC Based Language Learning Environment: Focusing on University General English Education)

  • 정양수
    • 한국영어학회지:영어학
    • /
    • 제3권2호
    • /
    • pp.281-308
    • /
    • 2003
  • The purpose of this study is to determine the effects of CMC-based English learning. In this study, CMC components were found to provide circumstances of facilitating interactions between student-student and student-student-teacher, which enabled students to accomplish language learning tasks. Findings of this study are as follows: First, CMC based language learning experience helps students have positive attitudes toward their English language learning. Second, student-student-instructor interaction group outperformed other groups in academic achievement and class activity participation. Third, cooperative learning groups more actively participated in the class activity than the individual learning group resulting in better academic performances. These findings supported the fact that cooperative learning with CMC components are useful in bringing more class participation and positive attitude that were believed to foster language learning than other groups in traditional language learning environments. This study suggests that the instructor needs to use instructional design strategies helpful to facilitate active interactions between instructors and students in order to achieve better effectiveness of English learning in a CMC based learning environment.

  • PDF