Transformer-based Language Recognition Technique for Big Data

Hwang, Chi-Gon;Yoon, Chang-Pyo;Lee, Soo-Wook;

Proceedings of the Korean Institute of Information and Commucation Sciences Conference (한국정보통신학회:학술대회논문집)

2022.10a
/
Pages.267-268
/
2022

The Korea Institute of Information and Commucation Engineering (한국정보통신학회)

Transformer-based Language Recognition Technique for Big Data

빅데이터를 위한 트랜스포머 기반의 언어 인식 기법

Hwang, Chi-Gon (Kwangwoon University) ;
Yoon, Chang-Pyo (GyeongGi University of Science and Technology) ;
Lee, Soo-Wook (Kwangwoon University)

황치곤 (광운대학교) ;
윤창표 (경기과학기술대학교) ;
이수욱 (광운대학교)

Published : 2022.10.03

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

Recently, big data analysis can use various techniques according to the development of machine learning. Big data collected in reality lacks an automated refining technique for the same or similar terms based on semantic analysis of the relationship between words. Big data is usually in the form of sentences, and morphological analysis or understanding of the sentences is required. Accordingly, NLP, a technique for analyzing natural language, can understand the relationship of words and sentences. In this paper, we study the advantages and disadvantages of Transformers and Reformers, which are techniques that complement the disadvantages of RNN, which is a time series approach to big data.

최근, 빅데이터 분석은 기계학습의 발전에 따른 다양한 기법들을 이용할 수 있다. 현실에서 수집된 빅데이터는 단어 간의 관계성에 대한 의미적 분석을 바탕으로 같거나 유사한 용어에 대한 자동화된 정제기법이 부족하다. 빅데이터는 보통 문장의 형태로 구성되어 있고, 이에 대한 형태소 분석이나 문장의 이해가 필요하다. 이에 자연어를 분석하기 위한 기법인 NLP는 단어의 관계성과 문장을 이해할 수 있다. 본 논문에서는 빅데이터를 시계열 접근법인 RNN의 단점을 보완한 기법인 트랜스포머와 리포머의 장단점에 대해 연구한다.

Proceedings of the Korean Institute of Information and Commucation Sciences Conference (한국정보통신학회:학술대회논문집)

Transformer-based Language Recognition Technique for Big Data

빅데이터를 위한 트랜스포머 기반의 언어 인식 기법

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)