Search | Korea Science

Improving Contextual Understanding Using Sparse Attention Models (Sparse Attention 모델을 활용한 효율적인 문맥 이해)

Tae-Hoon Her
- Annual Conference on Human and Language Technology
- /
- 2023.10a
- /
- pp.694-697
- /
- 2023
본 논문은 문맥 이해에서 발생할 수 있는 문제점을 개선하기 위해 Sparse Attention 모델을 적용하였다. 실험 결과, 이 방법은 문맥 손실률을 상당히 줄이며 자연어 처리에 유용하다는 것을 확인하였다. 본 연구는 기계 학습과 자연어 처리분야에서 더 나은 문맥 이해를 위한 새로운 방향을 제시하며, 향후 다양한 모델과 방법론을 탐구하여 문맥 이해를 더욱 향상시킬 계획이다.
PDF

A Lossless Coding Scheme for Progressive Transmission of Medical Images (의료 영상의 순차전송을 위한 무손실 부호화 기법)

김효준;송준석;이승준;김종효;이충웅
- Journal of Biomedical Engineering Research
- /
- v.18 no.4
- /
- pp.349-356
- /
- 1997
In this paper, we propose the lossless coding: scheme for progressive transmission of medical images. The input image is decomposed by the proposed fast adaptive subband decomposition method which is suited for a lossless coding. The decomposed images are coded by an arithmetic coder with two conditioning pixels, and the conditioning pixels are selected differently according to the property of the subbands. The conditioning contexts are usually quantized to reduce the conditional state, and the optimization method of quantization is proposed For the purpose of improving compression ratio in this paper. The proposed lossless coding scheme provides the asymmetric structure of cosec and results in better compression ability than the JPEC lossless coding[ 1 ].
PDF

Merging Context Information and Recognition Result for Robust Speech Recognition in Noisy Environments (잡음 환경에서의 강인한 음성인식을 위한 문맥 정보와 음성인식 결과의 융합)

Song, Won-Moon;Kim, Eun-Ju;Kim, Myung-Won
- Proceedings of the Korean Information Science Society Conference
- /
- 2005.11b
- /
- pp.733-735
- /
- 2005
최근 음성인식 분야 에서는 잡음 환경에서 좀 더 신뢰도 높은 음성 인식 결과물 얻기 위하여 인식 결과 도출 단계에서 여러 가지 정보를 융합 하는 방법이나 인식결과를 후처리 하여 새로운 결과를 얻어 내는 방법들이 연구 되고 있다. 본 논문에서는 개인 모바일 기기에서의 음성 인식 환경에서 사용자의 발화 패턴 정보를 가지는 문맥 정보를 활용함으로서 잡음 환경에서의 음성 정보 손실에 따른 인식률 하락을 보완하는 방법을 제안한다. 먼저 사용자의 기기 사용 로그나 발화 로그 정보로부터 특정 명령어들의 순차적 발화 패턴을 마이닝하여 문맥 정보를 구성한다. 이 후 음성 발화시에 인식기의 최종 인식 결과에 대한 신뢰도가 떨어진다고 판단될 때 앞서 얻어진 문맥 정보의 신뢰도를 인식기의 각 후보단어들의 인식률과 융합하여 새로운 인식 결과를 도출해 낸다. 이러한 과정에서 인식기 결과에 대한 신뢰성을 판단하는 기준을 실험을 통하여 결정 하였으며 신뢰성이 기준 이하일 경우의 융합 과정을 위하여 후보 단어 인식률과 문맥정보를 적절히 융합할 수 있는 방법을 제안한다.
PDF

Lossless Satellite Image Compression using Improved CAVLC of H.264/AVC (H.264/AVC의 향상된 CAVLC 부호화 방법을 이용한 무손실 위성 영상 압축)

Heo, Jin;Ho, Yo-Sung
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2009.11a
- /
- pp.107-110
- /
- 2009
본 논문에서는 무손실 위성 영상 압축의 효율을 높이기 위해 H.264/AVC의 무손실 화면내 부호화에 기반한 새로운 문맥기반 적응적 가변 길이 부호화 (CAVLC) 방법을 제안한다. 기존의 H.264/AVC의 CAVLC는 손실 압축에 적합하게 설계되었기 때문에, 변환과 양자화 과정을 수행하지 않는 무손실 압축에서 최적의 압축 성능을 제공하지 못한다. 본 논문에서는 손실 압축과 무손실 압축 환경에서 잔여 영상 신호의 통계적 차이가 있음을 확인하고, 무손실 위성 영상 압축 환경에서 잔여 신호의 통계적 특성을 고려하여 향상된 CAVLC 기반의 무손실 위성 영상 압축 방법을 제안한다. 제안한 방법을 사용하여 위성 영상을 압축한 결과 기존의 대표적인 무손실 영상 압축 방법인 JPEG-LS와 CALIC 보다 약 12% 정도 비트 수를 감소시켰다.
PDF

Motion Adaptive Lossless Image Compression Algorithm (움직임 적응적인 무손실 영상 압축 알고리즘)

Kim, Young-Ro;Park, Hyun-Sang
- Journal of the Korea Academia-Industrial cooperation Society
- /
- v.10 no.4
- /
- pp.736-739
- /
- 2009
In this paper, an efficient lossless compression algorithm using motion adaptation is proposed. It is divided into two parts: a motion adaptation based nonlinear predictor part and a residual data coding part. The proposed nonlinear predictor can reduce prediction error by learning from its past prediction errors using motion adaption. The predictor decides the proper selection of the intra and inter prediction values according to the past prediction error. The reduced error is coded by existing context adaptive coding method. Experimental results show that the proposed algorithm has the higher compression ratio than context modeling methods, such as FELICS, CALIC, and JPEG-LS.
https://doi.org/10.5762/KAIS.2009.10.4.736 인용 PDF

Multi Domain Dialog State Tracking using Domain State (도메인 상태를 이용한 다중 도메인 대화 상태 추적)

Jeon, Hyunmin;Lee, Geunbae
- Annual Conference on Human and Language Technology
- /
- 2020.10a
- /
- pp.421-426
- /
- 2020
다중 도메인 목적 지향 대화에서 기존 딥 러닝을 이용한 대화 상태 추적(Dialog state tracking)은 여러 턴 동안 누적된 사용자와 시스템 간 대화를 입력 받아 슬롯 밸류(Slot value)를 추출하는 모델들이 연구되었다. 하지만 이 모델들은 대화가 길어질수록 연산량이 증가한다. 이에 본 논문에서는 다중 도메인 대화에서 누적된 대화의 history 없이 슬롯 밸류를 추출하는 방법을 제안한다. 하지만, 단순하게 history를 제거하고 현재 턴의 발화만 입력 받는 방법은 문맥 정보의 손실로 이어진다. 따라서 본 논문에서는 도메인 상태(Domain state)를 도입하여 매 턴 마다 대화 상태와 함께 추적하는 모델을 제안한다. 도메인 상태를 같이 추적함으로써 현재 어떠한 도메인에 대하여 대화가 진행되고 있는지를 파악한다. 또한, 함축된 문맥 정보를 담고 있는 이전 턴의 대화 상태와 도메인 상태를 현재 턴의 발화와 같이 입력 받아 정보의 손실을 줄였다. 대표적인 데이터 셋인 MultiWOZ 2.0과 MultiWOZ 2.1에서 실험한 결과, 대화의 history를 사용하지 않고도 대화 상태 추적에 있어 좋은 성능을 보이는 것을 확인하였다. 또한, 시스템 응답과 과거 발화에 대한 의존성을 제거하여 end-to-end 대화 시스템으로의 확장이 좀 더 용이할 것으로 기대된다.
PDF

Speech Recognition by Integrating Audio, Visual and Contextual Features Based on Neural Networks (신경망 기반 음성, 영상 및 문맥 통합 음성인식)

김명원;한문성;이순신;류정우
- Journal of the Institute of Electronics Engineers of Korea CI
- /
- v.41 no.3
- /
- pp.67-77
- /
- 2004
The recent research has been focused on fusion of audio and visual features for reliable speech recognition in noisy environments. In this paper, we propose a neural network based model of robust speech recognition by integrating audio, visual, and contextual information. Bimodal Neural Network(BMNN) is a multi-layer perception of 4 layers, each of which performs a certain level of abstraction of input features. In BMNN the third layer combines audio md visual features of speech to compensate loss of audio information caused by noise. In order to improve the accuracy of speech recognition in noisy environments, we also propose a post-processing based on contextual information which are sequential patterns of words spoken by a user. Our experimental results show that our model outperforms any single mode models. Particularly, when we use the contextual information, we can obtain over 90% recognition accuracy even in noisy environments, which is a significant improvement compared with the state of art in speech recognition. Our research demonstrates that diverse sources of information need to be integrated to improve the accuracy of speech recognition particularly in noisy environments.
PDF KSCI

Design of Q/A System By Natural Language Processing (자연언어에 의한 질의응답 시스템의 설계)

Kim, Y.K.;Kang, S.H.;Woo, Y.S.;Kim, H.W.;Choi, B.U.
- Annual Conference on Human and Language Technology
- /
- 1992.10a
- /
- pp.469-477
- /
- 1992
본 논문에서는 자연언어 처리를 기반으로 한 질의응답 시스템을 설계하며 그 대상을 호텔 예약 시스템으로 한다. 시스템은 크게 언어 해석부, 문맥 처리부, 화제 처리부 및 한국어 생성부로 구성한다. 언어 처리부에서는 단일화 문법을 적용하여 사용자가 입력한 질의문을 해석하며 그 해석결과로부터 이 후 질의응답에 사용되는 정보들인 술어, 각 명사구들의 의존 관계치 및 의미 제약치를 얻어 낸다. 문맥처리를 수행하여 생략문 등에 의해 결핍된 정보들을 추출함으로서 정보의 손실을 보완한다. 이 정보들을 이용하여 사용자와 시스템간의 자연스러운 대화를 진행하기 위한 화제처리를 수행하며, 최종적으로 화제 처리부에서 파악한 화자의 의도를 기초로 사용자가 요구한 정보를 제공하고 계속된 대화진행을 위해 적절한 한국어를 생성해 주는 한국어 생성부를 거치게 된다. 본 논문에서는 호텔 예약이라는 정해진 영역하에서 자연언어처리에 의해 추출한 술어, 각 명사구들의 의존관계치 및 의미제약치 정보들을 이용하여 문맥처리와 화제처리를 수행함으로서 더욱 효율적인 질의응답 시스템을 구현할 수 있음을 보인다.
PDF

Improved CABAC Method for Lossless Image Compression (무손실 영상 압축을 위한 향상된 CABAC 방법)

Heo, Jin;Ho, Yo-Sung
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.36 no.6C
- /
- pp.355-360
- /
- 2011
In this paper, we propose a new context-based adaptive binary arithmetic coding (CABAC) method for lossless image compression. Since the conventional CABAC in H.264/AVC was originally designed for lossy coding, it does not yield adequate performance during lossless coding. Therefore, we proposed an improved CABAC method for lossless intra coding by considering the statistical characteristics of residual data in lossless intra coding. Experimental results showed that the proposed method reduced the bit rate by 18.2%, compared to the conventional CABAC for lossless intra coding.
https://doi.org/10.7840/KICS.2011.36C.6.355 인용 PDF KSCI

Target Word Selection for English-Korean Machine Translation System using Multiple Knowledge (다양한 지식을 사용한 영한 기계번역에서의 대역어 선택)

Lee, Ki-Young;Kim, Han-Woo
- Journal of the Korea Society of Computer and Information
- /
- v.11 no.5 s.43
- /
- pp.75-86
- /
- 2006
Target word selection is one of the most important and difficult tasks in English-Korean Machine Translation. It effects on the translation accuracy of machine translation systems. In this paper, we present a new approach to select Korean target word for an English noun with translation ambiguities using multiple knowledge such as verb frame patterns, sense vectors based on collocations, statistical Korean local context information and co-occurring POS information. Verb frame patterns constructed with dictionary and corpus play an important role in resolving the sparseness problem of collocation data. Sense vectors are a set of collocation data when an English word having target selection ambiguities is to be translated to specific Korean target word. Statistical Korean local context Information is an N-gram information generated using Korean corpus. The co-occurring POS information is a statistically significant POS clue which appears with ambiguous word. The experiment showed promising results for diverse sentences from web documents.
PDF

Search Result 20, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)