A study on knowledge distillation to preserve semantic information

Seong-hyun Park;Sangkyun Lee;

Proceedings of the Korea Information Processing Society Conference (한국정보처리학회:학술대회논문집)

2024.05a
/
Pages.772-773
/
2024
/
2005-0011(pISSN)
/
2671-7298(eISSN)

Korea Information Processing Society (한국정보처리학회)

A study on knowledge distillation to preserve semantic information

의미적 정보를 보존하는 지식 증류에 대한 연구

Seong-hyun Park (Dept of cyber defense, Korea university) ;
Sangkyun Lee (School of Cybersecurity, Korea university)

박성현 (고려대학교 사이버국방학과) ;
이상근 (고려대학교 정보보호대학원)

Published : 2024.05.23

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

의미적 정보까지 학생 모델에게 학습시키기 위한 지식 증류 기법은 많이 논의되어 왔다. 그러나 학생 모델의 용량이 교사 모델의 용량에 비해 부족함에서 발생하는 의미적 정보 손실에 대한 논의는 아직 진행되지 않았다. 본 논문에서는 의미적 정보의 최소 단위를 교사 모델의 레이어로 설정하여 학생 모델이 지식 증류를 시작하기 전 최적의 지식 증류 대상을 설정하는 최적 은닉층 선정 알고리즘을 제시한다.

Keywords

References

Liang, Chen, et al. "Less is more: Task-aware layer-wise distillation for language model compression." Proceedings of the 40thInternational Conference on Machine Learning, Honolulu Hawaii USA, 2023, 20852-20867.
Liang, Chen, et al. "Module-wise Adaptive Distillation for Multimodality Foundation Models." Advances in Neural Information Processing Systems, New Orleans USA, 2023, 69719-69735.
Pasad, Chou, et al. "Layer-wise analysis of a self-supervised speech representation model." 2021IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Cartagena, Colombia, 2021, 914-921

Proceedings of the Korea Information Processing Society Conference (한국정보처리학회:학술대회논문집)

A study on knowledge distillation to preserve semantic information

의미적 정보를 보존하는 지식 증류에 대한 연구

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)