Comparing the performance of Supervised Fine-tuning, Reinforcement Learning, and Chain-of-Hindsight with Llama and OPT models (Llama, OPT 모델을 활용한 Supervised Fine Tuning, Reinforcement Learning, Chain-of-Hindsight 성능 비교)
-
- Annual Conference on Human and Language Technology
- /
- 2023.10a
- /
- pp.217-221
- /
- 2023