Fine-Tuning Strategies for Weather Condition Shifts: A Comparative Analysis of Models Trained on Synthetic and Real Datasets

  • Jungwoo Kim (Graduate School of Artificial Intelligence, POSTECH) ;
  • Min Jung Lee (Graduate School of Artificial Intelligence, POSTECH) ;
  • Suha Kwak (Dept. of Computer Science, Graduate School of Artificial Intelligence, POSTECH)
  • Published : 2024.05.23

Abstract

Despite advancements in deep learning, existing semantic segmentation models exhibit suboptimal performance under adverse weather conditions, such as fog or rain, whereas they perform well in clear weather conditions. To address this issue, much of the research has focused on making image or feature-level representations weather-independent. However, disentangling the style and content of images remains a challenge. In this work, we propose a novel fine-tuning method, 'freeze-n-update.' We identify a subset of model parameters that are weather-independent and demonstrate that by freezing these parameters and fine-tuning others, segmentation performance can be significantly improved. Experiments on a test dataset confirm both the effectiveness and practicality of our approach.

Keywords

References

  1. Sakaridis, Christos, Dengxin Dai, and Luc Van Gool. "Semantic foggy scene understanding with synthetic data" International Journal of Computer Vision 126, pp. 973-992, 2018. 
  2. Cordts, M., Omran, M., Ramos, S., Rehfeld, T., Enzweiler, M., Benenson, R., ... & Schiele, B. "The cityscapes dataset for semantic urban scene understanding" In Proceedings of the IEEE conference on computer vision and pattern recognition, San Diego, 2016, pp. 3213-3223. 
  3. Hu, X., Fu, C. W., Zhu, L., & Heng, P. A. "Depth-attentional features for single-image rain removal." In Proceedings of the IEEE/CVF Conference on computer vision and pattern recognition, California, 2019, pp. 8022-8031. 
  4. Chen, L. C., Zhu, Y., Papandreou, G., Schroff, F., & Adam, H. "Encoder-decoder with atrous separable convolution for semantic image segmentation" In Proceedings of the European conference on computer vision (ECCV), Munich, 2018, pp. 801-818. 
  5. He, K., Zhang, X., Ren, S., & Sun, J. "Deep residual learning for image recognition" In Proceedings of the IEEE conference on computer vision and pattern recognition, Las Vegas, 2016, pp. 770-778. 
  6. Howard, Andrew G., et al. "Mobilenets: Efficient convolutional neural networks for mobile vision applications." arXiv preprint arXiv:1704.04861, 2017. 
  7. Lee, Y., Chen, A.S., Tajwar, F., Kumar, A., Yao, H., Liang, P., Finn, C. "Surgical fine-tuning improves adaptation to distribution shifts", ICLR, 2022.