Research on embedded system porting of SemiVL(Semi-Supervised Semantic segmentation with Vision-Language Guidance) model

Seung-Min Park;Du-Sang Kim;

한국정보처리학회:학술대회논문집 (Annual Conference of KIPS)

한국정보처리학회 2024년도 추계학술발표대회
/
Pages.561-562
/
2024
/
2005-0011(pISSN)
/
2671-7298(eISSN)

한국정보처리학회 (Korea Information Processing Society)

SemiVL(Semi-Supervised Semantic segmentation with Vision-Language Guidance) 모델의 임베디드 시스템 포팅(Porting)에 관한 연구

Research on embedded system porting of SemiVL(Semi-Supervised Semantic segmentation with Vision-Language Guidance) model

박승민 (동서울대학교 컴퓨터소프트웨어학과) ;
김두상 (동서울대학교 컴퓨터소프트웨어학과)

Seung-Min Park (Dept. of Computer software, Dong seoul University) ;
Du-Sang Kim (Dept. of Computer software, Dong seoul University)

발행 : 2024.10.31

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

SemiVL(Semi-Supervised Semantic Segmentation with Vision-Language Guidance) 모델은 자원이 제한된 환경에서도 높은 이미지 분할 성능을 발휘하는 준지도 학습 기반의 시맨틱 세그멘테이션 모델이다. 본 논문은 PyTorch 프레임워크에서 TorchScript 프레임워크로 변환된 SemiVL 모델을 임베디드 시스템 환경(Google Pixel 2)에 적용하여 온디바이스 AI를 구현한 연구이다. 목표는 데스크톱 GPU 환경과 유사한 추론 성능을 달성하는 것이었다. 성능 평가는 Pascal VOC 데이터셋을 사용하였으며, mIoU(mean Intersection over Union)와 추론 시간을 주요 지표로 측정하였다. 실험 결과, TorchScript로 변환된 SemiVL 모델은 데스크톱 PC에서 77.5%의 mIoU와 6438.99ms의 추론 시간을 기록하였고, Google Pixel 2에서는 62.8%의 mIoU와 6658.45ms의 추론 시간을 달성하였다. 이 결과는 임베디드 시스템 환경에서 SemiVL 모델이 온디바이스 AI 솔루션으로 활용될 수 있음을 보여준다.

키워드

참고문헌

Lukas Hoyer, et.al "SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance", ECCV24, 2024
Feng Liang, et.al "Open-vocabulary semantic segmentation with mask-adapted clip". In CVPR, pages 7061-7070, 2023

한국정보처리학회:학술대회논문집 (Annual Conference of KIPS)

SemiVL(Semi-Supervised Semantic segmentation with Vision-Language Guidance) 모델의 임베디드 시스템 포팅(Porting)에 관한 연구

Research on embedded system porting of SemiVL(Semi-Supervised Semantic segmentation with Vision-Language Guidance) model

초록

키워드

참고문헌

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)