Back to Publications

CaveSeg: Deep Semantic Segmentation and Scene Parsing for Autonomous Underwater Cave Exploration

Authors: Adnan Abdullah, Titon Barua, Reagan Tibbetts, Zijie Chen, Md Jahidul Islam, Ioannis Rekleitis

Abstract: In this paper, we present CaveSeg - the first visual learning pipeline for semantic segmentation and scene parsing for AUV navigation inside underwater caves. We address the problem of scarce annotated training data by preparing a comprehensive dataset for semantic segmentation of underwater cave scenes. It contains pixel annotations for important navigation markers (e.g. caveline, arrows), obstacles (e.g. ground plain and overhead layers), scuba divers, and open areas for servoing. Through comprehensive benchmark analyses on cave systems in USA, Mexico, and Spain locations, we demonstrate that robust deep visual models can be developed based on CaveSeg for fast semantic scene parsing of underwater cave environments. In particular, we formulate a novel transformer-based model that is computationally light and offers near real-time execution in addition to achieving state-of-the-art performance. Finally, we explore the design choices and implications of semantic segmentation for visual servoing by AUVs inside underwater caves. The proposed model and benchmark dataset open up promising opportunities for future research in autonomous underwater cave exploration and mapping.

PDF Video
@inproceedings{AbdullahICRA2024, author = {Adnan Abdullah and Titon Barua and Reagan Tibbetts and Zijie Chen and Md Jahidul Islam and Ioannis Rekleitis}, booktitle = {IEEE International Conference on Robotics and Automation (ICRA)}, title = {CaveSeg: Deep Semantic Segmentation and Scene Parsing for Autonomous Underwater Cave Exploration}, year = {2024}, volume = {}, number = {}, pages = {3781-3788}, keywords = {}, doi = {} }