Awesome
<div align="center"> <h1>Urban Waterlogging Detection: A Challenging Benchmark and Large-Small Model Co-Adapter [ECCV2024]</h1>Suqi Song<sup>1†</sup>, Chenxu Zhang<sup>1†</sup>, Peng Zhang<sup>1</sup>, Pengkun Li<sup>2</sup>, Fenglong Song<sup>3</sup>, Lei Zhang<sup>1*</sup>
<sup>1</sup>Chongqing University, <sup>2</sup>Huawei Technologies Co., Ltd., <sup>3</sup>Huawei Noah's Ark Lab
<div> <sup>†</sup> Equal contribution <sup>*</sup> Corresponding author </div> <div> {songsuqi, zhangpeng}@stu.cqu.edu.cn, {zhangchenxu, leizhang}@cqu.edu.cn, {lipengkun3, songfenglong}@huawei.com </div>Abstract
Urban waterlogging poses a major risk to public safety and infrastructure. Conventional methods using water-level sensors need high-maintenance to hardly achieve full coverage. Recent advances employ surveillance camera imagery and deep learning for detection, yet these struggle amidst scarce data and adverse environmental conditions. In this paper, we establish a challenging Urban Waterlogging Benchmark (UW-Bench) under diverse adverse conditions to advance real-world applications. We propose a Large-Small Model co-adapter paradigm (LSM-adapter), which harnesses the substantial generic segmentation potential of large model and the specific task-directed guidance of small model. Specifically, a Triple-S Prompt Adapter module alongside a Dynamic Prompt Combiner are proposed to generate then merge multiple prompts for mask decoder adaptation. Meanwhile, a Histogram Equalization Adap-ter module is designed to infuse the image specific information for image encoder adaptation. Results and analysis show the challenge and superiority of our developed benchmark and algorithm.
<div align="center"> <img src="pictures/fig1_bluemask_0307v2.jpg"> </div>Overview
- We propose an innovative large-small model co-adapter paradigm (LSM-adapter), aiming at achieving win-win regime. In order to learn a robust prompter, a Triple-S prompt adapter (TSP-Adapt) with a dynamic prompt combiner is formulated, enabling a success on adaptation. We pioneer the use of vision foundation model i.e., SAM for urban waterlogging detection, providing new insights for future research.
- Details of the proposed HE-Adapt and Semantic Prompter
- One-stage and Two-stage training strategies
UW-Bench Dataset
<div align="center"> <img src="pictures/dataset.jpg"> </div> <p> Training and testing examples in the developed UW-Bench. For objectively evaluating the capability of the model in real-world applications, we consider both <i>general-sample</i> and <i>hard-sample</i> cases in test set. </p> </div>- Please note that</b> the training set (Baidu Drive) | Google Drive) was collected and labeled by LiVE group of Chongqing University and the test set was provided by Huawei.
- Dataset Password: Sign the Dataset Access Agreement and send it to one of the following e-mail addresses for a password. (songsuqi@stu.cqu.edu.cn/zhangchenxu@cqu.edu.cn/leizhang@cqu.edu.cn)
- Users of this benchmark: Zhejiang University, Nanjing University
Citation
@inproceedings{
song2024lsmadapter,
title={Urban Waterlogging Detection: A Challenging Benchmark and Large-Small Model Co-Adapter},
author={Suqi Song and Chenxu Zhang and Peng Zhang and Pengkun Li and Fenglong Song and Lei Zhang},
journal = {ECCV},
issue_date = {2024}
}