Awesome
<div align="center"> <h1> A Survey on Visual Mamba </h1> </div>Authors: Hanwei Zhang, Ying Zhu, Dan Wang, Lijun Zhang, Tianxiang Chen, Ziyang Wang and Zi Ye.
</p> <img src="num.png" width="100%" height="auto"> <hr />A curated list of awesome Mamba for Computer Vision, inspired by the other awesome-initiatives. We intend to regularly update the relevant latest papers and their open-source implementations on this page.
We strongly encourage the researchers that want to promote their fantastic work to the community to make pull request, remind us on issue, or contact with email to update their paper's information!
Citation
If you find the listing and survey useful for your work, please cite the paper:
- Zhang, Hanwei, et al. "A Survey on Visual Mamba." Applied Sciences 13.14 (2024): 5683.
@article{zhang2024survey,
title={A Survey on Visual Mamba},
author={Zhang, Hanwei and Zhu, Ying and Wang, Dan and Zhang, Lijun and Chen, Tianxiang and Wang, Ziyang and Ye, Zi},
journal={Applied Sciences},
volume={14},
number={13},
pages={5683},
year={2024},
publisher={MDPI}
}
Overview
- Survey Papers
- Mamba Backbone
- Image Classification
- Object Detection
- Image Segmentation
- Video Classification
- Video Understanding
- Multi-Modal Understanding
- Video Prediction
- Image Registration
- Image Super-Resolution
- Image Restoration
- Image Dehazing
- Image Derain
- Image Deblurring
- Visual Generation
- Point Cloud
- Depth Estimation
- 3D Reconstruction
- Video Generation
- Others
Survey Papers
A Survey on Visual Mamba. [24th April., 2024].<br> Zhang, Hanwei, Ying Zhu, Dan Wang, Lijun Zhang, Tianxiang Chen, Ziyang Wang, and Zi Ye.<br> [PDF]
A survey on vision mamba: Models, applications and challenges. [29th April., 2024].<br> Xu, Rui, Shu Yang, Yihui Wang, Bo Du, and Hao Chen.<br> [PDF]
Vision Mamba: A Comprehensive Survey and Taxonomy [7th May., 2024].<br> Liu, Xiao, Chenxu Zhang, and Lei Zhang.<br> [PDF]
Rethinking Scanning Strategies with Vision Mamba in Semantic Segmentation of Remote Sensing Imagery: An Experimental Study. [14th May., 2024].<br> Zhu, Qinfeng, Yuan Fang, Yuanzhi Cai, Cheng Chen, and Lei Fan.<br> [PDF]
Mamba-360: Survey of state space models as transformer alternative for long sequence modelling: Methods, applications, and challenges. [24th April., 2024].<br> Patro, Badri Narayana, and Vijay Srinivas Agneeswaran.<br> [PDF]
Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis. [5th Jun., 2024].<br> Moein Heidari, Sina Ghorbani Kolahi, Sanaz Karimijafarbigloo, Bobby Azad, Afshin Bozorgpour, Soheila Hatami, Reza Azad, Ali Diba, Ulas Bagci, Dorit Merhof, Ilker Hacihaliloglu.<br> [PDF]
Mamba Backbone
Vision mamba: Efficient visual representation learning with bidirectional state space mode. [17th Jan., 2024].<br> Zhu, Lianghui, Bencheng Liao, Qian Zhang, Xinlong Wang, Wenyu Liu, and Xinggang Wang.<br> [PDF]
Vmamba: Visual state space model. [18th Jan., 2024].<br> Liu, Yue, Yunjie Tian, Yuzhong Zhao, Hongtian Yu, Lingxi Xie, Yaowei Wang, Qixiang Ye, and Yunfan Liu.<br> [PDF]
Plainmamba: Improving non-hierarchical mamba in visual recognition. [26th Mar., 2024].<br> Chenhongyi Yang, Zehui Chen, Miguel Espinosa, Linus Ericsson, Zhenyu Wang, Jiaming Liu, Elliot J. Crowley.<br> [PDF]
Localmamba: Visual state space model with windowed selective scan. [14th Mar., 2024].<br> Tao Huang, Xiaohuan Pei, Shan You, Fei Wang, Chen Qian, Chang Xu.<br> [PDF]
Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data. [8th Feb., 2024].<br> Shufan Li, Harkanwar Singh, Aditya Grover.<br> [PDF]
SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series. [22nd Mar., 2024].<br> Badri N. Patro, Vijay S. Agneeswaran.<br> [PDF]
Scalable Visual State Space Model with Fractal Scanning. [23rd Mar., 2024].<br> Lv Tang, HaoKe Xiao, Peng-Tao Jiang, Hao Zhang, Jinwei Chen, Bo Li.<br> [PDF]
Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model. [23rd Mar., 2024].<br> Yuheng Shi, Minjing Dong, Chang Xu.<br> [PDF], [Code]
Coupled Mamba: Enhanced Multi-modal Fusion with Coupled State Space Model. [28th May., 2024].<br> Wenbing Li, Hang Zhou, Zikai Song, Wei Yang.<br> [PDF]
Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality. [31st May., 2024].<br> Tri Dao, Albert Gu.<br> [PDF]
Vim-F: Visual State Space Model Benefiting from Learning in the Frequency Domain. [29th May., 2024].<br> Juntao Zhang, Kun Bian, Peng Cheng, Wenbo An, Jianning Liu, Jun Zhou.<br> [PDF], [Code]
Autoregressive Pretraining with Mamba in Vision. [11th Jan., 2024].<br> Sucheng Ren, Xianhang Li, Haoqin Tu, Feng Wang, Fangxun Shu, Lei Zhang, Jieru Mei, Linjie Yang, Peng Wang, Heng Wang, Alan Yuille, Cihang Xie.<br> [PDF], [Code]
Image Classification
Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with Deep Residual Learning. [24th Feb., 2024].<br> Chen, Chi-Sheng, Guan-Ying Chen, Dong Zhou, Di Jiang, and Dai-Shi Chen.<br> [PDF]
Medmamba: Vision mamba for medical image classification. [6th Mar., 2024].<br> Yue, Yubiao, and Zhenzhang Li.<br> [PDF]
CMViM: Contrastive Masked Vim Autoencoder for 3D Multi-modal Representation Learning for AD classification. [25th Mar., 2024].<br> Guangqian Yang, Kangrui Du, Zhihan Yang, Ye Du, Yongping Zheng, Shujun Wang.<br> [PDF]
DualMamba: A Lightweight Spectral-Spatial Mamba-Convolution Network for Hyperspectral Image Classification. [11th Jan., 2024].<br> Jiamu Sheng, Jingyi Zhou, Jiong Wang, Peng Ye, Jiayuan Fan.<br> [PDF]
Object Detection
State Space Models for Event Cameras. [23th Feb., 2024].<br> Nikola Zubić, Mathias Gehrig, Davide Scaramuzza.<br> [PDF]
MiM-ISTD: Mamba-in-Mamba for Efficient Infrared Small Target Detection. [3rd Apr., 2024].<br> Tianxiang Chen, Zhentao Tan, Tao Gong, Qi Chu, Yue Wu, Bin Liu, Jieping Ye, Nenghai Yu.<br> [PDF] [Code]
CDMamba: Remote Sensing Image Change Detection with Mamba. [6th JUn., 2024].<br> Haotian Zhang, Keyan Chen, Chenyang Liu, Hao Chen, Zhengxia Zou, Zhenwei Shi.<br> [PDF] [Code]
Image Segmentation
ReMamber: Referring Image Segmentation with Mamba Twister. [26th Mar., 2024].<br> Yuhuan Yang, Chaofan Ma, Jiangchao Yao, Zhun Zhong, Ya Zhang, Yanfeng Wang.<br> [PDF]
Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation. [7th Feb., 2024].<br> Ziyang Wang, Jian-Qing Zheng, Yichi Zhang, Ge Cui, Lei Li.<br> [PDF]
Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation. [5th Apr., 2024].<br> Zifu Wan, Yuhao Wang, Silong Yong, Pingping Zhang, Simon Stepputtis, Katia Sycara, Yaqi Xie.<br> [PDF]
Samba: Semantic Segmentation of Remotely Sensed Images with State Space Model. [11th Apr., 2024].<br> Qinfeng Zhu, Yuanzhi Cai, Yuan Fang, Yihan Yang, Cheng Chen, Lei Fan, Anh Nguyen.<br> [PDF] [Code]
nnMamba: 3D Biomedical Image Segmentation, Classification and Landmark Detection with State Space Model. [5th Feb., 2024].<br> Haifan Gong, Luoyao Kang, Yitao Wang, Xiang Wan, Haofeng Li.<br> [PDF] [Code]
T-Mamba: Frequency-Enhanced Gated Long-Range Dependency for Tooth 3D CBCT Segmentation. [1st Apr., 2024].<br> Jing Hao, Lei He, Kuo Feng Hung.<br> [PDF] [Code]
Large Window-based Mamba UNet for Medical Image Segmentation: Beyond Convolution and Self-attention. [12th Mar., 2024].<br> Jinhong Wang, Jintai Chen, Danny Chen, Jian Wu.<br> [PDF] [Code]
RS3Mamba: Visual State Space Model for Remote Sensing Images Semantic Segmentation. [3rd Apr., 2024].<br> Xianping Ma, Xiaokang Zhang, Man-On Pun.<br> [PDF] [Code]
VM-UNet: Vision Mamba UNet for Medical Image Segmentation. [4th Feb., 2024].<br> Jiacheng Ruan, Suncheng Xiang.<br> [PDF] [Code]
UU-Mamba: Uncertainty-aware U-Mamba for Cardiac Image Segmentation. [25th May., 2024].<br> Ting Yu Tsai, Li Lin, Shu Hu, Ming-Ching, Hongtu Zhu, Xin Wang.<br> [PDF]
MHS-VM: Multi-Head Scanning in Parallel Subspaces for Vision Mamba. [10th Jan., 2024].<br> Zhongping Ji.<br> [PDF]
Segmentation in X-ray Fluoroscopy Utilizing Virtual Simulations of Cardiovascular Procedures. [2024].<br> Andersson Rasmus, Ekerstedt Martin.<br> [PDF]
Rotate to scan: Unet-like mamba with triplet ssm module for medical image segmentation. [2024].<br> Hao Tang, Lianglun Cheng, Guoheng Huang, Zhengguang Tan, Junhao Lu, Kaihong Wu.<br> [PDF]
Video Classification
Long Movie Clip Classification with State-Space Video Models. [14th Nov., 2022].<br> Islam, Md Mohaiminul, and Gedas Bertasius.<br> [PDF]
Video Understanding
VideoMamba: State Space Model for Efficient Video Understanding. [11th Mar., 2024].<br> Kunchang Li, Xinhao Li, Yi Wang, Yinan He, Yali Wang, Limin Wang, Yu Qiao.<br> [PDF]
SpikeMba: Multi-Modal Spiking Saliency Mamba for Temporal Video Grounding. [1st Apr., 2024].<br> Wenrui Li, Xiaopeng Hong, Xiaopeng Fan.<br> [PDF]
Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding. [14th Mar., 2024].<br> Guo Chen, Yifei Huang, Jilan Xu, Baoqi Pei, Zhe Chen, Zhiqi Li, Jiahao Wang, Kunchang Li, Tong Lu, Limin Wang.<br> [PDF]
Image Registration
MambaMorph: a Mamba-based Framework for Medical MR-CT Deformable Registration. [25th Jan., 2024].<br> Tao Guo, Yinuo Wang, Shihao Shu, Diansheng Chen, Zhouping Tang, Cai Meng, Xiangzhi Bai.<br> [PDF] [Code]
VMambaMorph: a Multi-Modality Deformable Image Registration Framework based on Visual State Space Model with Cross-Scan Module. [7th Apr., 2024].<br> Ziyang Wang, Jian-Qing Zheng, Chao Ma, Tao Guo.<br> [PDF]
Multi-Modal Understanding
Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference. [21th Mar., 2024].<br> Han Zhao, Min Zhang, Wei Zhao, Pengxiang Ding, Siteng Huang, Donglin Wang.<br> [PDF]
Video Prediction
VMRNN: Integrating Vision Mamba and LSTM for Efficient and Accurate Spatiotemporal Forecasting. [25th Mar., 2024].<br> Yujin Tang, Peijie Dong, Zhenheng Tang, Xiaowen Chu, Junwei Liang.<br> [PDF]
Image Super-Resolution
Activating Wider Areas in Image Super-Resolution. [13th Mar., 2024].<br> Cheng Cheng, Hang Wang, Hongbin Sun.<br> [PDF]
Image Restoration
MambaIR: A Simple Baseline for Image Restoration with State-Space Model. [23th Feb., 2024].<br> Hang Guo, Jinmin Li, Tao Dai, Zhihao Ouyang, Xudong Ren, Shu-Tao Xia.<br> [PDF]
Serpent: Scalable and Efficient Image Restoration via Multi-scale Structured State Space Models. [26th Mar., 2024].<br> Mohammad Shahab Sepehri, Zalan Fabian, Mahdi Soltanolkotabi.<br> [PDF]
MambaLLIE: Implicit Retinex-Aware Low Light Enhancement with Global-then-Local State Space. [25th May., 2024].<br> Jiangwei Weng, Zhiqiang Yan, Ying Tai, Jianjun Qian, Jian Yang, Jun Li.<br> [PDF], [Code]
LLEMamba: Low-Light Enhancement via Relighting-Guided Mamba with Deep Unfolding Network. [3rd Jun., 2024].<br> Xuanqi Zhang, Haijin Zeng, Jinwang Pan, Qiangqiang Shen, Yongyong Chen.<br> [PDF]
Image Dehazing
U-shaped Vision Mamba for Single Image Dehazing. [6th Feb., 2024].<br> Zhuoran Zheng, Chen Wu.<br> [PDF]
Image Derain
FreqMamba: Viewing Mamba from a Frequency Perspective for Image Deraining. [15th Apr., 2024].<br> Zou Zhen, Yu Hu, Zhao Feng.<br> [PDF]
FourierMamba: Fourier Learning Integration with State Space Models for Image Deraining. [29th May., 2024].<br> Dong Li, Yidi Liu, Xueyang Fu, Senyan Xu, Zheng-Jun Zha.<br> [PDF]
Image Deblurring
Learning Enriched Features via Selective State Spaces Model for Efficient Image Deblurring. [29th Mar., 2024].<br> Hu Gao, Depeng Dang.<br> [PDF]
Efficient Visual State Space Model for Image Deblurring. [23rd May., 2024].<br> Lingshun Kong, Jiangxin Dong, Ming-Hsuan Yang, Jinshan Pan.<br> [PDF]
Visual Generation
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models. [14th Mar., 2024].<br> Zunnan Xu, Yukang Lin, Haonan Han, Sicheng Yang, Ronghui Li, Yachao Zhang, Xiu Li.<br> [PDF]
Scalable Diffusion Models with State Space Backbone. [8th Feb., 2024].<br> Zhengcong Fei, Mingyuan Fan, Changqian Yu, Junshi Huang.<br> [PDF]
ZigMa: A DiT-style Zigzag Mamba Diffusion Model. [20th Mar., 2024].<br> Vincent Tao Hu, Stefan Andreas Baumann, Ming Gui, Olga Grebenkova, Pingchuan Ma, Johannes Fischer, Björn Ommer.<br> [PDF]
Motion Mamba: Efficient and Long Sequence Motion Generation with Hierarchical and Bidirectional Selective SSM. [12th Mar., 2024].<br> Zeyu Zhang, Akide Liu, Ian Reid, Richard Hartley, Bohan Zhuang, Hao Tang.<br> [PDF]
I2I-Mamba: Multi-modal medical image synthesis via selective state space modeling. [22nd May., 2024].<br> Omer F. Atli, Bilal Kabas, Fuat Arslan, Mahmut Yurt, Onat Dalmaz, Tolga Çukur.<br> [PDF]
DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis. [23rd May., 2024].<br> Yao Teng, Yue Wu, Han Shi, Xuefei Ning, Guohao Dai, Yu Wang, Zhenguo Li, Xihui Liu.<br> [PDF], [Code]
Soft Masked Mamba Diffusion Model for CT to MRI Conversion. [22nd Jun., 2024].<br> Zhenbin Wang, Lei Zhang, Lituan Wang, Zhenwei Zhang.<br> [PDF], [Code]
Point Cloud
Point Mamba: A Novel Point Cloud Backbone Based on State Space Model with Octree-Based Ordering Strategy. [11th Mar., 2024].<br> Jiuming Liu, Ruiji Yu, Yian Wang, Yu Zheng, Tianchen Deng, Weicai Ye, Hesheng Wang.<br> [PDF]
3DMambaComplete: Exploring Structured State Space Model for Point Cloud Completion. [10th Apr., 2024].<br> Yixuan Li, Weidong Yang, Ben Fei.<br> [PDF]
3DMambaIPF: A State Space Model for Iterative Point Cloud Filtering via Differentiable Rendering. [8th Apr., 2024].<br> Qingyuan Zhou, Weidong Yang, Ben Fei, Jingyi Xu, Rui Zhang, Keyi Liu, Yeqi Luo, Ying He.<br> [PDF]
Point Cloud Mamba: Point Cloud Learning via State Space Model. [1th Mar., 2024].<br> Tao Zhang, Xiangtai Li, Haobo Yuan, Shunping Ji, Shuicheng Yan.<br> [PDF]
MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Model. [23rd May., 2024].<br> Jiuming Liu, Jinru Han, Lihao Liu, Angelica I. Aviles-Rivero, Chaokang Jiang, Zhe Liu, Hesheng Wang.<br> [PDF]
Depth Estimation
MambaDepth: Enhancing Long-range Dependency for Self-Supervised Fine-Structured Monocular Depth Estimation. [6th Jun., 2024].<br> Ionuţ Grigore, Călin-Adrian Popa.<br> [PDF]
3D Reconstruction
Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction. [27th Mar., 2024].<br> Qiuhong Shen, Xuanyu Yi, Zike Wu, Pan Zhou, Hanwang Zhang, Shuicheng Yan, Xinchao Wang.<br> [PDF]
MMR-Mamba: Multi-Contrast MRI Reconstruction with Mamba and Spatial-Frequency Information Fusion. [27th Jun., 2024].<br> Jing Zou, Lanqing Liu, Qi Chen, Shujun Wang, Xiaohan Xing, Jing Qin.<br> [PDF]
Video Generation
SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces. [12th Mar., 2024].<br> Yuta Oshima, Shohei Taniguchi, Masahiro Suzuki, Yutaka Matsuo.<br> [PDF]
Others
Graph-Mamba: Towards Long-Range Graph Sequence Modeling with Selective State Spaces. [1st Feb., 2024].<br> Chloe Wang, Oleksii Tsepa, Jun Ma, Bo Wang.<br> [PDF], [Code]
HeteGraph-Mamba: Heterogeneous Graph Learning via Selective State Space Model. [22nd May., 2024].<br> Zhenyu Pan, Yoonsung Jeong, Xiaoda Liu, Han Liu.<br> [PDF]
Mamba-R: Vision Mamba ALSO Needs Registers. [23rd May., 2024].<br> Feng Wang, Jiahao Wang, Sucheng Ren, Guoyizhe Wei, Jieru Mei, Wei Shao, Yuyin Zhou, Alan Yuille, Cihang Xie.<br> [PDF]
MGI: Multimodal Contrastive pre-training of Genomic and Medical Imaging. [2nd Jun., 2024].<br> Jiaying Zhou, Mingzhou Jiang, Junde Wu, Jiayuan Zhu, Ziyue Wang, Yueming Jin.<br> [PDF]
Zamba: A Compact 7B SSM Hybrid Model. [26th May., 2024].<br> Paolo Glorioso, Quentin Anthony, Yury Tokpanov, James Whittington, Jonathan Pilault, Adam Ibrahim, Beren Millidge.<br> [PDF]
MambaLRP: Explaining Selective State Space Sequence Models. [11th Jun., 2024].<br> Farnoush Rezaei Jafari, Grégoire Montavon, Klaus-Robert Müller, Oliver Eberle.<br> [PDF]