Home

Awesome

Cross-Scale MAE (NeurIPS 2023)

<p align="center"> <img src="images/model.png" width="480"> </p>

Authors: Maofeng Tang · Andrei Cozma · Konstantinos Georgiou · Hairong Qi

This is a PyTorch implementation of our NeurIPS 2023 paper: Cross-Scale MAE: A Tale of Multi-Scale Exploitation in Remote Sensing

Pretraining

The run the pretraining on a single node, you can use use the the train.sh. Make sure you modify its contents to match your environment. Alternatively, you can use the main_pretrain.py directly.

For multi-gpu training, use the train_distributed.sh instead.

Finetuning & Linear Probing

To run finetuning on a single node, you can use the finetune.sh. Make sure you modify its contents to match your environment. Alternatively, you can use the main_finetune.py directly.

For linear probing, use the linprobe.sh and main_linprobe.py instead.

Model Weights

Pretrained weights for the models used in the paper can be found here:

<table><tbody> <th valign="bottom"></th> <th valign="bottom">epochs</th> <th valign="bottom">pre-trained checkpoint</th> <th valign="bottom">md5</th> <!-- TABLE BODY --> <tr><td align="left">ViT-Base</td> <td align="center"><tt>400</tt></td> <td align="center"><a href="https://drive.google.com/file/d/17FePR5lAFNL45g7738JMofP2pyC_6hau/view?usp=sharing">download</a></td> <td align="center"><tt>0c33995da85c112d9602f89d5b186bbc</tt></td> </tr> <tr><td align="left">ViT-Large</td> <td align="center"><tt>400</tt></td> <td align="center"><a href="https://drive.google.com/file/d/1FBJiQ8Z6J3P8Le7wFQXOicTDX9jTNGnT/view?usp=drive_link">download</a></td> <td align="center"><tt>e6e4f58c07bbbc4c4dd63fa26c644dd4</tt></td> </tr> </tbody></table>

You would need to download the weights and place them in a folder named weights in the root of the repository.

Acknowledgements

Code from this repository is inspired from the following repositories:

Citation

If you found our project helpful, please cite our paper:

@inproceedings{tang2023cross,
  title={Cross-Scale MAE: A Tale of Multiscale Exploitation in Remote Sensing},
  author={Tang, Maofeng and Cozma, Andrei Liviu and Georgiou, Konstantinos and Qi, Hairong},
  booktitle={Thirty-seventh Conference on Neural Information Processing Systems},
  year={2023}
}

License

This project is under the CC-BY-NC 4.0 license. See LICENSE for details.