Awesome

Network architecture

MobilenetV1 [2017]
MobilenetV2 [2019]
MobilenetV3 [2019]
SqueezeNet [2016]
- https://github.com/forresti/SqueezeNet
SqueezeNext [2018]
- https://github.com/amirgholami/SqueezeNext
Tiny Darknet [?]
CondenseNet
NASNet
- https://github.com/tensorflow/models/tree/master/research/slim/nets/nasnet
ShuffleNet
FD-MobileNet[2018]
ProxylessNAS[2019]
- https://github.com/MIT-HAN-LAB/ProxylessNAS
MnasNet[2018]
ESPNetv2[2018]
- https://github.com/sacmehta/ESPNetv2

Network-acceleration

https://github.com/yihui-he/channel-pruning

Network Compression

Network quantization

Binary networks

Convolution decomposition

Pruning

Pruning Convolutional Neural Networks for Resource Efficient Inference [2017]
- https://jacobgil.github.io/deeplearning/pruning-deep-learning
Pruning in tensorflow
- https://github.com/ex4sperans/pruning_with_tensorflow

Frameworks

Training smaller model based on existing one

Data-Free Knowledge Distillation for Deep Neural Networks (https://arxiv.org/pdf/1710.07535.pdf) [2017]
- https://github.com/iRapha/replayed_distillation
Stealing Machine Learning Models via Prediction APIs (https://arxiv.org/pdf/1609.02943.pdf) [2016]

To look at:

https://github.com/csyhhu/Awesome-Deep-Neural-Network-Compression
https://github.com/wpf535236337/real-time-network
https://github.com/dkozlov/awesome-knowledge-distillation
https://github.com/memoiry/Awesome-model-compression-and-acceleration
https://github.com/ljk628/ML-Systems/blob/master/dl_cnn.md
https://github.com/songhan/SqueezeNet-Deep-Compression
https://github.com/jiaxiang-wu/quantized-cnn
https://github.com/andyhahaha/Convolutional-Neural-Network-Compression-Survey
https://github.com/Zhouaojun/Efficient-Deep-Learning
https://github.com/ZFTurbo/Keras-inference-time-optimizer
https://github.com/becauseofAI/MobileFace
https://github.com/MingSun-Tse/EfficientDNNs
https://github.com/csyhhu/Awesome-Deep-Neural-Network-Compression

Awesome

Network architecture

Network-acceleration

Network Compression

Network quantization

Binary networks

Convolution decomposition

Pruning

Frameworks

Training smaller model based on existing one

Face recognition

Face detection

Blog posts