Home

Awesome

Deep learning networks

Build Status GitHub License Python Version

This repo is used to research convolutional networks primarily for computer vision tasks. For this purpose, the repo contains (re)implementations of various classification, segmentation, detection, and pose estimation models and scripts for training/evaluating/converting.

The following frameworks are used:

For each supported framework, there is a PIP-package containing pure models without auxiliary scripts. List of packages:

Currently, models are mostly implemented on Gluon and then ported to other frameworks. Some models are pretrained on ImageNet-1K, CIFAR-10/100, SVHN, CUB-200-2011, Pascal VOC2012, ADE20K, Cityscapes, and COCO datasets. All pretrained weights are loaded automatically during use. See examples of such automatic loading of weights in the corresponding sections of the documentation dedicated to a particular package:

Installation

To use training/evaluating scripts as well as all models, you need to clone the repository and install dependencies:

git clone git@github.com:osmr/imgclsmob.git
pip install -r requirements.txt

Table of implemented classification models

Some remarks:

ModelGluonPyTorchChainerKerasTFTF2PaperRepoYear
AlexNetAAAAAAlinklink2012
ZFNetAAAAAAlink-2013
VGGAAAAAAlink-2014
BN-VGGAAAAAAlink-2015
BN-InceptionAAA--Alink-2015
ResNetABCDEABCDEABCDEAAABCDElinklink2015
PreResNetABCDABCDABCDAAABCDlinklink2016
ResNeXtABCDABCDABCDAAABCDlinklink2016
SENetAAAAAAlinklink2017
SE-ResNetABCDEABCDEABCDEAAABCDElinklink2017
SE-PreResNetABCDABCDABCDAAABCDlinklink2017
SE-ResNeXtAAAAAAlinklink2017
ResNeSt(A)AAA--Alinklink2020
IBN-ResNetAA---Alinklink2018
IBN-ResNeXtAA---Alinklink2018
IBN-DenseNetAA---Alinklink2018
AirNetAAA--Alinklink2018
AirNeXtAAA--Alinklink2018
BAM-ResNetAAA--Alinklink2018
CBAM-ResNetAAA--Alinklink2018
ResAttNetaaa---linklink2017
SKNetaaa---linklink2019
SCNetAAA--Alinklink2020
RegNetAAA--Alinklink2020
DIA-ResNetaBCDaBCDaBCD---linklink2019
DIA-PreResNetaBCDaBCDaBCD---linklink2019
PyramidNetABCDABCDABCD--ABCDlinklink2016
DiracNetV2AAA--Alinklink2017
ShaResNetaaa---linklink2017
CRU-NetA-----linklink2018
DenseNetABCDABCDABCDAAABCDlinklink2016
CondenseNetAAA---linklink2017
SparseNetaaa---linklink2018
PeleeNetAAA--Alinklink2018
Oct-ResNetabcdaa---link-2019
Res2Neta-----link-2019
WRNABCDABCDABCD--alinklink2016
WRN-1bitBCDBCDBCD---linklink2018
DRN-CAAA--Alinklink2017
DRN-DAAA--Alinklink2017
DPNAAA--Alinklink2017
DarkNet RefAAAAAAlinklink-
DarkNet TinyAAAAAAlinklink-
DarkNet-19aaaaaalinklink-
DarkNet-53AAAAAAlinklink2018
ChannelNetaaa-a-linklink2018
iSQRT-COV-ResNetaa----linklink2017
RevNet-a----linklink2017
i-RevNetAAA---linklink2018
BagNetAAA--Alinklink2019
DLAAAA--Alinklink2017
MSDNetaab----linklink2017
FishNetAAA---linklink2018
ESPNetv2AAA---linklink2018
DiCENetAAA--Alinklink2019
HRNetAAA--Alinklink2019
VoVNetAAA--Alinklink2019
SelecSLSAAA--Alinklink2019
HarDNetAAA--Alinklink2019
X-DenseNetaBCDaBCDaBCD---linklink2017
SqueezeNetAAAAAAlinklink2016
SqueezeResNetAAAAAAlink-2016
SqueezeNextAAAAAAlinklink2018
ShuffleNetAAAAAAlink-2017
ShuffleNetV2AAAAAAlink-2018
MENetAAAAAAlinklink2018
MobileNetAEAEAEAAAElinklink2017
FD-MobileNetAAAAAAlinklink2018
MobileNetV2AAAAAAlinklink2018
MobileNetV3AAAA-Alinklink2019
IGCV3AAAAAAlinklink2018
GhostNetaaa--alinklink2019
MnasNetAAAAAAlink-2018
DARTSAAA---linklink2018
ProxylessNASAEAEAE--AElinklink2018
FBNet-CAAA--Alink-2018
XceptionAAA--Alinklink2016
InceptionV3AAA--Alinklink2015
InceptionV4AAA--Alinklink2016
InceptionResNetV1AAA--Alinklink2016
InceptionResNetV2AAA--Alinklink2016
PolyNetAAA--Alinklink2016
NASNet-LargeAAA--Alinklink2017
NASNet-MobileAAA--Alinklink2017
PNASNet-LargeAAA--Alinklink2017
SPNASNetAAA--Alinklink2019
EfficientNetAAAA-Alinklink2019
MixNetAAA--Alinklink2019
NINBCDBCDBCD---linklink2013
RoR-3BCDBCDBCD---link-2016
RiRBCDBCDBCD---link-2016
ResDrop-ResNetbcdbcdbcd---linklink2016
Shake-Shake-ResNetBCDBCDBCD---linklink2017
ShakeDrop-ResNetbcdbcdbcd---link-2018
FractalNetbcbc----linklink2016
NTS-NetEEE---linklink2018

Table of implemented segmentation models

Some remarks:

ModelGluonPyTorchChainerKerasTFTF2PaperRepoYear
PSPNetABCDABCDABCD--ABCDlink-2016
DeepLabv3ABcDABcDABcD--ABcDlink-2017
FCN-8s(d)ABcDABcDABcD--ABcDlink-2014
ICNetCCC--Clinklink2017
SINetCCC--clinklink2019
BiSeNeteee--elink-2018
DANetCCC--Clinklink2018
Fast-SCNNCCC--Clink-2019
CGNetccc--clinklink2018
DABNetccc--clinklink2019
FPENetccc--clink-2019
ContextNet-c----link-2018
LEDNetccc--clink-2019
ESNet-c----link-2019
EDANet-c----linklink2018
ENet-c----link-2016
ERFNet-c----link-2017
LinkNet-c----link-2017
SegNet-c----link-2015
U-Net-c----link-2015
SQNet-c----link-2016

Table of implemented object detection models

Some remarks:

ModelGluonPyTorchChainerKerasTFTF2PaperRepoYear
CenterNetaaa--alinklink2019

Table of implemented human pose estimation models

Some remarks:

ModelGluonPyTorchChainerKerasTFTF2PaperRepoYear
AlphaPoseAAA--Alinklink2016
SimplePoseAAA--Alinklink2018
SimplePose(Mobile)AAA--Alink-2018
Lightweight OpenPoseAAA--Alinklink2018
IBPPoseAAA--Alinklink2019

Table of implemented automatic speech recognition models

Some remarks:

ModelGluonPyTorchChainerKerasTFTF2PaperRepoYear
Jasper DRABABab--ablinklink2019
QuartzNetABABab--ablinklink2019