Kornia | Library is composed by a subset of packages containing operators that can be inserted within neural networks to train models to perform image transformations, epipolar geometry, depth estimation, and low-level image processing such as filtering and edge detection that operate directly on tensors | <ul><li>Edgar Riba</li> <li>Dmytro Mishkin</li> <li>Daniel Ponsa</li> <li>Ethan Rublee</li> <li>Gary Bradski</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/slack.svg" alt="slack" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 04.12.2024 |
AutoGen | Framework that enables development of LLM applications using multiple agents that can converse with each other to solve tasks | microsoft | <ul><li>blog post</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 04.12.2024 |
dm_control | DeepMind Infrastructure for Physics-Based Simulation | <ul><li>Saran Tunyasuvunakool</li> <li>Alistair Muldal</li> <li>Yotam Doron</li> <li>Siqi Liu</li><details><summary>others</summary><li>Steven Bohez</li> <li>Josh Merel</li> <li>Tom Erez</li> <li>Timothy Lillicrap</li> <li>Nicolas Heess</li> <li>Yuval Tassa</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 03.12.2024 |
MuJoCo | A general purpose physics engine that aims to facilitate research and development in robotics, biomechanics, graphics and animation, machine learning, and other areas which demand fast and accurate simulation of articulated structures interacting with their environment | <ul><li>Emo Todorov</li> <li>Tom Erez</li> <li>Yuval Tassa</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/deepmind.svg" alt="deepmind" height=20/>, <img src="images/deepmind.svg" alt="deepmind" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li>website</li><li><img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 03.12.2024 |
YOLOv8 | State-of-the-art model that builds upon the success of previous YOLO versions and introduces new features and improvements to further boost performance and flexibility | Glenn Jocher | <ul><li>COCO</li><li>ImageNet</li><li>blog post</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docker.svg" alt="docker" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 03.12.2024 |
SAE Lens | Training Sparse Autoencoders on Language Models | <ul><li>Joseph Bloom</li> <li>Curt Tigges</li> <li>David Chanin</li></ul> | <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/slack.svg" alt="slack" height=20/></li></ul> | | 03.12.2024 |
moondream | Tiny vision language model that kicks ass and runs anywhere | Vik Korrapati | <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>website</li></ul> | | 30.11.2024 |
LangGraph | Library for building stateful, multi-actor applications with LLMs, used to create agent and multi-agent workflows | LangChain | <ul><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 28.11.2024 |
LangChain | Framework for developing applications powered by large language models | LangChain | <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 26.11.2024 |
ARENA | Provide talented individuals with the skills, tools, and environment necessary for upskilling in ML engineering, for the purpose of contributing directly to AI alignment in technical roles | Callum McDougall | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/slack.svg" alt="slack" height=20/></li><li>website</li></ul> | | 26.11.2024 |
Feast | An open source feature store for machine learning | <ul><li>Willem Pienaar</li> <li>Danny Chiao</li> <li>Achal Shah</li> <li>Terence Lim</li><details><summary>others</summary><li>Ches Martin</li> <li>Judah Rand</li> <li>Matt Delacour</li> <li>Miguel Trejo Marrufo</li> <li>Francisco Javier Arceo</li></ul></details> | <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 22.11.2024 |
VC | Client software for performing real-time voice conversion using various Voice Conversion AI | w-okada | <ul><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 20.11.2024 |
CatBoost | High-performance open source library for gradient boosting on decision trees | <ul><li>Anna Veronika Dorogush</li> <li>Vasily Ershov</li> <li>Andrey Gulin</li> <li>Liudmila Prokhorenkova</li><details><summary>others</summary><li>Gleb Gusev</li> <li>Aleksandr Vorobev</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/neurips.svg" alt="neurips" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li>website</li><li><img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 18.11.2024 |
Gemma 2 | New addition to the Gemma family of lightweight, state-of-the-art open models, ranging in scale from 2 billion to 27 billion parameters | unsloth | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 17.11.2024 |
Llama 3.1 | First openly available model that rivals the top AI models when it comes to state-of-the-art capabilities in general knowledge, steerability, math, tool use, and multilingual translation | unsloth | <ul><li>blog post</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/meta.svg" alt="meta" height=20/>, <img src="images/meta.svg" alt="meta" height=20/>, <img src="images/meta.svg" alt="meta" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 17.11.2024 |
Mistral Small | Enterprise-grade small model | unsloth | <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 17.11.2024 |
ORPO | Get up and running with large language models | <ul><li>Jiwoo Hong</li> <li>Noah Lee</li> <li>James Thorne</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 17.11.2024 |
Phi-3.5 | 3.8 billion parameter language model trained on 3.3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3.5, despite being small enough to be deployed on a phone | unsloth | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/>, <img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 17.11.2024 |
Simple audio recognition | This tutorial will show you how to build a basic speech recognition network that recognizes ten different words | Google | <ul><li>coursera</li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li><li>tf.js</li></ul> | | 15.11.2024 |
xFormers | Toolbox to Accelerate Research on Transformers | <ul><li>Benjamin Lefaudeux</li> <li>Francisco Massa</li> <li>Diana Liskovich</li> <li>Wenhan Xiong</li><details><summary>others</summary><li>Vittorio Caggiano</li> <li>Sean Naren</li> <li>Min Xu</li> <li>Jieru Hu</li> <li>Marta Tintore</li> <li>Susan Zhang</li> <li>Patrick Labatut</li> <li>Daniel Haziza</li></ul></details> | <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 13.11.2024 |
Building Your Own Federated Learning Algorithm | We discuss how to implement federated learning algorithms without deferring to the tff.learning API | Zachary Charles | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul> | | 01.11.2024 |
Federated Learning for Image Classification | We use the classic MNIST training example to introduce the Federated Learning API layer of TFF, tff.learning - a set of higher-level interfaces that can be used to perform common types of federated learning tasks, such as federated training, against user-supplied models implemented in TensorFlow | Krzysztof Ostrowski | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/>, <img src="images/pwc.svg" alt="pwc" height=20/></li></ul> | | 01.11.2024 |
Federated Learning for Text Generation | We start with a RNN that generates ASCII characters, and refine it via federated learning | Krzysztof Ostrowski | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data, data</li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul> | | 01.11.2024 |
Custom Federated Algorithms, Part 1: Introduction to the Federated Core | This tutorial is the first part of a two-part series that demonstrates how to implement custom types of federated algorithms in TensorFlow Federated using the Federated Core - a set of lower-level interfaces that serve as a foundation upon which we have implemented the Federated Learning layer | Krzysztof Ostrowski | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li></ul> | | 01.11.2024 |
Custom Federated Algorithms, Part 2: Implementing Federated Averaging | This tutorial is the second part of a two-part series that demonstrates how to implement custom types of federated algorithms in TFF using the Federated Core, which serves as a foundation for the Federated Learning layer | Krzysztof Ostrowski | <ul><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li></ul> | | 01.11.2024 |
High-performance simulations with TFF | This tutorial will describe how to setup high-performance simulations with TFF in a variety of common scenarios | Krzysztof Ostrowski | <ul><li><img src="images/pwc.svg" alt="pwc" height=20/></li></ul> | | 01.11.2024 |
Autodistill | Uses big, slower foundation models to train small, faster supervised models | autodistill | <ul><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 01.11.2024 |
LightAutoML | Allows you create machine learning models using just a few lines of code, or build your own custom pipeline using ready blocks | <ul><li>Alexander Ryzhkov</li> <li>Anton Vakhrushev</li> <li>Dmitry Simakov</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/>, <img src="images/kaggle.svg" alt="kaggle" height=20/>, <img src="images/kaggle.svg" alt="kaggle" height=20/>, <img src="images/kaggle.svg" alt="kaggle" height=20/>, <img src="images/kaggle.svg" alt="kaggle" height=20/>, <img src="images/kaggle.svg" alt="kaggle" height=20/>, <img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 31.10.2024 |
Crawl4AI | LLM Friendly Web Crawler & Scrapper | UncleCode | <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 30.10.2024 |
NotebookLlama | Open Source version of NotebookLM | Meta | <ul><li><img src="images/medium.svg" alt="medium" height=20/></li><li>meidum</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li></ul> | | 29.10.2024 |
XGBoost | Optimized distributed gradient boosting library designed to be highly efficient, flexible and portable | <ul><li>Tianqi Chen</li> <li>Carlos Guestrin</li></ul> | <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 22.10.2024 |
YOLOv5 | You Only Look Once | Glenn Jocher | <ul><li>data</li><li><img src="images/kaggle.svg" alt="kaggle" height=20/>, <img src="images/kaggle.svg" alt="kaggle" height=20/></li></ul> | | 19.10.2024 |
YOLOv3 | You Only Look Once | Glenn Jocher | <ul><li>data</li><li><img src="images/kaggle.svg" alt="kaggle" height=20/>, <img src="images/kaggle.svg" alt="kaggle" height=20/></li></ul> | | 19.10.2024 |
Swarm | Educational framework exploring ergonomic, lightweight multi-agent orchestration | <ul><li>Ilan Bigio</li> <li>James Hills</li> <li>Shyamal Anadkat</li> <li>Charu Jaiswal</li><details><summary>others</summary><li>Colin Jarvis</li> <li>Katia Guzman</li></ul></details> | <ul><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 15.10.2024 |
LM Evaluation Harness | Framework for few-shot evaluation of language models. | EleutherAI | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li></ul> | | 04.10.2024 |
Multimodal Maestro | Gives you more control over large multimodal models to get the outputs you want | Roboflow | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li>website</li></ul> | | 26.09.2024 |
TRL | Set of tools to train transformer language models with Reinforcement Learning, from the Supervised Fine-tuning step, Reward Modeling step to the Proximal Policy Optimization step | <ul><li>Leandro von Werra</li> <li>Younes Belkada</li> <li>Lewis Tunstall</li> <li>Edward Beeching</li><details><summary>others</summary><li>Tristan Thrush</li> <li>Nathan Lambert</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 24.09.2024 |
The Autodiff Cookbook | You'll go through a whole bunch of neat autodiff ideas that you can cherry pick for your own work, starting with the basics | <ul><li>Alex Wiltschko</li> <li>Matthew Johnson</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>book, book</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>tutorial</li><li><img src="images/wiki.svg" alt="wiki" height=20/>, [<img src="images/wiki.svg" alt="wiki" height=20/>](https://en.wikipedia.org/wiki/Pullback_(differential_geometry), <img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/></li></ul> | | 20.09.2024 |
Supervision | Reusable computer vision tools | Roboflow | <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/>, <img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 19.09.2024 |
PEFT | Parameter-Efficient Fine-Tuning methods enable efficient adaptation of pre-trained language models to various downstream applications without fine-tuning all the model's parameters | <ul><li>Sourab Mangrulkar</li> <li>Sylvain Gugger</li> <li>Lysandre Debut</li> <li>Younes Belkada</li> <li>Sayak Paul</li></ul> | <ul><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 13.09.2024 |
SAA+ | Framework, Segment Any Anomaly +, for zero-shot anomaly segmentation with hybrid prompt regularization to improve the adaptability of modern foundation models | <ul><li>Yunkang Cao</li> <li>Xiaohao Xu</li> <li>Chen Sun</li> <li>Yuqi Cheng</li><details><summary>others</summary><li>Zongwei Du</li> <li>Liang Gao</li> <li>Weiming Shen</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li></ul> | | 13.09.2024 |
TensorRT | SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applications | nvidia | <ul><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li>forum</li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 12.09.2024 |
DataChain | AI-dataframe to enrich, transform and analyze data from cloud storages for ML training and LLM apps | Iterative | <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 09.09.2024 |
TFF for Federated Learning Research: Model and Update Compression | We use the EMNIST dataset to demonstrate how to enable lossy compression algorithms to reduce communication cost in the Federated Averaging algorithm | Weikang Song | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li>tensor encoding</li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li></ul> | | 05.09.2024 |
LlamaIndex | Data framework for your LLM application | Jerry Liu | <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/meta.svg" alt="meta" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 05.09.2024 |
Deforum Stable Diffusion | Open source project is designed to be free to use and easy to modify for custom needs and pipelines | <ul><li>EnzymeZoo</li> <li>Артем Храпов</li> <li>Forest Star Walz</li> <li>pharmapsychotic</li></ul> | <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 30.08.2024 |
ComfyUI | Powerful and modular stable diffusion GUI and backend | comfyanonymous | <ul><li>examples</li><li><img src="images/git.svg" alt="git" height=20/></li><li>pytorch</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 30.08.2024 |
Machine Learning Simplified | A Gentle Introduction to Supervised Learning | Andrew Wolf | <ul><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/>, <img src="images/reddit.svg" alt="reddit" height=20/></li><li>website</li></ul> | | 29.08.2024 |
Anomalib | Deep learning library that aims to collect state-of-the-art anomaly detection algorithms for benchmarking on both public and private datasets | <ul><li>Samet Akcay</li> <li>Dick Ameln</li> <li>Ashwin Vaidya</li> <li>Barath Lakshmanan</li><details><summary>others</summary><li>Nilesh Ahuja</li> <li>Utku Genc</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/></li></ul> | | 29.08.2024 |
Anthropic courses | Anthropic's educational courses | Anthropic | <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li></ul> | | 22.08.2024 |
Nerfstudio | API that allows for a simplified end-to-end process of creating, training, and testing NeRFs | <ul><li>Matthew Tancik</li> <li>Ethan Weber</li> <li>Evonne Ng</li> <li>Ruilong Li</li><details><summary>others</summary><li>Brent Yi</li> <li>Justin Kerr</li> <li>Terrance Wang</li> <li>Alexander Kristoffersen</li> <li>Jake Austin</li> <li>Kamyar Salahi</li> <li>Abhik Ahuja</li> <li>David McAllister</li> <li>Angjoo Kanazawa</li></ul></details> | <ul><li>Viewer</li><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 19.08.2024 |
mlcourse.ai | Open Machine Learning Course | Yury Kashnitsky | <ul><li>blog post</li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>project</li><li><img src="images/slack.svg" alt="slack" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 19.08.2024 |
PyTerrier | A Python framework for performing information retrieval experiments | <ul><li>Craig Macdonald</li> <li>Nicola Tonellotto</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul> | | 16.08.2024 |
highway-env | A collection of environments for autonomous driving and tactical decision-making tasks | Edouard Leurent | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul> | | 09.08.2024 |
GNN | Production-tested library for building GNNs at large scale | <ul><li>Oleksandr Ferludin</li> <li>Arno Eigenwillig</li> <li>Martin Blais</li> <li>Dustin Zelle</li><details><summary>others</summary><li>Jan Pfeifer</li> <li>Alvaro Sanchez-Gonzalez</li> <li>Wai Lok Sibon Li</li> <li>Sami Abu-El-Haija</li> <li>Peter Battaglia</li> <li>Neslihan Bulut</li> <li>Jonathan Halcrow</li> <li>Filipe Miguel Gonçalves de Almeida</li> <li>Pedro Gonnet</li> <li>Liangze Jiang</li> <li>Parth Kothari</li> <li>Silvio Lattanzi</li> <li>André Linhares</li> <li>Brandon Mayer</li> <li>Vahab Mirrokni</li> <li>John Palowitch</li> <li>Mihir Paradkar</li> <li>Jennifer She</li> <li>Anton Tsitsulin</li> <li>Kevin Villela</li> <li>Lisa Wang</li> <li>Bryan Perozzi</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 09.08.2024 |
Pix2Pix | This notebook demonstrates image to image translation using conditional GAN's | <ul><li>Phillip Isola</li> <li>Jun-Yan Zhu</li> <li>Tinghui Zhou</li> <li>Alexei Efros</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul> | | 24.07.2024 |
Image classification | This tutorial shows how to classify images of flowers | Billy Lamberta | <ul><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li></ul> | | 24.07.2024 |
TransformerLens | Library for doing mechanistic interpretability of GPT-2 Style language models | <ul><li>Neel Nanda</li> <li>Joseph Bloom</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/slack.svg" alt="slack" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 23.07.2024 |
Kor | Half-baked prototype that "helps" you extract structured data from text using LLMs | Eugene Yurtsev | <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li></ul> | | 20.07.2024 |
Mistral Inference | Minimal code to run Mistral models | mistral | <ul><li>blog post</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 16.07.2024 |
PyTorch3D | Library for deep learning with 3D data | <ul><li>Nikhila Ravi</li> <li>Jeremy Reizenstein</li> <li>David Novotny</li> <li>Taylor Gordon</li><details><summary>others</summary><li>Wan-Yen Lo</li> <li>Justin Johnson</li> <li>Georgia Gkioxari</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post, blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 11.07.2024 |
Stable Diffusion Videos | Create videos with Stable Diffusion by exploring the latent space and morphing between text prompts | Nathan Raw | <ul><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul> | | 11.07.2024 |
Transfer learning and fine-tuning | You will learn how to classify images of cats and dogs by using transfer learning from a pre-trained network | François Chollet | <ul><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li></ul> | | 26.06.2024 |
MARS5 | Speech model for insane prosody | CAMB.AI | <ul><li>demo</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docker.svg" alt="docker" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 25.06.2024 |
Deep RL Course | The Hugging Face Deep Reinforcement Learning Course | <ul><li>Thomas Simonini</li> <li>Omar Sanseviero</li> <li>Sayak Paul</li></ul> | <ul><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/pt.svg" alt="pt" height=20/></li><li>syllabus</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 24.06.2024 |
ToonCrafter | Can interpolate two cartoon images by leveraging the pre-trained image-to-video diffusion priors | <ul><li>Jinbo Xing</li> <li>Hanyuan Liu</li> <li>Menghan Xia</li> <li>Yong Zhang</li><details><summary>others</summary><li>Xintao Wang</li> <li>Ying Shan</li> <li>Tien-Tsin Wong</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>project</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 20.06.2024 |
Brax | A differentiable physics engine that simulates environments made up of rigid bodies, joints, and actuators | <ul><li>Daniel Freeman</li> <li>Erik Frey</li> <li>Anton Raichuk</li> <li>Sertan Girgin</li><details><summary>others</summary><li>Igor Mordatch</li> <li>Olivier Bachem</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/neurips.svg" alt="neurips" height=20/></li></ul> | | 07.06.2024 |
DiffSynth | Restructured architectures including Text Encoder, UNet, VAE, among others, maintaining compatibility with models from the open-source community while enhancing computational performance | Artiprocher | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li></ul> | | 06.06.2024 |
Transformer | This tutorial trains a Transformer model to translate Portuguese to English | <ul><li>Ashish Vaswani</li> <li>Noam Shazeer</li> <li>Niki Parmar</li> <li>Jakob Uszkoreit</li><details><summary>others</summary><li>Llion Jones</li> <li>Aidan Gomez</li> <li>Łukasz Kaiser</li> <li>Illia Polosukhin</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>link</li><li><img src="images/neurips.svg" alt="neurips" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul> | | 31.05.2024 |
NeMo | A conversational AI toolkit built for researchers working on automatic speech recognition, natural language processing, and text-to-speech synthesis | <ul><li>Oleksii Kuchaiev</li> <li>Jason Li</li> <li>Chip Huyen</li> <li>Oleksii Hrinchuk</li><details><summary>others</summary><li>Ryan Leary</li> <li>Boris Ginsburg</li> <li>Samuel Kriman</li> <li>Stanislav Beliaev</li> <li>Vitaly Lavrukhin</li> <li>Jack Cook</li></ul></details> | <ul><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 25.05.2024 |
SentencePiece | An unsupervised text tokenizer and detokenizer mainly for Neural Network-based text generation systems where the vocabulary size is predetermined prior to the neural model training | <ul><li>Taku Kudo</li> <li>John Richardson</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 21.05.2024 |
Llama3 from scratch | Llama3 from scratch, one tensor and matrix multiplication at a time | Nishant Aklecha | <ul><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/>, <img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 19.05.2024 |
Hello, many worlds | This tutorial shows how a classical neural network can learn to correct qubit calibration errors | Michael Broughton | <ul><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 17.05.2024 |
IC-Light | Manipulate the illumination of images | <ul><li>Lvmin Zhang</li> <li>Anyi Rao</li> <li>Maneesh Agrawala</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 09.05.2024 |
Neural style transfer | This tutorial uses deep learning to compose one image in the style of another image | <ul><li>Leon Gatys</li> <li>Alexander Ecker</li> <li>Matthias Bethge</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li></ul> | | 06.05.2024 |
TorchGeo | PyTorch domain library that provides datasets, transforms, samplers, and pre-trained models specific to geospatial data | <ul><li>Adam Stewart</li> <li>Caleb Robinson</li> <li>Isaac Corley</li> <li>Anthony Ortiz</li><details><summary>others</summary><li>Juan Lavista Ferres</li> <li>Arindam Banerjee</li></ul></details> | <ul><li>NDBI</li><li>NDVI</li><li>NDWI</li><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data, data</li><li><img src="images/git.svg" alt="git" height=20/></li></ul> | | 03.05.2024 |
Autoencoders | This tutorial introduces autoencoders with three examples: the basics, image denoising, and anomaly detection | Billy Lamberta | <ul><li>blog post</li><li>book</li><li>data</li><li>examples</li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul> | | 15.04.2024 |
MagicTime | Metamorphic time-lapse video generation model, which learns real-world physics knowledge from time-lapse videos and implements metamorphic generation | <ul><li>Shenghai Yuan</li> <li>Jinfa Huang</li> <li>Yujun Shi</li> <li>Yongqi Xu</li><details><summary>others</summary><li>Ruijie Zhu</li> <li>Bin Lin</li> <li>Xinhua Cheng</li> <li>Li Yuan</li> <li>Jiebo Luo</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/>, <img src="images/twitter.svg" alt="twitter" height=20/></li></ul> | | 14.04.2024 |
SAGE | Methodology for generative spelling correction, which was tested on English and Russian languages and potentially can be extended to any language with minor changes | <ul><li>Nikita Martynov</li> <li>Mark Baushenko</li> <li>Anastasia Kozlova</li> <li>Katerina Kolomeytseva</li><details><summary>others</summary><li>Aleksandr Abramov</li> <li>Alena Fenogenova</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 11.04.2024 |
Image segmentation | This tutorial focuses on the task of image segmentation, using a modified U-Net | <ul><li>Olaf Ronneberger</li> <li>Philipp Fischer</li> <li>Thomas Brox</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul> | | 09.04.2024 |
Open-Sora Plan | Simple and efficient design along with remarkable performance in text-to-video generation | YUAN Lab at PKU | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 07.04.2024 |
Gorilla | Finetuned LLaMA-based model that surpasses the performance of GPT-4 on writing API calls | <ul><li>Shishir Patil</li> <li>Tianjun Zhang</li> <li>Xin Wang</li> <li>Joseph Gonzalez</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 06.04.2024 |
Cleanlab | Helps you clean data and labels by automatically detecting issues in a ML dataset | <ul><li>Curtis Northcutt</li> <li>Lu Jiang</li> <li>Isaac Chuang</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/slack.svg" alt="slack" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 30.03.2024 |
AniPortrait | Framework for generating high-quality animation driven by audio and a reference portrait image | <ul><li>Zejun Yang</li> <li>Zhisheng Wang</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 27.03.2024 |
OpenVINO | Open-source toolkit for optimizing and deploying AI inference | intel | <ul><li>blog post</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li>forum</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 25.03.2024 |
Gazelle | Joint Speech Language Model | Tincans | <ul><li>blog post</li><li>demo</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li>[<img src="images/wiki.svg" alt="wiki" height=20/>](https://en.wikipedia.org/wiki/Spike_/(software_development)</li></ul> | | 20.03.2024 |
Intel® Extension for Transformers | Transformer-based Toolkit to Accelerate GenAI/LLM Everywhere | intel | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 19.03.2024 |
Datasets | A Community Library for Natural Language Processing | <ul><li>Quentin Lhoest</li> <li>Albert Villanova</li> <li>Yacine Jernite</li> <li>Abhishek Thakur</li><details><summary>others</summary><li>Patrick von Platen</li> <li>Suraj Patil</li> <li>Julien Chaumond</li> <li>Mariama Dramé</li> <li>Julien Plu</li> <li>Lewis Tunstall</li> <li>Joe Davison</li> <li>Mario Šaško</li> <li>Gunjan Chhablani</li> <li>Bhavitvya Malik</li> <li>Simon Brandeis</li> <li>Teven Le Scao</li> <li>Victor Sanh</li> <li>Canwen Xu</li> <li>Nicolas Patry</li> <li>Angelina McMillan-Major</li> <li>Philipp Schmid</li> <li>Sylvain Gugger</li> <li>Clément Delangue</li> <li>Théo Matussière</li> <li>Lysandre Debut</li> <li>Stas Bekman</li> <li>Pierric Cistac</li> <li>Thibault Goehringer</li> <li>Victor Mustar</li> <li>François Lagunas</li> <li>Alexander Rush</li> <li>Thomas Wolf</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 18.03.2024 |
Evidently | An open-source framework to evaluate, test and monitor ML models in production | <ul><li>Elena Samuylova</li> <li>Emeli Dral</li> <li>Olga Filippova</li></ul> | <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 15.03.2024 |
Instructor | Library that makes it a breeze to work with structured outputs from large language models | Jason Liu | <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 13.03.2024 |
FiftyOne | Open-source tool for building high-quality datasets and computer vision models | <ul><li>Brian Moore</li> <li>Jason Corso</li></ul> | <ul><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/slack.svg" alt="slack" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 27.02.2024 |
MetaVoice | 1.2B parameter base model trained on 100K hours of speech for TTS | MetaVoice | <ul><li>demo</li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 26.02.2024 |
Generative AI for Beginners - A Course | A 12 Lesson course teaching everything you need to know to start building Generative AI applications | microsoft | <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li>project</li></ul> | | 22.02.2024 |
OmegaConf | Hierarchical configuration system, with support for merging configurations from multiple sources providing a consistent API regardless of how the configuration was created | Omry Yadan | <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>slides</li></ul> | | 15.02.2024 |
Optuna | An automatic hyperparameter optimization software framework, particularly designed for machine learning | <ul><li>Takuya Akiba</li> <li>Shotaro Sano</li> <li>Toshihiko Yanase</li> <li>Takeru Ohta</li> <li>Masanori Koyama</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docker.svg" alt="docker" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 15.02.2024 |
Data augmentation | This tutorial demonstrates data augmentation: a technique to increase the diversity of your training set by applying random transformations such as image rotation | Billy Lamberta | <ul><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li></ul> | | 14.02.2024 |
Stable Cascade | Text to image model introduces an interesting three-stage approach, setting new benchmarks for quality, flexibility, fine-tuning, and efficiency with a focus on further eliminating hardware barriers | Stability AI | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 14.02.2024 |
CleanVision | Automatically detects potential issues in image datasets like images that are: blurry, under/over-exposed, (near) duplicates, etc | cleanlab | <ul><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/slack.svg" alt="slack" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li></ul> | | 13.02.2024 |
DynamiCrafter | Animating Open-domain Images with Video Diffusion Priors | <ul><li>Jinbo Xing</li> <li>Menghan Xia</li> <li>Yong Zhang</li> <li>Haoxin Chen</li><details><summary>others</summary><li>Wangbo Yu</li> <li>Hanyuan Liu</li> <li>Xintao Wang</li> <li>Tien-Tsin Wong</li> <li>Ying Shan</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 12.02.2024 |
Ollama | Get up and running with large language models | Michael Yang | <ul><li><img src="images/docker.svg" alt="docker" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 10.02.2024 |
XLA | Accelerated Linear Algebra is an open-source machine learning compiler for GPUs, CPUs, and ML accelerators | OpenXLA | <ul><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pt.svg" alt="pt" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 02.02.2024 |
Composer | PyTorch library that enables you to train neural networks faster, at lower cost, and to higher accuracy | The Mosaic ML Team | <ul><li>app</li><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/slack.svg" alt="slack" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li>website</li><li><img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 01.02.2024 |
CycleGAN | This notebook demonstrates unpaired image to image translation using conditional GAN's | <ul><li>Jun-Yan Zhu</li> <li>Taesung Park</li> <li>Phillip Isola</li> <li>Alexei Efros</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li></ul> | | 17.01.2024 |
Integrated gradients | This tutorial demonstrates how to implement Integrated Gradients, an Explainable AI technique | <ul><li>Mukund Sundararajan</li> <li>Ankur Taly</li> <li>Qiqi Yan</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li><li>visualizing</li><li><img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/></li></ul> | | 17.01.2024 |
MAGNeT | Masked generative sequence modeling method that operates directly over several streams of audio tokens | <ul><li>Alon Ziv</li> <li>Itai Gat</li> <li>Gaël Le Lan</li> <li>Tal Remez</li><details><summary>others</summary><li>Felix Kreuk</li> <li>Alexandre Défossez</li> <li>Jade Copet</li> <li>Gabriel Synnaeve</li> <li>Yossi Adi</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>project</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li></ul> | | 16.01.2024 |
AutoFaiss | Automatically create Faiss knn indices with the most optimal similarity search parameters | Ctiteo | <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li></ul> | | 12.01.2024 |
Retrieval based Voice Conversion WebUI | An easy-to-use Voice Conversion framework based on VITS | RVC-Project | <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 11.01.2024 |
Flax | Neural network library and ecosystem for JAX designed for flexibility | <ul><li>Jonathan Heek</li> <li>Anselm Levskaya</li> <li>Avital Oliver</li> <li>Marvin Ritter</li><details><summary>others</summary><li>Bertrand Rondepierre</li> <li>Andreas Steiner</li> <li>Marc van Zee</li></ul></details> | <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 10.01.2024 |
Big Vision | This codebase is designed for training large-scale vision models using Cloud TPU VMs or GPU machines | <ul><li>Lucas Beyer</li> <li>Xiaohua Zhai</li> <li>Alexander Kolesnikov</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li></ul> | | 03.01.2024 |
Open Interpreter | An open-source, locally running implementation of OpenAI's Code Interpreter | Killian Lucas | <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 03.01.2024 |
Seamless Communication | Family of AI models that enable more natural and authentic communication across languages | <ul><li>Loïc Barrault</li> <li>Yu-An Chung</li> <li>Mariano Coria</li> <li>David Dale</li><details><summary>others</summary><li>Ning Dong</li> <li>Mark Duppenthaler</li> <li>Paul-Ambroise Duquenne</li> <li>Hady Elsahar</li> <li>Min-Jae Hwang</li> <li>Hirofumi Inaguma</li> <li>Ilia Kulikov</li> <li>Pengwei Li</li> <li>Daniel Licht</li> <li>Jean Maillard</li> <li>Ruslan Mavlyutov</li> <li>Kaushik Ram Sadagopan</li> <li>Abinesh Ramakrishnan</li> <li>Tuan Tran</li> <li>Guillaume Wenzek</li> <li>Yilin Yang</li> <li>Ethan Ye</li> <li>Ivan Evtimov</li> <li>Pierre Fernandez</li> <li>Robin San Roman</li> <li>Bokai Yu</li> <li>Pierre Andrews</li> <li>Can Balioglu</li> <li>Peng-Jen Chen</li> <li>Marta Costa-jussà</li> <li>Maha Elbayad</li> <li>Hongyu Gong</li> <li>Francisco Guzmán</li> <li>Kevin Heffernan</li> <li>Somya Jain</li> <li>Justine Kao</li> <li>Ann Lee</li> <li>Xutai Ma</li> <li>Benjamin Peloquin</li> <li>Juan Pino</li> <li>Sravya Popuri</li> <li>Holger Schwenk</li> <li>Anna Sun</li> <li>Paden Tomasello</li> <li>Changhan Wang</li> <li>Skyler Wang</li> <li>Mary Williamson</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 14.12.2023 |
colab2pdf | Convert your Colab notebook to a PDF | Drengskapur | | | 11.12.2023 |
Sentence Transformers | Multilingual Sentence, Paragraph, and Image Embeddings using BERT & Co | <ul><li>Nils Reimers</li> <li>Iryna Gurevych</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li></ul> | | 07.12.2023 |
CleanRL | Deep Reinforcement Learning library that provides high-quality single-file implementation with research-friendly features | <ul><li>Shengyi Huang</li> <li>Rousslan Dossa</li> <li>Chang Ye</li> <li>Jeff Braga</li><details><summary>others</summary><li>Dipam Chakraborty</li> <li>Kinal Mehta</li> <li>João Araújo</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>paper</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 28.11.2023 |
Vocos | Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis | Hubert Siuzdak | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>project</li></ul> | | 21.11.2023 |
X—LLM | Easy LLM Finetuning using the most advanced methods | Boris Zubarev | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li></ul> | | 15.11.2023 |
Distil-Whisper | Maintains the robustness of the Whisper model to difficult acoustic conditions, while being less prone to hallucination errors on long-form audio | <ul><li>Sanchit Gandhi</li> <li>Patrick von Platen</li> <li>Alexander Rush</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 08.11.2023 |
AnimateDiff | Practical framework to animate most of the existing personalized text-to-image models once and for all, saving efforts in model-specific tuning | <ul><li>Yuwei Guo</li> <li>Ceyuan Yang</li> <li>Anyi Rao</li> <li>Yaohui Wang</li><details><summary>others</summary><li>Yu Qiao</li> <li>Dahua Lin</li> <li>Bo Dai</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 30.10.2023 |
Intel® Neural Compressor | Aims to provide popular model compression techniques such as quantization, pruning (sparsity), distillation, and neural architecture search on mainstream frameworks such as TensorFlow, PyTorch, ONNX Runtime, and MXNet, as well as Intel extensions such as Intel Extension for TensorFlow and Intel Extension for PyTorch | intel | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/neurips.svg" alt="neurips" height=20/></li><li><img src="images/pt.svg" alt="pt" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 27.10.2023 |
Bark | Transformer-based text-to-audio model | suno | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li>examples</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 25.10.2023 |
Mistral Transformer | The most powerful language model for its size to date | <ul><li>Albert Jiang</li> <li>Alexandre Sablayrolles</li> <li>Arthur Mensch</li> <li>Chris Bamford</li><details><summary>others</summary><li>Devendra Chaplot</li> <li>Diego Casas</li> <li>Florian Bressand</li> <li>Gianna Lengyel</li> <li>Guillaume Lample</li> <li>Lucile Saulnier</li> <li>Lélio Renard Lavaud</li> <li>Marie-Anne Lachaux</li> <li>Pierre Stock</li> <li>Teven Scao</li> <li>Thibaut Lavril</li> <li>Thomas Wang</li> <li>Timothée Lacroix</li> <li>William Sayed</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 09.10.2023 |
Fooocus | Image generating software | Lvmin Zhang | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 03.10.2023 |
Actor-Critic | This tutorial demonstrates how to implement the Actor-Critic method using TensorFlow to train an agent on the Open AI Gym CartPole-V0 environment | <ul><li>Vijay Konda</li> <li>John Tsitsiklis</li></ul> | <ul><li>gym</li><li><img src="images/neurips.svg" alt="neurips" height=20/>, <img src="images/neurips.svg" alt="neurips" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li></ul> | | 28.09.2023 |
MMagic | AIGC toolbox for professional AI researchers and machine learning engineers to explore image and video processing, editing and generation | OpenMMLab | <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 11.09.2023 |
SeqIO | Library for processing sequential data to be fed into downstream sequence models | <ul><li>Adam Roberts</li> <li>Hyung Won Chung</li> <li>Anselm Levskaya</li> <li>Gaurav Mishra</li><details><summary>others</summary><li>James Bradbury</li> <li>Daniel Andor</li> <li>Sharan Narang</li> <li>Brian Lester</li> <li>Colin Gaffney</li> <li>Afroz Mohiuddin</li> <li>Curtis Hawthorne</li> <li>Aitor Lewkowycz</li> <li>Alex Salcianu</li> <li>Marc van Zee</li> <li>Jacob Austin</li> <li>Sebastian Goodman</li> <li>Livio Baldini Soares</li> <li>Haitang Hu</li> <li>Sasha Tsvyashchenko</li> <li>Aakanksha Chowdhery</li> <li>Jasmijn Bastings</li> <li>Jannis Bulian</li> <li>Xavier Garcia</li> <li>Jianmo Ni</li> <li>Andrew Chen</li> <li>Kathleen Kenealy</li> <li>Jonathan Clark</li> <li>Stephan Lee</li> <li>Dan Garrette</li> <li>James Lee-Thorp</li> <li>Colin Raffel</li> <li>Noam Shazeer</li> <li>Marvin Ritter</li> <li>Maarten Bosma</li> <li>Alexandre Passos</li> <li>Jeremy Maitin-Shepard</li> <li>Noah Fiedel</li> <li>Mark Omernick</li> <li>Brennan Saeta</li> <li>Ryan Sepassi</li> <li>Alexander Spiridonov</li> <li>Joshua Newlan</li> <li>Andrea Gesmundo</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li></ul> | | 08.09.2023 |
MMAction2 | An open-source toolbox for video understanding based on PyTorch | MMAction2 Contributors | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data, data, data, data</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul> | | 06.09.2023 |
Ray | Unified framework for scaling AI and Python applications | <ul><li>Philipp Moritz</li> <li>Robert Nishihara</li> <li>Stephanie Wang</li> <li>Alexey Tumanov</li><details><summary>others</summary><li>Richard Liaw</li> <li>Eric Liang</li> <li>Melih Elibol</li> <li>Zongheng Yang</li> <li>William Paul</li> <li>Michael Jordan</li> <li>Ion Stoica</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 06.09.2023 |
Home Robot | Low-level API for controlling various home robots | Chris Paxton | <ul><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul> | | 30.08.2023 |
Neural Tangents | Library designed to enable research into infinite-width neural networks | <ul><li>Roman Novak</li> <li>Lechao Xiao</li> <li>Jiri Hron</li> <li>Jaehoon Lee</li><details><summary>others</summary><li>Alexander Alemi</li> <li>Jascha Sohl-Dickstein</li> <li>Samuel Schoenholz</li></ul></details> | <ul><li>ICLR</li><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 29.08.2023 |
Stable Diffusion 2 | New stable diffusion model at 768x768 resolution. Same number of parameters in the U-Net as 1.5, but uses OpenCLIP-ViT/H as the text encoder and is trained from scratch | <ul><li>Robin Rombach</li> <li>Andreas Blattmann</li> <li>Dominik Lorenz</li> <li>Patrick Esser</li><details><summary>others</summary><li>Björn Ommer</li> <li>qunash</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 26.08.2023 |
DALL·E Mini | Generate images from a text prompt | <ul><li>Boris Dayma</li> <li>Suraj Patil</li> <li>Pedro Cuenca</li> <li>Khalid Saifullah</li><details><summary>others</summary><li>Tanishq Abraham</li> <li>Phúc H. Lê Khắc</li> <li>Luke Melas</li> <li>Ritobrata Ghosh</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li>data</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li></ul> | | 22.08.2023 |
Classify text with BERT | This tutorial contains complete code to fine-tune BERT to perform sentiment analysis on a dataset of plain-text IMDB movie reviews | Anirudh Dubey | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li></ul> | | 08.08.2023 |
Kandinsky 2.1 | As text and image encoder it uses CLIP model and diffusion image prior between latent spaces of CLIP modalities | <ul><li>Arseniy Shakhmatov</li> <li>Anton Razzhigaev</li> <li>Aleksandr Nikolich</li> <li>Vladimir Arkhipkin</li><details><summary>others</summary><li>Igor Pavlov</li> <li>Andrey Kuznetsov</li> <li>Denis Dimitrov</li></ul></details> | <ul><li>blog post</li><li>demo</li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 07.08.2023 |
SoftVC VITS | Singing Voice Conversion | svc develop team | <ul><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li></ul> | | 31.07.2023 |
threestudio | Unified framework for 3D content creation from text prompts, single images, and few-shot images, by lifting 2D text-to-image generation models | <ul><li>Yuan-Chen Guo</li> <li>Ying-Tian Liu</li> <li>Ruizhi Shao</li> <li>Christian Laforte</li><details><summary>others</summary><li>Vikram Voleti</li> <li>Guan Luo</li> <li>Chia-Hao Chen</li> <li>Zi-Xin Zou</li> <li>Chen Wang</li> <li>Yanpei Cao</li> <li>Song-Hai Zhang</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 28.07.2023 |
Image captioning | Given an image our goal is to generate a caption | <ul><li>Kelvin Xu</li> <li>Jimmy Ba</li> <li>Ryan Kiros</li> <li>Kyunghyun Cho</li><details><summary>others</summary><li>Aaron Courville</li> <li>Ruslan Salakhutdinov</li> <li>Richard Zemel</li> <li>Yoshua Bengio</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul> | | 25.07.2023 |
Word2Vec | Word2Vec is not a singular algorithm, rather, it is a family of model architectures and optimizations that can be used to learn word embeddings from large datasets | Google | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>link</li><li><img src="images/neurips.svg" alt="neurips" height=20/></li><li>projector</li><li><img src="images/pwc.svg" alt="pwc" height=20/>, <img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li></ul> | | 25.07.2023 |
Word embeddings | This tutorial contains an introduction to word embeddings | Billy Lamberta | <ul><li>data</li><li>projector</li></ul> | | 25.07.2023 |
Contextualized Topic Models | Family of topic models that use pre-trained representations of language to support topic modeling | <ul><li>Federico Bianchi</li> <li>Silvia Terragni</li> <li>Dirk Hovy</li> <li>Debora Nozza</li> <li>Elisabetta Fersini</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 22.07.2023 |
Tortoise | A multi-voice TTS system trained with an emphasis on quality | James Betker | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>examples</li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 15.07.2023 |
Petals | Run 100B+ language models at home, BitTorrent-style | BigScience | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li><img src="images/wiki.svg" alt="wiki" height=20/></li></ul> | | 05.07.2023 |
Epistemic Neural Networks | A library for neural networks that know what they don't know | <ul><li>Ian Osband</li> <li>Zheng Wen</li> <li>Seyed Mohammad Asghari</li> <li>Vikranth Dwaracherla</li><details><summary>others</summary><li>Morteza Ibrahimi</li> <li>Xiuyuan Lu</li> <li>Benjamin Van Roy</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 27.06.2023 |
DeepFloyd IF | State-of-the-art open-source text-to-image model with a high degree of photorealism and language understanding | <ul><li>Alex Shonenkov</li> <li>Misha Konstantinov</li> <li>Daria Bakshandaeva</li> <li>Christoph Schuhmann</li><details><summary>others</summary><li>Ksenia Ivanova</li> <li>Nadiia Klokova</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 26.06.2023 |
normflows | PyTorch implementation of discrete normalizing flows | <ul><li>Vincent Stimper</li> <li>David Liu</li> <li>Andrew Campbell</li> <li>Vincent Berenz</li><details><summary>others</summary><li>Lukas Ryll</li> <li>Bernhard Schölkopf</li> <li>José Miguel Hernández-Lobato</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li></ul> | | 26.06.2023 |
MMPose | Toolbox for pose estimation based on PyTorch | OpenMMLab | <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 19.06.2023 |
MyoSuite | A collection of musculoskeletal environments and tasks simulated with the MuJoCo physics engine and wrapped in the OpenAI gym API to enable the application of Machine Learning to bio-mechanic control problems | <ul><li>Vittorio Caggiano</li> <li>Huawei Wang</li> <li>Guillaume Durandau</li> <li>Massimo Sartori</li> <li>Vikash Kumar</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li></ul> | | 16.06.2023 |
Audiocraft | PyTorch library for deep learning research on audio generation | <ul><li>Jade Copet</li> <li>Felix Kreuk</li> <li>Itai Gat</li> <li>Tal Remez</li><details><summary>others</summary><li>David Kant</li> <li>Gabriel Synnaeve</li> <li>Yossi Adi</li> <li>Alexandre Défossez</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 11.06.2023 |
Detectron2 | FAIR's next-generation platform for object detection and segmentation | Yuxin Wu | <ul><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li></ul> | | 26.05.2023 |
Reverb | Efficient and easy-to-use data storage and transport system designed for machine learning research | <ul><li>Albin Cassirer</li> <li>Gabriel Barth-Maron</li> <li>Eugene Brevdo</li> <li>Sabela Ramos</li><details><summary>others</summary><li>Toby Boyd</li> <li>Thibault Sottiaux</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li></ul> | | 23.05.2023 |
MMDetection | Open source object detection toolbox based on PyTorch | OpenMMLab | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/>, <img src="images/pwc.svg" alt="pwc" height=20/>, <img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 17.05.2023 |
ChatRWKV | Like ChatGPT but powered by RWKV (100% RNN) language model, which is the only RNN that can match transformers in quality and scaling, while being faster and saves VRAM | <ul><li>Bo Peng</li> <li>Eric Alcaide</li> <li>Quentin Anthony</li> <li>Alon Albalak</li><details><summary>others</summary><li>Samuel Arcadinho</li> <li>Matteo Grella</li> <li>Kranthi Kiran</li> <li>Haowen Hou</li> <li>Przemyslaw Kazienko</li> <li>Jan Kocon</li> <li>Bartlomiej Koptyra</li> <li>Ipsit Mantri</li> <li>Ferdinand Mom</li> <li>Xiangru Tang</li> <li>Johan Wind</li> <li>Stanisław Woźniak</li> <li>Qihang Zhao</li> <li>Peng Zhou</li> <li>Jian Zhu</li> <li>Rui-Jie Zhu</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 08.05.2023 |
Python Data Science Handbook | Jupyter notebook version of the Python Data Science Handbook by Jake VanderPlas | Jake Vanderplas | <ul><li>project</li></ul> | | 06.05.2023 |
PGMax | General factor graphs for discrete probabilistic graphical models, and hardware-accelerated differentiable loopy belief propagation in JAX | <ul><li>Guangyao Zhou</li> <li>Nishanth Kumar</li> <li>Antoine Dedieu</li> <li>Miguel Lázaro-Gredilla</li><details><summary>others</summary><li>Shrinu Kushagra</li> <li>Dileep George</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li></ul> | | 05.05.2023 |
StableLM | Stability AI Language Models | Stability AI | <ul><li>blog post</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 27.04.2023 |
TTS | A library for advanced Text-to-Speech generation, built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed and quality | <ul><li>Eren Gölge</li> <li>Aya-AlJafari</li> <li>Edresson Casanova</li> <li>Josh Meyer</li><details><summary>others</summary><li>Kelly Davis</li> <li>Reuben Morais</li></ul></details> | <ul><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li>samples</li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 26.04.2023 |
OpenCLIP | An open source implementation of CLIP | <ul><li>Ross Wightman</li> <li>Cade Gordon</li> <li>Vaishaal Shankar</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data, data, data</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li></ul> | | 16.04.2023 |
Stable Baselines3 | Set of reliable implementations of reinforcement learning algorithms in PyTorch | <ul><li>Antonin Raffin</li> <li>Ashley Hill</li> <li>Adam Gleave</li> <li>Anssi Kanervisto</li><details><summary>others</summary><li>Maximilian Ernestus</li> <li>Noah Dormann</li></ul></details> | <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>paper</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 14.04.2023 |
RL Baselines3 Zoo | Training Framework for Stable Baselines3 Reinforcement Learning Agents | Antonin Raffin | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li></ul> | | 14.04.2023 |
Grounded-SAM | Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect, Segment and Generate Anything | IDEA-Research | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 12.04.2023 |
TFDS | Collection of ready-to-use datasets for use with TensorFlow, Jax, and other Machine Learning frameworks | Google | <ul><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 11.04.2023 |
Optimum | Extension of Transformers and Diffusers, providing a set of optimization tools enabling maximum efficiency to train and run models on targeted hardware, while keeping things easy to use | Hugging Face | <ul><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 06.04.2023 |
MMOCR | Open source toolkit based on PyTorch and MMDetection, supporting numerous OCR-related models, including text detection, text recognition, and key information extraction | <ul><li>Zhanghui Kuang</li> <li>Hongbin Sun</li> <li>Zhizhong Li</li> <li>Xiaoyu Yue</li><details><summary>others</summary><li>Tsui Hin Lin</li> <li>Jianyong Chen</li> <li>Huaqiang Wei</li> <li>Yiqin Zhu</li> <li>Tong Gao</li> <li>Wenwei Zhang</li> <li>Kai Chen</li> <li>Wayne Zhang</li> <li>Dahua Lin</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 06.04.2023 |
MMSegmentation | Open source semantic segmentation toolbox based on PyTorch | OpenMMLab | <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 31.03.2023 |
LAVIS | Python deep learning library for LAnguage-and-VISion intelligence research and applications | <ul><li>Dongxu Li</li> <li>Junnan Li</li> <li>Hung Le</li> <li>Guangsen Wang</li><details><summary>others</summary><li>Silvio Savarese</li> <li>Steven Hoi</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li></ul> | | 24.03.2023 |
AudioLM | Framework for high-quality audio generation with long-term consistency | <ul><li>Phil Wang</li> <li>Zalán Borsos</li> <li>Raphaël Marinier</li> <li>Damien Vincent</li><details><summary>others</summary><li>Eugene Kharitonov</li> <li>Olivier Pietquin</li> <li>Matt Sharifi</li> <li>Olivier Teboul</li> <li>David Grangier</li> <li>Marco Tagliasacchi</li> <li>Neil Zeghidour</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 23.03.2023 |
pymdp | Package for simulating Active Inference agents in Markov Decision Process environments | <ul><li>Conor Heins</li> <li>Alec Tschantz</li> <li>Beren Millidge</li> <li>Brennan Klein</li><details><summary>others</summary><li>Arun Niranjan</li> <li>Daphne Demekas</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li></ul> | | 19.03.2023 |
Tzer | Coverage-Guided Tensor Compiler Fuzzing with Joint IR-Pass Mutation | <ul><li>Jiawei Liu</li> <li>Yuxiang Wei</li> <li>Sen Yang</li> <li>Yinlin Deng</li> <li>Lingming Zhang</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docker.svg" alt="docker" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li></ul> | | 09.03.2023 |
ArtLine | A Deep Learning based project for creating line art portraits | Vijish Madhavan | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul> | | 03.03.2023 |
Haiku | A library built on top of JAX designed to provide simple, composable abstractions for machine learning research | <ul><li>Tom Hennigan</li> <li>Trevor Cai</li> <li>Tamara Norman</li> <li>Igor Babuschkin</li></ul> | <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li>website</li></ul> | | 02.03.2023 |
SAHI | A lightweight vision library for performing large scale object detection & instance segmentation | <ul><li>Fatih Cagatay Akyon</li> <li>Sinan Onur ALTINUÇ</li> <li>Alptekin Temizel</li> <li>Cemil Cengiz</li><details><summary>others</summary><li>Devrim Çavuşoğlu</li> <li>Kadir Şahin</li> <li>Oğulcan Eryüksel</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li></ul> | | 23.02.2023 |
AmpliGraph | A suite of neural machine learning models for relational Learning, a branch of machine learning that deals with supervised learning on knowledge graphs | <ul><li>Luca Costabello</li> <li>Adrianna Janik</li> <li>Chan Le Van</li> <li>Nicholas McCarthy</li><details><summary>others</summary><li>Rory McGrath</li> <li>Sumit Pai</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/neurips.svg" alt="neurips" height=20/>, <img src="images/neurips.svg" alt="neurips" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 23.02.2023 |
NMT with attention | This notebook trains a seq2seq model for Spanish to English translation | <ul><li>Minh-Thang Luong</li> <li>Hieu Pham</li> <li>Christopher Manning</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li></ul> | | 15.02.2023 |
GLUE using BERT on TPU | This tutorial contains complete end-to-end code to train models on a TPU | Anirudh Dubey | <ul><li>GLUE</li><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li></ul> | | 15.02.2023 |
TensorBoard | Suite of web applications for inspecting and understanding your TensorFlow runs and graphs | Yuan Tang | <ul><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li><li>website</li><li><img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 10.02.2023 |
High-performance Simulation with Kubernetes | This tutorial will describe how to set up high-performance simulation using a TFF runtime running on Kubernetes | Jason Roselander | <ul><li>GKE</li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li>shell</li></ul> | | 31.01.2023 |
Compel | Text prompt weighting and blending library for transformers-type text embedding systems | Damian Stewart | <ul><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li></ul> | | 26.01.2023 |
DALL·E Flow | An interactive workflow for generating high-definition images from text prompt | <ul><li>Han Xiao</li> <li>Delgermurun Purevkhuu</li> <li>Alex Cureton-Griffiths</li></ul> | <ul><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 26.01.2023 |
Diffusers | Provides pretrained diffusion models across multiple modalities, such as vision and audio, and serves as a modular toolbox for inference and training of diffusion models | Hugging Face | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 17.01.2023 |
Sample Factory | One of the fastest RL libraries focused on very efficient synchronous and asynchronous implementations of policy gradients | <ul><li>Aleksei Petrenko</li> <li>Zhehui Huang</li> <li>Tushar Kumar</li> <li>Gaurav Sukhatme</li> <li>Vladlen Koltun</li></ul> | <ul><li>ICML</li><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 17.01.2023 |
Open-Assistant | Chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so | <ul><li>Andreas Köpf</li> <li>Yannic Kilcher</li> <li>Huu Nguyen</li> <li>Christoph Schuhmann</li><details><summary>others</summary><li>Keith Stevens</li> <li>Abdullah Barhoum</li> <li>Nguyen Minh Duc</li> <li>Oliver Stanley</li> <li>James Melvin Ebenezer</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 14.01.2023 |
panda-gym | Set of robotic environments based on PyBullet physics engine and gymnasium | <ul><li>Quentin Gallouédec</li> <li>Nicolas Cazin</li> <li>Emmanuel Dellandréa</li> <li>Liming Chen</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 02.01.2023 |
BANMo | Given multiple casual videos capturing a deformable object, BANMo reconstructs an animatable 3D model, including an implicit canonical 3D shape, appearance, skinning weights, and time-varying articulations, without pre-defined shape templates or registered cameras | <ul><li>Gengshan Yang</li> <li>Minh Vo</li> <li>Natalia Neverova</li> <li>Deva Ramanan</li><details><summary>others</summary><li>Andrea Vedaldi</li> <li>Hanbyul Joo</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 30.12.2022 |
tensor_parallel | Run large PyTorch models on multiple GPUs in one line of code with potentially linear speedup | Andrei Panferov | <ul><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/>, <img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li></ul> | | 29.12.2022 |
TPU | Reference models and tools for Cloud TPUs | Google | <ul><li>website</li><li><img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 20.12.2022 |
rliable | Library for reliable evaluation, even with a handful of runs, on reinforcement learning and machine learnings benchmarks | <ul><li>Rishabh Agarwal</li> <li>Max Schwarzer</li> <li>Pablo Castro</li> <li>Aaron Courville</li> <li>Marc Bellemare</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post, blog post</li><li><img src="images/neurips.svg" alt="neurips" height=20/></li><li>podcast</li><li>poster</li><li>project</li><li>slides</li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 19.12.2022 |
TF-Agents | A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning | <ul><li>Sergio Guadarrama</li> <li>Anoop Korattikara</li> <li>Oscar Ramirez</li> <li>Pablo Castro</li><details><summary>others</summary><li>Ethan Holly</li> <li>Sam Fishman</li> <li>Ke Wang</li> <li>Ekaterina Gonina</li> <li>Neal Wu</li> <li>Efi Kokiopoulou</li> <li>Luciano Sbaiz</li> <li>Jamie Smith</li> <li>Gábor Bartók</li> <li>Jesse Berent</li> <li>Chris Harris</li> <li>Vincent Vanhoucke</li> <li>Eugene Brevdo</li></ul></details> | <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 15.12.2022 |
PyG | Library built upon PyTorch to easily write and train Graph Neural Networks for a wide range of applications related to structured data | <ul><li>Matthias Fey</li> <li>Jan Eric Lenssen</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/neurips.svg" alt="neurips" height=20/>, <img src="images/neurips.svg" alt="neurips" height=20/>, <img src="images/neurips.svg" alt="neurips" height=20/></li><li><img src="images/pt.svg" alt="pt" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 08.12.2022 |
ruGPT3 | Example of inference of RuGPT3XL | Anton Emelyanov | <ul><li>cristofari</li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>sparse attention</li></ul> | | 07.12.2022 |
DSP theory | Theory of digital signal processing: signals, filtration (IIR, FIR, CIC, MAF), transforms (FFT, DFT, Hilbert, Z-transform) etc | <ul><li>Alexander Kapitanov</li> <li>Vladimir Fadeev</li> <li>Karina Kvanchiani</li> <li>Elizaveta Petrova</li> <li>Andrei Makhliarchuk</li></ul> | <ul><li>blog post</li></ul> | | 18.10.2022 |
Mubert | Prompt-based music generation via Mubert API | Ilya Belikov | <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 18.10.2022 |
RuDOLPH | A fast and light text-image-text transformer designed for a quick and easy fine-tuning setup for the solution of various tasks: from generating images by text description and image classification to visual question answering and more | <ul><li>Alex Shonenkov</li> <li>Misha Konstantinov</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li></ul> | | 06.10.2022 |
Batch RL | Offline RL using the DQN replay dataset comprising the entire replay experience of a DQN agent on 60 Atari 2600 games | <ul><li>Rishabh Agarwal</li> <li>Dale Schuurmans</li> <li>Mohammad Norouzi</li></ul> | <ul><li>DQN</li><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li>data, data</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li>slides</li><li>talk</li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul> | | 04.10.2022 |
EfficientDet | New family of object detectors, called EfficientDet, which consistently achieve much better efficiency than prior art across a wide spectrum of resource constraints | <ul><li>Mingxing Tan</li> <li>Ruoming Pang</li> <li>Quoc Le</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li><li>tutorial</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 27.09.2022 |
RL Games | High performance RL library | <ul><li>Denys Makoviichuk</li> <li>Viktor Makoviychuk</li></ul> | <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li></ul> | | 27.09.2022 |
ACME | A library of reinforcement learning components and agents | <ul><li>Matt Hoffman</li> <li>Bobak Shahriari</li> <li>John Aslanides</li> <li>Gabriel Barth-Maron</li><details><summary>others</summary><li>Feryal Behbahani</li> <li>Tamara Norman</li> <li>Abbas Abdolmaleki</li> <li>Albin Cassirer</li> <li>Fan Yang</li> <li>Kate Baumli</li> <li>Sarah Henderson</li> <li>Alex Novikov</li> <li>Sergio Gómez Colmenarejo</li> <li>Serkan Cabi</li> <li>Caglar Gulcehre</li> <li>Tom Le Paine</li> <li>Andrew Cowie</li> <li>Ziyu Wang</li> <li>Bilal Piot</li> <li>Nando de Freitas</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 26.09.2022 |
RWKV | Reinventing RNNs for the Transformer Era | <ul><li>Bo Peng</li> <li>Eric Alcaide</li> <li>Quentin Anthony</li> <li>Alon Albalak</li><details><summary>others</summary><li>Samuel Arcadinho</li> <li>Matteo Grella</li> <li>Kranthi Kiran</li> <li>Haowen Hou</li> <li>Przemyslaw Kazienko</li> <li>Jan Kocon</li> <li>Bartlomiej Koptyra</li> <li>Ipsit Mantri</li> <li>Ferdinand Mom</li> <li>Xiangru Tang</li> <li>Johan Wind</li> <li>Stanisław Woźniak</li> <li>Qihang Zhao</li> <li>Peng Zhou</li> <li>Jian Zhu</li> <li>Rui-Jie Zhu</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li>demo</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/>, <img src="images/twitter.svg" alt="twitter" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 21.09.2022 |
NetKet | Open-source project delivering cutting-edge methods for the study of many-body quantum systems with artificial neural networks and machine learning techniques | <ul><li>Filippo Vicentini</li> <li>Damian Hofmann</li> <li>Attila Szabó</li> <li>Dian Wu</li><details><summary>others</summary><li>Christopher Roth</li> <li>Clemens Giuliani</li> <li>Gabriel Pescia</li> <li>Jannes Nys</li> <li>Vladimir Vargas-Calderón</li> <li>Nikita Astrakhantsev</li> <li>Giuseppe Carleo</li> <li>Kenny Choo</li> <li>James Smith</li> <li>Tom Westerhout</li> <li>Fabien Alet</li> <li>Emily Davis</li> <li>Stavros Efthymiou</li> <li>Ivan Glasser</li> <li>Sheng-Hsuan Lin</li> <li>Marta Mauri</li> <li>Mazzola Guglielmo</li> <li>Christian Mendl</li> <li>Evert Nieuwenburg</li> <li>Ossian O'Reilly</li> <li>Hugo Théveniaut</li> <li>Giacomo Torlai</li> <li>Alexander Wietek</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 15.09.2022 |
Stable Diffusion | A latent text-to-image diffusion model | <ul><li>Robin Rombach</li> <li>Andreas Blattmann</li> <li>Dominik Lorenz</li> <li>Patrick Esser</li> <li>Björn Ommer</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li></ul> | | 10.08.2022 |
Deep-MAC | Welcome to the Novel class segmentation demo | Vighnesh Birodkar | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/></li></ul> | | 10.08.2022 |
NL-Augmenter | A collaborative effort intended to add transformations of datasets dealing with natural language | <ul><li>Aadesh Gupta</li> <li>Timothy Sum Hon Mun</li> <li>Aditya Srivatsa</li> <li>Xudong Shen</li><details><summary>others</summary><li>Juan Diego Rodriguez</li> <li>Ashish Shrivastava</li> <li>Nagender Aneja</li> <li>Zijie Wang</li> <li>Yiwen Shi</li> <li>Afnan Mir</li> <li>William Soto</li> <li>Chandan Singh</li> <li>Claude Roux</li> <li>Abinaya Mahendiran</li> <li>Anna Shvets</li> <li>Kaustubh Dhole</li> <li>Bryan Wilie</li> <li>Jamie Simon</li> <li>Mukund Varma</li> <li>Sang Han</li> <li>Denis Kleyko</li> <li>Samuel Cahyawijaya</li> <li>Filip Cornell</li> <li>Tanay Dixit</li> <li>Connor Boyle</li> <li>Genta Indra Winata</li> <li>Seungjae Ryan Lee</li> <li>Marcin Namysl</li> <li>Roman Sitelew</li> <li>Zhenhao Li</li> <li>Fiona Tan</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>website</li></ul> | | 06.08.2022 |
XManager | Framework for managing machine learning experiment | Andrew Chen | <ul><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li>slides</li></ul> | | 29.07.2022 |
Accelerate | A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision | Hugging Face | <ul><li><img src="images/docs.svg" alt="docs" height=20/></li></ul> | | 27.07.2022 |
YOLOv5 on Custom Objects | This notebook shows training on your own custom objects | Jacob Solawetz | <ul><li>blog post</li><li>data</li></ul> | | 20.07.2022 |
MindsEye | Graphical user interface built to run multimodal ai art models for free from a Google Colab, without needing edit a single line of code or know any programming | <ul><li>multimodal.art</li> <li>João Paulo Apolinário Passos</li></ul> | <ul><li><img src="images/git.svg" alt="git" height=20/></li><li>project</li></ul> | | 06.07.2022 |
py-irt | Fitting Item Response Theory models using variational inference | <ul><li>John Lalor</li> <li>Hong Yu</li> <li>Pedro Rodriguez</li> <li>Joe Barrow</li><details><summary>others</summary><li>Alexander Hoyle</li> <li>Robin Jia</li> <li>Jordan Boyd-Graber</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>paper</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 30.06.2022 |
BIG-bench | A collaborative benchmark intended to probe large language models and extrapolate their future capabilities | <ul><li>Jaehoon Lee</li> <li>Jascha Sohl-Dickstein</li> <li>Vinay Ramasesh</li> <li>Sajant Anand</li><details><summary>others</summary><li>Alicia Parrish</li> <li>Ethan Dyer</li> <li>Liam Dugan</li> <li>Dieuwke Hupkes</li> <li>Daniel Freeman</li> <li>Guy Gur-Ari</li> <li>Aitor Lewkowycz</li></ul></details> | <ul><li>API</li><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li></ul> | | 28.06.2022 |
HuggingArtists | Choose your favorite Artist and train a language model to write new lyrics based on their unique voice | Aleksey Korshuk | <ul><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li></ul> | | 25.06.2022 |
Introduction to the TensorFlow Models NLP library | You will learn how to build transformer-based models for common NLP tasks including pretraining, span labelling and classification using the building blocks from NLP modeling library | Chen Chen | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li></ul> | | 22.06.2022 |
Cirq | A python framework for creating, editing, and invoking Noisy Intermediate Scale Quantum circuits | <ul><li>Balint Pato</li> <li>Matthew Harrigan</li> <li>Animesh Sinha</li> <li>Matthew Neeley</li><details><summary>others</summary><li>Dave Bacon</li> <li>Matteo Pompili</li> <li>Michael Broughton</li></ul></details> | <ul><li><img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 21.06.2022 |
CLIP-as-service | A low-latency high-scalability service for embedding images and text | Han Xiao | <ul><li>data</li><li><img src="images/git.svg" alt="git" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 19.06.2022 |
Jina | MLOps framework that empowers anyone to build cross-modal and multi-modal applications on the cloud | Han Xiao | <ul><li>data</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li>hub</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 11.06.2022 |
MMRotate | Toolbox for rotated object detection based on PyTorch | <ul><li>Yue Zhou</li> <li>Xue Yang</li> <li>Gefan Zhang</li> <li>Jiabao Wang</li><details><summary>others</summary><li>Yanyi Liu</li> <li>Liping Hou</li> <li>Xue Jiang</li> <li>Xingzhao Liu</li> <li>Junchi Yan</li> <li>Chengqi Lyu</li> <li>Wenwei Zhang</li> <li>Kai Chen</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/>, <img src="images/pwc.svg" alt="pwc" height=20/>, <img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 10.06.2022 |
Aesthetics Predictor | A linear estimator on top of clip to predict the aesthetic quality of pictures | LAION AI | <ul><li>blog post</li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li></ul> | | 04.06.2022 |
Flashlight | Fast, flexible machine learning library written entirely in C++ | <ul><li>Jacob Kahn</li> <li>Vineel Pratap</li> <li>Tatiana Likhomanenko</li> <li>Qiantong Xu</li><details><summary>others</summary><li>Awni Hannun</li> <li>Jeff Cai</li> <li>Paden Tomasello</li> <li>Ann Lee</li> <li>Edouard Grave</li> <li>Gilad Avidov</li> <li>Benoit Steiner</li> <li>Vitaliy Liptchinsky</li> <li>Gabriel Synnaeve</li> <li>Ronan Collobert</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docker.svg" alt="docker" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul> | | 01.06.2022 |
RL Unplugged | Suite of benchmarks for offline reinforcement learning | <ul><li>Caglar Gulcehre</li> <li>Ziyu Wang</li> <li>Alexander Novikov</li> <li>Tom Le Paine</li><details><summary>others</summary><li>Sergio Gómez Colmenarejo</li> <li>Konrad Żołna</li> <li>Rishabh Agarwal</li> <li>Josh Merel</li> <li>Daniel Mankowitz</li> <li>Cosmin Paduraru</li> <li>Gabriel Dulac-Arnold</li> <li>Jerry Li</li> <li>Mohammad Norouzi</li> <li>Matt Hoffman</li> <li>Ofir Nachum</li> <li>George Tucker</li> <li>Nicolas Heess</li> <li>Nando de Freitas</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 26.05.2022 |
Scenic | Codebase with a focus on research around attention-based models for computer vision | <ul><li>Mostafa Dehghani</li> <li>Alexey Gritsenko</li> <li>Anurag Arnab</li> <li>Matthias Minderer</li> <li>Yi Tay</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li></ul> | | 04.05.2022 |
Text generation with RNN | This tutorial demonstrates how to generate text using a character-based RNN | Anirudh Dubey | <ul><li>link</li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul> | | 03.05.2022 |
CLIPDraw | Synthesize drawings to match a text prompt | <ul><li>Kevin Frans</li> <li>Lisa Soros</li> <li>Olaf Witkowski</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/git.svg" alt="git" height=20/></li></ul> | | 29.04.2022 |
CodeGen | Family of open-source model for program synthesis | <ul><li>Erik Nijkamp</li> <li>Bo Pang</li> <li>Hiroaki Hayashi</li> <li>Lifu Tu</li><details><summary>others</summary><li>Huan Wang</li> <li>Yingbo Zhou</li> <li>Silvio Savarese</li> <li>Caiming Xiong</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li></ul> | | 23.04.2022 |
Jraph | library for graph neural networks in jax | <ul><li>Jonathan Godwin</li> <li>Thomas Keck</li> <li>Peter Battaglia</li> <li>Victor Bapst</li><details><summary>others</summary><li>Thomas Kipf</li> <li>Yujia Li</li> <li>Kimberly Stachenfeld</li> <li>Petar Veličković</li> <li>Alvaro Sanchez-Gonzalez</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 15.04.2022 |
deep-significance | Easy-to-use package containing different significance tests and utility functions specifically tailored towards research needs and usability | <ul><li>Dennis Ulmer</li> <li>Christian Hardmeier</li> <li>Jes Frellsen</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li></ul> | | 12.04.2022 |
Text classification with RNN | This text classification tutorial trains a recurrent neural network on the IMDB large movie review dataset for sentiment analysis | Anirudh Dubey | <ul><li>data</li><li>link</li><li><img src="images/pwc.svg" alt="pwc" height=20/></li></ul> | | 17.03.2022 |
TriMap | Dimensionality reduction technique based on triplet constraints, which preserves the global structure of the data better than the other commonly used methods such as t-SNE, LargeVis, and UMAP | <ul><li>Ehsan Amid</li> <li>Manfred Warmuth</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/></li></ul> | | 17.03.2022 |
RLDS | Reinforcement Learning Datasets and it is an ecosystem of tools to store, retrieve and manipulate episodic data in the context of Sequential Decision Making including RL, Learning for Demonstrations, Offline RL or Imitation Learning | <ul><li>Sabela Ramos</li> <li>Sertan Girgin</li> <li>Léonard Hussenot</li> <li>Damien Vincent</li><details><summary>others</summary><li>Hanna Yakubovich</li> <li>Daniel Toyama</li> <li>Anita Gergely</li> <li>Piotr Stanczyk</li> <li>Raphaël Marinier</li> <li>Jeremiah Harmsen</li> <li>Olivier Pietquin</li> <li>Nikola Momchev</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li></ul> | | 16.03.2022 |
Real-Time Voice Cloning | SV2TTS with a vocoder that works in real-time | <ul><li>Corentin Jemine</li> <li>Erdene-Ochir Tuguldur</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 08.03.2022 |
BLIP | VLP framework which transfers flexibly to both vision-language understanding and generation tasks | <ul><li>Junnan Li</li> <li>Dongxu Li</li> <li>Caiming Xiong</li> <li>Steven Hoi</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 03.03.2022 |
VideoGPT | A conceptually simple architecture for scaling likelihood based generative modeling to natural videos | <ul><li>Wilson Yan</li> <li>Yunzhi Zhang</li> <li>Pieter Abbeel</li> <li>Aravind Srinivas</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>project</li></ul> | | 02.03.2022 |
Silero Models | Pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple | Silero team | <ul><li>STT, STT, STT</li><li>TTS, TTS</li><li>Text Enhancement</li><li>VAD, VAD</li><li>website</li></ul> | | 27.02.2022 |
Real-CUGAN | AI super resolution model for anime images, trained in a million scale anime dataset, using the same architecture as Waifu2x-CUNet | bilibili | <ul><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 27.02.2022 |
ArcaneGAN | Process video in the style of the Arcane animated series | Alexander Spirin | <ul><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 17.02.2022 |
textlesslib | A library aimed to facilitate research in Textless NLP | <ul><li>Eugene Kharitonov</li> <li>Jade Copet</li> <li>Kushal Lakhotia</li> <li>Nguyễn Tú Anh</li><details><summary>others</summary><li>Paden Tomasello</li> <li>Ann Lee</li> <li>Ali Elkahky</li> <li>Wei-Ning Hsu</li> <li>Abdelrahman Mohamed</li> <li>Emmanuel Dupoux</li> <li>Yossi Adi</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/></li></ul> | | 15.02.2022 |
AV-HuBERT | Self-supervised representation learning framework for audio-visual speech | <ul><li>Bowen Shi</li> <li>Wei-Ning Hsu</li> <li>Kushal Lakhotia</li> <li>Abdelrahman Mohamed</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li></ul> | | 12.02.2022 |
Lingvo | Framework for building neural networks in Tensorflow, particularly sequence models | <ul><li>Jonathan Shen</li> <li>Patrick Nguyen</li> <li>Yonghui Wu</li> <li>Zhifeng Chen</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docker.svg" alt="docker" height=20/>, <img src="images/docker.svg" alt="docker" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li></ul> | | 28.01.2022 |
DeepDream | This tutorial contains a minimal implementation of DeepDream: an experiment that visualizes the patterns learned by a neural network | <ul><li>Alexander Mordvintsev</li> <li>Billy Lamberta</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/></li></ul> | | 13.01.2022 |
FuseDream | Training-Free Text-to-Image Generation with Improved CLIP+GAN Space Optimization | <ul><li>Xingchao Liu</li> <li>Chengyue Gong</li> <li>Lemeng Wu</li> <li>Hao Su</li> <li>Qiang Liu</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li></ul> | | 02.01.2022 |
MLP | The most basic neural network architectures, a multilayer perceptron, also known as a feedforward network | Ben Trevett | <ul><li>NN and DL</li><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>optimization</li><li><img src="images/pt.svg" alt="pt" height=20/>, <img src="images/pt.svg" alt="pt" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li></ul> | | 26.12.2021 |
AlexNet | A neural network model that uses convolutional neural network layers and was designed for the ImageNet challenge | Ben Trevett | <ul><li>ILSVRC</li><li>LR</li><li>PMLR</li><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>cifar-10</li><li>dropout</li><li><img src="images/neurips.svg" alt="neurips" height=20/></li><li><img src="images/pt.svg" alt="pt" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li>[<img src="images/wiki.svg" alt="wiki" height=20/>](https://en.wikipedia.org/wiki/Regularization_(mathematics), <img src="images/wiki.svg" alt="wiki" height=20/></li></ul> | | 26.12.2021 |
VGG | Very Deep Convolutional Networks for Large-Scale Image Recognition | Ben Trevett | <ul><li>ILSVRC</li><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>cifar-10</li><li><img src="images/pt.svg" alt="pt" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 26.12.2021 |
LeNet | A neural network model that uses convolutional neural network layers and was designed for classifying handwritten characters | Ben Trevett | <ul><li>CNN</li><li>LeNet-5</li><li>guide</li><li>paper</li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/></li></ul> | | 26.12.2021 |
Music Composer | Synthesizing symbolic music in MIDI format using the Music Transformer model | bazanovvanya | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li>data, data</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul> | | 20.12.2021 |
FLAML | Lightweight Python library that finds accurate machine learning models automatically, efficiently and economically | <ul><li>Chi Wang</li> <li>Qingyun Wu</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li>paper</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 17.12.2021 |
CompilerGym | A reinforcement learning toolkit for compiler optimizations | <ul><li>Chris Cummins</li> <li>Bram Wasti</li> <li>Jiadong Guo</li> <li>Brandon Cui</li><details><summary>others</summary><li>Jason Ansel</li> <li>Sahir Gomez</li> <li>Olivier Teytaud</li> <li>Benoit Steiner</li> <li>Yuandong Tian</li> <li>Hugh Leather</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li></ul> | | 16.11.2021 |
Reformer | Performs on par with Transformer models while being much more memory-efficient and much faster on long sequences | <ul><li>Phil Wang</li> <li>Nikita Kitaev</li> <li>Łukasz Kaiser</li> <li>Anselm Levskaya</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/neurips.svg" alt="neurips" height=20/>, <img src="images/neurips.svg" alt="neurips" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 07.11.2021 |
ruDALL·E | Generate images from texts in Russian | Alex Shonenkov | <ul><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>project</li></ul> | | 03.11.2021 |
DeepStyle | The Neural Style algorithm synthesizes a pastiche by separating and combining the content of one image with the style of another image using convolutional neural networks | <ul><li>Cameron Smith</li> <li>Alexander Spirin</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>cvpr</li><li><img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/></li></ul> | | 01.10.2021 |
Text2Animation | Generate images from text phrases with VQGAN and CLIP with animation and keyframes | <ul><li>Katherine Crowson</li> <li>Ryan Murdock</li> <li>Chigozie Nri</li> <li>Denis Malimonov</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 29.09.2021 |
EfficientNetV2 | A family of image classification models, which achieve better parameter efficiency and faster training speed than prior arts | <ul><li>Mingxing Tan</li> <li>Quoc Le</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li></ul> | | 24.09.2021 |
Clip retrieval | Easily compute clip embeddings and build a clip retrieval system with them | Romain Beaumont | <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>project</li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li></ul> | | 21.09.2021 |
img2dataset | Easily turn large sets of image urls to an image dataset | Romain Beaumont | <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul> | | 17.09.2021 |
Droidlet | A modular embodied agent architecture and platform for building embodied agents | <ul><li>Anurag Pratik</li> <li>Soumith Chintala</li> <li>Kavya Srinet</li> <li>Dhiraj Gandhi</li><details><summary>others</summary><li>Rebecca Qian</li> <li>Yuxuan Sun</li> <li>Ryan Drew</li> <li>Sara Elkafrawy</li> <li>Anoushka Tiwari</li> <li>Tucker Hart</li> <li>Mary Williamson</li> <li>Abhinav Gupta</li> <li>Arthur Szlam</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li></ul> | | 15.09.2021 |
GPT-J-6B | A 6 billion parameter, autoregressive text generation model trained on The Pile | <ul><li>Ben Wang</li> <li>Aran Komatsuzaki</li> <li>Janko Prester</li></ul> | <ul><li>The Pile</li><li>blog post</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>web demo</li></ul> | | 15.09.2021 |
Machine learning course | This course is broad and shallow, but author will provide additional links so that you can deepen your understanding of the ML method you need | Тимчишин Віталій | <ul><li>blog post</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 02.09.2021 |
Lucid Sonic Dreams | Syncs GAN-generated visuals to music | Mikael Alafriz | <ul><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 24.08.2021 |
textgenrnn | Generate text using a pretrained neural network with a few lines of code, or easily train your own text-generating neural network of any size and complexity | Max Woolf | <ul><li>blog post</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 13.07.2021 |
BasicSR | Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. | <ul><li>Xintao Wang</li> <li>Liangbin Xie</li> <li>Ke Yu</li> <li>Kelvin Chan</li><details><summary>others</summary><li>Chen Change Loy</li> <li>Chao Dong</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 07.06.2021 |
TensorFlowTTS | Real-time state-of-the-art speech synthesis architectures such as Tacotron-2, Melgan, Multiband-Melgan, FastSpeech, FastSpeech2 based-on TensorFlow 2 | <ul><li>Minh Nguyen Quan Anh</li> <li>Eren Gölge</li> <li>Kuan Chen</li> <li>Takuya Ebata</li></ul> | <ul><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li>project</li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li></ul> | | 01.06.2021 |
Hyperopt | Python library for serial and parallel optimization over awkward search spaces, which may include real-valued, discrete, and conditional dimensions | <ul><li>James Bergstra</li> <li>Dan Yamins</li> <li>David Cox</li></ul> | <ul><li>ICML</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/neurips.svg" alt="neurips" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 01.06.2021 |
CNN | This tutorial demonstrates training a simple Convolutional Neural Network to classify CIFAR images | Billy Lamberta | <ul><li>cifar</li><li>link</li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul> | | 22.05.2021 |
Custom GPT-2 + Tokenizer | Train a custom GPT-2 model for free on a GPU using aitextgen! | Max Woolf | <ul><li>data</li><li><img src="images/docs.svg" alt="docs" height=20/></li></ul> | | 17.05.2021 |
Train a GPT-2 Text-Generating Model | Retrain an advanced text generating neural network on any text dataset for free on a GPU using Colaboratory using aitextgen! | Max Woolf | <ul><li>data</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/></li></ul> | | 17.05.2021 |
EasyNMT | Easy to use, state-of-the-art machine translation for more than 100+ languages | Nils Reimers | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>demo</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul> | | 26.04.2021 |
SkinDeep | Remove Body Tattoo Using Deep Learning | Vijish Madhavan | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li></ul> | | 24.04.2021 |
PaddleHub | Pre-trained models toolkit based on PaddlePaddle: 400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Inference & Serving | <ul><li>Zeyu Chen</li> <li>Zewu Wu</li> <li>Bin Long</li> <li>Xuefei Zhang</li><details><summary>others</summary><li>Jinxuan Qiu</li> <li>Yuhan Shen</li> <li>Yuying Hao</li> <li>Xiaojie Chen</li></ul></details> | <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 20.04.2021 |
OCTIS | Framework for training, analyzing, and comparing Topic Models, whose optimal hyper-parameters are estimated using a Bayesian Optimization approach | <ul><li>Silvia Terragni</li> <li>Elisabetta Fersini</li> <li>Antonio Candelieri</li> <li>Pietro Tropeano</li><details><summary>others</summary><li>Bruno Galuzzi</li> <li>Lorenzo Famiglini</li> <li>Davide Pietrasanta</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data, data</li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/neurips.svg" alt="neurips" height=20/></li><li>paper</li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 19.04.2021 |
PyTorchVideo | Deeplearning library with a focus on video understanding work | <ul><li>Haoqi Fan</li> <li>Tullie Murrell</li> <li>Heng Wang</li> <li>Kalyan Vasudev Alwala</li><details><summary>others</summary><li>Yanghao Li</li> <li>Yilei Li</li> <li>Bo Xiong</li> <li>Nikhila Ravi</li> <li>Meng Li</li> <li>Haichuan Yang</li> <li>Jitendra Malik</li> <li>Ross Girshick</li> <li>Matt Feiszli</li> <li>Aaron Adcock</li> <li>Wan-Yen Lo</li> <li>Christoph Feichtenhofer</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 13.04.2021 |
NeuSpell | Open-source toolkit for spelling correction in English | <ul><li>Sai Muralidhar Jayanthi</li> <li>Danish Pruthi</li> <li>Graham Neubig</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>project</li></ul> | | 03.04.2021 |
GPT Neo | An implementation of model & data parallel GPT2 & GPT3 -like models, with the ability to scale up to full GPT3 sizes (and possibly more!), using the mesh-tensorflow library | EleutherAI | <ul><li>GPT-2</li><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>pretrained</li></ul> | | 28.03.2021 |
CVAE | This notebook demonstrates how train a Variational Autoencoder on the MNIST dataset | <ul><li>Diederik Kingma</li> <li>Max Welling</li> <li>Danilo Rezende</li> <li>Shakir Mohamed</li> <li>Daan Wierstra</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul> | | 22.03.2021 |
Big Sleep | Text to image generation, using OpenAI's CLIP and a BigGAN | Phil Wang | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/>, <img src="images/reddit.svg" alt="reddit" height=20/></li></ul> | | 17.03.2021 |
Deep Daze | Text to image generation using OpenAI's CLIP and Siren | Phil Wang | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li></ul> | | 17.03.2021 |
DCGAN | This tutorial demonstrates how to generate images of handwritten digits using a Deep Convolutional Generative Adversarial Network | <ul><li>Alec Radford</li> <li>Luke Metz</li> <li>Soumith Chintala</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul> | | 12.03.2021 |
Adversarial FGSM | This tutorial creates an adversarial example using the Fast Gradient Signed Method attack. This was one of the first and most popular attacks to fool a neural network. | <ul><li>Ian Goodfellow</li> <li>Jonathon Shlens</li> <li>Christian Szegedy</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>imagenet</li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li></ul> | | 12.03.2021 |
GAN steerability | We will navigate in GAN latent space to simulate various camera transformations | <ul><li>Ali Jahanian</li> <li>Lucy Chai</li> <li>Phillip Isola</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 04.03.2021 |
Trax | End-to-end library for deep learning that focuses on clear code and speed | Google | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>discuss</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/>, <img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 18.02.2021 |
bsuite | A collection of carefully-designed experiments that investigate core capabilities of an RL agent with two main objectives | <ul><li>Ian Osband</li> <li>Yotam Doron</li> <li>Matteo Hessel</li> <li>John Aslanides</li><details><summary>others</summary><li>Eren Sezener</li> <li>Andre Saraiva</li> <li>Katrina McKinney</li> <li>Tor Lattimore</li> <li>Csaba Szepesvari</li> <li>Satinder Singh</li> <li>Benjamin Van Roy</li> <li>Richard Sutton</li> <li>David Silver</li> <li>Hado Van Hasselt</li></ul></details> | <ul><li><img src="images/git.svg" alt="git" height=20/></li><li>paper</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 13.02.2021 |
TF-Ranking | End-to-end walkthrough of training a TensorFlow Ranking neural network model which incorporates sparse textual features | Rama Kumar | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/></li></ul> | | 04.02.2021 |
Toon-Me | A fun project to toon portrait images | Vijish Madhavan | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li></ul> | | 22.01.2021 |
TensorNetwork | A library for easy and efficient manipulation of tensor networks | Chase Roberts | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 21.01.2021 |
Spleeter | Deezer source separation library including pretrained models | <ul><li>Romain Hennequin</li> <li>Anis Khlif</li> <li>Félix Voituret</li> <li>Manuel Moussallam</li></ul> | <ul><li>blog post</li><li>data</li><li>project</li></ul> | | 10.01.2021 |
Bullet Physics SDK | Real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc | <ul><li>Erwin Coumans</li> <li>Yunfei Bai</li></ul> | <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 14.10.2020 |
Person Remover | Project that combines Pix2Pix and YOLO arhitectures in order to remove people or other objects from photos | <ul><li>Javier Gamazo</li> <li>Daryl Autar</li></ul> | <ul><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 22.08.2020 |
Semantic Segmentation | Pytorch implementation for Semantic Segmentation/Scene Parsing on MIT ADE20K dataset | <ul><li>Bolei Zhou</li> <li>Hang Zhao</li> <li>Xavier Puig</li> <li>Sanja Fidler</li> <li>Antonio Torralba</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li></ul> | | 21.08.2020 |
Gin Config | Lightweight configuration framework for Python, based on dependency injection | <ul><li>Dan Holtmann-Rice</li> <li>Sergio Guadarrama</li> <li>Nathan Silberman</li></ul> | <ul><li><img src="images/medium.svg" alt="medium" height=20/></li></ul> | | 13.08.2020 |
Dopamine | Research framework for fast prototyping of reinforcement learning algorithms | <ul><li>Pablo Castro</li> <li>Subhodeep Moitra</li> <li>Carles Gelada</li> <li>Saurabh Kumar</li> <li>Marc Bellemare</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>baselines</li><li>blog post</li><li><img src="images/docker.svg" alt="docker" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 04.08.2020 |
Analyzing Tennis Serve | We'll use the Video Intelligence API to analyze a tennis serve, including the angle of the arms and legs during the serve | Dale Markowitz | <ul><li>blog post</li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 14.07.2020 |
YOLOv4 | This tutorial will help you build YOLOv4 easily in the cloud with GPU enabled so that you can run object detections in milliseconds! | Alexey Bochkovskiy | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li>project</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 25.06.2020 |
TensorFlow Graphics | Differentiable computer graphics in tensorflow | <ul><li>Julien Valentin</li> <li>Cem Keskin</li> <li>Pavel Pidlypenskyi</li> <li>Ameesh Makadia</li><details><summary>others</summary><li>Avneesh Sud</li> <li>Sofien Bouaziz</li></ul></details> | <ul><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 20.05.2020 |
GAN Dissection | Visualizing and Understanding Generative Adversarial Networks | <ul><li>David Bau</li> <li>Jun-Yan Zhu</li> <li>Hendrik Strobelt</li> <li>Bolei Zhou</li><details><summary>others</summary><li>Joshua Tenenbaum</li> <li>William Freeman</li> <li>Antonio Torralba</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>demo</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 04.05.2020 |
Sonnet | Library built on top of TensorFlow 2 designed to provide simple, composable abstractions for machine learning research | <ul><li>Malcolm Reynolds</li> <li>Jack Rae</li> <li>Andreas Fidjeland</li> <li>Fabio Viola</li><details><summary>others</summary><li>Adrià Puigdomènech</li> <li>Frederic Besse</li> <li>Tim Green</li> <li>Sébastien Racanière</li> <li>Gabriel Barth-Maron</li> <li>Diego Casas</li></ul></details> | <ul><li><img src="images/deepmind.svg" alt="deepmind" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/neurips.svg" alt="neurips" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 17.04.2020 |
Classification of chest vs. adominal X-rays | The goal of this tutorial is to build a deep learning classifier to accurately differentiate between chest and abdominal X-rays | tmoneyx01 | <ul><li>annotator</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li></ul> | | 07.03.2020 |
Earth Engine Python API and Folium Interactive Mapping | This notebook demonstrates how to setup the Earth Engine and provides several examples for visualizing Earth Engine processed data interactively using the folium library | Qiusheng Wu | <ul><li>api</li></ul> | | 20.01.2020 |
Tensor2Tensor | Library for deep learning models that is well-suited for neural machine translation and includes the reference implementation of the state-of-the-art Transformer model | <ul><li>Ashish Vaswani</li> <li>Samy Bengio</li> <li>Eugene Brevdo</li> <li>François Chollet</li><details><summary>others</summary><li>Aidan Gomez</li> <li>Stephan Gouws</li> <li>Llion Jones</li> <li>Łukasz Kaiser</li> <li>Nal Kalchbrenner</li> <li>Niki Parmar</li> <li>Ryan Sepassi</li> <li>Noam Shazeer</li> <li>Jakob Uszkoreit</li></ul></details> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li>data</li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 14.01.2020 |
Traffic counting | Making Road Traffic Counting App based on Computer Vision and OpenCV | Andrey Nikishaev | <ul><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 10.01.2020 |
NYU-DLSP20 | This course concerns the latest techniques in deep learning and representation learning, focusing on supervised and unsupervised deep learning, embedding methods, metric learning, convolutional and recurrent nets, with applications to computer vision, natural language understanding, and speech recognition | <ul><li>Yann LeCun</li> <li>Alfredo Canziani</li></ul> | <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul> | | 30.10.2019 |
Imagededup | This package provides functionality to make use of hashing algorithms that are particularly good at finding exact duplicates as well as convolutional neural networks which are also adept at finding near duplicates | <ul><li>Tanuj Jain</li> <li>Christopher Lennan</li> <li>Dat Tran</li></ul> | <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>project</li></ul> | | 03.10.2019 |