Home

Awesome

Hits awesome-colab-notebooks

The page might not be rendered properly. Please open README.md file directly

Awesome colab notebooks collection for ML experiments

Trending

repositoriespaperspackages
<ul><li>datachain </li> <li>IC-Light </li> <li>BiRefNet </li> <li>SAELens </li> <li>PuLID </li> <li>ARENA_3.0 </li> <li>autogen </li> <li>langgraph </li> <li>segment-anything-2 </li> <li>unsloth </li> <li>ComfyUI </li> <li>TransformerLens </li> <li>fab-torch </li> <li>llama-recipes </li> <li>rl_games </li> <li>InstantMesh </li> <li>instructor </li> <li>co-tracker </li> <li>DDColor </li> <li>ultralytics </li> <li>normalizing-flows </li> <li>open-interpreter </li> <li>pymdp </li></ul><ul><li>DifFace </li> <li>UniFormerV2 </li> <li>Panini-Net </li> <li>PyMAF-X </li> <li>GraphCast </li> <li>Gaussian Splatting </li> <li>MMOCR </li> <li>CodeTalker </li> <li>VideoReTalking </li> <li>VRT </li> <li>FILM </li> <li>SadTalker </li> <li>f-BRS </li> <li>HiDT </li> <li>Score Jacobian Chaining </li> <li>RealBasicVSR </li> <li>OWL-ViT </li> <li>LaSAFT </li> <li>Geometry-Free View Synthesis </li> <li>SAM </li> <li>Rethinking Style Transfer: From Pixels to Parameterized Brushstrokes </li> <li>PyTorchVideo </li> <li>Omnivore </li></ul><ul><li>unsloth </li> <li>Crawl4AI </li> <li>langgraph </li> <li>llama-index </li> <li>ollama </li> <li>langchain </li> <li>catboost </li> <li>rl-games </li> <li>img2dataset </li> <li>reformer-pytorch </li> <li>xgboost </li> <li>mmpose </li> <li>sae-lens </li> <li>lightautoml </li> <li>mistral-inference </li> <li>neural-tangents </li> <li>TensorFlowTTS </li> <li>dm-reverb </li> <li>xmanager </li> <li>mmrotate </li> <li>clip-retrieval </li> <li>contextualized_topic_models </li> <li>datachain </li></ul>

Research

namedescriptionauthorslinkscolaboratoryupdate
GraphCastLearning skillful medium-range global weather forecasting<ul><li>Rémi Lam</li> <li>Alvaro Sanchez-Gonzalez</li> <li>Matthew Willson</li> <li>Peter Wirnsberger</li><details><summary>others</summary><li>Meire Fortunato</li> <li>Ferran Alet</li> <li>Suman Ravuri</li> <li>Timo Ewalds</li> <li>Zach Eaton-Rosen</li> <li>Weihua Hu</li> <li>Alexander Merose</li> <li>Stephan Hoyer</li> <li>George Holland</li> <li>Oriol Vinyals</li> <li>Jacklynn Stott</li> <li>Alexander Pritzel</li> <li>Shakir Mohamed</li> <li>Peter Battaglia</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/deepmind.svg" alt="deepmind" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab04.12.2024
TAPIRTracking Any Point with per-frame Initialization and temporal Refinement<ul><li>Carl Doersch</li> <li>Yi Yang</li> <li>Mel Vecerik</li> <li>Dilara Gokay</li><details><summary>others</summary><li>Ankush Gupta</li> <li>Yusuf Aytar</li> <li>Joao Carreira</li> <li>Andrew Zisserman</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post, blog post</li><li><img src="images/deepmind.svg" alt="deepmind" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/neurips.svg" alt="neurips" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab30.11.2024
T2M-GPTConditional generative framework based on Vector Quantised-Variational AutoEncoder and Generative Pre-trained Transformer for human motion generation from textural descriptions<ul><li>Jianrong Zhang</li> <li>Yangsong Zhang</li> <li>Xiaodong Cun</li> <li>Shaoli Huang</li><details><summary>others</summary><li>Yong Zhang</li> <li>Hongwei Zhao</li> <li>Hongtao Lu</li> <li>Xi Shen</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab24.11.2024
PuLIDPure and Lightning ID customization, a tuning-free ID customization method for text-to-image generation<ul><li>Zinan Guo</li> <li>Yanze Wu</li> <li>Zhuowei Chen</li> <li>Lang Chen</li> <li>Qian He</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li></ul>Open In Colab09.11.2024
CoTrackerArchitecture that jointly tracks multiple points throughout an entire video<ul><li>Nikita Karaev</li> <li>Ignacio Rocco</li> <li>Benjamin Graham</li> <li>Natalia Neverova</li><details><summary>others</summary><li>Andrea Vedaldi</li> <li>Christian Rupprecht</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab16.10.2024
PIFuPixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization<ul><li>Ryota Natsume</li> <li>Shunsuke Saito</li> <li>Zeng Huang</li> <li>Angjoo Kanazawa</li> <li>Hao Li</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab08.10.2024
DifFaceMethod that is capable of coping with unseen and complex degradations more gracefully without complicated loss designs<ul><li>Zongsheng Yue</li> <li>Chen Change Loy</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li></ul>Open In Colab05.10.2024
Segment Anything 2Foundation model towards solving promptable visual segmentation in images and videos<ul><li>Nikhila Ravi</li> <li>Valentin Gabeur</li> <li>Yuan-Ting Hu</li> <li>Ronghang Hu</li><details><summary>others</summary><li>Chaitanya Ryali</li> <li>Tengyu Ma</li> <li>Haitham Khedr</li> <li>Roman Rädle</li> <li>Chloé Rolland</li> <li>Laura Gustafson</li> <li>Eric Mintun</li> <li>Junting Pan</li> <li>[Kalyan Vasudev](lwala](https://scholar.google.co.in/citations?user=m34oaWEAAAAJ)</li> <li>Nicolas Carion</li> <li>[Chao-Yuan](u](https://chaoyuan.org/)</li> <li>Ross Girshick</li> <li>Piotr Dollár</li> <li>Christoph Feichtenhofer</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>demo</li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/meta.svg" alt="meta" height=20/>, <img src="images/meta.svg" alt="meta" height=20/>, <img src="images/meta.svg" alt="meta" height=20/></li><li>project</li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab01.10.2024
Open-UnmixA deep neural network reference implementation for music source separation, applicable for researchers, audio engineers and artists<ul><li>Fabian-Robert Stöter</li> <li>Antoine Liutkus</li></ul> <ul><li>data</li><li><img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab25.09.2024
Deep Painterly HarmonizationAlgorithm produces significantly better results than photo compositing or global stylization techniques and that it enables creative painterly edits that would be otherwise difficult to achieve<ul><li>Fujun Luan</li> <li>Sylvain Paris</li> <li>Eli Shechtman</li> <li>Kavita Bala</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab23.09.2024
audio2photorealFramework for generating full-bodied photorealistic avatars that gesture according to the conversational dynamics of a dyadic interaction<ul><li>Evonne Ng</li> <li>Javier Romero</li> <li>Timur Bagautdinov</li> <li>Shaojie Bai</li><details><summary>others</summary><li>Trevor Darrell</li> <li>Angjoo Kanazawa</li> <li>Alexander Richard</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab13.09.2024
Fast Segment AnythingCNN Segment Anything Model trained using only 2% of the SA-1B dataset published by SAM authors<ul><li>Xu Zhao</li> <li>Wenchao Ding</li> <li>Yongqi An</li> <li>Yinglong Du</li><details><summary>others</summary><li>Tao Yu</li> <li>Min Li</li> <li>Ming Tang</li> <li>Jinqiao Wang</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab10.09.2024
NeuralangeloFramework for high-fidelity 3D surface reconstruction from RGB video captures<ul><li>Zhaoshuo Li</li> <li>Thomas Müller</li> <li>Alex Evans</li> <li>Russell Taylor</li><details><summary>others</summary><li>Mathias Unberath</li> <li>Ming-Yu Liu</li> <li>Chen-Hsuan Lin</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab02.09.2024
BiRefNetBilateral reference framework for high-resolution dichotomous image segmentation<ul><li>Peng Zheng</li> <li>Dehong Gao</li> <li>Deng-Ping Fan</li> <li>Li Liu</li><details><summary>others</summary><li>Jorma Laaksonen</li> <li>Wanli Ouyang</li> <li>Nicu Sebe</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li><img src="images/pwc.svg" alt="pwc" height=20/>, <img src="images/pwc.svg" alt="pwc" height=20/>, <img src="images/pwc.svg" alt="pwc" height=20/></li></ul>Open In Colab23.08.2024
SPINLearning to Reconstruct 3D Human Pose and Shape via Model-fitting in the Loop<ul><li>Nikos Kolotouros</li> <li>Georgios Pavlakos</li> <li>Michael Black</li> <li>Kostas Daniilidis</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docker.svg" alt="docker" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li></ul>Open In Colab21.08.2024
YOLOv10Aim to further advance the performance-efficiency boundary of YOLOs from both the post-processing and model architecture<ul><li>Ao Wang</li> <li>Hui Chen</li> <li>Kai Chen</li> <li>Zijia Lin</li><details><summary>others</summary><li>Jungong Han</li> <li>Guiguang Ding</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li>demo</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab20.08.2024
SpecVQGANTaming the visually guided sound generation by shrinking a training dataset to a set of representative vectors<ul><li>Vladimir Iashin</li> <li>Esa Rahtu</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li>[<img src="images/wiki.svg" alt="wiki" height=20/>](https://en.wikipedia.org/wiki/Foley_(filmmaking), <img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab12.07.2024
LivePortraitVideo-driven portrait animation framework with a focus on better generalization, controllability, and efficiency for practical usage<ul><li>Jianzhu Guo</li> <li>Dingyun Zhang</li> <li>Xiaoqiang Liu</li> <li>Zhizhou Zhong</li><details><summary>others</summary><li>Yuan Zhang</li> <li>Pengfei Wan</li> <li>Di Zhang</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab10.07.2024
Wav2LipA Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild<ul><li>Prajwal Renukanand</li> <li>Rudrabha Mukhopadhyay</li> <li>Vinay Namboodiri</li> <li>C. V. Jawahar</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li>demo</li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab27.06.2024
DeepLabCutEfficient method for markerless pose estimation based on transfer learning with deep neural networks that achieves excellent results with minimal training data<ul><li>Alexander Mathis</li> <li>Pranav Mamidanna</li> <li>Kevin Cury</li> <li>Taiga Abe</li><details><summary>others</summary><li>Venkatesh Murthy</li> <li>Mackenzie Mathis</li> <li>Matthias Bethge</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docker.svg" alt="docker" height=20/></li><li>forum</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab05.06.2024
PoolFormerMetaFormer Is Actually What You Need for Vision<ul><li>Weihao Yu</li> <li>Mi Luo</li> <li>Pan Zhou</li> <li>Chenyang Si</li><details><summary>others</summary><li>Yichen Zhou</li> <li>Xinchao Wang</li> <li>Jiashi Feng</li> <li>Shuicheng Yan</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li></ul>Open In Colab01.06.2024
StoryDiffusionWay of self-attention calculation, termed Consistent Self-Attention, that significantly boosts the consistency between the generated images and augments prevalent pretrained diffusion-based text-to-image models in a zero-shot manner<ul><li>Yupeng Zhou</li> <li>Daquan Zhou</li> <li>Ming-Ming Cheng</li> <li>Jiashi Feng</li> <li>Qibin Hou</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>project</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab04.05.2024
FILMA frame interpolation algorithm that synthesizes multiple intermediate frames from two input images with large in-between motion<ul><li>Fitsum Reda</li> <li>Janne Kontkanen</li> <li>Eric Tabellion</li> <li>Deqing Sun</li><details><summary>others</summary><li>Caroline Pantofaru</li> <li>Brian Curless</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data, data, data</li><li><img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab03.05.2024
VoiceCrafttoken infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech on audiobooks, internet videos, and podcasts<ul><li>Puyuan Peng</li> <li>Po-Yao Huang</li> <li>Shang-Wen Li</li> <li>Abdelrahman Mohamed</li> <li>David Harwath</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab21.04.2024
ZeSTMethod for zero-shot material transfer to an object in the input image given a material exemplar image<ul><li>Ta-Ying Cheng</li> <li>Prafull Sharma</li> <li>Andrew Markham</li> <li>Niki Trigoni</li> <li>Varun Jampani</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>project</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab16.04.2024
InstantMeshFeed-forward framework for instant 3D mesh generation from a single image, featuring state-of-the-art generation quality and significant training scalability<ul><li>Jiale Xu</li> <li>Weihao Cheng</li> <li>Yiming Gao</li> <li>Xintao Wang</li><details><summary>others</summary><li>Shenghua Gao</li> <li>Ying Shan</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab16.04.2024
AlphaFoldHighly accurate protein structure prediction<ul><li>John Jumper</li> <li>Richard Evans</li> <li>Alexander Pritzel</li> <li>Tim Green</li><details><summary>others</summary><li>Michael Figurnov</li> <li>Olaf Ronneberger</li> <li>Kathryn Tunyasuvunakool</li> <li>Russ Bates</li> <li>Augustin Žídek</li> <li>Anna Potapenko</li> <li>Alex Bridgland</li> <li>Clemens Meyer</li> <li>Simon Kohl</li> <li>Andrew Ballard</li> <li>Bernardino Romera-Paredes</li> <li>Stanislav Nikolov</li> <li>Rishub Jain</li></ul></details> <ul><li>blog post, blog post</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>paper</li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab15.04.2024
WürstchenArchitecture for text-to-image synthesis that combines competitive performance with unprecedented cost-effectiveness for large-scale text-to-image diffusion models<ul><li>Pablo Pernias</li> <li>Dominic Rampas</li> <li>Mats Richter</li> <li>Christopher Pal</li> <li>Marc Aubreville</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab06.04.2024
AudioSepFoundation model for open-domain audio source separation with natural language queries<ul><li>Xubo Liu</li> <li>Qiuqiang Kong</li> <li>Yan Zhao</li> <li>Haohe Liu</li><details><summary>others</summary><li>Yi Yuan</li> <li>Yuzhuo Liu</li> <li>Rui Xia</li> <li>Yuxuan Wang</li> <li>Mark Plumbley</li> <li>Wenwu Wang</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>project</li></ul>Open In Colab15.03.2024
AQLMExtreme Compression of Large Language Models via Additive Quantization<ul><li>Vage Egiazarian</li> <li>Andrei Panferov</li> <li>Denis Kuznedelev</li> <li>Elias Frantar</li><details><summary>others</summary><li>Artem Babenko</li> <li>Dan Alistarh</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab08.03.2024
YOLOv9Learning What You Want to Learn Using Programmable Gradient Information<ul><li>Chien-Yao Wang</li> <li>I-Hau Yeh</li> <li>Hong-Yuan Mark Liao</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab05.03.2024
Multi-LoRA CompositionLoRA Switch and LoRA Composite, approaches that aim to surpass traditional techniques in terms of accuracy and image quality, especially in complex compositions<ul><li>Ming Zhong</li> <li>Yelong Shen</li> <li>Shuohang Wang</li> <li>Yadong Lu</li><details><summary>others</summary><li>Yizhu Jiao</li> <li>Siru Ouyang</li> <li>Donghan Yu</li> <li>Jiawei Han</li> <li>Weizhu Chen</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li>website</li></ul>Open In Colab03.03.2024
AMARETTOMultiscale and multimodal inference of regulatory networks to identify cell circuits and their drivers shared and distinct within and across biological systems of human disease<ul><li>Nathalie Pochet</li> <li>Olivier Gevaert</li> <li>Mohsen Nabian</li> <li>Jayendra Shinde</li><details><summary>others</summary><li>Celine Everaert</li> <li>Thorin Tabor</li></ul></details> <ul><li>bioconductor</li><li>project</li></ul>Open In Colab28.02.2024
LIDATool for generating grammar-agnostic visualizations and infographicsVictor Dibia <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab06.02.2024
ViTVision Transformer and MLP-Mixer Architectures<ul><li>Alexey Dosovitskiy</li> <li>Lucas Beyer</li> <li>Alexander Kolesnikov</li> <li>Dirk Weissenborn</li><details><summary>others</summary><li>Xiaohua Zhai</li> <li>Thomas Unterthiner</li> <li>Mostafa Dehghani</li> <li>Matthias Minderer</li> <li>Georg Heigold</li> <li>Sylvain Gelly</li> <li>Jakob Uszkoreit</li> <li>Neil Houlsby</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab06.02.2024
3D Ken BurnsA reference implementation of 3D Ken Burns Effect from a Single Image using PyTorch - given a single input image, it animates this still image with a virtual camera scan and zoom subject to motion parallaxManuel Romero <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab24.01.2024
VALL-E XCross-lingual neural codec language model for cross-lingual speech synthesis<ul><li>Ziqiang Zhang</li> <li>Long Zhou</li> <li>Chengyi Wang</li> <li>Sanyuan Chen</li><details><summary>others</summary><li>Yu Wu</li> <li>Shujie Liu</li> <li>Zhuo Chen</li> <li>Yanqing Liu</li> <li>Huaming Wang</li> <li>Jinyu Li</li> <li>Lei He</li> <li>Sheng Zhao</li> <li>Furu Wei</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>demo</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab19.01.2024
PhotoMakerEfficient personalized text-to-image generation method, which mainly encodes an arbitrary number of input ID images into a stack ID embedding for preserving ID information<ul><li>Zhen Li</li> <li>Mingdeng Cao</li> <li>Xintao Wang</li> <li>Zhongang Qi</li><details><summary>others</summary><li>Ming-Ming Cheng</li> <li>Ying Shan</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>project</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab18.01.2024
DDColorEnd-to-end method with dual decoders for image colorization<ul><li>Xiaoyang Kang</li> <li>Tao Yang</li> <li>Wenqi Ouyang</li> <li>Peiran Ren</li><details><summary>others</summary><li>Lingzhi Li</li> <li>Xuansong Xie</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab15.01.2024
PASDPixel-aware stable diffusion network to achieve robust Real-ISR as well as personalized stylization<ul><li>Tao Yang</li> <li>Peiran Ren</li> <li>Xuansong Xie</li> <li>Lei Zhang</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li></ul>Open In Colab12.01.2024
HandRefinerRefining Malformed Hands in Generated Images by Diffusion-based Conditional Inpainting<ul><li>Wenquan Lu</li> <li>Yufei Xu</li> <li>Jing Zhang</li> <li>Chaoyue Wang</li> <li>Dacheng Tao</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab08.01.2024
ESMEvolutionary Scale Modeling: Pretrained language models for proteins<ul><li>Zeming Lin</li> <li>Roshan Rao</li> <li>Brian Hie</li> <li>Zhongkai Zhu</li><details><summary>others</summary><li>Allan dos Santos Costa</li> <li>Maryam Fazel-Zarandi</li> <li>Tom Sercu</li> <li>Salvatore Candido</li> <li>Alexander Rives</li> <li>Joshua Meier</li> <li>Robert Verkuil</li> <li>Jason Liu</li> <li>Chloe Hsu</li> <li>Adam Lerer</li></ul></details> <ul><li>ESM Atlas</li><li>FSDP</li><li>ICML</li><li>data</li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>paper, paper, paper, paper</li><li>pubmed</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab28.12.2023
LLaVALarge Language and Vision Assistant, an end-to-end trained large multimodal model that connects a vision encoder and LLM for general-purpose visual and language understanding<ul><li>Haotian Liu</li> <li>Chunyuan Li</li> <li>Qingyang Wu</li> <li>Yong Jae Lee</li> <li>Yuheng Li</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>demo</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab22.12.2023
Background Matting V2Real-time, high-resolution background replacement technique which operates at 30fps in 4K resolution, and 60fps for HD on a modern GPU<ul><li>Shanchuan Lin</li> <li>Andrey Ryabtsev</li> <li>Soumyadip Sengupta</li> <li>Brian Curless</li><details><summary>others</summary><li>Steve Seitz</li> <li>Ira Kemelmacher-Shlizerman</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab22.12.2023
Gaussian SplattingState-of-the-art visual quality while maintaining competitive training times and importantly allow high-quality real-time (≥ 100 fps) novel-view synthesis at 1080p resolution<ul><li>Bernhard Kerbl</li> <li>Georgios Kopanas</li> <li>Thomas Leimkühler</li> <li>George Drettakis</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>project</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab19.12.2023
SMPLer-XScaling up EHPS towards the first generalist foundation model, with up to ViT-Huge as the backbone and training with up to 4.5M instances from diverse data sources<ul><li>Zhongang Cai</li> <li>Wanqi Yin</li> <li>Ailing Zeng</li> <li>Chen Wei</li><details><summary>others</summary><li>Qingping Sun</li> <li>Yanjun Wang</li> <li>Hui En Pang</li> <li>Haiyi Mei</li> <li>Mingyuan Zhang</li> <li>Lei Zhang</li> <li>Chen Change Loy</li> <li>Lei Yang</li> <li>Ziwei Liu</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/neurips.svg" alt="neurips" height=20/></li><li>project</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab18.12.2023
DeepCacheTraining-free paradigm that accelerates diffusion models from the perspective of model architecture<ul><li>Xinyin Ma</li> <li>Gongfan Fang</li> <li>Xinchao Wang</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li></ul>Open In Colab18.12.2023
MagicAnimateDiffusion-based framework that aims at enhancing temporal consistency, preserving reference image faithfully, and improving animation fidelity<ul><li>Zhongcong Xu</li> <li>Jianfeng Zhang</li> <li>Jun Hao Liew</li> <li>Hanshu Yan</li><details><summary>others</summary><li>Jiawei Liu</li> <li>Chenxu Zhang</li> <li>Jiashi Feng</li> <li>Mike Shou</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>project</li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab18.12.2023
DiffBIRTowards Blind Image Restoration with Generative Diffusion Prior<ul><li>Xinqi Lin</li> <li>Jingwen He</li> <li>Ziyan Chen</li> <li>Zhaoyang Lyu</li><details><summary>others</summary><li>Ben Fei</li> <li>Bo Dai</li> <li>Wanli Ouyang</li> <li>Yu Qiao</li> <li>Chao Dong</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab18.12.2023
AudioLDMText-to-audio system that is built on a latent space to learn the continuous audio representations from contrastive language-audio pretraining latents<ul><li>Haohe Liu</li> <li>Zehua Chen</li> <li>Yi Yuan</li> <li>Xinhao Mei</li><details><summary>others</summary><li>Xubo Liu</li> <li>Danilo Mandic</li> <li>Wenwu Wang</li> <li>Mark Plumbley</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab02.12.2023
TabPFNNeural network that learned to do tabular data prediction<ul><li>Noah Hollmann</li> <li>Samuel Müller</li> <li>Katharina Eggensperger</li> <li>Frank Hutter</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab29.11.2023
Concept SlidersPlug-and-play low rank adaptors applied on top of pretrained models<ul><li>Rohit Gandikota</li> <li>Joanna Materzyńska</li> <li>Tingrui Zhou</li> <li>Antonio Torralba</li> <li>David Bau</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/neurips.svg" alt="neurips" height=20/></li><li>project</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li></ul>Open In Colab26.11.2023
Qwen-VLSet of large-scale vision-language models designed to perceive and understand both text and images<ul><li>Jinze Bai</li> <li>Shuai Bai</li> <li>Shusheng Yang</li> <li>Shijie Wang</li><details><summary>others</summary><li>Sinan Tan</li> <li>Peng Wang</li> <li>Junyang Lin</li> <li>Chang Zhou</li> <li>Jingren Zhou</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>demo</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab24.11.2023
AnimeGANv3Double-tail generative adversarial network for fast photo animation<ul><li>Gang Liu</li> <li>Xin Chen</li></ul> <ul><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab23.11.2023
IthacaFirst Deep Neural Network for the textual restoration, geographical and chronological attribution of ancient Greek inscriptions<ul><li>Yannis Assael</li> <li>Thea Sommerschield</li> <li>Brendan Shillingford</li> <li>Mahyar Bordbar</li><details><summary>others</summary><li>John Pavlopoulos</li> <li>Marita Chatzipanagiotou</li> <li>Ion Androutsopoulos</li> <li>Jonathan Prag</li> <li>Nando de Freitas</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li>project</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li></ul>Open In Colab21.11.2023
PixArt-ΣWeak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation<ul><li>Junsong Chen</li> <li>Chongjian Ge</li> <li>Enze Xie</li> <li>Yue Wu</li><details><summary>others</summary><li>Lewei Yao</li> <li>Xiaozhe Ren</li> <li>Zhongdao Wang</li> <li>Ping Luo</li> <li>Huchuan Lu</li> <li>Zhenguo Li</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li></ul>Open In Colab07.11.2023
Zero123++Image-conditioned diffusion model for generating 3D-consistent multi-view images from a single input view<ul><li>Ruoxi Shi</li> <li>Hansheng Chen</li> <li>Zhuoyang Zhang</li> <li>Minghua Liu</li><details><summary>others</summary><li>Chao Xu</li> <li>Xinyue Wei</li> <li>Linghao Chen</li> <li>Chong Zeng</li> <li>Hao Su</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab26.10.2023
UniFormerV2Unified Transformer for Efficient Spatiotemporal Representation Learning<ul><li>Kunchang Li</li> <li>Yali Wang</li> <li>Yinan He</li> <li>Yizhuo Li</li><details><summary>others</summary><li>Yi Wang</li> <li>Limin Wang</li> <li>Yu Qiao</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/>, <img src="images/pwc.svg" alt="pwc" height=20/>, <img src="images/pwc.svg" alt="pwc" height=20/>, <img src="images/pwc.svg" alt="pwc" height=20/>, <img src="images/pwc.svg" alt="pwc" height=20/></li></ul>Open In Colab20.10.2023
Show-1Hybrid model, dubbed as Show-1, which marries pixel-based and latent-based VDMs for text-to-video generation<ul><li>David Junhao Zhang</li> <li>Jay Zhangjie Wu</li> <li>Jiawei Liu</li> <li>Rui Zhao</li><details><summary>others</summary><li>Lingmin Ran</li> <li>Yuchao Gu</li> <li>Difei Gao</li> <li>Mike Zheng Shou</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li>project</li></ul>Open In Colab15.10.2023
DA-CLIPDegradation-aware vision-language model to better transfer pretrained vision-language models to low-level vision tasks as a universal framework for image restoration<ul><li>Ziwei Luo</li> <li>Fredrik Gustafsson</li> <li>Zheng Zhao</li> <li>Jens Sjölund</li> <li>Thomas Schön</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>project</li></ul>Open In Colab11.10.2023
SadTalkerGenerates 3D motion coefficients of the 3DMM from audio and implicitly modulates a novel 3D-aware face render for talking head generation<ul><li>Wenxuan Zhang</li> <li>Xiaodong Cun</li> <li>Xuan Wang</li> <li>Yong Zhang</li><details><summary>others</summary><li>Xi Shen</li> <li>Yu Guo</li> <li>Ying Shan</li> <li>Fei Wang</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab10.10.2023
MusikaMusic generation system that can be trained on hundreds of hours of music using a single consumer GPU, and that allows for much faster than real-time generation of music of arbitrary length on a consumer CPU<ul><li>Marco Pasini</li> <li>Jan Schlüter</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab09.10.2023
YOLOv6Single-stage object detection framework dedicated to industrial applications<ul><li>Kaiheng Weng</li> <li>Meng Cheng</li> <li>Yiduo Li</li> <li>Xiangxiang Chu</li> <li>Xiaolin Wei</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li>data</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab08.10.2023
DreamGaussianAlgorithm to convert 3D Gaussians into textured meshes and apply a fine-tuning stage to refine the details<ul><li>Jiaxiang Tang</li> <li>Jiawei Ren</li> <li>Hang Zhou</li> <li>Ziwei Liu</li> <li>Gang Zeng</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li></ul>Open In Colab04.10.2023
ICONGiven a set of images, method estimates a detailed 3D surface from each image and then combines these into an animatable avatar<ul><li>Yuliang Xiu</li> <li>Jinlong Yang</li> <li>Dimitrios Tzionas</li> <li>Michael Black</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab31.08.2023
DINOv2Produce high-performance visual features that can be directly employed with classifiers as simple as linear layers on a variety of computer vision tasks; these visual features are robust and perform well across domains without any requirement for fine-tuning<ul><li>Maxime Oquab</li> <li>Timothée Darcet</li> <li>Théo Moutakanni</li> <li>Huy Vo</li><details><summary>others</summary><li>Marc Szafraniec</li> <li>Vasil Khalidov</li> <li>Pierre Fernandez</li> <li>Daniel Haziza</li> <li>Francisco Massa</li> <li>Alaaeldin El-Nouby</li> <li>Mahmoud Assran</li> <li>Nicolas Ballas</li> <li>Wojciech Galuba</li> <li>Russell Howes</li> <li>Po-Yao Huang</li> <li>Shang-Wen Li</li> <li>Ishan Misra</li> <li>Michael Rabbat</li> <li>Vasu Sharma</li> <li>Gabriel Synnaeve</li> <li>Hu Xu</li> <li>Hervé Jegou</li> <li>Julien Mairal</li> <li>Patrick Labatut</li> <li>Armand Joulin</li> <li>Piotr Bojanowski</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li>demo</li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab31.08.2023
OWL-ViTSimple Open-Vocabulary Object Detection with Vision Transformers<ul><li>Matthias Minderer</li> <li>Alexey Gritsenko</li> <li>Austin Stone</li> <li>Maxim Neumann</li><details><summary>others</summary><li>Dirk Weissenborn</li> <li>Alexey Dosovitskiy</li> <li>Aravindh Mahendran</li> <li>Anurag Arnab</li> <li>Mostafa Dehghani</li> <li>Zhuoran Shen</li> <li>Xiao Wang</li> <li>Xiaohua Zhai</li> <li>Thomas Kipf</li> <li>Neil Houlsby</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li></ul>Open In Colab21.08.2023
StyleGAN3Alias-Free Generative Adversarial Networks<ul><li>Tero Karras</li> <li>Miika Aittala</li> <li>Samuli Laine</li> <li>Erik Härkönen</li><details><summary>others</summary><li>Janne Hellsten</li> <li>Jaakko Lehtinen</li> <li>Timo Aila</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/neurips.svg" alt="neurips" height=20/></li><li>project</li></ul>Open In Colab13.08.2023
FateZeroZero-shot text-based editing method on real-world videos without per-prompt training or use-specific mask<ul><li>Chenyang Qi</li> <li>Xiaodong Cun</li> <li>Yong Zhang</li> <li>Chenyang Lei</li><details><summary>others</summary><li>Xintao Wang</li> <li>Ying Shan</li> <li>Qifeng Chen</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li>video</li></ul>Open In Colab13.08.2023
Big GANLarge Scale GAN Training for High Fidelity Natural Image Synthesis<ul><li>Andrew Brock</li> <li>Jeff Donahue</li> <li>Karen Simonyan</li></ul><ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li></ul>Open In Colab03.08.2023
LaMaResolution-robust Large Mask Inpainting with Fourier Convolutions<ul><li>Roman Suvorov</li> <li>Elizaveta Logacheva</li> <li>Anton Mashikhin</li> <li>Anastasia Remizova</li><details><summary>others</summary><li>Arsenii Ashukha</li> <li>Aleksei Silvestrov</li> <li>Naejin Kong</li> <li>Harshith Goka</li> <li>Kiwoong Park</li> <li>Victor Lempitsky</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li></ul>Open In Colab02.08.2023
MakeItTalkA method that generates expressive talking-head videos from a single facial image with audio as the only input<ul><li>Yang Zhou</li> <li>Xintong Han</li> <li>Eli Shechtman</li> <li>Jose Echevarria</li><details><summary>others</summary><li>Evangelos Kalogerakis</li> <li>Dingzeyu Li</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab27.07.2023
HiDTA generative image-to-image model and a new upsampling scheme that allows to apply image translation at high resolution<ul><li>Denis Korzhenkov</li> <li>Gleb Sterkin</li> <li>Sergey Nikolenko</li> <li>Victor Lempitsky</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab24.07.2023
CutLERSimple approach for training unsupervised object detection and segmentation models<ul><li>Xudong Wang</li> <li>Rohit Girdhar</li> <li>Stella Yu</li> <li>Ishan Misra</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li>project</li></ul>Open In Colab24.07.2023
Recognize Anything & Tag2TextVision language pre-training framework, which introduces image tagging into vision-language models to guide the learning of visual-linguistic features<ul><li>Xinyu Huang</li> <li>Youcai Zhang</li> <li>Jinyu Ma</li> <li>Zhaoyang Li</li><details><summary>others</summary><li>Yanchun Xie</li> <li>Yuzhuo Qin</li> <li>Tong Luo</li> <li>Yaqian Li</li> <li>Yandong Guo</li> <li>Yandong Guo</li> <li>Lei Zhang</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>project, project</li></ul>Open In Colab09.07.2023
Thin-Plate Spline Motion ModelEnd-to-end unsupervised motion transfer framework<ul><li>Jian Zhao</li> <li>Hui Zhang</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>supp</li></ul>Open In Colab07.07.2023
MobileSAMTowards Lightweight SAM for Mobile Applications<ul><li>Chaoning Zhang</li> <li>Dongshen Han</li> <li>Yu Qiao</li> <li>Jung Uk Kim</li><details><summary>others</summary><li>Sung-Ho Bae</li> <li>Seungkyu Lee</li> <li>Choong Seon Hong</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab30.06.2023
Grounding DINOMarrying DINO with Grounded Pre-Training for Open-Set Object Detection<ul><li>Shilong Liu</li> <li>Zhaoyang Zeng</li> <li>Tianhe Ren</li> <li>Feng Li</li><details><summary>others</summary><li>Hao Zhang</li> <li>Jie Yang</li> <li>Chunyuan Li</li> <li>Jianwei Yang</li> <li>Hang Su</li> <li>Jun Zhu</li> <li>Lei Zhang</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/>, <img src="images/pwc.svg" alt="pwc" height=20/>, <img src="images/pwc.svg" alt="pwc" height=20/>, <img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab28.06.2023
T5XModular, composable, research-friendly framework for high-performance, configurable, self-service training, evaluation, and inference of sequence models at many scales<ul><li>Adam Roberts</li> <li>Hyung Won Chung</li> <li>Anselm Levskaya</li> <li>Gaurav Mishra</li><details><summary>others</summary><li>James Bradbury</li> <li>Daniel Andor</li> <li>Sharan Narang</li> <li>Brian Lester</li> <li>Colin Gaffney</li> <li>Afroz Mohiuddin</li> <li>Curtis Hawthorne</li> <li>Aitor Lewkowycz</li> <li>Alex Salcianu</li> <li>Marc van Zee</li> <li>Jacob Austin</li> <li>Sebastian Goodman</li> <li>Livio Baldini Soares</li> <li>Haitang Hu</li> <li>Sasha Tsvyashchenko</li> <li>Aakanksha Chowdhery</li> <li>Jasmijn Bastings</li> <li>Jannis Bulian</li> <li>Xavier Garcia</li> <li>Jianmo Ni</li> <li>Kathleen Kenealy</li> <li>Jonathan Clark</li> <li>Dan Garrette</li> <li>James Lee-Thorp</li> <li>Colin Raffel</li> <li>Noam Shazeer</li> <li>Marvin Ritter</li> <li>Maarten Bosma</li> <li>Alexandre Passos</li> <li>Jeremy Maitin-Shepard</li> <li>Noah Fiedel</li> <li>Brennan Saeta</li> <li>Ryan Sepassi</li> <li>Alexander Spiridonov</li> <li>Joshua Newlan</li> <li>Andrea Gesmundo</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab27.06.2023
CodeTalkerCast speech-driven facial animation as a code query task in a finite proxy space of the learned codebook, which effectively promotes the vividness of the generated motions by reducing the cross-modal mapping uncertainty<ul><li>Jinbo Xing</li> <li>Menghan Xia</li> <li>Yuechen Zhang</li> <li>Xiaodong Cun</li><details><summary>others</summary><li>Jue Wang</li> <li>Tien-Tsin Wong</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab16.06.2023
First Order Motion Model for Image AnimationTransferring facial movements from video to imageAliaksandr Siarohin <ul><li><img src="images/neurips.svg" alt="neurips" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab04.06.2023
Parallel WaveGANState-of-the-art non-autoregressive models to build your own great vocoderTomoki Hayashi <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>demo</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab01.06.2023
ECONdesigned for "Human digitization from a color image", which combines the best properties of implicit and explicit representations, to infer high-fidelity 3D clothed humans from in-the-wild images, even with loose clothing or in challenging poses<ul><li>Yuliang Xiu</li> <li>Jinlong Yang</li> <li>Xu Cao</li> <li>Dimitrios Tzionas</li> <li>Michael Black</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docker.svg" alt="docker" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab31.05.2023
MMSThe Massively Multilingual Speech project expands speech technology from about 100 languages to over 1000 by building a single multilingual speech recognition model supporting over 1100 languages, language identification models able to identify over 4000 languages, pretrained models supporting over 1400 languages, and text-to-speech models for over 1100 languages<ul><li>Vineel Pratap</li> <li>Andros Tjandra</li> <li>Bowen Shi</li> <li>Paden Tomasello</li><details><summary>others</summary><li>Arun Babu</li> <li>Sayani Kundu</li> <li>Ali Elkahky</li> <li>Zhaoheng Ni</li> <li>Apoorv Vyas</li> <li>Maryam Fazel-Zarandi</li> <li>Alexei Baevski</li> <li>Yossi Adi</li> <li>Xiaohui Zhang</li> <li>Wei-Ning Hsu</li> <li>Alexis Conneau</li> <li>Michael Auli</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/meta.svg" alt="meta" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab26.05.2023
FABFlow AIS Bootstrap uses AIS to generate samples in regions where the flow is a poor approximation of the target, facilitating the discovery of new modes<ul><li>Laurence Midgley</li> <li>Vincent Stimper</li> <li>Gregor N. C. Simm</li> <li>Bernhard Schölkopf</li> <li>José Miguel Hernández-Lobato</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab29.04.2023
CodeFormerTransformer-based prediction network to model global composition and context of the low-quality faces for code prediction, enabling the discovery of natural faces that closely approximate the target faces even when the inputs are severely degraded<ul><li>Shangchen Zhou</li> <li>Kelvin Chan</li> <li>Chongyi Li</li> <li>Chen Change Loy</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/neurips.svg" alt="neurips" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab21.04.2023
Text2Video-ZeroText-to-Image Diffusion Models are Zero-Shot Video Generators<ul><li>Levon Khachatryan</li> <li>Andranik Movsisyan</li> <li>Vahram Tadevosyan</li> <li>Roberto Henschel</li><details><summary>others</summary><li>Zhangyang Wang</li> <li>Shant Navasardyan</li> <li>Humphrey Shi</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li>video</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab11.04.2023
Segment AnythingThe Segment Anything Model produces high quality object masks from input prompts such as points or boxes, and it can be used to generate masks for all objects in an image<ul><li>Alexander Kirillov</li> <li>Eric Mintun</li> <li>Nikhila Ravi</li> <li>Hanzi Mao</li><details><summary>others</summary><li>Chloé Rolland</li> <li>Laura Gustafson</li> <li>Tete Xiao</li> <li>Spencer Whitehead</li> <li>Alex Berg</li> <li>Wan-Yen Lo</li> <li>Piotr Dollár</li> <li>Ross Girshick</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/meta.svg" alt="meta" height=20/>, <img src="images/meta.svg" alt="meta" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab10.04.2023
FollowYourPoseTwo-stage training scheme that can utilize image pose pair and pose-free video datasets and the pre-trained text-to-image model to obtain the pose-controllable character videos<ul><li>Yue Ma</li> <li>Yingqing He</li> <li>Xiaodong Cun</li> <li>Xintao Wang</li><details><summary>others</summary><li>Siran Chen</li> <li>Ying Shan</li> <li>Xiu Li</li> <li>Qifeng Chen</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li>video</li></ul>Open In Colab07.04.2023
EVA3DHigh-quality unconditional 3D human generative model that only requires 2D image collections for training<ul><li>Fangzhou Hong</li> <li>Zhaoxi Chen</li> <li>Yushi Lan</li> <li>Liang Pan</li> <li>Ziwei Liu</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab06.04.2023
Stable DreamfusionUsing a pretrained 2D text-to-image diffusion model to perform text-to-3D synthesis<ul><li>Jiaxiang Tang</li> <li>Ben Poole</li> <li>Ajay Jain</li> <li>Jon Barron</li> <li>Ben Mildenhall</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li><img src="images/pt.svg" alt="pt" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab04.04.2023
PIFuHDMulti-Level Pixel-Aligned Implicit Function for High-Resolution 3D Human Digitization<ul><li>Shunsuke Saito</li> <li>Tomas Simon</li> <li>Jason Saragih</li> <li>Hanbyul Joo</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab26.03.2023
VideoReTalkingSystem to edit the faces of a real-world talking head video according to input audio, producing a high-quality and lip-syncing output video even with a different emotion<ul><li>Kun Cheng</li> <li>Xiaodong Cun</li> <li>Yong Zhang</li> <li>Menghan Xia</li><details><summary>others</summary><li>Fei Yin</li> <li>Mingrui Zhu</li> <li>Xuan Wang</li> <li>Jue Wang</li> <li>Nannan Wang</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>project</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab19.03.2023
Visual ChatGPTConnects ChatGPT and a series of Visual Foundation Models to enable sending and receiving images during chatting<ul><li>Chenfei Wu</li> <li>Shengming Yin</li> <li>Weizhen Qi</li> <li>Xiaodong Wang</li><details><summary>others</summary><li>Zecheng Tang</li> <li>Nan Duan</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab15.03.2023
Tune-A-VideoOne-Shot Tuning of Image Diffusion Models for Text-to-Video Generation<ul><li>Jay Zhangjie Wu</li> <li>Yixiao Ge</li> <li>Xintao Wang</li> <li>Stan Weixian Lei</li><details><summary>others</summary><li>Yuchao Gu</li> <li>Yufei Shi</li> <li>Wynne Hsu</li> <li>Ying Shan</li> <li>Xiaohu Qie</li> <li>Mike Zheng Shou</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab23.02.2023
GPENGAN Prior Embedded Network for Blind Face Restoration in the Wild<ul><li>Tao Yang</li> <li>Peiran Ren</li> <li>Xuansong Xie</li> <li>Lei Zhang</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>demo</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab15.02.2023
PyMAF-XКegression-based approach to recovering parametric full-body models from monocular images<ul><li>Hongwen Zhang</li> <li>Yating Tian</li> <li>Yuxiang Zhang</li> <li>Mengcheng Li</li><details><summary>others</summary><li>Liang An</li> <li>Zhenan Sun</li> <li>Yebin Liu</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab14.02.2023
Disco DiffusionA frankensteinian amalgamation of notebooks, models and techniques for the generation of AI Art and Animations<ul><li>Max Ingham</li> <li>Adam Letts</li> <li>Daniel Russell</li> <li>Chigozie Nri</li></ul> <ul><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab11.02.2023
GrooVAESome applications of machine learning for generating and manipulating beats and drum performances<ul><li>Jon Gillick</li> <li>Adam Roberts</li> <li>Jesse Engel</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li>data</li><li>web app</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab02.02.2023
Multitrack MusicVAEThe models in this notebook are capable of encoding and decoding single measures of up to 8 tracks, optionally conditioned on an underlying chord<ul><li>Ian Simon</li> <li>Adam Roberts</li> <li>Colin Raffel</li> <li>Jesse Engel</li><details><summary>others</summary><li>Curtis Hawthorne</li> <li>Douglas Eck</li></ul></details><ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li></ul>Open In Colab02.02.2023
MusicVAEA Hierarchical Latent Vector Model for Learning Long-Term Structure in Music<ul><li>Adam Roberts</li> <li>Jesse Engel</li> <li>Colin Raffel</li> <li>Curtis Hawthorne</li> <li>Douglas Eck</li></ul><ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab02.02.2023
Learning to PaintLearning to Paint With Model-based Deep Reinforcement LearningManuel Romero <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab01.02.2023
Instant-NGPInstant Neural Graphics Primitives with a Multiresolution Hash Encoding<ul><li>Thomas Müller</li> <li>Alex Evans</li> <li>Christoph Schied</li> <li>Alexander Keller</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li>tutorial</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab18.01.2023
Fourier Feature NetworksFourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains<ul><li>Matthew Tancik</li> <li>Pratul Srinivasan</li> <li>Ben Mildenhall</li> <li>Sara Fridovich-Keil</li><details><summary>others</summary><li>Nithin Raghavan</li> <li>Utkarsh Singhal</li> <li>Ravi Ramamoorthi</li> <li>Jon Barron</li> <li>Ren Ng</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/neurips.svg" alt="neurips" height=20/>, <img src="images/neurips.svg" alt="neurips" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab17.01.2023
AlphaPoseWhole-Body Regional Multi-Person Pose Estimation and Tracking in Real-Time<ul><li>Hao-Shu Fang</li> <li>Jiefeng Li</li> <li>Hongyang Tang</li> <li>Chao Xu</li><details><summary>others</summary><li>Haoyi Zhu</li> <li>Yuliang Xiu</li> <li>Yong-Lu Li</li> <li>Cewu Lu</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab07.01.2023
HybrIKHybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation<ul><li>Jiefeng Li</li> <li>Chao Xu</li> <li>Zhicun Chen</li> <li>Siyuan Bian</li><details><summary>others</summary><li>Lixin Yang</li> <li>Cewu Lu</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li>supp</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab01.01.2023
Score Jacobian ChainingApply chain rule on the learned gradients, and back-propagate the score of a diffusion model through the Jacobian of a differentiable renderer, which we instantiate to be a voxel radiance field<ul><li>Haochen Wang</li> <li>Xiaodan Du</li> <li>Jiahao Li</li> <li>Raymond Yeh</li> <li>Greg Shakhnarovich</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab05.12.2022
DemucsHybrid Spectrogram and Waveform Source SeparationAlexandre Défossez <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab21.11.2022
StyleCLIPText-Driven Manipulation of StyleGAN Imager<ul><li>Or Patashnik</li> <li>Zongze Wu</li> <li>Eli Shechtman</li> <li>Daniel Cohen-Or</li> <li>Dani Lischinski</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab30.10.2022
MotionDiffuseThe first diffusion model-based text-driven motion generation framework, which demonstrates several desired properties over existing methods<ul><li>Mingyuan Zhang</li> <li>Zhongang Cai</li> <li>Liang Pan</li> <li>Fangzhou Hong</li><details><summary>others</summary><li>Xinying Guo</li> <li>Lei Yang</li> <li>Ziwei Liu</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab13.10.2022
VToonifyLeverages the mid- and high-resolution layers of StyleGAN to render high-quality artistic portraits based on the multi-scale content features extracted by an encoder to better preserve the frame details<ul><li>Shuai Yang</li> <li>Liming Jiang</li> <li>Ziwei Liu</li> <li>Chen Change Loy</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab07.10.2022
PyMAFPyramidal Mesh Alignment Feedback loop in regression network for well-aligned body mesh recovery and extend it for the recovery of expressive full-body models<ul><li>Hongwen Zhang</li> <li>Yating Tian</li> <li>Yuxiang Zhang</li> <li>Mengcheng Li</li><details><summary>others</summary><li>Liang An</li> <li>Zhenan Sun</li> <li>Yebin Liu</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab06.10.2022
AlphaTensorDiscovering faster matrix multiplication algorithms with reinforcement learning<ul><li>Alhussein Fawzi</li> <li>Matej Balog</li> <li>Aja Huang</li> <li>Thomas Hubert</li><details><summary>others</summary><li>Bernardino Romera-Paredes</li> <li>Mohammadamin Barekatain</li> <li>Alexander Novikov</li> <li>Francisco Ruiz</li> <li>Julian Schrittwieser</li> <li>Grzegorz Swirszcz</li> <li>David Silver</li> <li>Demis Hassabis</li> <li>Pushmeet Kohli</li></ul></details> <ul><li><img src="images/deepmind.svg" alt="deepmind" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab04.10.2022
Swin2SRNovel Swin Transformer V2, to improve SwinIR for image super-resolution, and in particular, the compressed input scenario<ul><li>Marcos Conde</li> <li>Ui-Jin Choi</li> <li>Maxime Burchi</li> <li>Radu Timofte</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/>, <img src="images/kaggle.svg" alt="kaggle" height=20/>, <img src="images/kaggle.svg" alt="kaggle" height=20/></li></ul>Open In Colab03.10.2022
FunctaFrom data to functa: Your data point is a function and you can treat it like one<ul><li>Emilien Dupont</li> <li>Hyunjik Kim</li> <li>Ali Eslami</li> <li>Danilo Rezende</li> <li>Dan Rosenbaum</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab24.09.2022
WhisperAutomatic speech recognition system trained on 680,000 hours of multilingual and multitask supervised data collected from the web<ul><li>Alec Radford</li> <li>Jong Wook Kim</li> <li>Tao Xu</li> <li>Greg Brockman</li><details><summary>others</summary><li>Christine McLeavey</li> <li>Ilya Sutskever</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab21.09.2022
DeOldify (video)Colorize your own videos!Jason Antic <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>model</li><li><img src="images/reddit.svg" alt="reddit" height=20/>, <img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab19.09.2022
DeOldify (photo)Colorize your own photos!<ul><li>Jason Antic</li> <li>Matt Robinson</li> <li>María Benavente</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>model</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li>website</li></ul>Open In Colab19.09.2022
Real-ESRGANExtend the powerful ESRGAN to a practical restoration application, which is trained with pure synthetic data<ul><li>Xintao Wang</li> <li>Liangbin Xie</li> <li>Chao Dong</li> <li>Ying Shan</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab18.09.2022
IDE-3DInteractive Disentangled Editing for High-Resolution 3D-aware Portrait Synthesis<ul><li>Jingxiang Sun</li> <li>Xuan Wang</li> <li>Yichun Shi</li> <li>Lizhen Wang</li><details><summary>others</summary><li>Jue Wang</li> <li>Yebin Liu</li></ul></details> <ul><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab08.09.2022
Decision TransformersAn architecture that casts the problem of RL as conditional sequence modeling<ul><li>Lili Chen</li> <li>Kevin Lu</li> <li>Aravind Rajeswaran</li> <li>Kimin Lee</li><details><summary>others</summary><li>Aditya Grover</li> <li>Michael Laskin</li> <li>Pieter Abbeel</li> <li>Aravind Srinivas</li> <li>Igor Mordatch</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li><img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab06.09.2022
textual-inversionAn Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion<ul><li>Rinon Gal</li> <li>Yuval Alaluf</li> <li>Yuval Atzmon</li> <li>Or Patashnik</li><details><summary>others</summary><li>Amit Bermano</li> <li>Gal Chechik</li> <li>Daniel Cohen-Or</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab21.08.2022
StyleGAN-HumanA Data-Centric Odyssey of Human Generation<ul><li>Jianglin Fu</li> <li>Shikai Li</li> <li>Yuming Jiang</li> <li>Kwan-Yee Lin</li><details><summary>others</summary><li>Chen Qian</li> <li>Chen Change Loy</li> <li>Wayne Wu</li> <li>Ziwei Liu</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab19.08.2022
Make-A-SceneScene-Based Text-to-Image Generation with Human Priors<ul><li>Oran Gafni</li> <li>Adam Polyak</li> <li>Oron Ashual</li> <li>Shelly Sheynin</li><details><summary>others</summary><li>Devi Parikh</li> <li>Yaniv Taigman</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab12.08.2022
StyleGAN-NADAZero-Shot non-adversarial domain adaptation of pre-trained generators<ul><li>Rinon Gal</li> <li>Or Patashnik</li> <li>Haggai Maron</li> <li>Gal Chechik</li> <li>Daniel Cohen-Or</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li></ul>Open In Colab09.08.2022
YOLOv7Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors<ul><li>Chien-Yao Wang</li> <li>Alexey Bochkovskiy</li> <li>Mark Liao</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data, data, data, data</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab09.08.2022
GLIPGrounded language-image pre-training model for learning object-level, language-aware, and semantic-rich visual representations<ul><li>Liunian Harold Li</li> <li>Pengchuan Zhang</li> <li>Haotian Zhang</li> <li>Jianwei Yang</li><details><summary>others</summary><li>Chunyuan Li</li> <li>Yiwu Zhong</li> <li>Lijuan Wang</li> <li>Lu Yuan</li> <li>Lei Zhang</li> <li>Jenq-Neng Hwang</li> <li>Kai-Wei Chang</li> <li>Jianfeng Gao</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab30.07.2022
Anycost GANInteractive natural image editing<ul><li>Ji Lin</li> <li>Richard Zhang</li> <li>Frieder Ganz</li> <li>Song Han</li> <li>Jun-Yan Zhu</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab20.07.2022
GFPGANTowards Real-World Blind Face Restoration with Generative Facial Prior<ul><li>Xintao Wang</li> <li>Yu Li</li> <li>Honglun Zhang</li> <li>Ying Shan</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li></ul>Open In Colab13.07.2022
EPro-PnPGeneralized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation<ul><li>Hansheng Chen</li> <li>Pichao Wang</li> <li>Fan Wang</li> <li>Wei Tian</li><details><summary>others</summary><li>Lu Xiong</li> <li>Hao Li</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>nuScenes</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab12.07.2022
Text2HumanText-driven controllable framework for a high-quality and diverse human generation<ul><li>Yuming Jiang</li> <li>Shuai Yang</li> <li>Haonan Qiu</li> <li>Wayne Wu</li><details><summary>others</summary><li>Chen Change Loy</li> <li>Ziwei Liu</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab04.07.2022
VQ-DiffusionBased on a VQ-VAE whose latent space is modeled by a conditional variant of the recently developed Denoising Diffusion Probabilistic Model<ul><li>Shuyang Gu</li> <li>Dong Chen</li> <li>Jianmin Bao</li> <li>Fang Wen</li><details><summary>others</summary><li>Bo Zhang</li> <li>Dongdong Chen</li> <li>Lu Yuan</li> <li>Baining Guo</li> <li>Shuyang Gu</li> <li>Zhicong Tang</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab30.06.2022
OPTOpen Pre-trained Transformers is a family of NLP models trained on billions of tokens of text obtained from the internet<ul><li>Susan Zhang</li> <li>Stephen Roller</li> <li>Naman Goyal</li> <li>Mikel Artetxe</li><details><summary>others</summary><li>Moya Chen</li> <li>Christopher Dewan</li> <li>Mona Diab</li> <li>Xi Victoria Lin</li> <li>Todor Mihaylov</li> <li>Myle Ott</li> <li>Sam Shleifer</li> <li>Kurt Shuster</li> <li>Daniel Simig</li> <li>Punit Singh Koura</li> <li>Anjali Sridhar</li> <li>Tianlu Wang</li> <li>Luke Zettlemoyer</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab29.06.2022
Customizing a Transformer EncoderWe will learn how to customize the encoder to employ new network architecturesChen Chen <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab22.06.2022
MTTREnd-to-End Referring Video Object Segmentation with Multimodal Transformers<ul><li>Adam Botach</li> <li>Evgenii Zheltonozhskii</li> <li>Chaim Baskin</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab20.06.2022
SwinIRImage Restoration Using Swin Transformer<ul><li>Jingyun Liang</li> <li>Jiezhang Cao</li> <li>Guolei Sun</li> <li>Kai Zhang</li><details><summary>others</summary><li>Luc Van Gool</li> <li>Radu Timofte</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab17.06.2022
VRTA Video Restoration Transformer<ul><li>Jingyun Liang</li> <li>Jiezhang Cao</li> <li>Yuchen Fan</li> <li>Kai Zhang</li><details><summary>others</summary><li>Yawei Li</li> <li>Radu Timofte</li> <li>Luc Van Gool</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab15.06.2022
OmnivoreA single model which excels at classifying images, videos, and single-view 3D data using exactly the same model parameters<ul><li>Rohit Girdhar</li> <li>Mannat Singh</li> <li>Nikhila Ravi</li> <li>Laurens Maaten</li><details><summary>others</summary><li>Armand Joulin</li> <li>Ishan Misra</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li><img src="images/pwc.svg" alt="pwc" height=20/></li></ul>Open In Colab14.06.2022
Dream FieldsZero-Shot Text-Guided Object Generation<ul><li>Ajay Jain</li> <li>Ben Mildenhall</li> <li>Jon Barron</li> <li>Pieter Abbeel</li> <li>Ben Poole</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab10.06.2022
DeticDetecting Twenty-thousand Classes using Image-level Supervision<ul><li>Xingyi Zhou</li> <li>Rohit Girdhar</li> <li>Armand Joulin</li> <li>Philipp Krähenbühl</li> <li>Ishan Misra</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab07.06.2022
T0Multitask Prompted Training Enables Zero-Shot Task Generalization<ul><li>Victor Sanh</li> <li>Albert Webson</li> <li>Colin Raffel</li> <li>Stephen Bach</li><details><summary>others</summary><li>Lintang Sutawika</li> <li>Zaid Alyafeai</li> <li>Antoine Chaffin</li> <li>Arnaud Stiegler</li> <li>Teven Scao</li> <li>Arun Raja</li> <li>Manan Dey</li> <li>M Saiful Bari</li> <li>Canwen Xu</li> <li>Urmish Thakker</li> <li>Shanya Sharma</li> <li>Eliza Szczechla</li> <li>Taewoon Kim</li> <li>Gunjan Chhablani</li> <li>Nihal Nayak</li> <li>Debajyoti Datta</li> <li>Jonathan Chang</li> <li>Mike Tian-Jian Jiang</li> <li>Matteo Manica</li> <li>Sheng Shen</li> <li>Zheng Xin Yong</li> <li>Harshit Pandey</li> <li>Rachel Bawden</li> <li>Trishala Neeraj</li> <li>Jos Rozen</li> <li>Abheesht Sharma</li> <li>Andrea Santilli</li> <li>Thibault Fevry</li> <li>Jason Alan Fries</li> <li>Ryan Teehan</li> <li>Stella Biderman</li> <li>Leo Gao</li> <li>Tali Bers</li> <li>Thomas Wolf</li> <li>Alexander M. Rush</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab29.05.2022
AvatarCLIPA zero-shot text-driven framework for 3D avatar generation and animation<ul><li>Fangzhou Hong</li> <li>Mingyuan Zhang</li> <li>Liang Pan</li> <li>Zhongang Cai</li><details><summary>others</summary><li>Lei Yang</li> <li>Ziwei Liu</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab15.05.2022
Text2MeshText-Driven Neural Stylization for Meshes<ul><li>Oscar Michel</li> <li>Roi Bar-On</li> <li>Richard Liu</li> <li>Sagie Benaim</li> <li>Rana Hanocka</li></ul> <ul><li>CLIP</li><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li>project</li></ul>Open In Colab14.05.2022
T5Text-To-Text Transfer Transformer<ul><li>Colin Raffel</li> <li>Noam Shazeer</li> <li>Adam Roberts</li> <li>Katherine Lee</li><details><summary>others</summary><li>Sharan Narang</li> <li>Michael Matena</li> <li>Yanqi Zhou</li> <li>Wei Li</li> <li>Peter J. Liu</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab11.05.2022
XLS-RSelf-supervised Cross-lingual Speech Representation Learning at Scale<ul><li>Arun Babu</li> <li>Changhan Wang</li> <li>Andros Tjandra</li> <li>Kushal Lakhotia</li><details><summary>others</summary><li>Qiantong Xu</li> <li>Naman Goyal</li> <li>Kritika Singh</li> <li>Patrick von Platen</li> <li>Yatharth Saraf</li> <li>Juan Pino</li> <li>Alexei Baevski</li> <li>Alexis Conneau</li> <li>Michael Auli</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab10.05.2022
MAGICTraining-free framework, iMAge-Guided text generatIon with CLIP, for plugging in visual controls in the generation process and enabling LMs to perform multimodal tasks in a zero-shot manner<ul><li>Yixuan Su</li> <li>Tian Lan</li> <li>Yahui Liu</li> <li>Fangyu Liu</li><details><summary>others</summary><li>Dani Yogatama</li> <li>Yan Wang</li> <li>Lingpeng Kong</li> <li>Nigel Collier</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li></ul>Open In Colab02.05.2022
DiffCSEUnsupervised contrastive learning framework for learning sentence embeddings<ul><li>Yung-Sung Chuang</li> <li>Rumen Dangovski</li> <li>Hongyin Luo</li> <li>Yang Zhang</li><details><summary>others</summary><li>Shiyu Chang</li> <li>Marin Soljačić</li> <li>Shang-Wen Li</li> <li>Scott Wen-tau Yih</li> <li>Yoon Kim</li> <li>James Glass</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li></ul>Open In Colab24.04.2022
ViDT+An Extendable, Efficient and Effective Transformer-based Object Detector<ul><li>Hwanjun Song</li> <li>Deqing Sun</li> <li>Sanghyuk Chun</li> <li>Varun Jampani</li><details><summary>others</summary><li>Dongyoon Han</li> <li>Byeongho Heo</li> <li>Wonjae Kim</li> <li>Ming-Hsuan Yang</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab20.04.2022
BasicVSR++Redesign BasicVSR by proposing second-order grid propagation and flow-guided deformable alignment<ul><li>Kelvin Chan</li> <li>Shangchen Zhou</li> <li>Xiangyu Xu</li> <li>Chen Change Loy</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab18.04.2022
NAFNetNonlinear Activation Free Network for Image Restoration<ul><li>Liangyu Chen</li> <li>Xiaojie Chu</li> <li>Xiangyu Zhang</li> <li>Jian Sun</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/>, <img src="images/pwc.svg" alt="pwc" height=20/></li></ul>Open In Colab15.04.2022
Panini-NetGAN Prior based Degradation-Aware Feature Interpolation for Face Restoration<ul><li>Yinhuai Wang</li> <li>Yujie Hu</li> <li>Jian Zhang</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab13.04.2022
E2FGVIAn End-to-End framework for Flow-Guided Video Inpainting through elaborately designed three trainable modules, namely, flow completion, feature propagation, and content hallucination modules<ul><li>Zhen Li</li> <li>Cheng-Ze Lu</li> <li>Jianhua Qin</li> <li>Chun-Le Guo</li> <li>Ming-Ming Cheng</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data, data</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab06.04.2022
LDMHigh-Resolution Image Synthesis with Latent Diffusion Models<ul><li>Robin Rombach</li> <li>Andreas Blattmann</li> <li>Dominik Lorenz</li> <li>Patrick Esser</li> <li>Björn Ommer</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li></ul>Open In Colab04.04.2022
GP-UNITNovel framework, Generative Prior-guided UNsupervised Image-to-image Translation, to improve the overall quality and applicability of the translation algorithm<ul><li>Shuai Yang</li> <li>Liming Jiang</li> <li>Ziwei Liu</li> <li>Chen Change Loy</li></ul> <ul><li>ImageNet</li><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab02.04.2022
DualStyleGANMore challenging exemplar-based high-resolution portrait style transfer by introducing a novel DualStyleGAN with flexible control of dual styles of the original face domain and the extended artistic portrait domain<ul><li>Shuai Yang</li> <li>Liming Jiang</li> <li>Ziwei Liu</li> <li>Chen Change Loy</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data, data</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab24.03.2022
CLIPassoSemantically-Aware Object Sketching<ul><li>Yael Vinker</li> <li>Ehsan Pajouheshgar</li> <li>Jessica Y. Bo</li> <li>Roman Bachmann</li><details><summary>others</summary><li>Amit Bermano</li> <li>Daniel Cohen-Or</li> <li>Amir Zamir</li> <li>Ariel Shamir</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>demo</li><li><img src="images/git.svg" alt="git" height=20/></li><li>project</li></ul>Open In Colab21.03.2022
StyleSDFA high resolution, 3D-consistent image and shape generation technique<ul><li>Roy Or-El</li> <li>Xuan Luo</li> <li>Mengyi Shan</li> <li>Eli Shechtman</li><details><summary>others</summary><li>Jeong Joon Park</li> <li>Ira Kemelmacher-Shlizerman</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>project</li></ul>Open In Colab05.03.2022
Disentangled Lifespan Face SynthesisLFS model is proposed to disentangle the key face characteristics including shape, texture and identity so that the unique shape and texture age transformations can be modeled effectively<ul><li>Sen He</li> <li>Wentong Liao</li> <li>Michael Yang</li> <li>Yi-Zhe Song</li><details><summary>others</summary><li>Bodo Rosenhahn</li> <li>Tao Xiang</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab22.02.2022
ClipCapCLIP Prefix for Image Captioning<ul><li>Ron Mokady</li> <li>Amir Hertz</li> <li>Amit Bermano</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab15.02.2022
ROMPMonocular, One-stage, Regression of Multiple 3D People<ul><li>Yu Sun</li> <li>Qian Bao</li> <li>Wu Liu</li> <li>Yili Fu</li><details><summary>others</summary><li>Michael Black</li> <li>Tao Mei</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab11.02.2022
Mask2FormerMasked-attention Mask Transformer for Universal Image Segmentation<ul><li>Bowen Cheng</li> <li>Ishan Misra</li> <li>Alexander Schwing</li> <li>Alexander Kirillov</li> <li>Rohit Girdhar</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>demo</li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>project</li></ul>Open In Colab09.02.2022
JoJoGANOne Shot Face Stylization<ul><li>Min Jin Chong</li> <li>David Forsyth</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab02.02.2022
Pose with StyleDetail-Preserving Pose-Guided Image Synthesis with Conditional StyleGAN<ul><li>Badour AlBahar</li> <li>Jingwan Lu</li> <li>Jimei Yang</li> <li>Zhixin Shu</li><details><summary>others</summary><li>Eli Shechtman</li> <li>Jia-Bin Huang</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab19.01.2022
ConvNeXtA pure ConvNet model constructed entirely from standard ConvNet modules<ul><li>Zhuang Liu</li> <li>Hanzi Mao</li> <li>Chao-Yuan Wu</li> <li>Christoph Feichtenhofer</li><details><summary>others</summary><li>Trevor Darrell</li> <li>Saining Xie</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab19.01.2022
diffsortDifferentiable Sorting Networks<ul><li>Felix Petersen</li> <li>Christian Borgelt</li> <li>Hilde Kuehne</li> <li>Oliver Deussen</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab17.01.2022
Taming Transformers for High-Resolution Image SynthesisWe combine the efficiancy of convolutional approaches with the expressivity of transformers by introducing a convolutional VQGAN, which learns a codebook of context-rich visual parts, whose composition is modeled with an autoregressive transformer<ul><li>Patrick Esser</li> <li>Robin Rombach</li> <li>Björn Ommer</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>project</li></ul>Open In Colab13.01.2022
RealBasicVSRInvestigating Tradeoffs in Real-World Video Super-Resolution<ul><li>Kelvin Chan</li> <li>Shangchen Zhou</li> <li>Xiangyu Xu</li> <li>Chen Change Loy</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li></ul>Open In Colab25.12.2021
GLIDETowards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models<ul><li>Alex Nichol</li> <li>Prafulla Dhariwal</li> <li>Aditya Ramesh</li> <li>Pranav Shyam</li><details><summary>others</summary><li>Pamela Mishkin</li> <li>Bob McGrew</li> <li>Ilya Sutskever</li> <li>Mark Chen</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab22.12.2021
NerfiesFirst method capable of photorealistically reconstructing deformable scenes using photos/videos captured casually from mobile phones<ul><li>Keunhong Park</li> <li>Utkarsh Sinha</li> <li>Jon Barron</li> <li>Sofien Bouaziz</li><details><summary>others</summary><li>Dan Goldman</li> <li>Steve Seitz</li> <li>Ricardo Martin-Brualla</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab06.12.2021
HyperStyleA hypernetwork that learns to modulate StyleGAN's weights to faithfully express a given image in editable regions of the latent space<ul><li>Yuval Alaluf</li> <li>Omer Tov</li> <li>Ron Mokady</li> <li>Rinon Gal</li> <li>Amit Bermano</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab03.12.2021
encoder4editingDesigning an Encoder for StyleGAN Image Manipulation<ul><li>Omer Tov</li> <li>Yuval Alaluf</li> <li>Yotam Nitzan</li> <li>Or Patashnik</li> <li>Daniel Cohen-Or</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab02.12.2021
StyleCariGANCaricature Generation via StyleGAN Feature Map Modulation<ul><li>Wonjong Jang</li> <li>Gwangjin Ju</li> <li>Yucheol Jung</li> <li>Jiaolong Yang</li><details><summary>others</summary><li>Xin Tong</li> <li>Seungyong Lee</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab30.11.2021
CartoonGANThe implementation of the cartoon GAN model with PyTorchTobias Sunderdiek <ul><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li>project</li></ul>Open In Colab24.11.2021
SimSwapAn efficient framework, called Simple Swap, aiming for generalized and high fidelity face swapping<ul><li>Xuanhong Chen</li> <li>Bingbing Ni</li> <li>Yanhao Ge</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab24.11.2021
RVMRobust High-Resolution Video Matting with Temporal Guidance<ul><li>Shanchuan Lin</li> <li>Linjie Yang</li> <li>Imran Saleemi</li> <li>Soumyadip Sengupta</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab24.11.2021
RVMRobust, real-time, high-resolution human video matting method that achieves new state-of-the-art performance<ul><li>Shanchuan Lin</li> <li>Linjie Yang</li> <li>Imran Saleemi</li> <li>Soumyadip Sengupta</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>project</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab24.11.2021
AnimeGANv2An improved version of AnimeGAN - it prevents the generation of high-frequency artifacts by simply changing the normalization of features in the network<ul><li>Xin Chen</li> <li>Gang Liu</li> <li>bryandlee</li></ul> <ul><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>project</li></ul>Open In Colab17.11.2021
SOATStyleGAN of All Trades: Image Manipulation with Only Pretrained StyleGAN<ul><li>Min Jin Chong</li> <li>Hsin-Ying Lee</li> <li>David Forsyth</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li></ul>Open In Colab13.11.2021
ArnheimGenerative Art Using Neural Visual Grammars and Dual Encoders<ul><li>Chrisantha Fernando</li> <li>Ali Eslami</li> <li>Jean-Baptiste Alayrac</li> <li>Piotr Mirowski</li><details><summary>others</summary><li>Dylan Banarse</li> <li>Simon Osindero</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab11.11.2021
StyleGAN 2Generation of faces, cars, etc.Mikael Christensen <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab05.11.2021
ByteTrackMulti-Object Tracking by Associating Every Detection Box<ul><li>Yifu Zhang</li> <li>Peize Sun</li> <li>Yi Jiang</li> <li>Dongdong Yu</li><details><summary>others</summary><li>Ping Luo</li> <li>Xinggang Wang</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data, data</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/></li></ul>Open In Colab30.10.2021
GPT-2Retrain an advanced text generating neural network on any text dataset using gpt-2-simple!Max Woolf <ul><li>blog post, blog post</li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li></ul>Open In Colab18.10.2021
ConvMixerAn extremely simple model that is similar in spirit to the ViT and the even-more-basic MLP-Mixer in that it operates directly on patches as input, separates the mixing of spatial and channel dimensions, and maintains equal size and resolution throughout the network<ul><li>Asher Trockman</li> <li>Zico Kolter</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab06.10.2021
IC-GANInstance-Conditioned GAN<ul><li>Arantxa Casanova</li> <li>Marlène Careil</li> <li>Jakob Verbeek</li> <li>Michał Drożdżal</li> <li>Adriana Romero-Soriano</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/neurips.svg" alt="neurips" height=20/></li></ul>Open In Colab01.10.2021
Skillful Precipitation Nowcasting Using Deep Generative Models of RadarOpen-sourced dataset and model snapshot for precipitation nowcasting<ul><li>Suman Ravuri</li> <li>Karel Lenc</li> <li>Matthew Willson</li> <li>Dmitry Kangin</li><details><summary>others</summary><li>Rémi Lam</li> <li>Piotr Mirowski</li> <li>Maria Athanassiadou</li> <li>Sheleem Kashem</li> <li>Rachel Prudden</li> <li>Amol Mandhane</li> <li>Aidan Clark</li> <li>Andrew Brock</li> <li>Karen Simonyan</li> <li>Raia Hadsell</li> <li>Niall Robinson</li> <li>Ellen Clancy</li> <li>Shakir Mohamed</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li>local kernel</li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab29.09.2021
Live Speech PortraitsReal-Time Photorealistic Talking-Head Animation<ul><li>Yuanxun Lu</li> <li>Jinxiang Chai</li> <li>Xun Cao</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li></ul>Open In Colab26.09.2021
StylExTraining a GAN to explain a classifier in StyleSpace<ul><li>Oran Lang</li> <li>Yossi Gandelsman</li> <li>Michal Yarom</li> <li>Yoav Wald</li><details><summary>others</summary><li>Gal Elidan</li> <li>Avinatan Hassidim</li> <li>William Freeman</li> <li>Phillip Isola</li> <li>Amir Globerso</li> <li>Michal Irani</li> <li>Inbar Mosseri</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li>project</li><li>supplementary</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab25.08.2021
VITSParallel end-to-end TTS method that generates more natural sounding audio than current two-stage models<ul><li>Jaehyeon Kim</li> <li>Jungil Kong</li> <li>Juhee Son</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>demo</li></ul>Open In Colab23.08.2021
Bringing Old Photo Back to LifeRestoring old photos that suffer from severe degradation through a deep learning approach<ul><li>Ziyu Wan</li> <li>Bo Zhang</li> <li>Dongdong Chen</li> <li>Pan Zhang</li><details><summary>others</summary><li>Dong Chen</li> <li>Jing Liao</li> <li>Fang Wen</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>demo</li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab13.07.2021
PTIPivotal Tuning Inversion enables employing off-the-shelf latent based semantic editing techniques on real images using StyleGAN<ul><li>Daniel Roich</li> <li>Ron Mokady</li> <li>Amit Bermano</li> <li>Daniel Cohen-Or</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab01.07.2021
TediGANFramework for multi-modal image generation and manipulation with textual descriptions<ul><li>Weihao Xia</li> <li>Yujiu Yang</li> <li>Jing-Hao Xue</li> <li>Baoyuan Wu</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab30.06.2021
SCALEModeling Clothed Humans with a Surface Codec of Articulated Local Elements<ul><li>Qianli Ma</li> <li>Shunsuke Saito</li> <li>Jinlong Yang</li> <li>Siyu Tang</li> <li>Michael Black</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>poster</li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab26.06.2021
CogViewMastering Text-to-Image Generation via Transformers<ul><li>Ming Ding</li> <li>Zhuoyi Yang</li> <li>Wenyi Hong</li> <li>Wendi Zheng</li><details><summary>others</summary><li>Chang Zhou</li> <li>Junyang Lin</li> <li>Xu Zou</li> <li>Zhou Shao</li> <li>Hongxia Yang</li> <li>Jie Tang</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>demo</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/neurips.svg" alt="neurips" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab21.06.2021
GANs N' RosesStable, Controllable, Diverse Image to Image Translation<ul><li>Min Jin Chong</li> <li>David Forsyth</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab19.06.2021
Rethinking Style Transfer: From Pixels to Parameterized BrushstrokesA method to stylize images by optimizing parameterized brushstrokes instead of pixels<ul><li>Dmytro Kotovenko</li> <li>Matthias Wright</li> <li>Arthur Heimbrecht</li> <li>Björn Ommer</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>project</li></ul>Open In Colab02.06.2021
Pixel2Style2PixelEncoding in Style: A StyleGAN Encoder for Image-to-Image Translation<ul><li>Elad Richardson</li> <li>Yuval Alaluf</li> <li>Yotam Nitzan</li> <li>Daniel Cohen-Or</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab01.06.2021
Fine-tuning a BERTWe will work through fine-tuning a BERT model using the tensorflow-models PIP package<ul><li>Chen Chen</li> <li>Claire Yao</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab25.05.2021
ReStyleA Residual-Based StyleGAN Encoder via Iterative Refinement<ul><li>Yuval Alaluf</li> <li>Or Patashnik</li> <li>Daniel Cohen-Or</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li></ul>Open In Colab21.05.2021
Motion Representations for Articulated AnimationNovel motion representations for animating articulated objects consisting of distinct parts<ul><li>Aliaksandr Siarohin</li> <li>Oliver Woodford</li> <li>Jian Ren</li> <li>Menglei Chai</li> <li>Sergey Tulyakov</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab29.04.2021
SAMAge Transformation Using a Style-Based Regression Model<ul><li>Yuval Alaluf</li> <li>Or Patashnik</li> <li>Daniel Cohen-Or</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab26.04.2021
Geometry-Free View SynthesisIs a geometric model required to synthesize novel views from a single image?<ul><li>Robin Rombach</li> <li>Patrick Esser</li> <li>Björn Ommer</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab22.04.2021
NeRViSAn algorithm for full-frame video stabilization by first estimating dense warp fields<ul><li>Yu-Lun Liu</li> <li>Wei-Sheng Lai</li> <li>Ming-Hsuan Yang</li> <li>Yung-Yu Chuang</li> <li>Jia-Bin Huang</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab11.04.2021
NeXView synthesis based on enhancements of multiplane image that can reproduce NeXt-level view-dependent effects in real time<ul><li>Suttisak Wizadwongsa</li> <li>Pakkapon Phongthawee</li> <li>Jiraphon Yenphraphai</li> <li>Supasorn Suwajanakorn</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data, data</li><li><img src="images/git.svg" alt="git" height=20/></li><li>project</li><li>vistec</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab25.03.2021
Score SDEScore-Based Generative Modeling through Stochastic Differential Equations<ul><li>Yang Song</li> <li>Jascha Sohl-Dickstein</li> <li>Diederik Kingma</li> <li>Abhishek Kumar</li><details><summary>others</summary><li>Stefano Ermon</li> <li>Ben Poole</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab18.03.2021
Talking Head Anime from a Single ImageThe network takes as input an image of an anime character's face and a desired pose, and it outputs another image of the same character in the given posePramook Khungurn <ul><li><img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab23.02.2021
NFNetAn adaptive gradient clipping technique, a significantly improved class of Normalizer-Free ResNets<ul><li>Andrew Brock</li> <li>Soham De</li> <li>Samuel L. Smith</li> <li>Karen Simonyan</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab17.02.2021
RITMSimple feedforward model for click-based interactive segmentation that employs the segmentation masks from previous steps<ul><li>Konstantin Sofiiuk</li> <li>Ilia Petrov</li> <li>Anton Konushin</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/>, <img src="images/pwc.svg" alt="pwc" height=20/></li></ul>Open In Colab13.02.2021
CLIPA neural network which efficiently learns visual concepts from natural language supervision<ul><li>Jong Wook Kim</li> <li>Alec Radford</li> <li>Ilya Sutskever</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li>paper</li><li>project</li><li>slides</li></ul>Open In Colab29.01.2021
Adversarial PatchA method to create universal, robust, targeted adversarial image patches in the real worldTom Brown<ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li></ul>Open In Colab27.01.2021
MSG-NetMulti-style Generative Network with a novel Inspiration Layer, which retains the functionality of optimization-based approaches and has the fast speed of feed-forward networks<ul><li>Hang Zhang</li> <li>Kristin Dana</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab25.01.2021
f-BRSFeature backpropagating refinement scheme that solves an optimization problem with respect to auxiliary variables instead of the network inputs, and requires running forward and backward pass just for a small part of a network<ul><li>Konstantin Sofiiuk</li> <li>Ilia Petrov</li> <li>Olga Barinova</li> <li>Anton Konushin</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab25.01.2021
Neural Style TransferImplementation of Neural Style Transfer in Keras 2.0+Somshubra Majumdar <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li></ul>Open In Colab22.01.2021
SkyARA vision-based method for video sky replacement and harmonization, which can automatically generate realistic and dramatic sky backgrounds in videos with controllable stylesZhengxia Zou <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab18.01.2021
MusicXML DocumentationThe goal of this notebook is to explore one of the magenta libraries for music<ul><li>Prakruti Joshi</li> <li>Falak Shah</li> <li>Twisha Naik</li></ul><ul><li>magenta</li><li>music theory</li><li>musicXML</li></ul>Open In Colab08.01.2021
SVG VAEA colab demo for the SVG VAE modelRaphael Gontijo Lopes <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li></ul>Open In Colab08.01.2021
Neural Magic EyeLearning to See and Understand the Scene Behind an Autostereogram<ul><li>Zhengxia Zou</li> <li>Tianyang Shi</li> <li>Yi Yuan</li> <li>Zhenwei Shi</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab01.01.2021
FGVCMethod first extracts and completes motion edges, and then uses them to guide piecewise-smooth flow completion with sharp edges<ul><li>Chen Gao</li> <li>Ayush Saraf</li> <li>Johannes Kopf</li> <li>Jia-Bin Huang</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab30.12.2020
VIBEVideo Inference for Body Pose and Shape Estimation, which makes use of an existing large-scale motion capture dataset together with unpaired, in-the-wild, 2D keypoint annotations<ul><li>Muhammed Kocabas</li> <li>Nikos Athanasiou</li> <li>Michael Black</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab23.12.2020
SeFaA closed-form approach for unsupervised latent semantic factorization in GANs<ul><li>Yujun Shen</li> <li>Bolei Zhou</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab06.12.2020
Stylized Neural PaintingAn image-to-painting translation method that generates vivid and realistic painting artworks with controllable styles<ul><li>Zhengxia Zou</li> <li>Tianyang Shi</li> <li>Yi Yuan</li> <li>Zhenwei Shi</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab01.12.2020
BiTBig Transfer: General Visual Representation Learning<ul><li>Alexander Kolesnikov</li> <li>Lucas Beyer</li> <li>Xiaohua Zhai</li> <li>Joan Puigcerver</li><details><summary>others</summary><li>Jessica Yung</li> <li>Sylvain Gelly</li> <li>Neil Houlsby</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab12.11.2020
LaSAFTLatent Source Attentive Frequency Transformation for Conditioned Source SeparationWoosung Choi <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li>project</li></ul>Open In Colab01.11.2020
Lifespan Age Transformation SynthesisMulti-domain image-to-image generative adversarial network architecture, whose learned latent space models a continuous bi-directional aging process<ul><li>Roy Or-El</li> <li>Soumyadip Sengupta</li> <li>Ohad Fried</li> <li>Eli Shechtman</li> <li>Ira Kemelmacher-Shlizerman</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab31.10.2020
HiGANSemantic Hierarchy Emerges in Deep Generative Representations for Scene Synthesis<ul><li>Ceyuan Yang</li> <li>Yujun Shen</li> <li>Bolei Zhou</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab14.10.2020
InterFaceGANInterpreting the Latent Space of GANs for Semantic Face Editing<ul><li>Yujun Shen</li> <li>Jinjin Gu</li> <li>Xiaoou Tang</li> <li>Bolei Zhou</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab13.10.2020
Instance-aware Image ColorizationNovel deep learning framework to achieve instance-aware colorizationJheng-Wei Su <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab30.08.2020
MoCoMomentum Contrast for unsupervised visual representation learning<ul><li>Kaiming He</li> <li>Haoqi Fan</li> <li>Yuxin Wu</li> <li>Saining Xie</li> <li>Ross Girshick</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab20.08.2020
CAPELearning to Dress 3D People in Generative Clothing<ul><li>Qianli Ma</li> <li>Jinlong Yang</li> <li>Anurag Ranjan</li> <li>Sergi Pujades</li><details><summary>others</summary><li>Gerard Pons-Moll</li> <li>Siyu Tang</li> <li>Michael Black</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab05.08.2020
Rewriting a Deep Generative ModelWe ask if a deep network can be reprogrammed to follow different rules, by enabling a user to directly change the weights, instead of training with a data set<ul><li>David Bau</li> <li>Steven Liu</li> <li>Tongzhou Wang</li> <li>Jun-Yan Zhu</li> <li>Antonio Torralba</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab01.08.2020
SIRENImplicit Neural Representations with Periodic Activation Functions<ul><li>Vincent Sitzmann</li> <li>Julien Martel</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/neurips.svg" alt="neurips" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab25.06.2020
3D Photo InpaintingMethod for converting a single RGB-D input image into a 3D photo, i.e., a multi-layer representation for novel view synthesis that contains hallucinated color and depth structures in regions occluded in the original view<ul><li>Meng-Li Shih</li> <li>Shih-Yang Su</li> <li>Johannes Kopf</li> <li>Jia-Bin Huang</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>project</li></ul>Open In Colab04.05.2020
Motion Supervised co-part SegmentationA self-supervised deep learning method for co-part segmentation<ul><li>Aliaksandr Siarohin</li> <li>Subhankar Roy</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab07.04.2020
Onsets and FramesOnsets and Frames is an automatic music transcription framework with piano and drums models<ul><li>Curtis Hawthorne</li> <li>Erich Elsen</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li>data, data</li></ul>Open In Colab02.04.2020
FBA MattingLow-cost modification to alpha matting networks to also predict the foreground and background colours<ul><li>Marco Forte</li> <li>François Pitié</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/></li></ul>Open In Colab19.03.2020
BERT scoreAn automatic evaluation metric for text generationTianyi Zhang <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li></ul>Open In Colab05.03.2020
Generating Piano Music with TransformerThis Colab notebook lets you play with pretrained Transformer models for piano music generation, based on the Music Transformer<ul><li>Ian Simon</li> <li>Anna Huang</li> <li>Jesse Engel</li> <li>Curtis Hawthorne</li></ul><ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li></ul>Open In Colab16.09.2019
HMREnd-to-end framework for reconstructing a full 3D mesh of a human body from a single RGB image<ul><li>Angjoo Kanazawa</li> <li>Michael Black</li> <li>David Jacobs</li> <li>Jitendra Malik</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docker.svg" alt="docker" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab15.03.2019
GANSynthThis notebook is a demo GANSynth, which generates audio with Generative Adversarial NetworksJesse Engel <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>project</li></ul>Open In Colab25.02.2019
Latent ConstraintsConditional Generation from Unconditional Generative Models<ul><li>Jesse Engel</li> <li>Matthew Hoffman</li> <li>Adam Roberts</li></ul><ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li></ul>Open In Colab27.11.2017
Performance RNNThis notebook shows you how to generate new performed compositions from a trained model<ul><li>Ian Simon</li> <li>Sageev Oore</li> <li>Curtis Hawthorne</li></ul> <ul><li>blog post</li><li>data</li></ul>Open In Colab11.07.2017
NSynthThis colab notebook has everything you need to upload your own sounds and use NSynth models to reconstruct and interpolate between them<ul><li>Jesse Engel</li> <li>Cinjon Resnick</li> <li>Adam Roberts</li> <li>Sander Dieleman</li><details><summary>others</summary><li>Karen Simonyan</li> <li>Mohammad Norouzi</li> <li>Douglas Eck</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li>data</li><li>tutorial</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab06.04.2017

Tutorials

namedescriptionauthorslinkscolaboratoryupdate
KorniaLibrary is composed by a subset of packages containing operators that can be inserted within neural networks to train models to perform image transformations, epipolar geometry, depth estimation, and low-level image processing such as filtering and edge detection that operate directly on tensors<ul><li>Edgar Riba</li> <li>Dmytro Mishkin</li> <li>Daniel Ponsa</li> <li>Ethan Rublee</li> <li>Gary Bradski</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/slack.svg" alt="slack" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab04.12.2024
AutoGenFramework that enables development of LLM applications using multiple agents that can converse with each other to solve tasksmicrosoft <ul><li>blog post</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab04.12.2024
dm_controlDeepMind Infrastructure for Physics-Based Simulation<ul><li>Saran Tunyasuvunakool</li> <li>Alistair Muldal</li> <li>Yotam Doron</li> <li>Siqi Liu</li><details><summary>others</summary><li>Steven Bohez</li> <li>Josh Merel</li> <li>Tom Erez</li> <li>Timothy Lillicrap</li> <li>Nicolas Heess</li> <li>Yuval Tassa</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab03.12.2024
MuJoCoA general purpose physics engine that aims to facilitate research and development in robotics, biomechanics, graphics and animation, machine learning, and other areas which demand fast and accurate simulation of articulated structures interacting with their environment<ul><li>Emo Todorov</li> <li>Tom Erez</li> <li>Yuval Tassa</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/deepmind.svg" alt="deepmind" height=20/>, <img src="images/deepmind.svg" alt="deepmind" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li>website</li><li><img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab03.12.2024
YOLOv8State-of-the-art model that builds upon the success of previous YOLO versions and introduces new features and improvements to further boost performance and flexibilityGlenn Jocher <ul><li>COCO</li><li>ImageNet</li><li>blog post</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docker.svg" alt="docker" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab03.12.2024
SAE LensTraining Sparse Autoencoders on Language Models<ul><li>Joseph Bloom</li> <li>Curt Tigges</li> <li>David Chanin</li></ul> <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/slack.svg" alt="slack" height=20/></li></ul>Open In Colab03.12.2024
moondreamTiny vision language model that kicks ass and runs anywhereVik Korrapati <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>website</li></ul>Open In Colab30.11.2024
LangGraphLibrary for building stateful, multi-actor applications with LLMs, used to create agent and multi-agent workflowsLangChain <ul><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab28.11.2024
LangChainFramework for developing applications powered by large language modelsLangChain <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab26.11.2024
ARENAProvide talented individuals with the skills, tools, and environment necessary for upskilling in ML engineering, for the purpose of contributing directly to AI alignment in technical rolesCallum McDougall <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/slack.svg" alt="slack" height=20/></li><li>website</li></ul>Open In Colab26.11.2024
FeastAn open source feature store for machine learning<ul><li>Willem Pienaar</li> <li>Danny Chiao</li> <li>Achal Shah</li> <li>Terence Lim</li><details><summary>others</summary><li>Ches Martin</li> <li>Judah Rand</li> <li>Matt Delacour</li> <li>Miguel Trejo Marrufo</li> <li>Francisco Javier Arceo</li></ul></details> <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab22.11.2024
VCClient software for performing real-time voice conversion using various Voice Conversion AIw-okada <ul><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab20.11.2024
CatBoostHigh-performance open source library for gradient boosting on decision trees<ul><li>Anna Veronika Dorogush</li> <li>Vasily Ershov</li> <li>Andrey Gulin</li> <li>Liudmila Prokhorenkova</li><details><summary>others</summary><li>Gleb Gusev</li> <li>Aleksandr Vorobev</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/neurips.svg" alt="neurips" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li>website</li><li><img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab18.11.2024
Gemma 2New addition to the Gemma family of lightweight, state-of-the-art open models, ranging in scale from 2 billion to 27 billion parametersunsloth <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab17.11.2024
Llama 3.1First openly available model that rivals the top AI models when it comes to state-of-the-art capabilities in general knowledge, steerability, math, tool use, and multilingual translationunsloth <ul><li>blog post</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/meta.svg" alt="meta" height=20/>, <img src="images/meta.svg" alt="meta" height=20/>, <img src="images/meta.svg" alt="meta" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab17.11.2024
Mistral SmallEnterprise-grade small modelunsloth <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab17.11.2024
ORPOGet up and running with large language models<ul><li>Jiwoo Hong</li> <li>Noah Lee</li> <li>James Thorne</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab17.11.2024
Phi-3.53.8 billion parameter language model trained on 3.3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3.5, despite being small enough to be deployed on a phoneunsloth <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/>, <img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab17.11.2024
Simple audio recognitionThis tutorial will show you how to build a basic speech recognition network that recognizes ten different wordsGoogle<ul><li>coursera</li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li><li>tf.js</li></ul>Open In Colab15.11.2024
xFormersToolbox to Accelerate Research on Transformers<ul><li>Benjamin Lefaudeux</li> <li>Francisco Massa</li> <li>Diana Liskovich</li> <li>Wenhan Xiong</li><details><summary>others</summary><li>Vittorio Caggiano</li> <li>Sean Naren</li> <li>Min Xu</li> <li>Jieru Hu</li> <li>Marta Tintore</li> <li>Susan Zhang</li> <li>Patrick Labatut</li> <li>Daniel Haziza</li></ul></details> <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab13.11.2024
Building Your Own Federated Learning AlgorithmWe discuss how to implement federated learning algorithms without deferring to the tff.learning APIZachary Charles<ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab01.11.2024
Federated Learning for Image ClassificationWe use the classic MNIST training example to introduce the Federated Learning API layer of TFF, tff.learning - a set of higher-level interfaces that can be used to perform common types of federated learning tasks, such as federated training, against user-supplied models implemented in TensorFlowKrzysztof Ostrowski<ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/>, <img src="images/pwc.svg" alt="pwc" height=20/></li></ul>Open In Colab01.11.2024
Federated Learning for Text GenerationWe start with a RNN that generates ASCII characters, and refine it via federated learningKrzysztof Ostrowski<ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data, data</li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab01.11.2024
Custom Federated Algorithms, Part 1: Introduction to the Federated CoreThis tutorial is the first part of a two-part series that demonstrates how to implement custom types of federated algorithms in TensorFlow Federated using the Federated Core - a set of lower-level interfaces that serve as a foundation upon which we have implemented the Federated Learning layerKrzysztof Ostrowski<ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab01.11.2024
Custom Federated Algorithms, Part 2: Implementing Federated AveragingThis tutorial is the second part of a two-part series that demonstrates how to implement custom types of federated algorithms in TFF using the Federated Core, which serves as a foundation for the Federated Learning layerKrzysztof Ostrowski <ul><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab01.11.2024
High-performance simulations with TFFThis tutorial will describe how to setup high-performance simulations with TFF in a variety of common scenariosKrzysztof Ostrowski<ul><li><img src="images/pwc.svg" alt="pwc" height=20/></li></ul>Open In Colab01.11.2024
AutodistillUses big, slower foundation models to train small, faster supervised modelsautodistill <ul><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab01.11.2024
LightAutoMLAllows you create machine learning models using just a few lines of code, or build your own custom pipeline using ready blocks<ul><li>Alexander Ryzhkov</li> <li>Anton Vakhrushev</li> <li>Dmitry Simakov</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/>, <img src="images/kaggle.svg" alt="kaggle" height=20/>, <img src="images/kaggle.svg" alt="kaggle" height=20/>, <img src="images/kaggle.svg" alt="kaggle" height=20/>, <img src="images/kaggle.svg" alt="kaggle" height=20/>, <img src="images/kaggle.svg" alt="kaggle" height=20/>, <img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab31.10.2024
Crawl4AILLM Friendly Web Crawler & ScrapperUncleCode <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab30.10.2024
NotebookLlamaOpen Source version of NotebookLMMeta <ul><li><img src="images/medium.svg" alt="medium" height=20/></li><li>meidum</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li></ul>Open In Colab29.10.2024
XGBoostOptimized distributed gradient boosting library designed to be highly efficient, flexible and portable<ul><li>Tianqi Chen</li> <li>Carlos Guestrin</li></ul> <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab22.10.2024
YOLOv5You Only Look OnceGlenn Jocher <ul><li>data</li><li><img src="images/kaggle.svg" alt="kaggle" height=20/>, <img src="images/kaggle.svg" alt="kaggle" height=20/></li></ul>Open In Colab19.10.2024
YOLOv3You Only Look OnceGlenn Jocher <ul><li>data</li><li><img src="images/kaggle.svg" alt="kaggle" height=20/>, <img src="images/kaggle.svg" alt="kaggle" height=20/></li></ul>Open In Colab19.10.2024
SwarmEducational framework exploring ergonomic, lightweight multi-agent orchestration<ul><li>Ilan Bigio</li> <li>James Hills</li> <li>Shyamal Anadkat</li> <li>Charu Jaiswal</li><details><summary>others</summary><li>Colin Jarvis</li> <li>Katia Guzman</li></ul></details> <ul><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab15.10.2024
LM Evaluation HarnessFramework for few-shot evaluation of language models.EleutherAI <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li></ul>Open In Colab04.10.2024
Multimodal MaestroGives you more control over large multimodal models to get the outputs you wantRoboflow <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li>website</li></ul>Open In Colab26.09.2024
TRLSet of tools to train transformer language models with Reinforcement Learning, from the Supervised Fine-tuning step, Reward Modeling step to the Proximal Policy Optimization step<ul><li>Leandro von Werra</li> <li>Younes Belkada</li> <li>Lewis Tunstall</li> <li>Edward Beeching</li><details><summary>others</summary><li>Tristan Thrush</li> <li>Nathan Lambert</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab24.09.2024
The Autodiff CookbookYou'll go through a whole bunch of neat autodiff ideas that you can cherry pick for your own work, starting with the basics<ul><li>Alex Wiltschko</li> <li>Matthew Johnson</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>book, book</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>tutorial</li><li><img src="images/wiki.svg" alt="wiki" height=20/>, [<img src="images/wiki.svg" alt="wiki" height=20/>](https://en.wikipedia.org/wiki/Pullback_(differential_geometry), <img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/></li></ul>Open In Colab20.09.2024
SupervisionReusable computer vision toolsRoboflow <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/>, <img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab19.09.2024
PEFTParameter-Efficient Fine-Tuning methods enable efficient adaptation of pre-trained language models to various downstream applications without fine-tuning all the model's parameters<ul><li>Sourab Mangrulkar</li> <li>Sylvain Gugger</li> <li>Lysandre Debut</li> <li>Younes Belkada</li> <li>Sayak Paul</li></ul> <ul><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab13.09.2024
SAA+Framework, Segment Any Anomaly +, for zero-shot anomaly segmentation with hybrid prompt regularization to improve the adaptability of modern foundation models<ul><li>Yunkang Cao</li> <li>Xiaohao Xu</li> <li>Chen Sun</li> <li>Yuqi Cheng</li><details><summary>others</summary><li>Zongwei Du</li> <li>Liang Gao</li> <li>Weiming Shen</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li></ul>Open In Colab13.09.2024
TensorRTSDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applicationsnvidia <ul><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li>forum</li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab12.09.2024
DataChainAI-dataframe to enrich, transform and analyze data from cloud storages for ML training and LLM appsIterative <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab09.09.2024
TFF for Federated Learning Research: Model and Update CompressionWe use the EMNIST dataset to demonstrate how to enable lossy compression algorithms to reduce communication cost in the Federated Averaging algorithmWeikang Song<ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li>tensor encoding</li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab05.09.2024
LlamaIndexData framework for your LLM applicationJerry Liu <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/meta.svg" alt="meta" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab05.09.2024
Deforum Stable DiffusionOpen source project is designed to be free to use and easy to modify for custom needs and pipelines<ul><li>EnzymeZoo</li> <li>Артем Храпов</li> <li>Forest Star Walz</li> <li>pharmapsychotic</li></ul> <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab30.08.2024
ComfyUIPowerful and modular stable diffusion GUI and backendcomfyanonymous <ul><li>examples</li><li><img src="images/git.svg" alt="git" height=20/></li><li>pytorch</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab30.08.2024
Machine Learning SimplifiedA Gentle Introduction to Supervised LearningAndrew Wolf <ul><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/>, <img src="images/reddit.svg" alt="reddit" height=20/></li><li>website</li></ul>Open In Colab29.08.2024
AnomalibDeep learning library that aims to collect state-of-the-art anomaly detection algorithms for benchmarking on both public and private datasets<ul><li>Samet Akcay</li> <li>Dick Ameln</li> <li>Ashwin Vaidya</li> <li>Barath Lakshmanan</li><details><summary>others</summary><li>Nilesh Ahuja</li> <li>Utku Genc</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/></li></ul>Open In Colab29.08.2024
Anthropic coursesAnthropic's educational coursesAnthropic <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li></ul>Open In Colab22.08.2024
NerfstudioAPI that allows for a simplified end-to-end process of creating, training, and testing NeRFs<ul><li>Matthew Tancik</li> <li>Ethan Weber</li> <li>Evonne Ng</li> <li>Ruilong Li</li><details><summary>others</summary><li>Brent Yi</li> <li>Justin Kerr</li> <li>Terrance Wang</li> <li>Alexander Kristoffersen</li> <li>Jake Austin</li> <li>Kamyar Salahi</li> <li>Abhik Ahuja</li> <li>David McAllister</li> <li>Angjoo Kanazawa</li></ul></details> <ul><li>Viewer</li><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab19.08.2024
mlcourse.aiOpen Machine Learning CourseYury Kashnitsky <ul><li>blog post</li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>project</li><li><img src="images/slack.svg" alt="slack" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab19.08.2024
PyTerrierA Python framework for performing information retrieval experiments<ul><li>Craig Macdonald</li> <li>Nicola Tonellotto</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab16.08.2024
highway-envA collection of environments for autonomous driving and tactical decision-making tasksEdouard Leurent <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab09.08.2024
GNNProduction-tested library for building GNNs at large scale<ul><li>Oleksandr Ferludin</li> <li>Arno Eigenwillig</li> <li>Martin Blais</li> <li>Dustin Zelle</li><details><summary>others</summary><li>Jan Pfeifer</li> <li>Alvaro Sanchez-Gonzalez</li> <li>Wai Lok Sibon Li</li> <li>Sami Abu-El-Haija</li> <li>Peter Battaglia</li> <li>Neslihan Bulut</li> <li>Jonathan Halcrow</li> <li>Filipe Miguel Gonçalves de Almeida</li> <li>Pedro Gonnet</li> <li>Liangze Jiang</li> <li>Parth Kothari</li> <li>Silvio Lattanzi</li> <li>André Linhares</li> <li>Brandon Mayer</li> <li>Vahab Mirrokni</li> <li>John Palowitch</li> <li>Mihir Paradkar</li> <li>Jennifer She</li> <li>Anton Tsitsulin</li> <li>Kevin Villela</li> <li>Lisa Wang</li> <li>Bryan Perozzi</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab09.08.2024
Pix2PixThis notebook demonstrates image to image translation using conditional GAN's<ul><li>Phillip Isola</li> <li>Jun-Yan Zhu</li> <li>Tinghui Zhou</li> <li>Alexei Efros</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab24.07.2024
Image classificationThis tutorial shows how to classify images of flowersBilly Lamberta<ul><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab24.07.2024
TransformerLensLibrary for doing mechanistic interpretability of GPT-2 Style language models<ul><li>Neel Nanda</li> <li>Joseph Bloom</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/slack.svg" alt="slack" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab23.07.2024
KorHalf-baked prototype that "helps" you extract structured data from text using LLMsEugene Yurtsev <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li></ul>Open In Colab20.07.2024
Mistral InferenceMinimal code to run Mistral modelsmistral <ul><li>blog post</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab16.07.2024
PyTorch3DLibrary for deep learning with 3D data<ul><li>Nikhila Ravi</li> <li>Jeremy Reizenstein</li> <li>David Novotny</li> <li>Taylor Gordon</li><details><summary>others</summary><li>Wan-Yen Lo</li> <li>Justin Johnson</li> <li>Georgia Gkioxari</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post, blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab11.07.2024
Stable Diffusion VideosCreate videos with Stable Diffusion by exploring the latent space and morphing between text promptsNathan Raw <ul><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab11.07.2024
Transfer learning and fine-tuningYou will learn how to classify images of cats and dogs by using transfer learning from a pre-trained networkFrançois Chollet<ul><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li></ul>Open In Colab26.06.2024
MARS5Speech model for insane prosodyCAMB.AI <ul><li>demo</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docker.svg" alt="docker" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab25.06.2024
Deep RL CourseThe Hugging Face Deep Reinforcement Learning Course<ul><li>Thomas Simonini</li> <li>Omar Sanseviero</li> <li>Sayak Paul</li></ul> <ul><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/pt.svg" alt="pt" height=20/></li><li>syllabus</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab24.06.2024
ToonCrafterCan interpolate two cartoon images by leveraging the pre-trained image-to-video diffusion priors<ul><li>Jinbo Xing</li> <li>Hanyuan Liu</li> <li>Menghan Xia</li> <li>Yong Zhang</li><details><summary>others</summary><li>Xintao Wang</li> <li>Ying Shan</li> <li>Tien-Tsin Wong</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>project</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab20.06.2024
BraxA differentiable physics engine that simulates environments made up of rigid bodies, joints, and actuators<ul><li>Daniel Freeman</li> <li>Erik Frey</li> <li>Anton Raichuk</li> <li>Sertan Girgin</li><details><summary>others</summary><li>Igor Mordatch</li> <li>Olivier Bachem</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/neurips.svg" alt="neurips" height=20/></li></ul>Open In Colab07.06.2024
DiffSynthRestructured architectures including Text Encoder, UNet, VAE, among others, maintaining compatibility with models from the open-source community while enhancing computational performanceArtiprocher <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li></ul>Open In Colab06.06.2024
TransformerThis tutorial trains a Transformer model to translate Portuguese to English<ul><li>Ashish Vaswani</li> <li>Noam Shazeer</li> <li>Niki Parmar</li> <li>Jakob Uszkoreit</li><details><summary>others</summary><li>Llion Jones</li> <li>Aidan Gomez</li> <li>Łukasz Kaiser</li> <li>Illia Polosukhin</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>link</li><li><img src="images/neurips.svg" alt="neurips" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab31.05.2024
NeMoA conversational AI toolkit built for researchers working on automatic speech recognition, natural language processing, and text-to-speech synthesis<ul><li>Oleksii Kuchaiev</li> <li>Jason Li</li> <li>Chip Huyen</li> <li>Oleksii Hrinchuk</li><details><summary>others</summary><li>Ryan Leary</li> <li>Boris Ginsburg</li> <li>Samuel Kriman</li> <li>Stanislav Beliaev</li> <li>Vitaly Lavrukhin</li> <li>Jack Cook</li></ul></details> <ul><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab25.05.2024
SentencePieceAn unsupervised text tokenizer and detokenizer mainly for Neural Network-based text generation systems where the vocabulary size is predetermined prior to the neural model training<ul><li>Taku Kudo</li> <li>John Richardson</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab21.05.2024
Llama3 from scratchLlama3 from scratch, one tensor and matrix multiplication at a timeNishant Aklecha <ul><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/>, <img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab19.05.2024
Hello, many worldsThis tutorial shows how a classical neural network can learn to correct qubit calibration errorsMichael Broughton<ul><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab17.05.2024
IC-LightManipulate the illumination of images<ul><li>Lvmin Zhang</li> <li>Anyi Rao</li> <li>Maneesh Agrawala</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab09.05.2024
Neural style transferThis tutorial uses deep learning to compose one image in the style of another image<ul><li>Leon Gatys</li> <li>Alexander Ecker</li> <li>Matthias Bethge</li></ul><ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li></ul>Open In Colab06.05.2024
TorchGeoPyTorch domain library that provides datasets, transforms, samplers, and pre-trained models specific to geospatial data<ul><li>Adam Stewart</li> <li>Caleb Robinson</li> <li>Isaac Corley</li> <li>Anthony Ortiz</li><details><summary>others</summary><li>Juan Lavista Ferres</li> <li>Arindam Banerjee</li></ul></details> <ul><li>NDBI</li><li>NDVI</li><li>NDWI</li><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data, data</li><li><img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab03.05.2024
AutoencodersThis tutorial introduces autoencoders with three examples: the basics, image denoising, and anomaly detectionBilly Lamberta<ul><li>blog post</li><li>book</li><li>data</li><li>examples</li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab15.04.2024
MagicTimeMetamorphic time-lapse video generation model, which learns real-world physics knowledge from time-lapse videos and implements metamorphic generation<ul><li>Shenghai Yuan</li> <li>Jinfa Huang</li> <li>Yujun Shi</li> <li>Yongqi Xu</li><details><summary>others</summary><li>Ruijie Zhu</li> <li>Bin Lin</li> <li>Xinhua Cheng</li> <li>Li Yuan</li> <li>Jiebo Luo</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/>, <img src="images/twitter.svg" alt="twitter" height=20/></li></ul>Open In Colab14.04.2024
SAGEMethodology for generative spelling correction, which was tested on English and Russian languages and potentially can be extended to any language with minor changes<ul><li>Nikita Martynov</li> <li>Mark Baushenko</li> <li>Anastasia Kozlova</li> <li>Katerina Kolomeytseva</li><details><summary>others</summary><li>Aleksandr Abramov</li> <li>Alena Fenogenova</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab11.04.2024
Image segmentationThis tutorial focuses on the task of image segmentation, using a modified U-Net<ul><li>Olaf Ronneberger</li> <li>Philipp Fischer</li> <li>Thomas Brox</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab09.04.2024
Open-Sora PlanSimple and efficient design along with remarkable performance in text-to-video generationYUAN Lab at PKU <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab07.04.2024
GorillaFinetuned LLaMA-based model that surpasses the performance of GPT-4 on writing API calls<ul><li>Shishir Patil</li> <li>Tianjun Zhang</li> <li>Xin Wang</li> <li>Joseph Gonzalez</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab06.04.2024
CleanlabHelps you clean data and labels by automatically detecting issues in a ML dataset<ul><li>Curtis Northcutt</li> <li>Lu Jiang</li> <li>Isaac Chuang</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/slack.svg" alt="slack" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab30.03.2024
AniPortraitFramework for generating high-quality animation driven by audio and a reference portrait image<ul><li>Zejun Yang</li> <li>Zhisheng Wang</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab27.03.2024
OpenVINOOpen-source toolkit for optimizing and deploying AI inferenceintel <ul><li>blog post</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li>forum</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab25.03.2024
GazelleJoint Speech Language ModelTincans <ul><li>blog post</li><li>demo</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li>[<img src="images/wiki.svg" alt="wiki" height=20/>](https://en.wikipedia.org/wiki/Spike_/(software_development)</li></ul>Open In Colab20.03.2024
Intel® Extension for TransformersTransformer-based Toolkit to Accelerate GenAI/LLM Everywhereintel <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab19.03.2024
DatasetsA Community Library for Natural Language Processing<ul><li>Quentin Lhoest</li> <li>Albert Villanova</li> <li>Yacine Jernite</li> <li>Abhishek Thakur</li><details><summary>others</summary><li>Patrick von Platen</li> <li>Suraj Patil</li> <li>Julien Chaumond</li> <li>Mariama Dramé</li> <li>Julien Plu</li> <li>Lewis Tunstall</li> <li>Joe Davison</li> <li>Mario Šaško</li> <li>Gunjan Chhablani</li> <li>Bhavitvya Malik</li> <li>Simon Brandeis</li> <li>Teven Le Scao</li> <li>Victor Sanh</li> <li>Canwen Xu</li> <li>Nicolas Patry</li> <li>Angelina McMillan-Major</li> <li>Philipp Schmid</li> <li>Sylvain Gugger</li> <li>Clément Delangue</li> <li>Théo Matussière</li> <li>Lysandre Debut</li> <li>Stas Bekman</li> <li>Pierric Cistac</li> <li>Thibault Goehringer</li> <li>Victor Mustar</li> <li>François Lagunas</li> <li>Alexander Rush</li> <li>Thomas Wolf</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab18.03.2024
EvidentlyAn open-source framework to evaluate, test and monitor ML models in production<ul><li>Elena Samuylova</li> <li>Emeli Dral</li> <li>Olga Filippova</li></ul> <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab15.03.2024
InstructorLibrary that makes it a breeze to work with structured outputs from large language modelsJason Liu <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab13.03.2024
FiftyOneOpen-source tool for building high-quality datasets and computer vision models<ul><li>Brian Moore</li> <li>Jason Corso</li></ul> <ul><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/slack.svg" alt="slack" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab27.02.2024
MetaVoice1.2B parameter base model trained on 100K hours of speech for TTSMetaVoice <ul><li>demo</li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab26.02.2024
Generative AI for Beginners - A CourseA 12 Lesson course teaching everything you need to know to start building Generative AI applicationsmicrosoft <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li>project</li></ul>Open In Colab22.02.2024
OmegaConfHierarchical configuration system, with support for merging configurations from multiple sources providing a consistent API regardless of how the configuration was createdOmry Yadan <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>slides</li></ul>Open In Colab15.02.2024
OptunaAn automatic hyperparameter optimization software framework, particularly designed for machine learning<ul><li>Takuya Akiba</li> <li>Shotaro Sano</li> <li>Toshihiko Yanase</li> <li>Takeru Ohta</li> <li>Masanori Koyama</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docker.svg" alt="docker" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab15.02.2024
Data augmentationThis tutorial demonstrates data augmentation: a technique to increase the diversity of your training set by applying random transformations such as image rotationBilly Lamberta<ul><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li></ul>Open In Colab14.02.2024
Stable CascadeText to image model introduces an interesting three-stage approach, setting new benchmarks for quality, flexibility, fine-tuning, and efficiency with a focus on further eliminating hardware barriersStability AI <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab14.02.2024
CleanVisionAutomatically detects potential issues in image datasets like images that are: blurry, under/over-exposed, (near) duplicates, etccleanlab <ul><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/slack.svg" alt="slack" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li></ul>Open In Colab13.02.2024
DynamiCrafterAnimating Open-domain Images with Video Diffusion Priors<ul><li>Jinbo Xing</li> <li>Menghan Xia</li> <li>Yong Zhang</li> <li>Haoxin Chen</li><details><summary>others</summary><li>Wangbo Yu</li> <li>Hanyuan Liu</li> <li>Xintao Wang</li> <li>Tien-Tsin Wong</li> <li>Ying Shan</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab12.02.2024
OllamaGet up and running with large language modelsMichael Yang <ul><li><img src="images/docker.svg" alt="docker" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab10.02.2024
XLAAccelerated Linear Algebra is an open-source machine learning compiler for GPUs, CPUs, and ML acceleratorsOpenXLA <ul><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pt.svg" alt="pt" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab02.02.2024
ComposerPyTorch library that enables you to train neural networks faster, at lower cost, and to higher accuracyThe Mosaic ML Team <ul><li>app</li><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/slack.svg" alt="slack" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li>website</li><li><img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab01.02.2024
CycleGANThis notebook demonstrates unpaired image to image translation using conditional GAN's<ul><li>Jun-Yan Zhu</li> <li>Taesung Park</li> <li>Phillip Isola</li> <li>Alexei Efros</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab17.01.2024
Integrated gradientsThis tutorial demonstrates how to implement Integrated Gradients, an Explainable AI technique<ul><li>Mukund Sundararajan</li> <li>Ankur Taly</li> <li>Qiqi Yan</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li><li>visualizing</li><li><img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/></li></ul>Open In Colab17.01.2024
MAGNeTMasked generative sequence modeling method that operates directly over several streams of audio tokens<ul><li>Alon Ziv</li> <li>Itai Gat</li> <li>Gaël Le Lan</li> <li>Tal Remez</li><details><summary>others</summary><li>Felix Kreuk</li> <li>Alexandre Défossez</li> <li>Jade Copet</li> <li>Gabriel Synnaeve</li> <li>Yossi Adi</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>project</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li></ul>Open In Colab16.01.2024
AutoFaissAutomatically create Faiss knn indices with the most optimal similarity search parametersCtiteo <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li></ul>Open In Colab12.01.2024
Retrieval based Voice Conversion WebUIAn easy-to-use Voice Conversion framework based on VITSRVC-Project <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab11.01.2024
FlaxNeural network library and ecosystem for JAX designed for flexibility<ul><li>Jonathan Heek</li> <li>Anselm Levskaya</li> <li>Avital Oliver</li> <li>Marvin Ritter</li><details><summary>others</summary><li>Bertrand Rondepierre</li> <li>Andreas Steiner</li> <li>Marc van Zee</li></ul></details> <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab10.01.2024
Big VisionThis codebase is designed for training large-scale vision models using Cloud TPU VMs or GPU machines<ul><li>Lucas Beyer</li> <li>Xiaohua Zhai</li> <li>Alexander Kolesnikov</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab03.01.2024
Open InterpreterAn open-source, locally running implementation of OpenAI's Code InterpreterKillian Lucas <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab03.01.2024
Seamless CommunicationFamily of AI models that enable more natural and authentic communication across languages<ul><li>Loïc Barrault</li> <li>Yu-An Chung</li> <li>Mariano Coria</li> <li>David Dale</li><details><summary>others</summary><li>Ning Dong</li> <li>Mark Duppenthaler</li> <li>Paul-Ambroise Duquenne</li> <li>Hady Elsahar</li> <li>Min-Jae Hwang</li> <li>Hirofumi Inaguma</li> <li>Ilia Kulikov</li> <li>Pengwei Li</li> <li>Daniel Licht</li> <li>Jean Maillard</li> <li>Ruslan Mavlyutov</li> <li>Kaushik Ram Sadagopan</li> <li>Abinesh Ramakrishnan</li> <li>Tuan Tran</li> <li>Guillaume Wenzek</li> <li>Yilin Yang</li> <li>Ethan Ye</li> <li>Ivan Evtimov</li> <li>Pierre Fernandez</li> <li>Robin San Roman</li> <li>Bokai Yu</li> <li>Pierre Andrews</li> <li>Can Balioglu</li> <li>Peng-Jen Chen</li> <li>Marta Costa-jussà</li> <li>Maha Elbayad</li> <li>Hongyu Gong</li> <li>Francisco Guzmán</li> <li>Kevin Heffernan</li> <li>Somya Jain</li> <li>Justine Kao</li> <li>Ann Lee</li> <li>Xutai Ma</li> <li>Benjamin Peloquin</li> <li>Juan Pino</li> <li>Sravya Popuri</li> <li>Holger Schwenk</li> <li>Anna Sun</li> <li>Paden Tomasello</li> <li>Changhan Wang</li> <li>Skyler Wang</li> <li>Mary Williamson</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab14.12.2023
colab2pdfConvert your Colab notebook to a PDFDrengskapurOpen In Colab11.12.2023
Sentence TransformersMultilingual Sentence, Paragraph, and Image Embeddings using BERT & Co<ul><li>Nils Reimers</li> <li>Iryna Gurevych</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li></ul>Open In Colab07.12.2023
CleanRLDeep Reinforcement Learning library that provides high-quality single-file implementation with research-friendly features<ul><li>Shengyi Huang</li> <li>Rousslan Dossa</li> <li>Chang Ye</li> <li>Jeff Braga</li><details><summary>others</summary><li>Dipam Chakraborty</li> <li>Kinal Mehta</li> <li>João Araújo</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>paper</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab28.11.2023
VocosClosing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesisHubert Siuzdak <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>project</li></ul>Open In Colab21.11.2023
X—LLMEasy LLM Finetuning using the most advanced methodsBoris Zubarev <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li></ul>Open In Colab15.11.2023
Distil-WhisperMaintains the robustness of the Whisper model to difficult acoustic conditions, while being less prone to hallucination errors on long-form audio<ul><li>Sanchit Gandhi</li> <li>Patrick von Platen</li> <li>Alexander Rush</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab08.11.2023
AnimateDiffPractical framework to animate most of the existing personalized text-to-image models once and for all, saving efforts in model-specific tuning<ul><li>Yuwei Guo</li> <li>Ceyuan Yang</li> <li>Anyi Rao</li> <li>Yaohui Wang</li><details><summary>others</summary><li>Yu Qiao</li> <li>Dahua Lin</li> <li>Bo Dai</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab30.10.2023
Intel® Neural CompressorAims to provide popular model compression techniques such as quantization, pruning (sparsity), distillation, and neural architecture search on mainstream frameworks such as TensorFlow, PyTorch, ONNX Runtime, and MXNet, as well as Intel extensions such as Intel Extension for TensorFlow and Intel Extension for PyTorchintel <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/neurips.svg" alt="neurips" height=20/></li><li><img src="images/pt.svg" alt="pt" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab27.10.2023
BarkTransformer-based text-to-audio modelsuno <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li>examples</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab25.10.2023
Mistral TransformerThe most powerful language model for its size to date<ul><li>Albert Jiang</li> <li>Alexandre Sablayrolles</li> <li>Arthur Mensch</li> <li>Chris Bamford</li><details><summary>others</summary><li>Devendra Chaplot</li> <li>Diego Casas</li> <li>Florian Bressand</li> <li>Gianna Lengyel</li> <li>Guillaume Lample</li> <li>Lucile Saulnier</li> <li>Lélio Renard Lavaud</li> <li>Marie-Anne Lachaux</li> <li>Pierre Stock</li> <li>Teven Scao</li> <li>Thibaut Lavril</li> <li>Thomas Wang</li> <li>Timothée Lacroix</li> <li>William Sayed</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab09.10.2023
FooocusImage generating softwareLvmin Zhang <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab03.10.2023
Actor-CriticThis tutorial demonstrates how to implement the Actor-Critic method using TensorFlow to train an agent on the Open AI Gym CartPole-V0 environment<ul><li>Vijay Konda</li> <li>John Tsitsiklis</li></ul><ul><li>gym</li><li><img src="images/neurips.svg" alt="neurips" height=20/>, <img src="images/neurips.svg" alt="neurips" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li></ul>Open In Colab28.09.2023
MMagicAIGC toolbox for professional AI researchers and machine learning engineers to explore image and video processing, editing and generationOpenMMLab <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab11.09.2023
SeqIOLibrary for processing sequential data to be fed into downstream sequence models<ul><li>Adam Roberts</li> <li>Hyung Won Chung</li> <li>Anselm Levskaya</li> <li>Gaurav Mishra</li><details><summary>others</summary><li>James Bradbury</li> <li>Daniel Andor</li> <li>Sharan Narang</li> <li>Brian Lester</li> <li>Colin Gaffney</li> <li>Afroz Mohiuddin</li> <li>Curtis Hawthorne</li> <li>Aitor Lewkowycz</li> <li>Alex Salcianu</li> <li>Marc van Zee</li> <li>Jacob Austin</li> <li>Sebastian Goodman</li> <li>Livio Baldini Soares</li> <li>Haitang Hu</li> <li>Sasha Tsvyashchenko</li> <li>Aakanksha Chowdhery</li> <li>Jasmijn Bastings</li> <li>Jannis Bulian</li> <li>Xavier Garcia</li> <li>Jianmo Ni</li> <li>Andrew Chen</li> <li>Kathleen Kenealy</li> <li>Jonathan Clark</li> <li>Stephan Lee</li> <li>Dan Garrette</li> <li>James Lee-Thorp</li> <li>Colin Raffel</li> <li>Noam Shazeer</li> <li>Marvin Ritter</li> <li>Maarten Bosma</li> <li>Alexandre Passos</li> <li>Jeremy Maitin-Shepard</li> <li>Noah Fiedel</li> <li>Mark Omernick</li> <li>Brennan Saeta</li> <li>Ryan Sepassi</li> <li>Alexander Spiridonov</li> <li>Joshua Newlan</li> <li>Andrea Gesmundo</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab08.09.2023
MMAction2An open-source toolbox for video understanding based on PyTorchMMAction2 Contributors <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data, data, data, data</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab06.09.2023
RayUnified framework for scaling AI and Python applications<ul><li>Philipp Moritz</li> <li>Robert Nishihara</li> <li>Stephanie Wang</li> <li>Alexey Tumanov</li><details><summary>others</summary><li>Richard Liaw</li> <li>Eric Liang</li> <li>Melih Elibol</li> <li>Zongheng Yang</li> <li>William Paul</li> <li>Michael Jordan</li> <li>Ion Stoica</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab06.09.2023
Home RobotLow-level API for controlling various home robotsChris Paxton <ul><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab30.08.2023
Neural TangentsLibrary designed to enable research into infinite-width neural networks<ul><li>Roman Novak</li> <li>Lechao Xiao</li> <li>Jiri Hron</li> <li>Jaehoon Lee</li><details><summary>others</summary><li>Alexander Alemi</li> <li>Jascha Sohl-Dickstein</li> <li>Samuel Schoenholz</li></ul></details> <ul><li>ICLR</li><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab29.08.2023
Stable Diffusion 2New stable diffusion model at 768x768 resolution. Same number of parameters in the U-Net as 1.5, but uses OpenCLIP-ViT/H as the text encoder and is trained from scratch<ul><li>Robin Rombach</li> <li>Andreas Blattmann</li> <li>Dominik Lorenz</li> <li>Patrick Esser</li><details><summary>others</summary><li>Björn Ommer</li> <li>qunash</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab26.08.2023
DALL·E MiniGenerate images from a text prompt<ul><li>Boris Dayma</li> <li>Suraj Patil</li> <li>Pedro Cuenca</li> <li>Khalid Saifullah</li><details><summary>others</summary><li>Tanishq Abraham</li> <li>Phúc H. Lê Khắc</li> <li>Luke Melas</li> <li>Ritobrata Ghosh</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li>data</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li></ul>Open In Colab22.08.2023
Classify text with BERTThis tutorial contains complete code to fine-tune BERT to perform sentiment analysis on a dataset of plain-text IMDB movie reviewsAnirudh Dubey <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab08.08.2023
Kandinsky 2.1As text and image encoder it uses CLIP model and diffusion image prior between latent spaces of CLIP modalities<ul><li>Arseniy Shakhmatov</li> <li>Anton Razzhigaev</li> <li>Aleksandr Nikolich</li> <li>Vladimir Arkhipkin</li><details><summary>others</summary><li>Igor Pavlov</li> <li>Andrey Kuznetsov</li> <li>Denis Dimitrov</li></ul></details> <ul><li>blog post</li><li>demo</li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab07.08.2023
SoftVC VITSSinging Voice Conversionsvc develop team <ul><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li></ul>Open In Colab31.07.2023
threestudioUnified framework for 3D content creation from text prompts, single images, and few-shot images, by lifting 2D text-to-image generation models<ul><li>Yuan-Chen Guo</li> <li>Ying-Tian Liu</li> <li>Ruizhi Shao</li> <li>Christian Laforte</li><details><summary>others</summary><li>Vikram Voleti</li> <li>Guan Luo</li> <li>Chia-Hao Chen</li> <li>Zi-Xin Zou</li> <li>Chen Wang</li> <li>Yanpei Cao</li> <li>Song-Hai Zhang</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab28.07.2023
Image captioningGiven an image our goal is to generate a caption<ul><li>Kelvin Xu</li> <li>Jimmy Ba</li> <li>Ryan Kiros</li> <li>Kyunghyun Cho</li><details><summary>others</summary><li>Aaron Courville</li> <li>Ruslan Salakhutdinov</li> <li>Richard Zemel</li> <li>Yoshua Bengio</li></ul></details><ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab25.07.2023
Word2VecWord2Vec is not a singular algorithm, rather, it is a family of model architectures and optimizations that can be used to learn word embeddings from large datasetsGoogle<ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>link</li><li><img src="images/neurips.svg" alt="neurips" height=20/></li><li>projector</li><li><img src="images/pwc.svg" alt="pwc" height=20/>, <img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li></ul>Open In Colab25.07.2023
Word embeddingsThis tutorial contains an introduction to word embeddingsBilly Lamberta<ul><li>data</li><li>projector</li></ul>Open In Colab25.07.2023
Contextualized Topic ModelsFamily of topic models that use pre-trained representations of language to support topic modeling<ul><li>Federico Bianchi</li> <li>Silvia Terragni</li> <li>Dirk Hovy</li> <li>Debora Nozza</li> <li>Elisabetta Fersini</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab22.07.2023
TortoiseA multi-voice TTS system trained with an emphasis on qualityJames Betker <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>examples</li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab15.07.2023
PetalsRun 100B+ language models at home, BitTorrent-styleBigScience <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li><img src="images/wiki.svg" alt="wiki" height=20/></li></ul>Open In Colab05.07.2023
Epistemic Neural NetworksA library for neural networks that know what they don't know<ul><li>Ian Osband</li> <li>Zheng Wen</li> <li>Seyed Mohammad Asghari</li> <li>Vikranth Dwaracherla</li><details><summary>others</summary><li>Morteza Ibrahimi</li> <li>Xiuyuan Lu</li> <li>Benjamin Van Roy</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab27.06.2023
DeepFloyd IFState-of-the-art open-source text-to-image model with a high degree of photorealism and language understanding<ul><li>Alex Shonenkov</li> <li>Misha Konstantinov</li> <li>Daria Bakshandaeva</li> <li>Christoph Schuhmann</li><details><summary>others</summary><li>Ksenia Ivanova</li> <li>Nadiia Klokova</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab26.06.2023
normflowsPyTorch implementation of discrete normalizing flows<ul><li>Vincent Stimper</li> <li>David Liu</li> <li>Andrew Campbell</li> <li>Vincent Berenz</li><details><summary>others</summary><li>Lukas Ryll</li> <li>Bernhard Schölkopf</li> <li>José Miguel Hernández-Lobato</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li></ul>Open In Colab26.06.2023
MMPoseToolbox for pose estimation based on PyTorchOpenMMLab <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab19.06.2023
MyoSuiteA collection of musculoskeletal environments and tasks simulated with the MuJoCo physics engine and wrapped in the OpenAI gym API to enable the application of Machine Learning to bio-mechanic control problems<ul><li>Vittorio Caggiano</li> <li>Huawei Wang</li> <li>Guillaume Durandau</li> <li>Massimo Sartori</li> <li>Vikash Kumar</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li></ul>Open In Colab16.06.2023
AudiocraftPyTorch library for deep learning research on audio generation<ul><li>Jade Copet</li> <li>Felix Kreuk</li> <li>Itai Gat</li> <li>Tal Remez</li><details><summary>others</summary><li>David Kant</li> <li>Gabriel Synnaeve</li> <li>Yossi Adi</li> <li>Alexandre Défossez</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab11.06.2023
Detectron2FAIR's next-generation platform for object detection and segmentationYuxin Wu <ul><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab26.05.2023
ReverbEfficient and easy-to-use data storage and transport system designed for machine learning research<ul><li>Albin Cassirer</li> <li>Gabriel Barth-Maron</li> <li>Eugene Brevdo</li> <li>Sabela Ramos</li><details><summary>others</summary><li>Toby Boyd</li> <li>Thibault Sottiaux</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li></ul>Open In Colab23.05.2023
MMDetectionOpen source object detection toolbox based on PyTorchOpenMMLab <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/>, <img src="images/pwc.svg" alt="pwc" height=20/>, <img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab17.05.2023
ChatRWKVLike ChatGPT but powered by RWKV (100% RNN) language model, which is the only RNN that can match transformers in quality and scaling, while being faster and saves VRAM<ul><li>Bo Peng</li> <li>Eric Alcaide</li> <li>Quentin Anthony</li> <li>Alon Albalak</li><details><summary>others</summary><li>Samuel Arcadinho</li> <li>Matteo Grella</li> <li>Kranthi Kiran</li> <li>Haowen Hou</li> <li>Przemyslaw Kazienko</li> <li>Jan Kocon</li> <li>Bartlomiej Koptyra</li> <li>Ipsit Mantri</li> <li>Ferdinand Mom</li> <li>Xiangru Tang</li> <li>Johan Wind</li> <li>Stanisław Woźniak</li> <li>Qihang Zhao</li> <li>Peng Zhou</li> <li>Jian Zhu</li> <li>Rui-Jie Zhu</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab08.05.2023
Python Data Science HandbookJupyter notebook version of the Python Data Science Handbook by Jake VanderPlasJake Vanderplas <ul><li>project</li></ul>Open In Colab06.05.2023
PGMaxGeneral factor graphs for discrete probabilistic graphical models, and hardware-accelerated differentiable loopy belief propagation in JAX<ul><li>Guangyao Zhou</li> <li>Nishanth Kumar</li> <li>Antoine Dedieu</li> <li>Miguel Lázaro-Gredilla</li><details><summary>others</summary><li>Shrinu Kushagra</li> <li>Dileep George</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li></ul>Open In Colab05.05.2023
StableLMStability AI Language ModelsStability AI <ul><li>blog post</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab27.04.2023
TTSA library for advanced Text-to-Speech generation, built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed and quality<ul><li>Eren Gölge</li> <li>Aya-AlJafari</li> <li>Edresson Casanova</li> <li>Josh Meyer</li><details><summary>others</summary><li>Kelly Davis</li> <li>Reuben Morais</li></ul></details> <ul><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li>samples</li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab26.04.2023
OpenCLIPAn open source implementation of CLIP<ul><li>Ross Wightman</li> <li>Cade Gordon</li> <li>Vaishaal Shankar</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data, data, data</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li></ul>Open In Colab16.04.2023
Stable Baselines3Set of reliable implementations of reinforcement learning algorithms in PyTorch<ul><li>Antonin Raffin</li> <li>Ashley Hill</li> <li>Adam Gleave</li> <li>Anssi Kanervisto</li><details><summary>others</summary><li>Maximilian Ernestus</li> <li>Noah Dormann</li></ul></details> <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>paper</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab14.04.2023
RL Baselines3 ZooTraining Framework for Stable Baselines3 Reinforcement Learning AgentsAntonin Raffin <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li></ul>Open In Colab14.04.2023
Grounded-SAMMarrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect, Segment and Generate AnythingIDEA-Research <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab12.04.2023
TFDSCollection of ready-to-use datasets for use with TensorFlow, Jax, and other Machine Learning frameworksGoogle <ul><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab11.04.2023
OptimumExtension of Transformers and Diffusers, providing a set of optimization tools enabling maximum efficiency to train and run models on targeted hardware, while keeping things easy to useHugging Face <ul><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab06.04.2023
MMOCROpen source toolkit based on PyTorch and MMDetection, supporting numerous OCR-related models, including text detection, text recognition, and key information extraction<ul><li>Zhanghui Kuang</li> <li>Hongbin Sun</li> <li>Zhizhong Li</li> <li>Xiaoyu Yue</li><details><summary>others</summary><li>Tsui Hin Lin</li> <li>Jianyong Chen</li> <li>Huaqiang Wei</li> <li>Yiqin Zhu</li> <li>Tong Gao</li> <li>Wenwei Zhang</li> <li>Kai Chen</li> <li>Wayne Zhang</li> <li>Dahua Lin</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab06.04.2023
MMSegmentationOpen source semantic segmentation toolbox based on PyTorchOpenMMLab <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab31.03.2023
LAVISPython deep learning library for LAnguage-and-VISion intelligence research and applications<ul><li>Dongxu Li</li> <li>Junnan Li</li> <li>Hung Le</li> <li>Guangsen Wang</li><details><summary>others</summary><li>Silvio Savarese</li> <li>Steven Hoi</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li></ul>Open In Colab24.03.2023
AudioLMFramework for high-quality audio generation with long-term consistency<ul><li>Phil Wang</li> <li>Zalán Borsos</li> <li>Raphaël Marinier</li> <li>Damien Vincent</li><details><summary>others</summary><li>Eugene Kharitonov</li> <li>Olivier Pietquin</li> <li>Matt Sharifi</li> <li>Olivier Teboul</li> <li>David Grangier</li> <li>Marco Tagliasacchi</li> <li>Neil Zeghidour</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab23.03.2023
pymdpPackage for simulating Active Inference agents in Markov Decision Process environments<ul><li>Conor Heins</li> <li>Alec Tschantz</li> <li>Beren Millidge</li> <li>Brennan Klein</li><details><summary>others</summary><li>Arun Niranjan</li> <li>Daphne Demekas</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li></ul>Open In Colab19.03.2023
TzerCoverage-Guided Tensor Compiler Fuzzing with Joint IR-Pass Mutation<ul><li>Jiawei Liu</li> <li>Yuxiang Wei</li> <li>Sen Yang</li> <li>Yinlin Deng</li> <li>Lingming Zhang</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docker.svg" alt="docker" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab09.03.2023
ArtLineA Deep Learning based project for creating line art portraitsVijish Madhavan <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab03.03.2023
HaikuA library built on top of JAX designed to provide simple, composable abstractions for machine learning research<ul><li>Tom Hennigan</li> <li>Trevor Cai</li> <li>Tamara Norman</li> <li>Igor Babuschkin</li></ul> <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li>website</li></ul>Open In Colab02.03.2023
SAHIA lightweight vision library for performing large scale object detection & instance segmentation<ul><li>Fatih Cagatay Akyon</li> <li>Sinan Onur ALTINUÇ</li> <li>Alptekin Temizel</li> <li>Cemil Cengiz</li><details><summary>others</summary><li>Devrim Çavuşoğlu</li> <li>Kadir Şahin</li> <li>Oğulcan Eryüksel</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li></ul>Open In Colab23.02.2023
AmpliGraphA suite of neural machine learning models for relational Learning, a branch of machine learning that deals with supervised learning on knowledge graphs<ul><li>Luca Costabello</li> <li>Adrianna Janik</li> <li>Chan Le Van</li> <li>Nicholas McCarthy</li><details><summary>others</summary><li>Rory McGrath</li> <li>Sumit Pai</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/neurips.svg" alt="neurips" height=20/>, <img src="images/neurips.svg" alt="neurips" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab23.02.2023
NMT with attentionThis notebook trains a seq2seq model for Spanish to English translation<ul><li>Minh-Thang Luong</li> <li>Hieu Pham</li> <li>Christopher Manning</li></ul><ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li></ul>Open In Colab15.02.2023
GLUE using BERT on TPUThis tutorial contains complete end-to-end code to train models on a TPUAnirudh Dubey<ul><li>GLUE</li><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab15.02.2023
TensorBoardSuite of web applications for inspecting and understanding your TensorFlow runs and graphsYuan Tang <ul><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li><li>website</li><li><img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab10.02.2023
High-performance Simulation with KubernetesThis tutorial will describe how to set up high-performance simulation using a TFF runtime running on KubernetesJason Roselander<ul><li>GKE</li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li>shell</li></ul>Open In Colab31.01.2023
CompelText prompt weighting and blending library for transformers-type text embedding systemsDamian Stewart <ul><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li></ul>Open In Colab26.01.2023
DALL·E FlowAn interactive workflow for generating high-definition images from text prompt<ul><li>Han Xiao</li> <li>Delgermurun Purevkhuu</li> <li>Alex Cureton-Griffiths</li></ul> <ul><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab26.01.2023
DiffusersProvides pretrained diffusion models across multiple modalities, such as vision and audio, and serves as a modular toolbox for inference and training of diffusion modelsHugging Face <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab17.01.2023
Sample FactoryOne of the fastest RL libraries focused on very efficient synchronous and asynchronous implementations of policy gradients<ul><li>Aleksei Petrenko</li> <li>Zhehui Huang</li> <li>Tushar Kumar</li> <li>Gaurav Sukhatme</li> <li>Vladlen Koltun</li></ul> <ul><li>ICML</li><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab17.01.2023
Open-AssistantChat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so<ul><li>Andreas Köpf</li> <li>Yannic Kilcher</li> <li>Huu Nguyen</li> <li>Christoph Schuhmann</li><details><summary>others</summary><li>Keith Stevens</li> <li>Abdullah Barhoum</li> <li>Nguyen Minh Duc</li> <li>Oliver Stanley</li> <li>James Melvin Ebenezer</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab14.01.2023
panda-gymSet of robotic environments based on PyBullet physics engine and gymnasium<ul><li>Quentin Gallouédec</li> <li>Nicolas Cazin</li> <li>Emmanuel Dellandréa</li> <li>Liming Chen</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab02.01.2023
BANMoGiven multiple casual videos capturing a deformable object, BANMo reconstructs an animatable 3D model, including an implicit canonical 3D shape, appearance, skinning weights, and time-varying articulations, without pre-defined shape templates or registered cameras<ul><li>Gengshan Yang</li> <li>Minh Vo</li> <li>Natalia Neverova</li> <li>Deva Ramanan</li><details><summary>others</summary><li>Andrea Vedaldi</li> <li>Hanbyul Joo</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab30.12.2022
tensor_parallelRun large PyTorch models on multiple GPUs in one line of code with potentially linear speedupAndrei Panferov <ul><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/>, <img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li></ul>Open In Colab29.12.2022
TPUReference models and tools for Cloud TPUsGoogle <ul><li>website</li><li><img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab20.12.2022
rliableLibrary for reliable evaluation, even with a handful of runs, on reinforcement learning and machine learnings benchmarks<ul><li>Rishabh Agarwal</li> <li>Max Schwarzer</li> <li>Pablo Castro</li> <li>Aaron Courville</li> <li>Marc Bellemare</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post, blog post</li><li><img src="images/neurips.svg" alt="neurips" height=20/></li><li>podcast</li><li>poster</li><li>project</li><li>slides</li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab19.12.2022
TF-AgentsA reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning<ul><li>Sergio Guadarrama</li> <li>Anoop Korattikara</li> <li>Oscar Ramirez</li> <li>Pablo Castro</li><details><summary>others</summary><li>Ethan Holly</li> <li>Sam Fishman</li> <li>Ke Wang</li> <li>Ekaterina Gonina</li> <li>Neal Wu</li> <li>Efi Kokiopoulou</li> <li>Luciano Sbaiz</li> <li>Jamie Smith</li> <li>Gábor Bartók</li> <li>Jesse Berent</li> <li>Chris Harris</li> <li>Vincent Vanhoucke</li> <li>Eugene Brevdo</li></ul></details> <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab15.12.2022
PyGLibrary built upon PyTorch to easily write and train Graph Neural Networks for a wide range of applications related to structured data<ul><li>Matthias Fey</li> <li>Jan Eric Lenssen</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/neurips.svg" alt="neurips" height=20/>, <img src="images/neurips.svg" alt="neurips" height=20/>, <img src="images/neurips.svg" alt="neurips" height=20/></li><li><img src="images/pt.svg" alt="pt" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab08.12.2022
ruGPT3Example of inference of RuGPT3XLAnton Emelyanov <ul><li>cristofari</li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>sparse attention</li></ul>Open In Colab07.12.2022
DSP theoryTheory of digital signal processing: signals, filtration (IIR, FIR, CIC, MAF), transforms (FFT, DFT, Hilbert, Z-transform) etc<ul><li>Alexander Kapitanov</li> <li>Vladimir Fadeev</li> <li>Karina Kvanchiani</li> <li>Elizaveta Petrova</li> <li>Andrei Makhliarchuk</li></ul> <ul><li>blog post</li></ul>Open In Colab18.10.2022
MubertPrompt-based music generation via Mubert APIIlya Belikov <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab18.10.2022
RuDOLPHA fast and light text-image-text transformer designed for a quick and easy fine-tuning setup for the solution of various tasks: from generating images by text description and image classification to visual question answering and more<ul><li>Alex Shonenkov</li> <li>Misha Konstantinov</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li></ul>Open In Colab06.10.2022
Batch RLOffline RL using the DQN replay dataset comprising the entire replay experience of a DQN agent on 60 Atari 2600 games<ul><li>Rishabh Agarwal</li> <li>Dale Schuurmans</li> <li>Mohammad Norouzi</li></ul> <ul><li>DQN</li><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li>data, data</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li>slides</li><li>talk</li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab04.10.2022
EfficientDetNew family of object detectors, called EfficientDet, which consistently achieve much better efficiency than prior art across a wide spectrum of resource constraints<ul><li>Mingxing Tan</li> <li>Ruoming Pang</li> <li>Quoc Le</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li><li>tutorial</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab27.09.2022
RL GamesHigh performance RL library<ul><li>Denys Makoviichuk</li> <li>Viktor Makoviychuk</li></ul> <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li></ul>Open In Colab27.09.2022
ACMEA library of reinforcement learning components and agents<ul><li>Matt Hoffman</li> <li>Bobak Shahriari</li> <li>John Aslanides</li> <li>Gabriel Barth-Maron</li><details><summary>others</summary><li>Feryal Behbahani</li> <li>Tamara Norman</li> <li>Abbas Abdolmaleki</li> <li>Albin Cassirer</li> <li>Fan Yang</li> <li>Kate Baumli</li> <li>Sarah Henderson</li> <li>Alex Novikov</li> <li>Sergio Gómez Colmenarejo</li> <li>Serkan Cabi</li> <li>Caglar Gulcehre</li> <li>Tom Le Paine</li> <li>Andrew Cowie</li> <li>Ziyu Wang</li> <li>Bilal Piot</li> <li>Nando de Freitas</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab26.09.2022
RWKVReinventing RNNs for the Transformer Era<ul><li>Bo Peng</li> <li>Eric Alcaide</li> <li>Quentin Anthony</li> <li>Alon Albalak</li><details><summary>others</summary><li>Samuel Arcadinho</li> <li>Matteo Grella</li> <li>Kranthi Kiran</li> <li>Haowen Hou</li> <li>Przemyslaw Kazienko</li> <li>Jan Kocon</li> <li>Bartlomiej Koptyra</li> <li>Ipsit Mantri</li> <li>Ferdinand Mom</li> <li>Xiangru Tang</li> <li>Johan Wind</li> <li>Stanisław Woźniak</li> <li>Qihang Zhao</li> <li>Peng Zhou</li> <li>Jian Zhu</li> <li>Rui-Jie Zhu</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li>demo</li><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/>, <img src="images/twitter.svg" alt="twitter" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab21.09.2022
NetKetOpen-source project delivering cutting-edge methods for the study of many-body quantum systems with artificial neural networks and machine learning techniques<ul><li>Filippo Vicentini</li> <li>Damian Hofmann</li> <li>Attila Szabó</li> <li>Dian Wu</li><details><summary>others</summary><li>Christopher Roth</li> <li>Clemens Giuliani</li> <li>Gabriel Pescia</li> <li>Jannes Nys</li> <li>Vladimir Vargas-Calderón</li> <li>Nikita Astrakhantsev</li> <li>Giuseppe Carleo</li> <li>Kenny Choo</li> <li>James Smith</li> <li>Tom Westerhout</li> <li>Fabien Alet</li> <li>Emily Davis</li> <li>Stavros Efthymiou</li> <li>Ivan Glasser</li> <li>Sheng-Hsuan Lin</li> <li>Marta Mauri</li> <li>Mazzola Guglielmo</li> <li>Christian Mendl</li> <li>Evert Nieuwenburg</li> <li>Ossian O'Reilly</li> <li>Hugo Théveniaut</li> <li>Giacomo Torlai</li> <li>Alexander Wietek</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab15.09.2022
Stable DiffusionA latent text-to-image diffusion model<ul><li>Robin Rombach</li> <li>Andreas Blattmann</li> <li>Dominik Lorenz</li> <li>Patrick Esser</li> <li>Björn Ommer</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li></ul>Open In Colab10.08.2022
Deep-MACWelcome to the Novel class segmentation demoVighnesh Birodkar<ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/></li></ul>Open In Colab10.08.2022
NL-AugmenterA collaborative effort intended to add transformations of datasets dealing with natural language<ul><li>Aadesh Gupta</li> <li>Timothy Sum Hon Mun</li> <li>Aditya Srivatsa</li> <li>Xudong Shen</li><details><summary>others</summary><li>Juan Diego Rodriguez</li> <li>Ashish Shrivastava</li> <li>Nagender Aneja</li> <li>Zijie Wang</li> <li>Yiwen Shi</li> <li>Afnan Mir</li> <li>William Soto</li> <li>Chandan Singh</li> <li>Claude Roux</li> <li>Abinaya Mahendiran</li> <li>Anna Shvets</li> <li>Kaustubh Dhole</li> <li>Bryan Wilie</li> <li>Jamie Simon</li> <li>Mukund Varma</li> <li>Sang Han</li> <li>Denis Kleyko</li> <li>Samuel Cahyawijaya</li> <li>Filip Cornell</li> <li>Tanay Dixit</li> <li>Connor Boyle</li> <li>Genta Indra Winata</li> <li>Seungjae Ryan Lee</li> <li>Marcin Namysl</li> <li>Roman Sitelew</li> <li>Zhenhao Li</li> <li>Fiona Tan</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>website</li></ul>Open In Colab06.08.2022
XManagerFramework for managing machine learning experimentAndrew Chen <ul><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li>slides</li></ul>Open In Colab29.07.2022
AccelerateA simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precisionHugging Face <ul><li><img src="images/docs.svg" alt="docs" height=20/></li></ul>Open In Colab27.07.2022
YOLOv5 on Custom ObjectsThis notebook shows training on your own custom objectsJacob Solawetz<ul><li>blog post</li><li>data</li></ul>Open In Colab20.07.2022
MindsEyeGraphical user interface built to run multimodal ai art models for free from a Google Colab, without needing edit a single line of code or know any programming<ul><li>multimodal.art</li> <li>João Paulo Apolinário Passos</li></ul> <ul><li><img src="images/git.svg" alt="git" height=20/></li><li>project</li></ul>Open In Colab06.07.2022
py-irtFitting Item Response Theory models using variational inference<ul><li>John Lalor</li> <li>Hong Yu</li> <li>Pedro Rodriguez</li> <li>Joe Barrow</li><details><summary>others</summary><li>Alexander Hoyle</li> <li>Robin Jia</li> <li>Jordan Boyd-Graber</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>paper</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab30.06.2022
BIG-benchA collaborative benchmark intended to probe large language models and extrapolate their future capabilities<ul><li>Jaehoon Lee</li> <li>Jascha Sohl-Dickstein</li> <li>Vinay Ramasesh</li> <li>Sajant Anand</li><details><summary>others</summary><li>Alicia Parrish</li> <li>Ethan Dyer</li> <li>Liam Dugan</li> <li>Dieuwke Hupkes</li> <li>Daniel Freeman</li> <li>Guy Gur-Ari</li> <li>Aitor Lewkowycz</li></ul></details> <ul><li>API</li><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li></ul>Open In Colab28.06.2022
HuggingArtistsChoose your favorite Artist and train a language model to write new lyrics based on their unique voiceAleksey Korshuk <ul><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li></ul>Open In Colab25.06.2022
Introduction to the TensorFlow Models NLP libraryYou will learn how to build transformer-based models for common NLP tasks including pretraining, span labelling and classification using the building blocks from NLP modeling libraryChen Chen <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li></ul>Open In Colab22.06.2022
CirqA python framework for creating, editing, and invoking Noisy Intermediate Scale Quantum circuits<ul><li>Balint Pato</li> <li>Matthew Harrigan</li> <li>Animesh Sinha</li> <li>Matthew Neeley</li><details><summary>others</summary><li>Dave Bacon</li> <li>Matteo Pompili</li> <li>Michael Broughton</li></ul></details> <ul><li><img src="images/wiki.svg" alt="wiki" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab21.06.2022
CLIP-as-serviceA low-latency high-scalability service for embedding images and textHan Xiao <ul><li>data</li><li><img src="images/git.svg" alt="git" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab19.06.2022
JinaMLOps framework that empowers anyone to build cross-modal and multi-modal applications on the cloudHan Xiao <ul><li>data</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li>hub</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab11.06.2022
MMRotateToolbox for rotated object detection based on PyTorch<ul><li>Yue Zhou</li> <li>Xue Yang</li> <li>Gefan Zhang</li> <li>Jiabao Wang</li><details><summary>others</summary><li>Yanyi Liu</li> <li>Liping Hou</li> <li>Xue Jiang</li> <li>Xingzhao Liu</li> <li>Junchi Yan</li> <li>Chengqi Lyu</li> <li>Wenwei Zhang</li> <li>Kai Chen</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/>, <img src="images/pwc.svg" alt="pwc" height=20/>, <img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab10.06.2022
Aesthetics PredictorA linear estimator on top of clip to predict the aesthetic quality of picturesLAION AI <ul><li>blog post</li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li></ul>Open In Colab04.06.2022
FlashlightFast, flexible machine learning library written entirely in C++<ul><li>Jacob Kahn</li> <li>Vineel Pratap</li> <li>Tatiana Likhomanenko</li> <li>Qiantong Xu</li><details><summary>others</summary><li>Awni Hannun</li> <li>Jeff Cai</li> <li>Paden Tomasello</li> <li>Ann Lee</li> <li>Edouard Grave</li> <li>Gilad Avidov</li> <li>Benoit Steiner</li> <li>Vitaliy Liptchinsky</li> <li>Gabriel Synnaeve</li> <li>Ronan Collobert</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docker.svg" alt="docker" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab01.06.2022
RL UnpluggedSuite of benchmarks for offline reinforcement learning<ul><li>Caglar Gulcehre</li> <li>Ziyu Wang</li> <li>Alexander Novikov</li> <li>Tom Le Paine</li><details><summary>others</summary><li>Sergio Gómez Colmenarejo</li> <li>Konrad Żołna</li> <li>Rishabh Agarwal</li> <li>Josh Merel</li> <li>Daniel Mankowitz</li> <li>Cosmin Paduraru</li> <li>Gabriel Dulac-Arnold</li> <li>Jerry Li</li> <li>Mohammad Norouzi</li> <li>Matt Hoffman</li> <li>Ofir Nachum</li> <li>George Tucker</li> <li>Nicolas Heess</li> <li>Nando de Freitas</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab26.05.2022
ScenicCodebase with a focus on research around attention-based models for computer vision<ul><li>Mostafa Dehghani</li> <li>Alexey Gritsenko</li> <li>Anurag Arnab</li> <li>Matthias Minderer</li> <li>Yi Tay</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li></ul>Open In Colab04.05.2022
Text generation with RNNThis tutorial demonstrates how to generate text using a character-based RNNAnirudh Dubey<ul><li>link</li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab03.05.2022
CLIPDrawSynthesize drawings to match a text prompt<ul><li>Kevin Frans</li> <li>Lisa Soros</li> <li>Olaf Witkowski</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab29.04.2022
CodeGenFamily of open-source model for program synthesis<ul><li>Erik Nijkamp</li> <li>Bo Pang</li> <li>Hiroaki Hayashi</li> <li>Lifu Tu</li><details><summary>others</summary><li>Huan Wang</li> <li>Yingbo Zhou</li> <li>Silvio Savarese</li> <li>Caiming Xiong</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li></ul>Open In Colab23.04.2022
Jraphlibrary for graph neural networks in jax<ul><li>Jonathan Godwin</li> <li>Thomas Keck</li> <li>Peter Battaglia</li> <li>Victor Bapst</li><details><summary>others</summary><li>Thomas Kipf</li> <li>Yujia Li</li> <li>Kimberly Stachenfeld</li> <li>Petar Veličković</li> <li>Alvaro Sanchez-Gonzalez</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab15.04.2022
deep-significanceEasy-to-use package containing different significance tests and utility functions specifically tailored towards research needs and usability<ul><li>Dennis Ulmer</li> <li>Christian Hardmeier</li> <li>Jes Frellsen</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li></ul>Open In Colab12.04.2022
Text classification with RNNThis text classification tutorial trains a recurrent neural network on the IMDB large movie review dataset for sentiment analysisAnirudh Dubey<ul><li>data</li><li>link</li><li><img src="images/pwc.svg" alt="pwc" height=20/></li></ul>Open In Colab17.03.2022
TriMapDimensionality reduction technique based on triplet constraints, which preserves the global structure of the data better than the other commonly used methods such as t-SNE, LargeVis, and UMAP<ul><li>Ehsan Amid</li> <li>Manfred Warmuth</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/></li></ul>Open In Colab17.03.2022
RLDSReinforcement Learning Datasets and it is an ecosystem of tools to store, retrieve and manipulate episodic data in the context of Sequential Decision Making including RL, Learning for Demonstrations, Offline RL or Imitation Learning<ul><li>Sabela Ramos</li> <li>Sertan Girgin</li> <li>Léonard Hussenot</li> <li>Damien Vincent</li><details><summary>others</summary><li>Hanna Yakubovich</li> <li>Daniel Toyama</li> <li>Anita Gergely</li> <li>Piotr Stanczyk</li> <li>Raphaël Marinier</li> <li>Jeremiah Harmsen</li> <li>Olivier Pietquin</li> <li>Nikola Momchev</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab16.03.2022
Real-Time Voice CloningSV2TTS with a vocoder that works in real-time<ul><li>Corentin Jemine</li> <li>Erdene-Ochir Tuguldur</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab08.03.2022
BLIPVLP framework which transfers flexibly to both vision-language understanding and generation tasks<ul><li>Junnan Li</li> <li>Dongxu Li</li> <li>Caiming Xiong</li> <li>Steven Hoi</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab03.03.2022
VideoGPTA conceptually simple architecture for scaling likelihood based generative modeling to natural videos<ul><li>Wilson Yan</li> <li>Yunzhi Zhang</li> <li>Pieter Abbeel</li> <li>Aravind Srinivas</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>project</li></ul>Open In Colab02.03.2022
Silero ModelsPre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simpleSilero team <ul><li>STT, STT, STT</li><li>TTS, TTS</li><li>Text Enhancement</li><li>VAD, VAD</li><li>website</li></ul>Open In Colab27.02.2022
Real-CUGANAI super resolution model for anime images, trained in a million scale anime dataset, using the same architecture as Waifu2x-CUNetbilibili <ul><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab27.02.2022
ArcaneGANProcess video in the style of the Arcane animated seriesAlexander Spirin <ul><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab17.02.2022
textlesslibA library aimed to facilitate research in Textless NLP<ul><li>Eugene Kharitonov</li> <li>Jade Copet</li> <li>Kushal Lakhotia</li> <li>Nguyễn Tú Anh</li><details><summary>others</summary><li>Paden Tomasello</li> <li>Ann Lee</li> <li>Ali Elkahky</li> <li>Wei-Ning Hsu</li> <li>Abdelrahman Mohamed</li> <li>Emmanuel Dupoux</li> <li>Yossi Adi</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/></li></ul>Open In Colab15.02.2022
AV-HuBERTSelf-supervised representation learning framework for audio-visual speech<ul><li>Bowen Shi</li> <li>Wei-Ning Hsu</li> <li>Kushal Lakhotia</li> <li>Abdelrahman Mohamed</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li></ul>Open In Colab12.02.2022
LingvoFramework for building neural networks in Tensorflow, particularly sequence models<ul><li>Jonathan Shen</li> <li>Patrick Nguyen</li> <li>Yonghui Wu</li> <li>Zhifeng Chen</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docker.svg" alt="docker" height=20/>, <img src="images/docker.svg" alt="docker" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li></ul>Open In Colab28.01.2022
DeepDreamThis tutorial contains a minimal implementation of DeepDream: an experiment that visualizes the patterns learned by a neural network<ul><li>Alexander Mordvintsev</li> <li>Billy Lamberta</li></ul><ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/></li></ul>Open In Colab13.01.2022
FuseDreamTraining-Free Text-to-Image Generation with Improved CLIP+GAN Space Optimization<ul><li>Xingchao Liu</li> <li>Chengyue Gong</li> <li>Lemeng Wu</li> <li>Hao Su</li> <li>Qiang Liu</li></ul><ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li></ul>Open In Colab02.01.2022
MLPThe most basic neural network architectures, a multilayer perceptron, also known as a feedforward networkBen Trevett<ul><li>NN and DL</li><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>optimization</li><li><img src="images/pt.svg" alt="pt" height=20/>, <img src="images/pt.svg" alt="pt" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li></ul>Open In Colab26.12.2021
AlexNetA neural network model that uses convolutional neural network layers and was designed for the ImageNet challengeBen Trevett <ul><li>ILSVRC</li><li>LR</li><li>PMLR</li><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>cifar-10</li><li>dropout</li><li><img src="images/neurips.svg" alt="neurips" height=20/></li><li><img src="images/pt.svg" alt="pt" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li>[<img src="images/wiki.svg" alt="wiki" height=20/>](https://en.wikipedia.org/wiki/Regularization_(mathematics), <img src="images/wiki.svg" alt="wiki" height=20/></li></ul>Open In Colab26.12.2021
VGGVery Deep Convolutional Networks for Large-Scale Image RecognitionBen Trevett <ul><li>ILSVRC</li><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>cifar-10</li><li><img src="images/pt.svg" alt="pt" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab26.12.2021
LeNetA neural network model that uses convolutional neural network layers and was designed for classifying handwritten charactersBen Trevett<ul><li>CNN</li><li>LeNet-5</li><li>guide</li><li>paper</li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/></li></ul>Open In Colab26.12.2021
Music ComposerSynthesizing symbolic music in MIDI format using the Music Transformer modelbazanovvanya <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li>data, data</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab20.12.2021
FLAMLLightweight Python library that finds accurate machine learning models automatically, efficiently and economically<ul><li>Chi Wang</li> <li>Qingyun Wu</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li>paper</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab17.12.2021
CompilerGymA reinforcement learning toolkit for compiler optimizations<ul><li>Chris Cummins</li> <li>Bram Wasti</li> <li>Jiadong Guo</li> <li>Brandon Cui</li><details><summary>others</summary><li>Jason Ansel</li> <li>Sahir Gomez</li> <li>Olivier Teytaud</li> <li>Benoit Steiner</li> <li>Yuandong Tian</li> <li>Hugh Leather</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li></ul>Open In Colab16.11.2021
ReformerPerforms on par with Transformer models while being much more memory-efficient and much faster on long sequences<ul><li>Phil Wang</li> <li>Nikita Kitaev</li> <li>Łukasz Kaiser</li> <li>Anselm Levskaya</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/neurips.svg" alt="neurips" height=20/>, <img src="images/neurips.svg" alt="neurips" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab07.11.2021
ruDALL·EGenerate images from texts in RussianAlex Shonenkov <ul><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li>project</li></ul>Open In Colab03.11.2021
DeepStyleThe Neural Style algorithm synthesizes a pastiche by separating and combining the content of one image with the style of another image using convolutional neural networks<ul><li>Cameron Smith</li> <li>Alexander Spirin</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>cvpr</li><li><img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/></li></ul>Open In Colab01.10.2021
Text2AnimationGenerate images from text phrases with VQGAN and CLIP with animation and keyframes<ul><li>Katherine Crowson</li> <li>Ryan Murdock</li> <li>Chigozie Nri</li> <li>Denis Malimonov</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab29.09.2021
EfficientNetV2A family of image classification models, which achieve better parameter efficiency and faster training speed than prior arts<ul><li>Mingxing Tan</li> <li>Quoc Le</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab24.09.2021
Clip retrievalEasily compute clip embeddings and build a clip retrieval system with themRomain Beaumont <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>project</li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/></li></ul>Open In Colab21.09.2021
img2datasetEasily turn large sets of image urls to an image datasetRomain Beaumont <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab17.09.2021
DroidletA modular embodied agent architecture and platform for building embodied agents<ul><li>Anurag Pratik</li> <li>Soumith Chintala</li> <li>Kavya Srinet</li> <li>Dhiraj Gandhi</li><details><summary>others</summary><li>Rebecca Qian</li> <li>Yuxuan Sun</li> <li>Ryan Drew</li> <li>Sara Elkafrawy</li> <li>Anoushka Tiwari</li> <li>Tucker Hart</li> <li>Mary Williamson</li> <li>Abhinav Gupta</li> <li>Arthur Szlam</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li></ul>Open In Colab15.09.2021
GPT-J-6BA 6 billion parameter, autoregressive text generation model trained on The Pile<ul><li>Ben Wang</li> <li>Aran Komatsuzaki</li> <li>Janko Prester</li></ul> <ul><li>The Pile</li><li>blog post</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>web demo</li></ul>Open In Colab15.09.2021
Machine learning courseThis course is broad and shallow, but author will provide additional links so that you can deepen your understanding of the ML method you needТимчишин Віталій <ul><li>blog post</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab02.09.2021
Lucid Sonic DreamsSyncs GAN-generated visuals to musicMikael Alafriz <ul><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab24.08.2021
textgenrnnGenerate text using a pretrained neural network with a few lines of code, or easily train your own text-generating neural network of any size and complexityMax Woolf <ul><li>blog post</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab13.07.2021
BasicSROpen Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc.<ul><li>Xintao Wang</li> <li>Liangbin Xie</li> <li>Ke Yu</li> <li>Kelvin Chan</li><details><summary>others</summary><li>Chen Change Loy</li> <li>Chao Dong</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab07.06.2021
TensorFlowTTSReal-time state-of-the-art speech synthesis architectures such as Tacotron-2, Melgan, Multiband-Melgan, FastSpeech, FastSpeech2 based-on TensorFlow 2<ul><li>Minh Nguyen Quan Anh</li> <li>Eren Gölge</li> <li>Kuan Chen</li> <li>Takuya Ebata</li></ul> <ul><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/>, <img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li>project</li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab01.06.2021
HyperoptPython library for serial and parallel optimization over awkward search spaces, which may include real-valued, discrete, and conditional dimensions<ul><li>James Bergstra</li> <li>Dan Yamins</li> <li>David Cox</li></ul> <ul><li>ICML</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/neurips.svg" alt="neurips" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab01.06.2021
CNNThis tutorial demonstrates training a simple Convolutional Neural Network to classify CIFAR imagesBilly Lamberta<ul><li>cifar</li><li>link</li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab22.05.2021
Custom GPT-2 + TokenizerTrain a custom GPT-2 model for free on a GPU using aitextgen!Max Woolf <ul><li>data</li><li><img src="images/docs.svg" alt="docs" height=20/></li></ul>Open In Colab17.05.2021
Train a GPT-2 Text-Generating ModelRetrain an advanced text generating neural network on any text dataset for free on a GPU using Colaboratory using aitextgen!Max Woolf <ul><li>data</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/pwc.svg" alt="pwc" height=20/></li></ul>Open In Colab17.05.2021
EasyNMTEasy to use, state-of-the-art machine translation for more than 100+ languagesNils Reimers <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>demo</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab26.04.2021
SkinDeepRemove Body Tattoo Using Deep LearningVijish Madhavan <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li></ul>Open In Colab24.04.2021
PaddleHubPre-trained models toolkit based on PaddlePaddle: 400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Inference & Serving<ul><li>Zeyu Chen</li> <li>Zewu Wu</li> <li>Bin Long</li> <li>Xuefei Zhang</li><details><summary>others</summary><li>Jinxuan Qiu</li> <li>Yuhan Shen</li> <li>Yuying Hao</li> <li>Xiaojie Chen</li></ul></details> <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab20.04.2021
OCTISFramework for training, analyzing, and comparing Topic Models, whose optimal hyper-parameters are estimated using a Bayesian Optimization approach<ul><li>Silvia Terragni</li> <li>Elisabetta Fersini</li> <li>Antonio Candelieri</li> <li>Pietro Tropeano</li><details><summary>others</summary><li>Bruno Galuzzi</li> <li>Lorenzo Famiglini</li> <li>Davide Pietrasanta</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data, data</li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/neurips.svg" alt="neurips" height=20/></li><li>paper</li><li><img src="images/pwc.svg" alt="pwc" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab19.04.2021
PyTorchVideoDeeplearning library with a focus on video understanding work<ul><li>Haoqi Fan</li> <li>Tullie Murrell</li> <li>Heng Wang</li> <li>Kalyan Vasudev Alwala</li><details><summary>others</summary><li>Yanghao Li</li> <li>Yilei Li</li> <li>Bo Xiong</li> <li>Nikhila Ravi</li> <li>Meng Li</li> <li>Haichuan Yang</li> <li>Jitendra Malik</li> <li>Ross Girshick</li> <li>Matt Feiszli</li> <li>Aaron Adcock</li> <li>Wan-Yen Lo</li> <li>Christoph Feichtenhofer</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab13.04.2021
NeuSpellOpen-source toolkit for spelling correction in English<ul><li>Sai Muralidhar Jayanthi</li> <li>Danish Pruthi</li> <li>Graham Neubig</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/hf.svg" alt="hf" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>project</li></ul>Open In Colab03.04.2021
GPT NeoAn implementation of model & data parallel GPT2 & GPT3 -like models, with the ability to scale up to full GPT3 sizes (and possibly more!), using the mesh-tensorflow libraryEleutherAI <ul><li>GPT-2</li><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>pretrained</li></ul>Open In Colab28.03.2021
CVAEThis notebook demonstrates how train a Variational Autoencoder on the MNIST dataset<ul><li>Diederik Kingma</li> <li>Max Welling</li> <li>Danilo Rezende</li> <li>Shakir Mohamed</li> <li>Daan Wierstra</li></ul><ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab22.03.2021
Big SleepText to image generation, using OpenAI's CLIP and a BigGANPhil Wang <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/>, <img src="images/reddit.svg" alt="reddit" height=20/></li></ul>Open In Colab17.03.2021
Deep DazeText to image generation using OpenAI's CLIP and SirenPhil Wang <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li></ul>Open In Colab17.03.2021
DCGANThis tutorial demonstrates how to generate images of handwritten digits using a Deep Convolutional Generative Adversarial Network<ul><li>Alec Radford</li> <li>Luke Metz</li> <li>Soumith Chintala</li></ul><ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab12.03.2021
Adversarial FGSMThis tutorial creates an adversarial example using the Fast Gradient Signed Method attack. This was one of the first and most popular attacks to fool a neural network.<ul><li>Ian Goodfellow</li> <li>Jonathon Shlens</li> <li>Christian Szegedy</li></ul><ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>imagenet</li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li></ul>Open In Colab12.03.2021
GAN steerabilityWe will navigate in GAN latent space to simulate various camera transformations<ul><li>Ali Jahanian</li> <li>Lucy Chai</li> <li>Phillip Isola</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab04.03.2021
TraxEnd-to-end library for deep learning that focuses on clear code and speedGoogle <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>discuss</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/kaggle.svg" alt="kaggle" height=20/>, <img src="images/kaggle.svg" alt="kaggle" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab18.02.2021
bsuiteA collection of carefully-designed experiments that investigate core capabilities of an RL agent with two main objectives<ul><li>Ian Osband</li> <li>Yotam Doron</li> <li>Matteo Hessel</li> <li>John Aslanides</li><details><summary>others</summary><li>Eren Sezener</li> <li>Andre Saraiva</li> <li>Katrina McKinney</li> <li>Tor Lattimore</li> <li>Csaba Szepesvari</li> <li>Satinder Singh</li> <li>Benjamin Van Roy</li> <li>Richard Sutton</li> <li>David Silver</li> <li>Hado Van Hasselt</li></ul></details> <ul><li><img src="images/git.svg" alt="git" height=20/></li><li>paper</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab13.02.2021
TF-RankingEnd-to-end walkthrough of training a TensorFlow Ranking neural network model which incorporates sparse textual featuresRama Kumar <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>data</li><li><img src="images/git.svg" alt="git" height=20/></li><li><img src="images/wiki.svg" alt="wiki" height=20/>, <img src="images/wiki.svg" alt="wiki" height=20/></li></ul>Open In Colab04.02.2021
Toon-MeA fun project to toon portrait imagesVijish Madhavan <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li></ul>Open In Colab22.01.2021
TensorNetworkA library for easy and efficient manipulation of tensor networksChase Roberts <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab21.01.2021
SpleeterDeezer source separation library including pretrained models<ul><li>Romain Hennequin</li> <li>Anis Khlif</li> <li>Félix Voituret</li> <li>Manuel Moussallam</li></ul> <ul><li>blog post</li><li>data</li><li>project</li></ul>Open In Colab10.01.2021
Bullet Physics SDKReal-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc<ul><li>Erwin Coumans</li> <li>Yunfei Bai</li></ul> <ul><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab14.10.2020
Person RemoverProject that combines Pix2Pix and YOLO arhitectures in order to remove people or other objects from photos<ul><li>Javier Gamazo</li> <li>Daryl Autar</li></ul> <ul><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab22.08.2020
Semantic SegmentationPytorch implementation for Semantic Segmentation/Scene Parsing on MIT ADE20K dataset<ul><li>Bolei Zhou</li> <li>Hang Zhao</li> <li>Xavier Puig</li> <li>Sanja Fidler</li> <li>Antonio Torralba</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li></ul>Open In Colab21.08.2020
Gin ConfigLightweight configuration framework for Python, based on dependency injection<ul><li>Dan Holtmann-Rice</li> <li>Sergio Guadarrama</li> <li>Nathan Silberman</li></ul> <ul><li><img src="images/medium.svg" alt="medium" height=20/></li></ul>Open In Colab13.08.2020
DopamineResearch framework for fast prototyping of reinforcement learning algorithms<ul><li>Pablo Castro</li> <li>Subhodeep Moitra</li> <li>Carles Gelada</li> <li>Saurabh Kumar</li> <li>Marc Bellemare</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>baselines</li><li>blog post</li><li><img src="images/docker.svg" alt="docker" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab04.08.2020
Analyzing Tennis ServeWe'll use the Video Intelligence API to analyze a tennis serve, including the angle of the arms and legs during the serveDale Markowitz <ul><li>blog post</li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab14.07.2020
YOLOv4This tutorial will help you build YOLOv4 easily in the cloud with GPU enabled so that you can run object detections in milliseconds!Alexey Bochkovskiy <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/>, <img src="images/medium.svg" alt="medium" height=20/></li><li>project</li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab25.06.2020
TensorFlow GraphicsDifferentiable computer graphics in tensorflow<ul><li>Julien Valentin</li> <li>Cem Keskin</li> <li>Pavel Pidlypenskyi</li> <li>Ameesh Makadia</li><details><summary>others</summary><li>Avneesh Sud</li> <li>Sofien Bouaziz</li></ul></details> <ul><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/twitter.svg" alt="twitter" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab20.05.2020
GAN DissectionVisualizing and Understanding Generative Adversarial Networks<ul><li>David Bau</li> <li>Jun-Yan Zhu</li> <li>Hendrik Strobelt</li> <li>Bolei Zhou</li><details><summary>others</summary><li>Joshua Tenenbaum</li> <li>William Freeman</li> <li>Antonio Torralba</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>demo</li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li>project</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab04.05.2020
SonnetLibrary built on top of TensorFlow 2 designed to provide simple, composable abstractions for machine learning research<ul><li>Malcolm Reynolds</li> <li>Jack Rae</li> <li>Andreas Fidjeland</li> <li>Fabio Viola</li><details><summary>others</summary><li>Adrià Puigdomènech</li> <li>Frederic Besse</li> <li>Tim Green</li> <li>Sébastien Racanière</li> <li>Gabriel Barth-Maron</li> <li>Diego Casas</li></ul></details> <ul><li><img src="images/deepmind.svg" alt="deepmind" height=20/></li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/neurips.svg" alt="neurips" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab17.04.2020
Classification of chest vs. adominal X-raysThe goal of this tutorial is to build a deep learning classifier to accurately differentiate between chest and abdominal X-raystmoneyx01 <ul><li>annotator</li><li><img src="images/docs.svg" alt="docs" height=20/></li><li><img src="images/pypi.svg" alt="pypi" height=20/></li></ul>Open In Colab07.03.2020
Earth Engine Python API and Folium Interactive MappingThis notebook demonstrates how to setup the Earth Engine and provides several examples for visualizing Earth Engine processed data interactively using the folium libraryQiusheng Wu <ul><li>api</li></ul>Open In Colab20.01.2020
Tensor2TensorLibrary for deep learning models that is well-suited for neural machine translation and includes the reference implementation of the state-of-the-art Transformer model<ul><li>Ashish Vaswani</li> <li>Samy Bengio</li> <li>Eugene Brevdo</li> <li>François Chollet</li><details><summary>others</summary><li>Aidan Gomez</li> <li>Stephan Gouws</li> <li>Llion Jones</li> <li>Łukasz Kaiser</li> <li>Nal Kalchbrenner</li> <li>Niki Parmar</li> <li>Ryan Sepassi</li> <li>Noam Shazeer</li> <li>Jakob Uszkoreit</li></ul></details> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/>, <img src="images/arxiv.svg" alt="arxiv" height=20/></li><li>blog post</li><li>data</li><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/tf.svg" alt="tf" height=20/>, <img src="images/tf.svg" alt="tf" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/>, <img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab14.01.2020
Traffic countingMaking Road Traffic Counting App based on Computer Vision and OpenCVAndrey Nikishaev <ul><li><img src="images/medium.svg" alt="medium" height=20/></li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab10.01.2020
NYU-DLSP20This course concerns the latest techniques in deep learning and representation learning, focusing on supervised and unsupervised deep learning, embedding methods, metric learning, convolutional and recurrent nets, with applications to computer vision, natural language understanding, and speech recognition<ul><li>Yann LeCun</li> <li>Alfredo Canziani</li></ul> <ul><li><img src="images/discord.svg" alt="discord" height=20/></li><li><img src="images/git.svg" alt="git" height=20/>, <img src="images/git.svg" alt="git" height=20/></li><li><img src="images/reddit.svg" alt="reddit" height=20/></li><li>website</li><li><img src="images/yt.svg" alt="yt" height=20/></li></ul>Open In Colab30.10.2019
ImagededupThis package provides functionality to make use of hashing algorithms that are particularly good at finding exact duplicates as well as convolutional neural networks which are also adept at finding near duplicates<ul><li>Tanuj Jain</li> <li>Christopher Lennan</li> <li>Dat Tran</li></ul> <ul><li><img src="images/arxiv.svg" alt="arxiv" height=20/></li><li><img src="images/medium.svg" alt="medium" height=20/></li><li>project</li></ul>Open In Colab03.10.2019

Best of the best

authorsrepositoriespaperspackages
<ul><li>Chen Change Loy</li> <li>Ziwei Liu</li> <li>Xintao Wang</li> <li>Ying Shan</li> <li>Daniel Cohen-Or</li> <li>Adam Roberts</li> <li>Curtis Hawthorne</li> <li>Jesse Engel</li> <li>Eli Shechtman</li> <li>Björn Ommer</li> <li>Yuval Alaluf</li> <li>Or Patashnik</li> <li>Michael Black</li> <li>Yong Zhang</li> <li>Billy Lamberta</li> <li>Nikhila Ravi</li> <li>Patrick Esser</li> <li>Robin Rombach</li> <li>Amit Bermano</li> <li>Jun-Yan Zhu</li> <li>Bolei Zhou</li> <li>Xiaodong Cun</li> <li>Krzysztof Ostrowski</li></ul><ul><li>ollama </li> <li>langchain </li> <li>models </li> <li>whisper </li> <li>stable-diffusion </li> <li>ComfyUI </li> <li>open-interpreter </li> <li>Real-Time-Voice-Cloning </li> <li>yolov5 </li> <li>segment-anything </li> <li>PythonDataScienceHandbook </li> <li>Fooocus </li> <li>stablediffusion </li> <li>llama_index </li> <li>Open-Assistant </li> <li>bark </li> <li>GFPGAN </li> <li>TTS </li> <li>autogen </li> <li>visual-chatgpt </li> <li>google-research </li> <li>ray </li> <li>ultralytics </li></ul><ul><li>Image segmentation </li> <li>AlphaFold </li> <li>XGBoost </li> <li>CycleGAN </li> <li>Pix2Pix </li> <li>MoCo </li> <li>LDM </li> <li>EfficientDet </li> <li>DeepLabCut </li> <li>StyleGAN 2 </li> <li>ConvNeXt </li> <li>Classify text with BERT </li> <li>SwinIR </li> <li>Instant-NGP </li> <li>HMR </li> <li>Mask2Former </li> <li>Taming Transformers for High-Resolution Image Synthesis </li> <li>PIFu </li> <li>Neural Style Transfer </li> <li>ByteTrack </li> <li>SPIN </li> <li>Pixel2Style2Pixel </li> <li>Real-ESRGAN </li></ul><ul><li>xgboost </li> <li>langchain </li> <li>catboost </li> <li>llama-index </li> <li>langgraph </li> <li>ollama </li> <li>autofaiss </li> <li>mmdet </li> <li>unsloth </li> <li>mmsegmentation </li> <li>transformer-lens </li> <li>mmpose </li> <li>img2dataset </li> <li>datachain </li> <li>Crawl4AI </li> <li>sae-lens </li> <li>mistral-inference </li> <li>reformer-pytorch </li> <li>dm-reverb </li> <li>clip-retrieval </li> <li>rl-games </li> <li>tensor-parallel </li> <li>mmocr </li></ul>

Stargazers over time

(generated by generate_markdown.py based on research.json and tutorials.json