Awesome

notebooks | inference | autodistill | collect

👋 hello

Over the years we have created dozens of Computer Vision tutorials. This repository contains examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO, SAM, and GPT-4 Vision.

Curious to learn more about GPT-4 Vision? Check out our GPT-4V experiments 🧪 repository.

🚀 model tutorials (46 notebooks)

notebook	open in colab / kaggle / sagemaker studio lab	complementary materials	repository / paper
Fine-Tune PaliGemma2 on Object Detection Dataset
Fine-Tune PaliGemma2 for JSON data extraction
Fine-Tune PaliGemma2 for LaTeX OCR
Fine-Tune Segment Anything 2.1 (SAM-2.1)
Fine-Tune GPT-4o
YOLO11 Object Detection
YOLO11 Instance Segmentation
Segment Images with SAM2
Segment Videos with SAM2
RT-DETR Object Detection
Fine-Tune Florence-2 on Object Detection Dataset
Run Different Vision Tasks with Florence-2
Fine-Tune PaliGemma on Object Detection Dataset
YOLOv10 Object Detection
Zero-Shot Object Detection with YOLO-World
YOLOv9 Object Detection
RTMDet Object Detection
Fast Segment Anything Model (FastSAM)
YOLO-NAS Object Detection
Segment Anything Model (SAM)
Zero-Shot Object Detection with Grounding DINO
DETR Transformer Object Detection
DINOv2 Image Classification
YOLOv8 Object Detection
YOLOv8 Pose Estimation
YOLOv8 Oriented Bounding Boxes
YOLOv8 Instance Segmentation
YOLOv8 Classification
YOLOv7 Object Detection
YOLOv7 Instance Segmentation
YOLOv7 Object Detection OpenVINO + TorchORT
MT-YOLOv6 Object Detection
YOLOv5 Object Detection
YOLOv5 Classification
YOLOv5 Instance Segmentation
Detection2 Instance Segmentation
SegFormer Instance Segmentation
Vision Transformer Classification
Scaled-YOLOv4 Object Detection
YOLOS Object Detection
YOLOR Object Detection
YOLOX Object Detection
Resnet34 fast.ai Classification
OpenAI Clip Classification
YOLOv4-tiny Darknet Object Detection
Train a YOLOv8 Classification Model with No Labeling

📸 computer vision skills (20 notebooks)

notebook	open in colab / kaggle / sagemaker studio lab	complementary materials	repository / paper
Football AI
Automated Dataset Annotation with GroundedSAM 2
How to Estimate Vehicle Speed
Detect and Count Objects in Polygon Zone with YOLOv5 / YOLOv8 / Detectron2 + Supervision
Track and Count Vehicles with YOLOv8 + ByteTRACK + Supervision
Football Players Tracking with YOLOv5 + ByteTRACK
Auto Train YOLOv8 Model with Autodistill
Image Embeddings Analysis - Part 1
Automated Dataset Annotation and Evaluation with Grounding DINO and SAM
Automated Dataset Annotation and Evaluation with Grounding DINO
Roboflow Video Inference with Custom Annotators
DINO-GPT-4V Object Detection
Train a Segmentation Model with No Labeling
DINOv2 Image Retrieval
Vector Analysis with Scikit-learn and Bokeh
RF100 Object Detection Model Benchmarking
Create Segmentation Masks with Roboflow
How to Use PolygonZone and Roboflow Supervision
Train a Package Detector With Two Labeled Images
Image-to-Image Search with CLIP and faiss

🎬 videos

Almost every week we create tutorials showing you the hottest models in Computer Vision. 🔥 Subscribe, and stay up to date with our latest YouTube videos!

<p align="left"> <a href="https://youtu.be/CilXrt3S-ws" title="How to Choose the Best Computer Vision Model for Your Project"><img src="https://github.com/roboflow/notebooks/assets/26109316/73a01d3b-cf70-40c3-a5e4-e4bc5be38d42" alt="How to Choose the Best Computer Vision Model for Your Project" width="300px" align="left" /></a> <a href="https://youtu.be/CilXrt3S-ws" title="How to Choose the Best Computer Vision Model for Your Project"><strong>How to Choose the Best Computer Vision Model for Your Project</strong></a> <div><strong>Created: 26 May 2023</strong> | <strong>Updated: 26 May 2023</strong></div> <br/> In this video, we will dive into the complexity of choosing the right computer vision model for your unique project. From the importance of high-quality datasets to hardware considerations, interoperability, benchmarking, and licensing issues, this video covers it all... </p> <br/> <p align="left"> <a href="https://youtu.be/oEQYStnF2l8" title="Accelerate Image Annotation with SAM and Grounding DINO"><img src="https://github.com/SkalskiP/SkalskiP/assets/26109316/ae1ca38e-40b7-4b35-8582-e8ea5de3806e" alt="Accelerate Image Annotation with SAM and Grounding DINO" width="300px" align="left" /></a> <a href="https://youtu.be/oEQYStnF2l8" title="Accelerate Image Annotation with SAM and Grounding DINO"><strong>Accelerate Image Annotation with SAM and Grounding DINO</strong></a> <div><strong>Created: 20 Apr 2023</strong> | <strong>Updated: 20 Apr 2023</strong></div> <br/> Discover how to speed up your image annotation process using Grounding DINO and Segment Anything Model (SAM). Learn how to convert object detection datasets into instance segmentation datasets, and see the potential of using these models to automatically annotate your datasets for real-time detectors like YOLOv8... </p> <br/> <p align="left"> <a href="https://youtu.be/D-D6ZmadzPE" title="SAM - Segment Anything Model by Meta AI: Complete Guide"><img src="https://github.com/SkalskiP/SkalskiP/assets/26109316/6913ff11-53c6-4341-8d90-eaff3023c3fd" alt="SAM - Segment Anything Model by Meta AI: Complete Guide" width="300px" align="left" /></a> <a href="https://youtu.be/D-D6ZmadzPE" title="SAM - Segment Anything Model by Meta AI: Complete Guide"><strong>SAM - Segment Anything Model by Meta AI: Complete Guide</strong></a> <div><strong>Created: 11 Apr 2023</strong> | <strong>Updated: 11 Apr 2023</strong></div>

<br/> Discover the incredible potential of Meta AI's Segment Anything Model (SAM)! We dive into SAM, an efficient and promptable model for image segmentation, which has revolutionized computer vision tasks. With over 1 billion masks on 11M licensed and privacy-respecting images, SAM's zero-shot performance is often superior to prior fully supervised results... </p>

💻 run locally

We try to make it as easy as possible to run Roboflow Notebooks in Colab and Kaggle, but if you still want to run them locally, below you will find instructions on how to do it. Remember don't install your dependencies globally, use venv.

# clone repository and navigate to root directory
git clone git@github.com:roboflow-ai/notebooks.git
cd notebooks

# setup python environment and activate it
python3 -m venv venv
source venv/bin/activate

# install and run jupyter notebook
pip install notebook
jupyter notebook

☁️ run in sagemaker studio lab

You can now open our tutorial notebooks in Amazon SageMaker Studio Lab - a free machine learning development environment that provides the compute, storage, and security—all at no cost—for anyone to learn and experiment with ML.

Stable Diffusion Image Generation	YOLOv5 Custom Dataset Training	YOLOv7 Custom Dataset Training

🐞 bugs & 🦸 contribution

Computer Vision moves fast! Sometimes our notebooks lag a tad behind the ever-pushing forward libraries. If you notice that any of the notebooks is not working properly, create a bug report and let us know.

If you have an idea for a new tutorial we should do, create a feature request. We are constantly looking for new ideas. If you feel up to the task and want to create a tutorial yourself, please take a peek at our contribution guide. There you can find all the information you need.

We are here for you, so don't hesitate to reach out.