Home

Awesome

LearnOpenCV

This repository contains code for Computer Vision, Deep learning, and AI research articles shared on our blog LearnOpenCV.com.

Want to become an expert in AI? AI Courses by OpenCV is a great place to start.

<a href="https://opencv.org/courses/"> <p align="center"> <img src="https://learnopencv.com/wp-content/uploads/2023/01/AI-Courses-By-OpenCV-Github.png"> </p> </a>

List of Blog Posts

Blog PostCode
Stable Diffusion 3 and 3.5: Paper Explanation and InferenceCode
LightRAG - Legal Document AnalysisCode
NVIDIA AI Summit 2024 – India Overview
Introduction to Speech to Speech: Most Efficient Form of NLPCode
Training 3D U-Net for Brain Tumor Segmentation (BraTS-GLI)Code
DETR: Overview and InferenceCode
YOLO11: Faster Than You Can Imagine!Code
Exploring DINO: Self-Supervised Transformers for Road Segmentation with ResNet50 and U-NetCode
Sapiens: Foundation for Human Vision Models by MetaCode
Multimodal RAG with ColPali and GeminiCode
Building Autonomous Vehicle in Carla: Path Following with PID Control & ROS 2Code
Handwritten Text Recognition using OCRCode
Training CLIP from Sratch for Image RetrievalCode
Introduction to LiDAR SLAM: LOAM and LeGO-LOAM Paper and Code Explanation with ROS 2 ImplementationCode
Recommendation System using Vector SearchCode
Fine Tuning Whisper on Custom DatasetCode
SAM 2 – Promptable Segmentation for Images and VideosCode
Introduction to Feature Matching Using Neural NetworksCode
Introduction to ROS2 (Robot Operating System 2): Tutorial on ROS2 Working, DDS, ROS1 RMW, Topics, Nodes, Publisher, Subscriber in PythonCode
CVPR 2024 Research Papers - Part- 2Code
CVPR 2024: An Overview and Key PapersCode
Object Detection on Edge Device - OAK-D-LiteCode
Fine-Tuning YOLOv10 Models on Custom DatasetCode
ROS2 and Carla Setup Guide for Ubuntu 22.04
Understanding Visual SLAM for Robotics Perception: Building Monocular SLAM from Scratch in PythonCode
Enhancing Image Segmentation using U2-Net: An Approach to Efficient Background RemovalCode
YOLOv10: The Dual-Head OG of YOLO SeriesCode
Fine-tuning Faster R-CNN on Sea Rescue DatasetCode
Mastering Recommendation System: A Complete Guide
Automatic Speech Recognition with Diarization : Speech-to-TextCode
Building MobileViT Image Classification Model from Scratch In Keras 3Code
SDXL Inpainting: Fusing Image Inpainting with Stable DiffusionCode
YOLOv9 Instance Segmentation on Medical DatasetCode
A Comprehensive Guide to Robotics
Integrating Gradio with OpenCV DNNCode
Fine-Tuning YOLOv9 on Custom DatasetCode
Dreambooth using DiffusersCode
Introduction to Hugging Face DiffusersCode
Introduction to Ultralytics Explorer APICode
YOLOv9: Advancing the YOLO LegacyCode
Fine-Tuning LLMs using PEFTCode
Depth Anything: Accelerating Monocular Depth PerceptionCode
Deciphering LLMs: From Transformers to QuantizationCode
YOLO Loss Function Part 2: GFL and VFL LossCode
YOLOv8-Object-Tracking-and-Counting-with-OpenCVCode
Stereo Vision in ADAS: Pioneering Depth Perception Beyond LiDARCode
YOLO Loss Function Part 1: SIoU and Focal LossCode
Moving Object Detection with OpenCVCode
Integrating ADAS with Keypoint Feature Pyramid Network for 3D LiDAR Object DetectionCode
Mastering All YOLO Models from YOLOv1 to YOLO-NAS: Papers Explained (2024)
GradCAM: Enhancing Neural Network Interpretability in the Realm of Explainable AICode
Text Summarization using T5: Fine-Tuning and Building Gradio AppCode
3D LiDAR Visualization using Open3D: A Case Study on 2D KITTI Depth Frames for Autonomous DrivingCode
Fine Tuning T5: Text2Text Transfer Transformer for Building a Stack Overflow Tag GeneratorCode
SegFormer 🤗 : Fine-Tuning for Improved Lane Detection in Autonomous VehiclesCode
Fine-Tuning BERT using Hugging Face TransformersCode
YOLO-NAS PoseCode
BERT: Bidirectional Encoder Representations from TransformersCode
Comparing KerasCV YOLOv8 Models on the Global Wheat Data 2020Code
Top 5 AI papers of September 2023
Empowering Drivers: The Rise and Role of Advanced Driver Assistance Systems
Semantic Segmentation using KerasCV DeepLabv3+Code
Object Detection using KerasCV YOLOv8Code
Fine-tuning YOLOv8 Pose Models for Animal Pose EstimationCode
Top 5 AI papers of August 2023
Fine Tuning TrOCR - Training TrOCR to Recognize Curved TextCode
TrOCR - Getting Started with Transformer Based OCRCode
Facial Emotion RecognitionCode
Object Keypoint Similarity in Keypoint DetectionCode
Real Time Deep SORT with Torchvision DetectorsCode
Top 5 AI papers of July 2023
Medical Image SegmentationCode
Weighted Boxes Fusion in Object Detection: A Comparison with Non-Maximum SuppressionCode
Medical Multi-label Classification with PyTorch & LightningCode
Getting Started with PaddlePaddle: Exploring Object Detection, Segmentation, and KeypointsCode
Drone Programming With Computer Vision A Beginners GuideCode
How to Build a Pip Installable Package & Upload to PyPi
IoU Loss Functions for Faster & More Accurate Object Detection
Exploring Slicing Aided Hyper Inference for Small Object DetectionCode
Advancements in Face Recognition Models, Toolkit and Datasets
Train YOLO NAS on Custom DatasetCode
Train YOLOv8 Instance Segmentation on Custom DataCode
YOLO-NAS: New Object Detection Model Beats YOLOv6 & YOLOv8Code
Segment Anything – A Foundation Model for Image SegmentationCode
Build a Video to Slides Converter Application using the Power of Background Estimation and Frame Differencing in OpenCVCode
A Closer Look at CVAT: Perfecting Your AnnotationsYouTube
ControlNet - Achieving Superior Image Generation ResultsCode
InstructPix2Pix - Edit Images With PromptsCode
NVIDIA Spring GTC 2023 Day 4: Ending on a High Note with Top Moments from the Finale!
NVIDIA Spring GTC 2023 Day 3: Digging deeper into Deep Learning, Semiconductors & more!
NVIDIA Spring GTC 2023 Day 2: Jensen’s keynote & the iPhone moment of AI is here!
NVIDIA Spring GTC 2023 Day 1: Welcome to the future!
NVIDIA GTC Spring 2023 Curtain Raiser
Stable Diffusion - A New Paradigm in Generative AICode
OpenCV Face Recognition – Does Face Recognition Work on AI-Generated Images?
An In-Depth Guide to Denoising Diffusion Probabilistic Models – From Theory to ImplementationCode
From Pixels to Paintings: The Rise of Midjourney AI Art
Mastering DALL·E 2: A Breakthrough in AI Art Generation
Top 10 AI Art Generation Tools using Diffusion Models
The Future of Image Recognition is Here: PyTorch Vision TransformerCode
Understanding Attention Mechanism in Transformer Neural NetworksCode
Deploying a Deep Learning Model using Hugging Face Spaces and GradioCode
Train YOLOv8 on Custom Dataset – A Complete TutorialCode
Introduction to Diffusion Models for Image GenerationCode
Building An Automated Image Annotation Tool: PyOpenAnnotateCode
Ultralytics YOLOv8: State-of-the-Art YOLO ModelsCode
Getting Started with YOLOv5 Instance SegmentationCode
The Ultimate Guide To DeepLabv3 - With PyTorch InferenceCode
AI Fitness Trainer using MediaPipe: Squats AnalysisCode
YoloR - Paper Explanation & Inference -An In-Depth AnalysisCode
Roadmap To an Automated Image Annotation Tool Using PythonCode
Performance Comparison of YOLO Object Detection Models – An Intensive Study
FCOS - Anchor Free Object Detection ExplainedCode
YOLOv6 Custom Dataset Training – Underwater Trash DetectionCode
What is EXIF Data in Images?Code
t-SNE: T-Distributed Stochastic Neighbor Embedding ExplainedCode
CenterNet: Objects as Points – Anchor-free Object Detection ExplainedCode
YOLOv7 Pose vs MediaPipe in Human Pose EstimationCode
YOLOv6 Object Detection – Paper Explanation and InferenceCode
YOLOX Object Detector Paper Explanation and Custom TrainingCode
Driver Drowsiness Detection Using Mediapipe In PythonCode
GTC 2022 Big Bang AI announcements: Everything you need to know
NVIDIA GTC 2022 : The most important AI event this Fall
Object Tracking and Reidentification with FairMOTCode
What is Face Detection? – The Ultimate Guide for 2022Code
Document Scanner: Custom Semantic Segmentation using PyTorch-DeepLabV3Code
Fine Tuning YOLOv7 on Custom DatasetCode
Center Stage for Zoom Calls using MediaPipeCode
Mean Average Precision (mAP) in Object Detection
YOLOv7 Object Detection Paper Explanation and InferenceCode
Pothole Detection using YOLOv4 and DarknetCode
Automatic Document Scanner using OpenCVCode
Demystifying GPU architectures for deep learning: Part 2Code
Demystifying GPU Architectures For Deep LearningCode
Intersection-over-Union(IoU)-in-Object-Detection-and-SegmentationCode
Understanding Multiple Object Tracking using DeepSORTCode
Optical Character Recognition using PaddleOCRCode
Gesture Control in Zoom Call using MediapipeCode
A Deep Dive into Tensorflow Model OptimizationCode
DepthAI Pipeline Overview: Creating a Complex PipelineCode
TensorFlow Lite Model Maker: Create Models for On-Device Machine LearningCode
TensorFlow Lite: Model Optimization for On Device Machine LearningCode
Object detection with depth measurement using pre-trained models with OAK-DCode
Custom Object Detection Training using YOLOv5Code
Object Detection using Yolov5 and OpenCV DNN (C++/Python)Code
Create Snapchat/Instagram filters using MediapipeCode
AUTOSAR C++ compliant deep learning inference with TensorRTCode
NVIDIA GTC 2022 Day 4 Highlights: Meet the new Jetson Orin
NVIDIA GTC 2022 Day 3 Highlights: Deep Dive into Hopper architecture
NVIDIA GTC 2022 Day 2 Highlights: Jensen’s Keynote
NVIDIA GTC 2022 Day 1 Highlights: Brilliant Start
Automatic License Plate Recognition using PythonCode
Building a Poor Body Posture Detection and Alert System using MediaPipeCode
Introduction to MediaPipeCode
Disparity Estimation using Deep LearningCode
How to build Chrome Dino game bot using OpenCV Feature MatchingCode
Top 10 Sources to Find Computer Vision and AI Models
Multi-Attribute and Graph-based Object Detection
Plastic Waste Detection with Deep LearningCode
Ensemble Deep Learning-based Defect Classification and Detection in SEM Images
Building Industrial embedded deep learning inference pipelines with TensorRTCode
Transfer Learning for Medical Images
Stereo Vision and Depth Estimation using OpenCV AI KitCode
Introduction to OpenCV AI Kit and DepthAICode
WeChat QR Code Scanner in OpenCVCode
AI behind the Diwali 2021 ‘Not just a Cadbury ad’
Model Selection and Benchmarking with Modelplace.AIModel Zoo
Real-time style transfer in a zoom meetingCode
Introduction to OpenVino Deep Learning WorkbenchCode
Running OpenVino Models on Intel Integrated GPUCode
Post Training Quantization with OpenVino ToolkitCode
Introduction to Intel OpenVINO Toolkit
Human Action Recognition using Detectron2 and LSTMCode
Pix2Pix:Image-to-Image Translation in PyTorch & TensorFlowCode
Conditional GAN (cGAN) in PyTorch and TensorFlowCode
Deep Convolutional GAN in PyTorch and TensorFlowCode
Introduction to Generative Adversarial Networks (GANs)Code
Human Pose Estimation using Keypoint RCNN in PyTorchCode
Non Maximum Suppression: Theory and Implementation in PyTorchCode
MRNet – The Multi-Task ApproachCode
Generative and Discriminative Models
Playing Chrome's T-Rex Game with Facial GesturesCode
Variational Autoencoder in TensorFlowCode
Autoencoder in TensorFlow 2: Beginner’s GuideCode
Deep Learning with OpenCV DNN Module: A Definitive GuideCode
Depth perception using stereo camera (Python/C++)Code
Contour Detection using OpenCV (Python/C++)Code
Super Resolution in OpenCVCode
Improving Illumination in Night Time ImagesCode
Video Classification and Human Activity RecognitionCode
How to use OpenCV DNN Module with Nvidia GPU on WindowsCode
How to use OpenCV DNN Module with NVIDIA GPUsCode
Code OpenCV in Visual Studio
Install OpenCV on Windows – C++ / PythonCode
Face Recognition with ArcFaceCode
Background Subtraction with OpenCV and BGS LibrariesCode
RAFT: Optical Flow estimation using Deep LearningCode
Making A Low-Cost Stereo Camera Using OpenCVCode
Optical Flow in OpenCV (C++/Python)Code
Introduction to Epipolar Geometry and Stereo VisionCode
Classification With Localization: Convert any keras Classifier to a DetectorCode
Photoshop Filters in OpenCVCode
Tetris Game using OpenCV PythonCode
Image Classification with OpenCV for AndroidCode
Image Classification with OpenCV JavaCode
PyTorch to Tensorflow Model ConversionCode
Snake Game with OpenCV PythonCode
Stanford MRNet Challenge: Classifying Knee MRIsCode
Experiment Logging with TensorBoard and wandbCode
Understanding Lens DistortionCode
Image Matting with state-of-the-art Method “F, B, Alpha Matting”Code
Bag Of Tricks For Image Classification - Let's check if it is working or notCode
Getting Started with OpenCV CUDA ModuleCode
Training a Custom Object Detector with DLIB & Making Gesture Controlled ApplicationsCode
How To Run Inference Using TensorRT C++ APICode
Using Facial Landmarks for Overlaying Faces with Medical MasksCode
Tensorboard with PyTorch LightningCode
Otsu's Thresholding with OpenCVCode
PyTorch-to-CoreML-model-conversionCode
Playing Rock, Paper, Scissors with AICode
CNN Receptive Field Computation Using Backprop with TensorFlowCode
CNN Fully Convolutional Image Classification with TensorFlowCode
How to convert a model from PyTorch to TensorRT and speed up inferenceCode
Efficient image loadingCode
Graph Convolutional Networks: Model Relations In DataCode
Getting Started with Federated Learning with PyTorch and PySyftCode
Creating a Virtual Pen & EraserCode
Getting Started with PyTorch LightningCode
Multi-Label Image Classification with PyTorch: Image TaggingCode
Funny Mirrors Using OpenCVcode
t-SNE for ResNet feature visualizationCode
Multi-Label Image Classification with PytorchCode
CNN Receptive Field Computation Using BackpropCode
CNN Receptive Field Computation Using Backprop with TensorFlowCode
Augmented Reality using AruCo Markers in OpenCV(C++ and Python)Code
Fully Convolutional Image Classification on Arbitrary Sized ImageCode
Camera Calibration using OpenCVCode
Geometry of Image Formation
Ensuring Training Reproducibility in Pytorch
Gaze Tracking
Simple Background Estimation in Videos Using OpenCVCode
Applications of Foreground-Background separation with Semantic SegmentationCode
EfficientNet: Theory + CodeCode
PyTorch for Beginners: Mask R-CNN Instance Segmentation with PyTorchCode
PyTorch for Beginners: Faster R-CNN Object Detection with PyTorchCode
PyTorch for Beginners: Semantic Segmentation using torchvisionCode
PyTorch for Beginners: Comparison of pre-trained models for Image ClassificationCode
PyTorch for Beginners: BasicsCode
PyTorch Model Inference using ONNX and Caffe2Code
Image Classification Using Transfer Learning in PyTorchCode
Hangman: Creating games in OpenCVCode
Image Inpainting with OpenCV (C++/Python)Code
Hough Transform with OpenCV (C++/Python)Code
Xeus-Cling: Run C++ code in Jupyter NotebookCode
Gender & Age Classification using OpenCV Deep Learning ( C++/Python )Code
Invisibility Cloak using Color Detection and Segmentation with OpenCVCode
Fast Image Downloader for Open Images V4 (Python)Code
Deep Learning based Text Detection Using OpenCV (C++/Python)Code
Video Stabilization Using Point Feature Matching in OpenCVCode
Training YOLOv3 : Deep Learning based Custom Object DetectorCode
Using OpenVINO with OpenCVCode
Duplicate Search on Quora DatasetCode
Shape Matching using Hu Moments (C++/Python)Code
Install OpenCV 4 on CentOS (C++ and Python)Code
Install OpenCV 3.4.4 on CentOS (C++ and Python)Code
Install OpenCV 3.4.4 on Red Hat (C++ and Python)Code
Install OpenCV 4 on Red Hat (C++ and Python)Code
Install OpenCV 4 on macOS (C++ and Python)Code
Install OpenCV 3.4.4 on Raspberry PiCode
Install OpenCV 3.4.4 on macOS (C++ and Python)Code
OpenCV QR Code Scanner (C++ and Python)Code
Install OpenCV 3.4.4 on Windows (C++ and Python)Code
Install OpenCV 3.4.4 on Ubuntu 16.04 (C++ and Python)Code
Install OpenCV 3.4.4 on Ubuntu 18.04 (C++ and Python)Code
Universal Sentence EncoderCode
Install OpenCV 4 on Raspberry PiCode
Install OpenCV 4 on Windows (C++ and Python)Code
Face Detection – Dlib, OpenCV, and Deep Learning ( C++ / Python )Code
Hand Keypoint Detection using Deep Learning and OpenCVCode
Deep learning based Object Detection and Instance Segmentation using Mask R-CNN in OpenCV (Python / C++)Code
Install OpenCV 4 on Ubuntu 18.04 (C++ and Python)Code
Install OpenCV 4 on Ubuntu 16.04 (C++ and Python)Code
Multi-Person Pose Estimation in OpenCV using OpenPoseCode
Heatmap for Logo Detection using OpenCV (Python)Code
Deep Learning based Object Detection using YOLOv3 with OpenCV ( Python / C++ )Code
Convex Hull using OpenCV in Python and C++Code
MultiTracker : Multiple Object Tracking using OpenCV (C++/Python)Code
Convolutional Neural Network based Image Colorization using OpenCVCode
SVM using scikit-learnCode
GOTURN: Deep Learning based Object TrackingCode
Find the Center of a Blob (Centroid) using OpenCV (C++/Python)Code
Support Vector Machines (SVM)Code
Batch Normalization in Deep NetworksCode
Deep Learning based Character Classification using Synthetic DatasetCode
Image Quality Assessment : BRISQUECode
Understanding AlexNet
Deep Learning based Text Recognition (OCR) using Tesseract and OpenCVCode
Deep Learning based Human Pose Estimation using OpenCV ( C++ / Python )Code
Number of Parameters and Tensor Sizes in a Convolutional Neural Network (CNN)
How to convert your OpenCV C++ code into a Python moduleCode
CV4Faces : Best Project Award 2018
Facemark : Facial Landmark Detection using OpenCVCode
Image Alignment (Feature Based) using OpenCV (C++/Python)Code
Barcode and QR code Scanner using ZBar and OpenCVCode
Keras Tutorial : Fine-tuning using pre-trained modelsCode
OpenCV Transparent API
Face Reconstruction using EigenFaces (C++/Python)Code
Eigenface using OpenCV (C++/Python)Code
Principal Component Analysis
Keras Tutorial : Transfer Learning using pre-trained modelsCode
Keras Tutorial : Using pre-trained Imagenet modelsCode
Technical Aspects of a Digital SLR
Using Harry Potter interactive wand with OpenCV to create magic
Install OpenCV 3 and Dlib on Windows ( Python only )
Image Classification using Convolutional Neural Networks in KerasCode
Understanding Autoencoders using Tensorflow (Python)Code
Best Project Award : Computer Vision for Faces
Understanding Activation Functions in Deep Learning
Image Classification using Feedforward Neural Network in KerasCode
Exposure Fusion using OpenCV (C++/Python)Code
Understanding Feedforward Neural Networks
High Dynamic Range (HDR) Imaging using OpenCV (C++/Python)Code
Deep learning using Keras – The BasicsCode
Selective Search for Object Detection (C++ / Python)Code
Installing Deep Learning Frameworks on Ubuntu with CUDA support
Parallel Pixel Access in OpenCV using forEachCode
cvui: A GUI lib built on top of OpenCV drawing primitivesCode
Install Dlib on Windows
Install Dlib on Ubuntu
Install OpenCV3 on Ubuntu
Read, Write and Display a video using OpenCV ( C++/ Python )Code
Install Dlib on MacOS
Install OpenCV 3 on MacOS
Install OpenCV 3 on Windows
Get OpenCV Build Information ( getBuildInformation )
Color spaces in OpenCV (C++ / Python)Code
Neural Networks : A 30,000 Feet View for Beginners
Alpha Blending using OpenCV (C++ / Python)Code
User stories : How readers of this blog are applying their knowledge to build applications
How to select a bounding box ( ROI ) in OpenCV (C++/Python) ?
Automatic Red Eye Remover using OpenCV (C++ / Python)Code
Bias-Variance Tradeoff in Machine Learning
Embedded Computer Vision: Which device should you choose?
Object Tracking using OpenCV (C++/Python)Code
Handwritten Digits Classification : An OpenCV ( C++ / Python ) TutorialCode
Training a better Haar and LBP cascade based Eye Detector using OpenCV
Deep Learning Book Gift Recipients
Minified OpenCV Haar and LBP CascadesCode
Deep Learning Book Gift
Histogram of Oriented Gradients
Image Recognition and Object Detection : Part 1
Head Pose Estimation using OpenCV and DlibCode
Live CV : A Computer Vision Coding Application
Approximate Focal Length for Webcams and Cell Phone Cameras
Configuring Qt for OpenCV on OSXCode
Rotation Matrix To Euler AnglesCode
Speeding up Dlib’s Facial Landmark Detector
Warp one triangle to another using OpenCV ( C++ / Python )Code
Average Face : OpenCV ( C++ / Python ) TutorialCode
Face Swap using OpenCV ( C++ / Python )Code
Face Morph Using OpenCV — C++ / PythonCode
Deep Learning Example using NVIDIA DIGITS 3 on EC2
NVIDIA DIGITS 3 on EC2
Homography Examples using OpenCV ( Python / C ++ )Code
Filling holes in an image using OpenCV ( Python / C++ )Code
How to find frame rate or frames per second (fps) in OpenCV ( Python / C++ ) ?Code
Delaunay Triangulation and Voronoi Diagram using OpenCV ( C++ / Python)Code
OpenCV (C++ vs Python) vs MATLAB for Computer Vision
Facial Landmark Detection
Why does OpenCV use BGR color format ?
Computer Vision for Predicting Facial AttractivenessCode
applyColorMap for pseudocoloring in OpenCV ( C++ / Python )Code
Image Alignment (ECC) in OpenCV ( C++ / Python )Code
How to find OpenCV version in Python and C++ ?
Baidu banned from ILSVRC 2015
OpenCV Transparent API
How Computer Vision Solved the Greatest Soccer Mystery of All Time
Embedded Vision Summit 2015
Read an Image in OpenCV ( Python, C++ )Code
Non-Photorealistic Rendering using OpenCV ( Python, C++ )Code
Seamless Cloning using OpenCV ( Python , C++ )Code
OpenCV Threshold ( Python , C++ )Code
Blob Detection Using OpenCV ( Python, C++ )Code
Turn your OpenCV Code into a Web API in under 10 minutes — Part 1
How to compile OpenCV sample Code ?
Install OpenCV 3 on Yosemite ( OSX 10.10.x )