Awesome
<div align="center"> <img src="image/52CV1.png" width="600"/> </div>查看2024年综述文献点这里↘️2024-CV-Surveys
2025 年论文分类汇总戳这里
↘️WACV-2025-Papers ↘️CVPR-2025-Papers
2024 年论文分类汇总戳这里
↘️WACV-2024-Papers ↘️CVPR-2024-Papers ↘️ECCV-2024-Papers
2023 年论文分类汇总戳这里
2022 年论文分类汇总戳这里
2021 年论文分类汇总戳这里
2020 年论文分类汇总戳这里
2024-CV-Surveys
2024 年,计算机视觉相关综述。包括目标检测、跟踪........
:green_book::green_book::green_book:在【我爱计算机视觉】微信公众号后台回复“CV综述”,即可收到本文列出的全部论文的打包下载。至12月20日已公开 465+1 篇。
1月份共计44篇。<br> 2月份共计36篇。<br> 3月份共计25篇。<br> 4月份共计33篇。<br> 5月份共计50篇。<br> 6月份共计40篇。<br> 7月份共计48篇。<br> 8月份共计46篇。<br> 9月份共计36篇。<br> 10月份共计38篇。<br> 11月份共计41篇。<br> 计437篇。
目录
:cat: | :dog: | :tiger: | :wolf: |
---|---|---|---|
1.Unkown(未分) |
Biometrics
- Reversing the Irreversible: A Survey on Inverse Biometrics<br>[2024-01-08]
Data Augmentation
Gaze estimation
- A Survey on Deep Learning-based Gaze Direction Regression: Searching for the State-of-the-art<br>[2024-10-23]
Fish-eye Camera(鱼眼相机)
- A Comprehensive Overview of Fish-Eye Camera Distortion Correction Methods<br>[2024-01-02]
- Surround-View Fisheye Optics in Computer Vision and Simulation: Survey and Challenge<br>[2024-02-20]
Memes Detection
- Toxic Memes: A Survey of Computational Perspectives on the Detection and Explanation of Meme Toxicities<br>[2024-06-12]
Fake News Detection(虚假新闻检测)
- Fact-checking based fake news detection: a review<br>[2024-01-04]
Scene Graph Generation
- A Review and Efficient Implementation of Scene Graph Generation Metrics<br>:star:code<br>[2024-04-16]
Sound
- A Survey of Recent Advances and Challenges in Deep Audio-Visual Correlation Learning<br>[2024-12-03]
- 音频描述
Deepfake
- Deepfake Generation and Detection: A Benchmark and Survey<br>[2024-03-27]<br>:star:code
- A Timely Survey on Vision Transformer for Deepfake Detection<br>[2024-05-15]
- Media Forensics and Deepfake Systematic Survey<br>[2024-06-21]
- The Tug-of-War Between Deepfake Generation and Detection<br>[2024-07-09]
- Understanding Audiovisual Deepfake Detection: Techniques, Challenges, Human Factors and Perceptual Insights<br>[2024-11-13]
- Passive Deepfake Detection Across Multi-modalities: A Comprehensive Survey<br>[2024-11-28]
- Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook<br>[2024-12-02]
Industrial Anomaly Detection(工业缺陷检测)
- A Systematic Review of Available Datasets in Additive Manufacturing<br>[2024-01-30]
- A Comprehensive Survey on Machine Learning Driven Material Defect Detection: Challenges, Solutions, and Future Prospects<br>[2024-06-13]
- A PRISMA Driven Systematic Review of Publicly Available Datasets for Benchmark and Model Developments for Industrial Defect Detection<br>[2024-06-13]
- Transformers and Large Language Models for Efficient Intrusion Detection Systems: A Comprehensive Survey<br>[2024-08-15]
- A Survey on RGB, 3D, and Multimodal Approaches for Unsupervised Industrial Anomaly Detection<br>:star:code<br>[2024-10-30]
- VAD
- 点云的工业系统 3D 缺陷检测和分类
- OOD
Multi-Label Learning(多标签学习)
- Deep Learning for Multi-Label Learning: A Comprehensive Survey<br>[2024-01-31]
Few/Zero-Shot Learning/DG/A(小/零样本/域泛化/域适应)
- 零样本
Deep learning
- 长尾学习
- A Systematic Review on Long-Tailed Learning<br>[2024-08-02]
Machine Learning(机器学习)
- A Comprehensive Review of Machine Learning Advances on Data Change: A Cross-Field Perspective<br>[2024-02-21]
- Open-world Machine Learning: A Review and New Outlooks<br>[2024-03-06]无PDF
- Inference Attacks in Machine Learning as a Service: A Taxonomy, Review, and Promising Directions<br>[2024-06-05]
- Machine Learning for Methane Detection and Quantification from Space -- A survey<br>[2024-08-28]
- Digital Twins in Additive Manufacturing: A Systematic Review<br>[2024-09-04]
- 持续学习
- Continual Learning with Pre-Trained Models: A Survey<br>:star:code<br>[2024-01-30]
- 迁移学习
- Which Model to Transfer? A Survey on Transferability Estimation<br>:star:code<br>[2024-02-26]
- 联邦学习
- 木马攻击
- A Survey of Trojan Attacks and Defenses to Deep Neural Networks<br>[2024-08-20]
- 对抗攻击
- Proactive Schemes: A Survey of Adversarial Attacks for Social Good<br>[2024-09-26]
- Adversarial Attacks of Vision Tasks in the Past 10 Years: A Survey<br>[2024-11-01]
- Adversarial Attacks Using Differentiable Rendering: A Survey<br>[2024-11-18]
Object Re-Id/Pose Estimation
- 物体重识别
- Transformer for Object Re-Identification: A Survey<br>[2024-01-17]
- 物体姿态估计
- Deep Learning-Based Object Pose Estimation: A Comprehensive Survey<br>:star:code<br>[2024-05-14]
Self-supervised Learning
- 自监督
- Masked Modeling for Self-supervised Representation Learning on Vision and Beyond<br>:star:code<br>[2024-01-03]
- A review on discriminative self-supervised learning methods<br>[2024-05-09]
- Masked Image Modeling: A Survey<br>[2024-08-14]
- A Survey of the Self Supervised Learning Mechanisms for Vision Transformers<br>[2024-09-02]
- 无监督学习
- A Survey on Deep Clustering: From the Prior Perspective<br>[2024-07-01]
Neural Radiance Fields (NeRF)
- Neural Radiance Field-based Visual Rendering: A Comprehensive Review<br>[2024-04-02]
- Dynamic NeRF: A Review<br>[2024-05-15]
Human Object Interaction(人机交互)
- How Can Large Language Models Enable Better Socially Assistive Human-Robot Interaction: A Brief Survey<br>[2024-04-02]
- A Review of Human-Object Interaction Detection<br>[2024-08-21]
Visual Question Answering(视觉问答)
- Visual Question Answering in Ophthalmology: A Progressive and Practical Perspective<br>[2024-10-23]
- A Comprehensive Survey on Visual Question Answering Datasets and Algorithms<br>[2024-11-19]
- Natural Language Understanding and Inference with MLLM in Visual Question Answering: A Survey<br>[2024-11-27]
Robot/SLAM
- Event-based Sensor Fusion and Application on Odometry: A Survey<br>[2024-10-22]
- SLAM
- How NeRFs and 3D Gaussian Splatting are Reshaping SLAM: a Survey<br>[2024-02-21]
- VR
- AI-Enhanced Virtual Reality in Medicine: A Comprehensive Survey<br>[2024-02-06]
- 地理定位
- 机器人
- Survey on Datasets for Perception in Unstructured Outdoor Environments<br>[2024-04-30]
- A Brief Survey on Leveraging Large Scale Vision Models for Enhanced Robot Grasping<br>[2024-06-18]
- A Survey of Embodied Learning for Object-Centric Robotic Manipulation<br>:star:code<br>[2024-08-22]
- Visual Servoing for Robotic On-Orbit Servicing: A Survey<br>[2024-09-05]
- Neural Fields in Robotics: A Survey<br>[2024-10-29]
- PR
- General Place Recognition Survey: Towards Real-World Autonomy<br>:star:code<br>[2024-05-09]
Autonomous Driving(自动驾驶)
- A Survey on Autonomous Driving Datasets: Data Statistic, Annotation, and Outlook<br>[2024-01-04]
- Data-Centric Evolution in Autonomous Driving: A Comprehensive Survey of Big Data System, Data Mining, and Closed-Loop Technologies<br>[2024-01-24]
- A Survey for Foundation Models in Autonomous Driving<br>[2024-02-05]
- Review of the Learning-based Camera and Lidar Simulation Methods for Autonomous Driving Systems<br>[2024-02-16]
- Explainable AI for Safe and Trustworthy Autonomous Driving: A Systematic Review<br>[2024-02-16]
- A Survey of Vision Transformers in Autonomous Driving: Current Trends and Future Directions<br>[2024-03-13]
- Monocular 3D lane detection for Autonomous Driving: Recent Achievements, Challenges, and Outlooks<br>[2024-04-11]
- Neural Radiance Field in Autonomous Driving: A Survey<br>[2024-04-23]
- Collaborative Perception Datasets in Autonomous Driving: A Survey<br>[2024-04-23]
- A Survey on Intermediate Fusion Methods for Collaborative Perception Categorized by Real World Challenges<br>[2024-04-26]
- Vision-based 3D occupancy prediction in autonomous driving: a review and outlook<br>:star:code<br>[2024-05-07]
- A Survey on Occupancy Perception for Autonomous Driving: The Information Fusion Perspective<br>:star:code<br>[2024-05-09]
- Cooperative Visual-LiDAR Extrinsic Calibration Technology for Intersection Vehicle-Infrastructure: A review<br>[2024-05-17]
- Collective Perception Datasets for Autonomous Driving: A Comprehensive Review<br>[2024-05-28]
- Panoptic Perception for Autonomous Driving: A Survey<br>[2024-08-29]
- Feature Importance in Pedestrian Intention Prediction: A Context-Aware Review<br>[2024-09-13]
- Joint Perception and Prediction for Autonomous Driving: A Survey<br>[2024-12-19]
- 目标检测
- Robustness-Aware 3D Object Detection in Autonomous Driving: A Review and Outlook<br>[2024-01-15]
- Deep Event-based Object Detection in Autonomous Driving: A Survey<br>[2024-05-08]
- A Comprehensive Review of 3D Object Detection in Autonomous Driving: Technological Advances and Future Directions<br>:star:code<br>[2024-08-30]
- 车道线检测
- Monocular Lane Detection Based on Deep Learning: A Survey<br>:star:code<br>[2024-11-26]
- 车辆重识别
- 疲劳驾驶检测
- 交通监控
Scene Understanding(场景理解)
- Reducing Label Dependency for Underwater Scene Understanding: A Survey of Datasets, Techniques and Applications<br>[2024-11-19]
Tamper Detection/image forencis detection(图像篡改检测方向)
Neural Rendering(神经渲染)
- Neural Rendering and Its Hardware Acceleration: A Review<br>[2024-02-02]
Neural Radiance Fields
- Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review<br>[2024-02-20]
Visual Question Answering
- Assistive Image Annotation Systems with Deep Learning and Natural Language Capabilities: A Review<br>[2024-07-02]
Vision language(视觉语言)
- A Survey on Hallucination in Large Vision-Language Models<br>[2024-02-02]
- Exploring the Frontier of Vision-Language Models: A Survey of Current Methodologies and Future Directions<br>[2024-04-12]
- A Survey on Visual Mamba<br>[2024-04-25]
- Vision Mamba: A Comprehensive Survey and Taxonomy<br>:star:code<br>[2024-05-08]
- A Survey on Vision-Language-Action Models for Embodied AI<br>[2024-05-24]
- JailbreakZoo: Survey, Landscapes, and Horizons in Jailbreaking Large Language and Vision-Language Models<br>:house:project<br>[2024-07-03]
- Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models<br>:star:code<br>[2024-08-06]
- Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey<br>[2024-09-19]
- One missing piece in Vision and Language: A Survey on Comics Understanding<br>:star:code<br>[2024-09-17]
- A Survey of Low-shot Vision-Language Model Adaptation via Representer Theorem<br>[2024-10-16]
- Autoregressive Models in Vision: A Survey<br>:star:code<br>[2024-11-12]
- Online Knowledge Integration for 3D Semantic Mapping: A Survey<br>[2024-11-28]
- How Vision-Language Tasks Benefit from Large Pre-trained Models: A Survey<br>[2024-12-12]
- 基础模型
- Few-shot Adaptation of Multi-modal Foundation Models: A Survey<br>[2024-01-04]
- Unveiling Hallucination in Text, Image, Video, and Audio Foundation Models: A Comprehensive Review<br>[2024-05-17]
- Towards Vision-Language Geo-Foundation Model: A Survey<br>:star:code<br>[2024-06-14]
- Towards Unifying Understanding and Generation in the Era of Vision Foundation Models: A Survey from the Autoregression Perspective<br>:star:code<br>[2024-10-30]
- MLLM
- The (R)Evolution of Multimodal Large Language Models: A Survey<br>[2024-02-21]
- Efficient Multimodal Large Language Models: A Survey<br>:star:code<br>[2024-05-20]
- A Survey of Multimodal Large Language Model from A Data-centric Perspective<br>[2024-05-28]
- The Synergy between Data and Multi-Modal Large Language Models: A Survey from Co-Development Perspective<br>:star:code<br>[2024-07-12]
- A Survey on Benchmarks of Multimodal Large Language Models<br>:star:code<br>[2024-08-19]
- Visual Prompting in Multimodal Large Language Models: A Survey<br>[2024-09-25]
- MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs<br>:star:code<br>[2024-11-26]
- Personalized Multimodal Large Language Models: A Survey<br>[2024-12-04]
- VLN
- LLM
- Large Multimodal Agents: A Survey<br>:star:code<br>[2024-02-26]
- Unbridled Icarus: A Survey of the Potential Perils of Image Inputs in Multimodal Large Language Model Security<br>[2024-04-09]
- Hallucination of Multimodal Large Language Models: A Survey<br>:star:code<br>[2024-04-30]
- Multi-Modal and Multi-Agent Systems Meet Rationality: A Survey<br>:star:code<br>[2024-06-04]
- A Survey of Attacks on Large Vision-Language Models: Resources, Advances, and Future Trends<br>:star:code<br>[2024-07-11]
- Grounding and Evaluation for Large Language Models: Practical Challenges and Lessons Learned (Survey)<br>[2024-07-19]
- Knowledge Mechanisms in Large Language Models: A Survey and Perspective<br>[2024-07-23]
- Harnessing Large Vision and Language Models in Agriculture: A Review<br>[2024-07-30]
- The Role of Language Models in Modern Healthcare: A Comprehensive Review<br>[2024-09-26]
- FTII-Bench: A Comprehensive Multimodal Benchmark for Flow Text with Image Insertion<br>:star:code<br>[2024-10-17]
- Survey of Cultural Awareness in Language Models: Text and Beyond<br>[2024-11-05]
- 多模态
- A Comprehensive Survey on Deep Multimodal Learning with Missing Modality<br>[2024-09-13]
- Multimodal Alignment and Fusion: A Survey<br>[2024-11-27]
Vision Transformer
- Exploring the Synergies of Hybrid CNNs and ViTs Architectures for Computer Vision: A survey<br>[2024-02-06]
- Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges<br>:star:code<br>[2024-04-26]
- A Comparative Survey of Vision Transformers for Feature Extraction in Texture Analysis<br>[2024-06-11]
- A Review of Transformer-Based Models for Computer Vision Tasks: Capturing Global Context and Spatial Relationships<br>[2024-08-28]
Style Transfer(风格迁移)
- Evaluation in Neural Style Transfer: A Review<br>[2024-01-31]
Image Matching(图像匹配)
- Local Feature Matching Using Deep Learning: A Survey<br>[2024-02-01]
Point Cloud(点云)
- Advancing 3D Point Cloud Understanding through Deep Transfer Learning: A Comprehensive Survey<br>[2024-07-26]
- Deep Learning for 3D Point Cloud Enhancement: A Survey<br>[2024-11-05]
- 点云配准
- A Comprehensive Survey and Taxonomy on Point Cloud Registration Based on Deep Learning<br>:star:code<br>[2024-04-23]
- 3D Registration in 30 Years: A Survey<br>[2024-12-19]
MC/KD/Pruning(模型压缩/知识蒸馏/剪枝)
- Computer Vision Model Compression Techniques for Embedded Systems: A Survey<br>:star:code<br>[2024-08-16]
- Adversarial Pruning: A Survey and Benchmark of Pruning Methods for Adversarial Robustness<br>:star:code<br>[2024-09-04]
- Model Compression Techniques in Biometrics Applications: A Survey<br>[2024-01-19]
- KD
OCR
- Transformers and Language Models in Form Understanding: A Comprehensive Review of Scanned Document Analysis<br>[2024-03-08]
- A short review on graphonometric evaluation tools in children.<br>[2024-06-11]
- A comprehensive survey of oracle character recognition: challenges, benchmarks, and beyond<br>[2024-11-19]
- 文本图像处理
- Visual Text Meets Low-level Vision: A Comprehensive Survey on Visual Text Processing<br>:star:code<br>[2024-02-06]
- 图表理解
- 文档理解
- 文本识别
- Self-Supervised Learning for Text Recognition: A Critical Survey<br>[2024-07-30]
- 手写识别
- 表格理解
Generation
- Video Diffusion Models: A Survey<br>:star:code<br>[2024-05-07]
- Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond<br>:star:code<br>[2024-05-07]
- Survey on Visual Signal Coding and Processing with Generative Models: Technologies, Standards and Optimization<br>[2024-05-24]
- LLMs Meet Multimodal Generation and Editing: A Survey<br>:star:code<br>[2024-05-30]
- Diffusion Models and Representation Learning: A Survey<br>:star:code<br>[2024-07-02]
- Replication in Visual Diffusion Models: A Survey and Outlook<br>:star:code<br>[2024-08-02]
- A Comprehensive Survey on Synthetic Infrared Image synthesis<br>[2024-08-14]
- Diffusion-Based Visual Art Creation: A Survey and New Perspectives<br>[2024-08-23]
- Jailbreak Attacks and Defenses against Multimodal Generative Models: A Survey<br>:star:code<br>[2024-11-15]
- 文本-图像生成
- Text-to-Image Cross-Modal Generation: A Systematic Review<br>[2024-01-23]
- Controllable Generation with Text-to-Image Diffusion Models: A Survey<br>:star:code<br>[2024-03-08]
- Evaluating Text to Image Synthesis: Survey and Taxonomy of Image Quality Metrics<br>[2024-03-19]
- Survey of Bias In Text-to-Image Generation: Definition, Evaluation, and Mitigation<br>[2024-04-02]
- Theoretical research on generative diffusion models: an overview<br>[2024-04-16]
- Exploring Feedback Generation in Automated Skeletal Movement Assessment: A Comprehensive Overview<br>[2024-04-16]
- Adversarial Attacks and Defenses on Text-to-Image Diffusion Models: A Survey<br>:star:code<br>[2024-07-24]
- Text-to-Image Synthesis: A Decade Survey<br>[2024-11-26]
- 内容生成
- A Survey on Personalized Content Synthesis with Diffusion Models<br>[2024-05-10]
- 文本-3D
- A Survey On Text-to-3D Contents Generation In The Wild<br>[2024-05-16]
- 3D 内容生成
- A Comprehensive Survey on 3D Content Generation<br>[2024-02-05]
- AIGC
- Generative Visual Compression: A Review<br>[2024-02-06]
- Generative AI in Vision: A Survey on Models, Metrics and Applications<br>[2024-02-27]
- Retrieval-Augmented Generation for AI-Generated Content: A Survey<br>:star:code<br>[2024-01-01]
- A Survey of Defenses against AI-generated Visual Media: Detection, Disruption, and Authentication<br>[2024-07-16]
- 图像编辑
- Diffusion Model-Based Image Editing: A Survey<br>:star:code<br>[2024-02-28]
- A Survey of Multimodal-Guided Image Editing with Text-to-Image Diffusion Models<br>:star:code<br>[2024-06-21]
- Instruction-Guided Editing Controls for Images and Multimedia: A Survey in LLM era<br>:star:code<br>[2024-11-18]
- 文本-视频
- Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models<br>:star:code<br>[2024-02-28]
- Sora as an AGI World Model? A Complete Survey on Text-to-Video Generation<br>[2024-03-11]
- From Sora What We Can See: A Survey of Text-to-Video Generation<br>:star:code<br>[2024-05-20]
- 视频生成
- A Survey on Long Video Generation: Challenges, Methods, and Prospects<br>[2024-03-26]
- A Comprehensive Survey on Human Video Generation: Challenges, Methods, and Insights<br>[2024-07-12]
- A Survey of AI-Generated Video Evaluation<br>[2024-10-29]
- 视频编辑
- Diffusion Model-Based Video Editing: A Survey<br>:star:code<br>[2024-07-11]
- GAN
- 街景视角合成
- Bird's-Eye View to Street-View: A Survey<br>[2024-05-16]
- 人体情感识别
- Generative Technology for Human Emotion Recognition: A Scope Review<br>[2024-07-08]
- Sensing technologies and machine learning methods for emotion recognition in autism: Systematic review<br>[2024-07-09]
- Survey on Emotion Recognition through Posture Detection and the possibility of its application in Virtual Reality<br>[2024-08-06]
- 艺术字生成
- Intelligent Artistic Typography: A Comprehensive Review of Artistic Text Design and Generation<br>:star:code<br>[2024-07-23]
- 扩撒
- A Comprehensive Survey on Diffusion Models and Their Applications<br>[2024-08-21]
- Alignment of Diffusion Models: Fundamentals, Challenges, and Future<br>[2024-09-12]
- A Survey on Diffusion Models for Inverse Problems<br>[2024-10-02]
- Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices<br>:star:code<br>[2024-10-16]
Biometrics(生物特征识别)
- Deep Learning Techniques for Hand Vein Biometrics: A Comprehensive Review<br>[2024-09-12]
- Biometrics in Extended Reality: A Review<br>[2024-11-19]
Reid/Pedestrian Detection(行人/重识别检测)
- Reid
- 行人检测
Human Action Recognition(人体动作识别)
- Body-Area Capacitive or Electric Field Sensing for Human Activity Recognition and Human-Computer Interaction: A Comprehensive Survey<br>[2024-01-12]
- A Survey of IMU Based Cross-Modal Transfer Learning in Human Activity Recognition<br>[2024-03-26]
- A Survey on Backbones for Deep Video Action Recognition<br>[2024-05-10]
- From CNNs to Transformers in Multimodal Human Action Recognition: A Survey<br>[2024-05-28]
- Self-Supervised Skeleton Action Representation Learning: A Benchmark and Beyond<br>[2024-06-06]
- RNNs, CNNs and Transformers in Human Action Recognition: A Survey and A Hybrid Model<br>[2024-07-09]
- A Comprehensive Review of Few-shot Action Recognition<br>[2024-07-23]
- A Critical Analysis on Machine Learning Techniques for Video-based Human Activity Recognition of Surveillance Systems: A Review<br>[2024-09-04]
- A Comprehensive Methodological Survey of Human Activity Recognition Across Divers Data Modalities<br>[2024-09-17]
- Human Action Anticipation: A Survey<br>[2024-10-21]
- Exocentric To Egocentric Transfer For Action Recognition: A Short Survey<br>[2024-10-29]
- 动作质量评估
- A Comprehensive Survey of Action Quality Assessment: Method and Benchmark<br>:star:code<br>[2024-12-17]
- 跌倒检测
Human Pose Estimation(人体姿态估计)
- In-Bed Pose Estimation: A Review<br>[2024-02-02]
- Survey of 3D Human Body Pose and Shape Estimation Methods for Contemporary Dance Applications<br>[2024-01-05]
- Deep Learning for 3D Human Pose Estimation and Mesh Recovery: A Survey<br>:star:code<br>[2024-03-01]
- A Survey on 3D Egocentric Human Pose Estimation<br>[2024-03-27]
- Human Modelling and Pose Estimation Overview<br>[2024-06-28]
- Markerless Multi-view 3D Human Pose Estimation: a survey<br>[2024-07-08]
- 三维人体
- 手势合成
- 手语翻译
- 运动生成
Video
- Deep video representation learning: a survey<br>[2024-05-13]
- Segment Anything for Videos: A Systematic Survey<br>:star:code<br>[2024-08-19]
- About Time: Advances, Challenges, and Outlooks of Action Understanding<br>[2024-11-25]
- AI-Driven Innovations in Volumetric Video Streaming: A Review<br>[2024-12-18]
- 视频摘要
- Video Summarization Techniques: A Comprehensive Review<br>[2024-10-08]
- 视频理解
- Video Understanding with Large Language Models: A Survey<br>:star:code<br>[2024-01-01]
- A Survey on Generative AI and LLM for Video Generation, Understanding, and Streaming<br>[2024-04-26]
- Foundation Models for Video Understanding: A Survey<br>:star:code<br>[2024-05-08]
- A Survey of Video Datasets for Grounded Event Understanding<br>[2024-06-17]
- From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Long Video Understanding<br>[2024-09-30]
- 视频预测
- 视频制作
- 视频监控
- 视频异常检测
- Networking Systems for Video Anomaly Detection: A Tutorial and Survey<br>:star:code<br>[2024-05-20]
- Video Anomaly Detection in 10 Years: A Survey and Outlook<br>[2024-05-31]
- Deep Learning for Video Anomaly Detection: A Review<br>[2024-09-10]
- Privacy-Preserving Video Anomaly Detection: A Survey<br>[2024-11-25]
Object Tracking(目标跟踪)
- Beyond Traditional Single Object Tracking: A Survey<br>[2024-05-20]
- The Progression of Transformers from Language to Vision to MOT: A Literature Review on Multi-Object Tracking with Transformers<br>[2024-06-25]
- Object Tracking in a 360o View: A Novel Perspective on Bridging the Gap to Biomedical Advancements<br>[2024-12-03]
- Visual Object Tracking across Diverse Data Modalities: A Review<br>[2024-12-16]
- 多模态目标跟踪
- Awesome Multi-modal Object Tracking<br>:star:code<br>[2024-05-24]
Object Detection(目标检测)
- Agricultural Object Detection with You Look Only Once (YOLO) Algorithm: A Bibliometric and Systematic Literature Review<br>[2024-01-22]
- YOLOv1 to YOLOv10: A comprehensive review of YOLO variants and their application in the agricultural domain<br>[2024-06-17]
- YOLOv10 to Its Genesis: A Decadal and Comprehensive Review of The You Only Look Once Series<br>[2024-07-01]
- Semi-Supervised Object Detection: A Survey on Progress from CNN to Transformer<br>[2024-07-12]
- A Survey and Evaluation of Adversarial Attacks for Object Detection<br>[2024-08-06]
- Surveying You Only Look Once (YOLO) Multispectral Object Detection Advancements, Applications And Challenges<br>[2024-09-23]
- Advancing Object Detection in Transportation with Multimodal Large Language Models (MLLMs): A Comprehensive Review and Empirical Testing<br>[2024-09-30]
- Radar and Camera Fusion for Object Detection and Tracking: A Comprehensive Survey<br>[2024-10-29]
- Event-based Spiking Neural Networks for Object Detection: A Review of Datasets, Architectures, Learning Rules, and Implementation<br>:star:code<br>[2024-11-27]
- From classical techniques to convolution-based models: A review of object detection algorithms<br>[2024-12-09]
- 真实世界目标检测
- 开发世界目标检测
- Open World Object Detection: A Survey<br>:star:code<br>[2024-10-16]
- 小样本目标检测
- Beyond Few-shot Object Detection: A Detailed Survey<br>[2024-08-27]
- 伪装目标检测
- A Survey of Camouflaged Object Detection and Beyond<br>:star:code<br>[2024-08-28]
- 海洋垃圾检测
- 3D目标识别
- 阴影检测
- 目标发现
UAV/Remote Sensing/Satellite Image(无人机/遥感/卫星图像)
- Image Fusion in Remote Sensing: An Overview and Meta Analysis<br>[2024-01-18]
- UAV-borne Mapping Algorithms for Canopy-Level and High-Speed Drone Applications<br>[2024-01-15]
- Solid Waste Detection in Remote Sensing Images: A Survey<br>[2024-02-15]
- A Comprehensive Review on Computer Vision Analysis of Aerial Data<br>[2024-02-16]
- Deep Learning for Satellite Image Time Series Analysis: A Review<br>[2024-04-08]
- A Review on Machine Learning Algorithms for Dust Aerosol Detection using Satellite Data<br>[2024-04-16]
- Sugarcane Health Monitoring With Satellite Spectroscopy and Machine Learning: A Review<br>[2024-04-29]利用卫星光谱和机器学习监测甘蔗健康
- Wildfire Risk Prediction: A Review<br>[2024-05-06]
- Dehazing Remote Sensing and UAV Imagery: A Review of Deep Learning, Prior-based, and Hybrid Approaches<br>[2024-05-14]
- Visual place recognition for aerial imagery: A survey<br>:star:code<br>[2024-06-04]
- Deep Learning for Slum Mapping in Remote Sensing Images: A Meta-analysis and Review<br>[2024-06-13]
- Hyperspectral Pansharpening: Critical Review, Tools and Future Perspectives<br>:star:code<br>[2024-07-02]
- AI Foundation Models in Remote Sensing: A Survey<br>[2024-08-08]
- Applications of Knowledge Distillation in Remote Sensing: A Survey<br>[2024-09-19]
- Foundation Models for Remote Sensing and Earth Observation: A Survey<br>[2024-10-23]
- Generative Artificial Intelligence Meets Synthetic Aperture Radar: A Survey<br>:star:code<br>[2024-11-11]
- Maritime Search and Rescue Missions with Aerial Images: A Survey<br>[2024-11-13]
- A comprehensive review of datasets and deep learning techniques for vision in Unmanned Surface Vehicles<br>[2024-12-03]
- Remote Sensing Temporal Vision-Language Models: A Comprehensive Survey<br>:star:code<br>[2024-12-04]
- 交叉视角地理定位
- Cross-view geo-localization: a survey<br>[2024-06-17]
- 航空航天
- 船舶轨迹预测
- 野生动物监测
- 变化检测
Medical Image Progress
- Empowering Medical Imaging with Artificial Intelligence: A Review of Machine Learning Approaches for the Detection, and Segmentation of COVID-19 Using Radiographic and Tomographic Images<br>[2024-01-17]
- Advancing Low-Rank and Local Low-Rank Matrix Approximation in Medical Imaging: A Systematic Literature Review and Future Directions<br>[2024-02-23]
- When Eye-Tracking Meets Machine Learning: A Systematic Review on Applications in Medical Image Analysis<br>[2024-03-33]
- Out-of-distribution Detection in Medical Image Analysis: A survey<br>[2024-04-30]
- Development of Skip Connection in Deep Neural Networks for Computer Vision and Medical Image Analysis: A Survey<br>:star:code<br>[2024-05-06]
- Continual Learning in Medical Imaging from Theory to Practice: A Survey and Practical Analysis<br>[2024-05-24]
- Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis<br>:star:code<br>[2024-06-06]
- Solving the Inverse Problem of Electrocardiography for Cardiac Digital Twins: A Survey<br>[2024-06-18]
- A Comprehensive Survey of Foundation Models in Medicine<br>[2024-06-18]
- Review of Zero-Shot and Few-Shot AI Algorithms in The Medical Domain<br>[2024-06-25]
- Applications of interpretable deep learning in neuroimaging: a comprehensive review<br>[2024-06-27]
- Foundational Models for Pathology and Endoscopy Images: Application for Gastric Inflammation<br>[2024-06-27]
- A Review of Image Processing Methods in Prostate Ultrasound<br>[2024-07-02]
- Physics-Inspired Generative Models in Medical Imaging: A Review<br>[2024-07-16]
- Integrating Deep Learning in Cardiology: A Comprehensive Review of Atrial Fibrillation, Left Atrial Scar Segmentation, and the Frontiers of State-of-the-Art Techniques<br>[2024-07-16]
- A Survey on Trustworthiness in Foundation Models for Medical Image Analysis<br>[2024-07-24]
- PINNs for Medical Image Analysis: A Survey<br>[2024-08-05]
- Future-Proofing Medical Imaging with Privacy-Preserving Federated Learning and Uncertainty Quantification: A Review<br>:star:code<br>[2024-09-26]
- Artificial intelligence techniques in inherited retinal diseases: A review<br>[2024-10-15]
- Medical AI for Early Detection of Lung Cancer: A Survey<br>:star:code<br>[2024-10-22]
- Advancing Histopathology with Deep Learning Under Data Scarcity: A Decade in Review<br>[2024-10-29]
- Multiplex Imaging Analysis in Pathology: a Comprehensive Review on Analytical Approaches and Digital Toolkits<br>[2024-11-05]
- Ultrasound-Based AI for COVID-19 Detection: A Comprehensive Review of Public and Private Lung Ultrasound Datasets and Studies<br>[2024-11-11]
- Artificial Intelligence-Informed Handheld Breast Ultrasound for Screening: A Systematic Review of Diagnostic Test Accuracy<br>[2024-11-13]
- A Survey of Medical Vision-and-Language Applications and Their Techniques<br>:star:code<br>[2024-11-20]
- Explainable Artificial Intelligence for Medical Applications: A Review<br>[2024-12-04]
- Privacy-Preserving in Medical Image Analysis: A Review of Methods and Applications<br>[2024-12-06]
- Automatic Prediction of Stroke Treatment Outcomes: Latest Advances and Perspectives<br>[2024-12-09]
- Machine learning algorithms to predict the risk of rupture of intracranial aneurysms: a systematic review<br>[2024-12-09]预测颅内动脉瘤破裂风险的机器学习算法:系统综述
- Computational Methods for Breast Cancer Molecular Profiling through Routine Histopathology: A Review<br>[2024-12-17]
- 息肉分割
- Colorectal Polyp Segmentation in the Deep Learning Era: A Comprehensive Survey<br>[2024-01-23]
- Artificial Intelligence in Gastrointestinal Bleeding Analysis for Video Capsule Endoscopy: Insights, Innovations, and Prospects (2008-2023)<br>[2024-09-04]
- A Short Survey on Set-Based Aggregation Techniques for Single-Vector WSI Representation in Digital Pathology<br>[2024-09-10]
- The Era of Foundation Models in Medical Imaging is Approaching : A Scoping Review of the Clinical Value of Large-Scale Generative AI Applications in Radiology<br>[2024-09-23]
- 生物医学图像分割
- 微创外科视觉
- 牙科 X 射线成像分割
- 胶质瘤组织切片分析
- 手术
- 人工耳蜗
- 医学图像配准
- stroke segmentation
- CT
- 医学图像分类
- 医学图像分割
- Explainable AI (XAI) in Image Segmentation in Medicine, Industry, and Beyond: A Survey<br>[2024-05-06]
- AI-based Automatic Segmentation of Prostate on Multi-modality Images: A Review<br>[2024-07-10]
- Deep Learning for Pancreas Segmentation: a Systematic Review<br>[2024-07-24]
- A Short Review and Evaluation of SAM2's Performance in 3D CT Image Segmentation<br>[2024-08-22]
- Unleashing the Potential of SAM2 for Biomedical Images and Videos: A Survey<br>:star:code<br>[2024-08-26]
- U-Net in Medical Image Segmentation: A Review of Its Applications Across Modalities<br>[2024-12-04]
- 医学影像分析
- A Comprehensive Survey of Mamba Architectures for Medical Image Analysis: Classification, Segmentation, Restoration and Beyond<br>[2024-10-04]
- Self-eXplainable AI for Medical Image Analysis: A Survey and New Outlooks<br>[2024-10-04]
- Navigating Distribution Shifts in Medical Image Analysis: A Survey<br>[2024-11-12]
- 医学图像生成
- Exploring Variational Autoencoders for Medical Image Generation: A Comprehensive Study<br>:star:code<br>[2024-11-12]
- 细胞核实例分割
- 神经成像中的异常检测
- 报告生成
- 基于步态的神经退行性疾病诊断中的人工智能调查
- A Survey of Artificial Intelligence in Gait-Based Neurodegenerative Disease Diagnosis<br>:star:code<br>[2024-05-24]
- 目标检测
- 肺炎检测
- 癌症检测
- MRI 重建
Image Classification(图像分类)
- High-energy physics image classification: A Survey of Jet Applications<br>[2024-03-19]
- Noisy Label Processing for Classification: A Survey<br>[2024-04-08]
- Traditional to Transformers: A Survey on Current Trends and Future Prospects for Hyperspectral Image Classification<br>:star:code<br>[2024-04-24]
- Convolutional Neural Networks and Vision Transformers for Fashion MNIST Classification: A Literature Review<br>[2024-06-06]
- A review on vision-based motion estimation<br>[2024-07-22]
- On the Element-Wise Representation and Reasoning in Zero-Shot Image Recognition: A Systematic Survey<br>[2024-08-12]
Image Retrieval
- A Review of Image Retrieval Techniques: Data Augmentation and Adversarial Learning Approaches<br>[2024-09-04]
Image Captioning(图像字幕)
- Surveying the Landscape of Image Captioning Evaluation: A Comprehensive Taxonomy and Novel Ensemble Method<br>[2024-08-12]
Image Segmentation(图像分割)
- Image Segmentation in Foundation Model Era: A Survey<br>[2024-08-26]
- On Efficient Variants of Segment Anything Model: A Survey<br>[2024-10-08]
- A Review of Bayesian Uncertainty Quantification in Deep Probabilistic Image Segmentation<br>[2024-11-26]
- Review of Fruit Tree Image Segmentation<br>[2024-12-20]
- 语义分割
- Semi-Supervised Semantic Segmentation Based on Pseudo-Labels: A Survey<br>[2024-03-06]无PDF
- Deep Learning-Based 3D Instance and Semantic Segmentation: A Review<br>[2024-06-21]
- Deep Learning on 3D Semantic Segmentation: A Detailed Review<br>:star:code<br>[2024-11-05]
- 纹理分割
Image retrieval(图像检索)
- A Survey of Multimodal Composite Editing and Retrieval<br>:star:code<br>[2024-09-10]
Super-Resolution(超分辨率)
- ISR
Image and Video Progress
- 修复
- 恢复
- Taming Diffusion Models for Image Restoration: A Review<br>[2024-09-17]
- A Survey on All-in-One Image Restoration: Taxonomy, Evaluation and Future Trends<br>:star:code<br>[2024-10-22]
- 着色
- Computer-aided Colorization State-of-the-science: A Survey<br>:star:code<br>[2024-10-04]
- 去噪
- 去模糊
- Application of Deep Learning in Blind Motion Deblurring: Current Status and Future Prospects<br>:star:code<br>[2024-01-11]
- 去阴影
- Single-Image Shadow Removal Using Deep Learning: A Comprehensive Survey<br>:star:code<br>[2024-07-15]
- 去大气湍流
- 图像增强
- 水下图像增强
- A Comprehensive Survey on Underwater Image Enhancement Based on Deep Learning<br>:star:code<br>[2024-05-31]
- 图像数据增强
- 水下图像增强
- 视频质量评估
- Video Quality Assessment: A Comprehensive Survey<br>[2024-12-09]
Image Segmentation
- Systematic review of image segmentation using complex networks<br>[2024-01-08]
Image/video compression(图像/视频压缩)
- The evolution of volumetric video: A survey of smart transcoding and compression approaches<br>[2024-11-05]
Face(人脸)
- SoK: Facial Deepfake Detectors<br>[2024-01-10]
- Neuromorphic Face Analysis: a Survey<br>[2024-02-20]
- A Comprehensive Survey of Masked Faces: Recognition, Detection, and Unmasking<br>[2024-05-10]
- Evolving from Single-modal to Multi-modal Facial Deepfake Detection: A Survey<br>:star:code<br>[2024-06-12]
- Artificial Immune System of Secure Face Recognition Against Adversarial Attacks<br>:star:code<br>[2024-06-27]
- Complex Emotion Recognition System using basic emotions via Facial Expression, EEG, and ECG Signals: a review<br>[2024-09-13]
- A Survey on Physical Adversarial Attacks against Face Recognition Systems<br>[2024-10-23]
- 人脸表情
- A Survey on Facial Expression Recognition of Static and Dynamic Emotions<br>:star:code<br>[2024-08-29]
- A survey on Graph Deep Representation Learning for Facial Expression Recognition<br>[2024-11-14]
- 群体情绪识别(Group-level Emotion Recognition ,GReco)
- A Survey of Deep Learning for Group-level Emotion Recognition<br>[2024-08-29]
- 人脸伪造检测
- Deep Learning Technology for Face Forgery Detection: A Survey<br>[2024-09-24]
3D Reconstruction
- 3D Scene Geometry Estimation from 360∘ Imagery: A Survey<br>[2024-01-18]
- Survey on Modeling of Articulated Objects<br>[2024-03-25]
- RGB Guided ToF Imaging System: A Survey of Deep Learning-based Methods<br>[2024-05-20]
- A Survey on Text-guided 3D Visual Grounding: Elements, Recent Advances, and Future Directions<br>:star:code<br>[2024-06-11]
- 3D Representation Methods: A Survey<br>[2024-10-10]
- 网格重建
- 三维视觉
- Diffusion Models in 3D Vision: A Survey<br>[2024-10-08]
- 三维重建
- Recent Trends in 3D Reconstruction of General Non-Rigid Scenes<br>[2024-03-25]
- Gaussian Splatting: 3D Reconstruction and Novel View Synthesis, a Review<br>[2024-05-07]
- Survey on Fundamental Deep Learning 3D Reconstruction Techniques<br>[2024-07-12]
- A Review of 3D Reconstruction Techniques for Deformable Tissues in Robotic Surgery<br>:star:code<br>[2024-08-09]
- 三维形状
- 3D 生成
- Advances in 3D Generation: A Survey<br>[2024-02-01]
- 3D 密集字幕
- 深度估计
- Geometric Constraints in Deep Learning Frameworks: A Survey<br>[2024-03-20]
- Deep Learning-based Depth Estimation Methods from Monocular Image and Videos: A Comprehensive Survey<br>[2024-07-01]
- Event-based Stereo Depth Estimation: A Survey<br>:star:code<br>[2024-09-27]
- 三维场景理解
- Stereo Matching
- A Survey on Deep Stereo Matching in the Twenties<br>:star:code<br>[2024-07-11]
- 3DGS
- 3DGS.zip: A survey on 3D Gaussian Splatting Compression Methods<br>:star:code<br>[2024-07-16]
- 3D Gaussian Splatting: Survey, Technologies, Challenges, and Opportunities<br>[2024-07-25]
- MVS
- Learning-based Multi-View Stereo: A Survey<br>[2024-08-28]
1.Unkown(未分)
- Comprehensive Exploration of Synthetic Data Generation: A Survey<br>[2024-01-08]
- Image-based Deep Learning for Smart Digital Twins: a Review<br>[2024-01-08]
- A Survey on 3D Gaussian Splatting<br>[2024-01-09]
- A Survey on African Computer Vision Datasets, Topics and Researchers<br>:star:code<br>[2024-01-23]
- Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey<br>:star:code<br>[2024-02-06]
- A Literature Review of Literature Reviews in Pattern Analysis and Machine Intelligence<br>[2024-02-21]
- Asphalt Concrete Characterization Using Digital Image Correlation: A Systematic Review of Best Practices, Applications, and Future Vision<br>[2024-02-28]
- Lightweight Deep Learning for Resource-Constrained Environments: A Survey<br>[2024-04-12]
- A Survey of Neural Network Robustness Assessment in Image Recognition<br>[2024-04-15]
- State Space Model for New-Generation Network Alternative to Transformers: A Survey<br>:star:code<br>[2024-04-16]
- A Survey on Vision Mamba: Models, Applications and Challenges<br>:star:code<br>[2024-04-30]
- Generative Artificial Intelligence: A Systematic Review and Applications<br>[2024-05-21]
- A Review of Pulse-Coupled Neural Network Applications in Computer Vision and Image Processing<br>[2024-06-04]
- Exploring the Potential of Polynomial Basis Functions in Kolmogorov-Arnold Networks: A Comparative Study of Different Groups of Polynomials<br>[2024-06-06]
- Deep learning for precipitation nowcasting: A survey from the perspective of time series forecasting<br>[2024-06-11]
- Diffusion Models in Low-Level Vision: A Survey<br>:star:code<br>[2024-06-18]
- Public Computer Vision Datasets for Precision Livestock Farming: A Systematic Survey<br>[2024-06-18]
- Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI<br>:star:code<br>[2024-07-10]
- Event-based vision on FPGAs -- a survey<br>[2024-07-12]
- Fairness and Bias Mitigation in Computer Vision: A Survey<br>[2024-08-06]
- A Review of Pseudo-Labeling for Computer Vision<br>[2024-08-15]
- Generative AI in Industrial Machine Vision -- A Review<br>[2024-08-21]
- Recent Event Camera Innovations: A Survey<br>:star:code<br>[2024-08-27]
- How Could Generative AI Support Compliance with the EU AI Act? A Review for Safe Automated Driving Perception<br>[2024-09-02]
- Local map Construction Methods with SD map: A Novel Survey<br>[2024-09-05]
- A Survey on Mixup Augmentations and Beyond<br>:star:code<br>[2024-09-10]
- Transfer Learning Applied to Computer Vision Problems: Survey on Current Progress, Limitations, and Opportunities<br>[2024-09-13]
- Mamba in Vision: A Comprehensive Survey of Techniques and Applications<br>:star:code<br>[2024-10-07]
- AI-Driven Approaches for Glaucoma Detection -- A Comprehensive Review<br>[2024-10-22]
- Where Do We Stand with Implicit Neural Representations? A Technical and Performance Survey<br>[2024-11-07]
- Trends, Challenges, and Future Directions in Deep Learning for Glaucoma: A Systematic Review<br>[2024-11-12]
- A Survey on Vision Autoregressive Model<br>[2024-11-14]
- Towards Fairness in AI for Melanoma Detection: Systemic Review and Recommendations<br>[2024-11-21]
- Artificial Intelligence for Geometry-Based Feature Extraction, Analysis and Synthesis in Artistic Images: A Survey<br>[2024-12-03]
- A Review of Intelligent Device Fault Diagnosis Technologies Based on Machine Vision<br>[2024-12-12]
- Predictive Pattern Recognition Techniques Towards Spatiotemporal Representation of Plant Growth in Simulated and Controlled Environments: A Comprehensive Review<br>[2024-12-17]
- A Review of Multimodal Explainable Artificial Intelligence: Past, Present and Future<br>[2024-12-19]
2023 年论文分类汇总戳这里
↘️CVPR-2023-Papers ↘️WACV-2023-Papers ↘️ICCV-2023-Papers ↘️2023-CV-Surveys
<a name="0000"/>2022 年论文分类汇总戳这里
↘️CVPR-2022-Papers ↘️WACV-2022-Papers ↘️ECCV-2022-Papers
<a name="000"/>2021 年论文分类汇总戳这里
↘️ICCV-2021-Papers ↘️CVPR-2021-Papers
<a name="00"/>2020 年论文分类汇总戳这里
↘️CVPR-2020-Papers ↘️ECCV-2020-Papers