Awesome
52CV-WACV-Papers
历年综述论文分类汇总戳这里↘️CV-Surveys施工中~~~~~~~~~~
2022 年论文分类汇总戳这里
↘️CVPR-2022-Papers ↘️WACV-2022-Papers
2021年论文分类汇总戳这里
↘️ICCV-2021-Papers ↘️CVPR-2021-Papers
2020 年论文分类汇总戳这里
↘️CVPR-2020-Papers ↘️ECCV-2020-Papers
:exclamation::exclamation::exclamation::star2::star2::star2:📗📗📗WACV 2022收录论文已全部公布,下载可在【我爱计算机视觉】后台回复“paper”,即可收到。共计 406 篇。
:exclamation::exclamation::exclamation::star2::star2::star2:分类完成
目录
55.Object Counting(物体计数)
<a name="54"/>54.Optical Flow(光流)
<a name="53"/>53.Gaze Estimation(视线估计)
<a name="52"/>52.Eye Tracking(眼动跟踪)
<a name="51"/>51.Semantic Scene Completion(语义场景完成SSC)
<a name="50"/>50.Sign Language Translation(手语翻译)
<a name="49"/>49.Debiasing(去偏见)
<a name="48"/>48.Light Fields(光场)
- Fast and Efficient Restoration of Extremely Dark Light Fields
- 相机校准
- Camera Pose Estimation(相机姿势估计)
47.Data Augmentation(数据增强)
- Meta Approach to Data Augmentation Optimization
- Improving Model Generalization by Agreement of Learned Representations From Data Augmentation<br>:star:code
46.Metric Learning(度量学习)
- Multi-Head Deep Metric Learning Using Global and Local Representations
- Hierarchical Proxy-Based Loss for Deep Metric Learning
45.Class-Incremental Learning(类增量学习)
<a name="44"/>44.Multi-Task Learning(多任务学习)
- Joint Classification and Trajectory Regression of Online Handwriting Using a Multi-Task Learning Approach
- Semi-Supervised Multi-Task Learning for Semantics and Depth
43.Active Learning(主动学习)
<a name="42"/>42.Landmark Detection(关键点检测)
<a name="41"/>41.Action Generation(动作生成)
- MUGL: Large Scale Multi Person Conditional Action Generation with Locomotion<br>:star:code:house:project
- 基于姿势引导的动作合成
40.Anomaly Detection(异常检测)
- CFLOW-AD: Real-Time Unsupervised Anomaly Detection With Localization via Conditional Normalizing Flows<br>:star:code
- A Semi-Supervised Generalized VAE Framework for Abnormality Detection Using One-Class Classification
- novelty detection(奇异值检测)
39.Style Transfer(风格迁移)
<a name="38"/>38.Sound(音频处理)
- Beyond Mono to Binaural: Generating Binaural Audio From Mono Audio With Depth and Cross Modal Attention<br>:house:project
- 声源定位
- 声源分离
37.Object Tracking(目标跟踪)
<a name="36"/>36.Soft Biometrics(软生物技术)
- Periocular(眼周) 识别
35.VQA(视觉问答)
- InfographicVQA<br>:star:code
- Efficient Counterfactual Debiasing for Visual Question Answering<br>:star:code
- Audio video scene-aware dialog(视听场景感知对话)
34.SLAM\Robots
- SLAM
- Try-On
- Robots
33.View Synthesis(视图合成)
- Revealing Disocclusions in Temporal View Synthesis Through Infilling Vector Prediction<br>:star:code:house:project:tv:video
- Fast and Explicit Neural View Synthesis
- Novel-View Synthesis of Human Tourist Photos
32.Continual Learning(持续学习)
<a name="31"/>31.Deepfake Detection(假象检测)
<a name="30"/>30.Reinforcement Learning(强化学习)
<a name="29"/>29.Image Classification(图像分类)
- Evaluating and Mitigating Bias in Image Classifiers: A Causal Perspective Using Counterfactuals
- Class-Balanced Active Learning for Image Classification<br>:star:code
- Learnable Adaptive Cosine Estimator (LACE) for Image Classification<br>:star:code
- Enhancing Few-Shot Image Classification With Unlabelled Examples<br>:star:code
- 零样本分类
- 小样本分类
- 细粒度识别
28.Pose Estimation(姿态估计)
- 物品姿势估计
- Object Pose Refinement
- 动物姿势
27.Defect Detection(缺陷检测)
- Fully Convolutional Cross-Scale-Flows for Image-Based Defect Detection<br>:star:code
- Automated Defect Inspection in Reverse Engineering of Integrated Circuits
- 下水道缺陷分类
26.Dataset\Benchmark(数据集\基准)
- MovingFashion: A Benchmark for the Video-To-Shop Challenge<br>:sunflower:dataset
- Challenges in Procedural Multimodal Machine Comprehension: A Novel Way To Benchmark
- 用于检测跟踪海域人类
- 图像识别
- 自动驾驶
- 用于从高空鱼眼相机中检测和跟踪行人
25.Image Captioning(图像字幕)
- Is an Image Worth Five Sentences? A New Look Into Semantics for Image-Text Matching<br>:star:code:star:code
- Let There Be a Clock on the Beach: Reducing Object Hallucination in Image Captioning<br>:star:code
- Improve Image Captioning by Estimating the Gazing Patterns From the Caption
24.Image Retrieval(图像检索)
- All the Attention You Need: Global-Local, Spatial-Channel Attention for Image Retrieval
- Learning With Label Noise for Image Retrieval by Selecting Interactions
- SAC: Semantic Attention Composition for Text-Conditioned Image Retrieval
- Image-Text retrieval
- 图像搜索
- 视频文本匹配
- 绘图检索
- 视频检索
23.Autonomous Driving(智能驾驶)
- 自动驾驶
- 车辆定位
- Vehicle Detection(交通检测)
- Lane Detection(车道线检测)
22.Human Action Recognition(人体动作识别与检测)
- NUTA: Non-Uniform Temporal Aggregation for Action Recognition
- MM-ViT: Multi-Modal Video Transformer for Compressed Video Action Recognition
- Dual-Head Contrastive Domain Adaptation for Video Action Recognition<br>:star:code
- Skeleton-DML: Deep Metric Learning for Skeleton-Based One-Shot Action Recognition<br>:star:code
- SWAG-V: Explanations for Video Using Superpixels Weighted by Average Gradients
- Pose and Joint-Aware Action Recognition<br>:star:code
- Domain Generalization Through Audio-Visual Relative Norm Alignment in First Person Action Recognition
- 3D动作识别
- 动作定位
- 时序动作分割
21.Point Cloud(点云)
- Surrogate Model-Based Explainability Methods for Point Cloud NNs<br>:star:code
- StickyLocalization: Robust End-to-End Relocalization on Point Clouds Using Graph Neural Networks
- 3D 点云
- 分类与分割
20.Transformer
- Billion-Scale Pretraining with Vision Transformers for Multi-Task Visual Representations
- Visualizing Paired Image Similarity in Transformer Networks<br>:star:code
- S2-MLP: Spatial-Shift MLP Architecture for Vision
- 图像分类
- 图像超级补全
19.Model Compression\Knowledge Distillation\Pruning(模型压缩\知识蒸馏\剪枝)
- 模型压缩
- 知识蒸馏
- 剪枝
18.NAS(神经架构搜索)
- Approximate Neural Architecture Search via Operation Distribution Learning
- Neural Architecture Search for Efficient Uncalibrated Deep Photometric Stereo
- Towards a Robust Differentiable Architecture Search Under Label Noise
- Lightweight Monocular Depth With a Novel Neural Architecture Search Method
- Progressive Automatic Design of Search Space for One-Shot Neural Architecture Search
17.OCR(文本检测)
- Post-OCR Paragraph Recognition by Graph Convolutional Networks
- 不规则场景文本识别
- LOGO识别
- 手写文本识别
- 表格结构识别
16.Super-Resolution(超分辨率)
- Normalizing Flow as a Flexible Fidelity Objective for Photo-Realistic Super-Resolution<br>:star:code
- Multi-Dimensional Dynamic Model Compression for Efficient Image Super-Resolution
- edge-SR: Super-Resolution for the Masses
- DAQ: Channel-Wise Distribution-Aware Quantization for Deep Image Super-Resolution Networks
- Hyperspectral Image Super-Resolution With RGB Image Super-Resolution as an Auxiliary Task<br>:star:code
- VSR
- BSR
15.Image Synthesis(图像合成)
- 图像生成
- sketch-to-photo
- Image-to-Image Translation
14.Un\Self\Semi-Supervised Learning(无\自\半监督学习)
- 半监督
- 自监督
- 无监督
13.Image Segmentation(图像分割)
- Semantically Stealthy Adversarial Attacks Against Segmentation Models
- 视频分割
- VOS(视频目标分割)
- 动作分割
- 语义分割
- Plugging Self-Supervised Monocular Depth Into Unsupervised Domain Adaptation for Semantic Segmentation<br>:star:code
- Adversarial Semantic Hallucination for Domain Generalized Semantic Segmentation<br>:star:code
- Shallow Features Guide Unsupervised Domain Adaptation for Semantic Segmentation at Class Boundaries<br>:star:code
- Evaluating the Robustness of Semantic Segmentation for Autonomous Driving Against Real-World Adversarial Patch Attacks
- Multi-Domain Incremental Learning for Semantic Segmentation<br>:star:code
- Active Learning for Improved Semi-Supervised Semantic Segmentation in Satellite Images<br>:star:code
- Multi-Domain Semantic Segmentation With Overlapping Labels
- Mixed-Dual-Head Meets Box Priors: A Robust Framework for Semi-Supervised Segmentation
- 视频语义分割
- 弱监督语义分割
- 无监督语义分割
- 半监督语义分割
- 小样本语义分割
- 实例分割
- 全景分割
- Foreground-Background 分割
- 超像素分割
- 道路分割
- 抠图
- 视频抠图
- Robust High-Resolution Video Matting with Temporal Guidance<br>:star:code:house:project:tv:video
- 视频抠图
12.One\Few-Shot Learning or Domain Adaptation\Generalization\Shift(单\小样本学习 or 域适应\泛化\偏移)
- 域适应
- Unsupervised Robust Domain Adaptation Without Source Data
- 半监督域适应
- 无监督域适应
- 开集域适应
- 多源域适应
- 多目标域适应
- 域泛化
- 小样本学习
- Contextual Gradient Scaling for Few-Shot Learning<br>:star:code
- Calibrating CNNs for Few-Shot Meta Learning
- SEGA: Semantic Guided Attention on Visual Prototype for Few-Shot Learning<br>:star:code
- Tensor Feature Hallucination for Few-Shot Learning<br>:star:code
- Ortho-Shot: Low Displacement Rank Regularization With Data Augmentation for Few-Shot Learning
- Domain Shift
- 单样本学习
11.Face(人脸)
- 3D Facial
- 基于皱纹的人体识别
- 人脸活体检测
- 人脸表情
- 人脸检测
- PAD人脸呈现攻击检测
- 年龄预测
- Face verification(人脸验证)
- 人脸去模糊
- facial forgery detection
- 人脸图像质量苹果
- 人脸补全
- 妆容迁移
- 人脸恢复
- 人脸识别
10.Adversarial Learning(对抗学习)
- 黑盒攻击
- 对抗样本
- 对抗攻击
9.Remote Sensing\Satellite Image(遥感\卫星图像)
- Lane-Level Street Map Extraction From Aerial Imagery
- An Experimental Comparison of Multi-View Stereo Approaches on Satellite Images
- 小样本开放集识别
- 检测
- 跟踪
8.Image Processing(图像处理)
- Extracting Vignetting and Grain Filter Effects From Photos
- 去噪
- 去雨
- 去模糊
- 去马赛克
- 图像着色
- 图像裁剪
- 图像恢复
- 图像修复
- 图像降质
- 图像增强
- 图像质量评估
- Image reenactment(图像重演)
- Image decomposition(图像分解)
- HDR
- Auto white balance(自动白平衡)
7.Human Pose(人体姿态)
- 人体动作合成
- 3D人体
- 人体姿态估计
- 3D人体姿态估计
- 3D手部姿势估计
- 头部姿势估计
- 三维人体模型
- 人体形状
6.Video(视频相关)
- 无监督视频域适应
- Partial Video Copy Detection(局部视频拷贝检测)
- 异常检测
- Discrete Neural Representations for Explainable Anomaly Detection<br>:house:project:tv:video
- Rethinking Video Anomaly Detection - A Continual Learning Approach
- A Modular and Unified Framework for Detecting and Localizing Video Anomalies
- FastAno: Fast Anomaly Detection via Spatio-Temporal Patch Transformation
- Multi-Branch Neural Networks for Video Anomaly Detection in Adverse Lighting and Weather Conditions
- sarcasm and humor detection(讽刺与幽默检测)
- 视频表征学习
- 视频字幕
- 视频人物定位
- 视频稳定
- Deep Online Fused Video Stabilization<br>:star:code:house:project:tv:video
- 视频理解
- 视频分类
- 视频摘要
- 有声视频合成
- Strumming to the Beat: Audio-Conditioned Contrastive Video Textures<br>:star:code:house:project:tv:video
- 视频帧插值
- 视频时刻定位
- Temporal Video Segmentation(时序视频分割)
5.Object Detection(目标检测)
- Meta-UDA: Unsupervised Domain Adaptive Thermal Object Detection Using Meta-Learning
- ADC: Adversarial Attacks Against Object Detection That Evade Context Consistency Checks
- TricubeNet: 2D Kernel-Based Object Representation for Weakly-Occluded Oriented Object Detection<br>:star:code
- Detecting Tear Gas Canisters With Limited Training Data
- Learned Event-Based Visual Perception for Improved Space Object Detection
- Densely-Packed Object Detection via Hard Negative-Aware Anchor Attention
- PICA: Point-Wise Instance and Centroid Alignment Based Few-Shot Domain Adaptive Object Detection With Loose Annotations
- Improving Object Detection by Label Assignment Distillation<br>:star:code
- Fusion Point Pruning for Optimized 2D Object Detection With Radar-Camera Fusion
- YOLO-ReT: Towards High Accuracy Real-Time Object Detection on Edge GPUs<br>:star:code
- SC-UDA: Style and Content Gaps Aware Unsupervised Domain Adaptation for Object Detection
- To Miss-Attend Is to Misalign! Residual Self-Attentive Feature Alignment for Adapting Object Detectors<br>:star:code
- 目标定位
- MOD(移动目标检测)
- 路标检测
- 零样本检测
- 小样本目标检测
- 图像异常检测
- 弱监督目标检测
- Few-Shot Weakly-Supervised Object Detection via Directional Statistics
- 海上障碍物检测
- 人造卫星识别
- Object Anti-Spoofing
- 3D目标检测
- ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection<br>:star:code
- Fast-CLOCs: Fast Camera-LiDAR Object Candidates Fusion for 3D Object Detection
- M3DETR: Multi-Representation, Multi-Scale, Mutual-Relation 3D Object Detection With Transformers<br>:star:code
- 显著目标检测
- 伪装目标检测
- 球员检测
- Wireframe Detection(线框检测)
4.GAN(生成对抗网络)
- GraN-GAN: Piecewise Gradient Normalization for Generative Adversarial Networks
- Latent to Latent: A Learned Mapper for Identity Preserving Editing of Multiple Face Attributes in StyleGAN-Generated Images
- AE-StyleGAN: Improved Training of Style-Based Auto-Encoders<br>:star:code
- GANs Spatial Control via Inference-Time Adaptive Normalization
- Latent Reweighting, an Almost Free Improvement for GANs
- PPCD-GAN: Progressive Pruning and Class-Aware Distillation for Large-Scale Conditional GANs Compression
- Controlled GAN-Based Creature Synthesis via a Challenging Game Art Dataset - Addressing the Noise-Latent Trade-Off
- Data InStance Prior (DISP) in Generative Adversarial Networks
- Sketch-To-Face草图到人脸图像翻译
- 基于关键点重新合成新姿势
- MRI重建
3.3D(三维视觉)
- 深度估计
- stereo images
- 三维重建
- Single-Shot Dense Active Stereo With Pixel-Wise Phase Estimation Based on Grid-Structure Using CNN and Correspondence Estimation Using GCN
- Style Agnostic 3D Reconstruction via Adversarial Style Transfer<br>:star:code
- 3D Modeling Beneath Ground: Plant Root Detection and Reconstruction Based on Ground-Penetrating Radar
- Mending Neural Implicit Modeling for 3D Vehicle Reconstruction in the Wild
- Learning to Reconstruct 3D Non-Cuboid Room Layout from a Single RGB Image<br>:star:code
- Tensor-Based Non-Rigid Structure From Motion
- stereo vision(立体视觉)
- 网格重建
2.Medical Image(医学影像)
- 分割
- UNETR: Transformers for 3D Medical Image Segmentation<br>:star:code
- Self-Supervised Generative Style Transfer for One-Shot Medical Image Segmentation<br>:star:code
- AFTer-UNet: Axial Fusion Transformer UNet for Medical Image Segmentation
- Co-Net: A Collaborative Region-Contour-Driven Network for Fine-to-Finer Medical Image Segmentation
- T-Net: A Resource-Constrained Tiny Convolutional Neural Network for Medical Image Segmentation
- Hyper-Convolution Networks for Biomedical Image Segmentation<br>:star:code
- 血管分割
- 腺体分割
- 检索
- 配准
- 分类
- 自动生成医学报告
- 手术器械定位
- 胸部X光片的异常分类和定位
1.其它
- Does Data Repair Lead to Fair Models? Curating Contextually Fair Data To Reduce Model Bias<br>:star:code
- The Untapped Potential of Off-the-Shelf Convolutional Neural Networks
- Unveiling Real-Life Effects of Online Photo Sharing
- Shadow Art Revisited: A Differentiable Rendering Based Approach
- Towards Class-Oriented Poisoning Attacks Against Neural Networks
- Predicting Levels of Household Electricity Consumption in Low-Access Settings
- Neural Radiance Fields Approach to Deep Multi-View Photometric Stereo
- PRECODE - A Generic Model Extension To Prevent Deep Gradient Leakage
- Discovering Underground Maps From Fashion
- On the Maximum Radius of Polynomial Lens Distortion<br>:star:code
- The Hitchhiker's Guide to Prior-Shift Adaptation<br>:star:code
- FalCon: Fine-Grained Feature Map Sparsity Computing With Decomposed Convolutions for Inference Optimization
- METGAN: Generative Tumour Inpainting and Modality Synthesis in Light Sheet Microscopy
- Agree To Disagree: When Deep Learning Models With Identical Architectures Produce Distinct Explanations<br>:star:code
- REFICS: A Step Towards Linking Vision With Hardware Assurance
- Deep Optimization Prior for THz Model Parameter Estimation
- Sharing Decoders: Network Fission for Multi-Task Pixel Prediction
- Fair Visual Recognition in Limited Data Regime using Self-Supervision and Self-Distillation
- Low-Cost Multispectral Scene Analysis With Modality Distillation
- Self-Supervised Pretraining Improves Self-Supervised Pretraining
- PROVES: Establishing Image Provenance Using Semantic Signatures
- Addressing Out-of-Distribution Label Noise in Webly-Labelled Data<br>:star:code
- Towards Durability Estimation of Bioprosthetic Heart Valves via Motion Symmetry Analysis
- Network Generalization Prediction for Safety Critical Tasks in Novel Operating Domains
- Generalized Clustering and Multi-Manifold Learning With Geometric Structure Preservation<br>:star:code
- Batch Normalization Tells You Which Filter Is Important
- Sandwich Batch Normalization: A Drop-In Replacement for Feature Distribution Heterogeneity<br>:star:code
- Parsing Line Chart Images Using Linear Programming
- CrossLocate: Cross-Modal Large-Scale Visual Geo-Localization in Natural Environments Using Rendered Modalities<br>:house:project
- Symmetric-Light Photometric Stereo
- REGroup: Rank-Aggregating Ensemble of Generative Classifiers for Robust Predictions<br>:house:project:star:code
- Leveraging Test-Time Consensus Prediction for Robustness Against Unseen Noise
- Supervised Compression for Resource-Constrained Edge Computing Systems<br>:star:code
- Action Anticipation Using Latent Goal Learning<br>:star:code
- Non-Semantic Evaluation of Image Forensics Tools: Methodology and Database
- Inpaint2Learn: A Self-Supervised Framework for Affordance Learning
- RGL-NET: A Recurrent Graph Learning Framework for Progressive Part Assembly
- Self-Supervised Knowledge Transfer via Loosely Supervised Auxiliary Tasks<br>:star:code
- Novel Ensemble Diversification Methods for Open-Set Scenarios
- Contrast To Divide: Self-Supervised Pre-Training for Learning With Noisy Labels<br>:star:code
- Typenet: Towards Camera Enabled Touch Typing on Flat Surfaces Through Self-Refinement<br>:star:code
- Nonnegative Low-Rank Tensor Completion via Dual Formulation With Applications to Image and Video Completion
- MisConv: Convolutional Neural Networks for Missing Data
- MAPS: Multimodal Attention for Product Similarity
- Global Assists Local: Effective Aerial Representations for Field of View Constrained Image Geo-Localization
- Self-Supervised Test-Time Adaptation on Video Data
- FT-DeepNets: Fault-Tolerant Convolutional Neural Networks With Kernel-Based Duplication
- Short-Term Solar Irradiance Prediction From Sky Images With a Clear Sky Model
- Reconstructing Training Data From Diverse ML Models by Ensemble Inversion
- How Good Is Your Explanation? Algorithmic Stability Measures To Assess the Quality of Explanations for Deep Neural Networks
- Seeing Implicit Neural Representations As Fourier Series
- Human-Aided Saliency Maps Improve Generalization of Deep Learning
- Cross-Modal Adversarial Reprogramming
- Learning From the CNN-Based Compressed Domain
- Spatiotemporal Initialization for 3D CNNs With Generated Motion Patterns<br>:house:project
- DAD: Data-Free Adversarial Defense at Test Time
- Geometry-Inspired Top-K Adversarial Perturbations
- Shape-Coded ArUco: Fiducial Marker for Bridging 2D and 3D Modalities
- Interpretable Semantic Photo Geolocation<br>:star:code
- Mesh Convolutional Autoencoder for Semi-Regular Meshes of Different Sizes<br>:star:code
- Geometry-Aware Hierarchical Bayesian Learning on Manifolds
- Transferable 3D Adversarial Textures Using End-to-End Optimization
- Improving Fractal Pre-Training<br>:star:code:house:project