Awesome
<!-- * @Author: fzy * @Date: 2020-03-09 21:53:10 * @LastEditors: Zhenying * @LastEditTime: 2020-12-03 18:58:12 * @Description: -->Awesome-Temporal-Action-Detection-Temporal-Action-Proposal-Generation
Temporal Action Detection & Weakly Supervised & Semi Supervised Temporal Action Detection & Temporal Action Proposal Generation & Open-Vocabulary Temporal Action Detection
Contents
<!-- TOC -->- Awesome-Temporal-Action-Detection-Temporal-Action-Proposal-Generation
- about pretrained model
- ActivityNet Challenge
- Temporal Action Proposal Generation
- Temporal Action Detection
- Weakly Supervised Temporal Action Detection
- Online Action Detection
- Semi Supervised Temporal Action Detection
- Open-Vocabulary Temporal Action Detection
about pretrained model
- (BSP) Boundary-sensitive Pre-training for Temporal Localization in Videos (ICCV 2021)
- (TSP) TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks (ICCVW 2021)
- (UP-TAL) Unsupervised Pre-training for Temporal Action Localization Tasks (CVPR 2022) code
- Contrastive Language-Action Pre-training for Temporal Localization (arxiv 2022)
- Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization (NeurIPS 2021)
ActivityNet Challenge and talks
- (2021) AcitvityNet 2021
- (2021) Transformer在时序行为检测中的应用 & 基于自监督学习的半监督时序行为检测 (DAMO Academy, Alibaba Group)
Papers: Temporal Action Proposal Generation
2023
- (MIFNet) MIFNet: Multiple Instances Focused Temporal Action Proposal Generation (Neurocomputing 2023)
- (SMBG) Faster Learning of Temporal Action Proposal via Sparse Multilevel Boundary Generator (arxiv 2023) code
- (MCBD) [Multi-Level Content-Aware Boundary Detection for Temporal Action Proposal Generation](Tip 2023) code
2022
- (BCNet) Temporal Action Proposal Generation with Background Constraint (AAAI 2022)
- (PRSA-Net) Pyramid Region-based Slot Attention Network for Temporal Action Proposal Generation (BMVC 2022) code
- (TDN) Modeling long-term video semantic distribution for temporal action proposal generation (Neurocomputing 2022)
- (AOE-Net) AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation (IJCV 2022)
2021
- (BSN++) BSN++: Complementary Boundary Regressor with Scale-Balanced RelationModeling for Temporal Action Proposal Generation (AAAI 2021) Author's Zhihu
- (RTD-Net) Relaxed Transformer Decoders for Direct Action Proposal Generation (ICCV 2021) code Zhihu
- (TCANet) Temporal Context Aggregation Network for Temporal Action Proposal Refinement (CVPR 2021) Zhihu
- Augmented Transformer with Adaptive Graph for Temporal Action Proposal Generation (arxiv 2021)
- (TAPG) Temporal Action Proposal Generation with Transformers (arxiv 2021)
- (AEN) Agent-Environment Network for Temporal Action Proposal Generation (ICASSP 2021)
- (AEI) AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation (BMVC 2021) code
2020
- VALSE talk by Tianwei Lin (2020.03.18) link (7y8g)
- (RapNet) Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid Network (AAAI 2020) pre-paper 2019 ActivityNet task-1 2nd
- (DBG) Fast Learning of Temporal Action Proposal via Dense Boundary Generator (AAAI 2020) paper code.TensorFlow
- (BC-GNN) Boundary Content Graph Neural Network for Temporal Action Proposal Generation (ECCV 2020) paper
- Bottom-Up Temporal Action Localization with Mutual Regularization (ECCV 2020) code.TensorFlow
- (TSI) TSI: Temporal Scale Invariant Network for Action Proposal Generation (ACCV 2020)
2019
- (SRG) SRG: Snippet Relatedness-based Temporal Action Proposal Generator (IEEE Trans 2019) paper
- (DPP) Deep Point-wise Prediction for Action Temporal Proposal (ICONIP 2019) paper code.PyTorch
- (BMN) BMN: Boundary-Matching Network for Temporal Action Proposal Generation (ICCV 2019) paper code.PaddlePaddle code.PyTorch_unofficial
- (MGG) Multi-granularity Generator for Temporal Action Proposal (CVPR 2019) paper
- Investigation on Combining 3D Convolution of Image Data and Optical Flow to Generate Temporal Action Proposals (2019 CVPR Workshop) paper
- (CMSN) CMSN: Continuous Multi-stage Network and Variable Margin Cosine Loss for Temporal Action Proposal Generation (arxiv 2019) paper
- A high performance computing method for accelerating temporal action proposal generation (arxiv 2019) paper
- Multi-Granularity Fusion Network for Proposal and Activity Localization: Submission to ActivityNet Challenge 2019 Task 1 and Task 2 (ActvityNet challenge 2019) paper
- Joint Learning of Local and Global Context for Temporal Action Proposal Generation (TCSVT 2019)
2018
- (CTAP) CTAP: Complementary Temporal Action Proposal Generation (ECCV 2018) paper code.TensorFlow
- (BSN) BSN: Boundary Sensitive Network for Temporal Action Proposal Generation (ECCV 2018) paper code.TensorFlow code.PyTorch
- (SAP) SAP: Self-Adaptive Proposal Model for Temporal Action Detection based on Reinforcement Learning (AAAI 2018) paper code.Torch
2017
- (TURN TAP) TURN TAP: Temporal Unit Regression Network for Temporal Action Proposals (ICCV 2017) paper code.TensorFlow
- (SST) SST: Single-Stream Temporal Action Proposals (CVPR 2017) paper code.theano code.TensorFlow
- YoTube: Searching Action Proposal via Recurrent and Static Regression Networks (IEEE Trans 2017) paper
- A Pursuit of Temporal Accuracy in General Activity Detection (arxiv 2017) paper code.PyTorch
before
Papers: Temporal Action Detection
2024
- (DenoiseLoc) Boundary Denoising for Video Activity Localization (ICLR 2024) code
- (LITA) LITA: Language Instructed Temporal-Localization Assistant (arXiv 2024) code
- (PLOT-TAL) (few-shot) PLOT-TAL -- Prompt Learning with Optimal Transport for Few-Shot Temporal Action Localization (Arxiv 2024)
- Benchmarking the Robustness of Temporal Action Detection Models Against Temporal Corruptions (CVPR 2024) code
- (zero-shot) (T3AL) Test-Time Zero-Shot Temporal Action Localization (CVPR 2024) code
- (UniMD) UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection (ECCV 2024) code
- Adapting Short-Term Transformers for Action Detection in Untrimmed Videos (CVPR 2024)
- (AdaTAD) End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames (CVPR 2024) code
- Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding (ECCV 2024) code
- (TE-TAD) TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate Expression (CVPR 2024) code
- (ADI-Diff) Action Detection via an Image Diffusion Process (CVPR 2024)
- (DualDETR) Dual DETRs for Multi-Label Temporal Action Detection (CVPR 2024) code
- An Effective-Efficient Approach for Dense Multi-Label Action Detection (arXiv 2024)
- (Spatio-Temporal) End-to-End Spatio-Temporal Action Localisation with Video Transformers (CVPR 2024)
- (DyFADet) DyFADet: Dynamic Feature Aggregation for Temporal Action Detection (ECCV 2024) code
- (causaltad) Harnessing Temporal Causality for Advanced Temporal Action Detection (arxiv 2024) code
- (LTP) Long-Term Pre-training for Temporal Action Detection with Transformers (arxiv 2024)
- (Pred-DETR) Prediction-Feedback DETR for Temporal Action Detection (arxiv 2024)
- Introducing Gating and Context into Temporal Action Detection (ECCV W 2024)
- (ContextDet) ContextDet: Temporal Action Detection with Adaptive Context Aggregation (arXiv 2024)
2023
- (AMNet) Action-aware Masking Network with Group-based Attention for Temporal Action Localization (WACV 2023)
- (ContextLoc++) ContextLoc++: A Unified Context Model for Temporal Action Localization (TPAMI 2023)
- Temporal action detection with dynamic weights based on curriculum learning (Neurocomputing 2023)
- (GAP) Post-Processing Temporal Action Detection (CVPR 2023) code
- (TriDet) TriDet: Temporal Action Detection with Relative Boundary Modeling (CVPR 2023) code
- Temporal Action Localization with Enhanced Instant Discriminability (extend version)
- (TemporalMaxer) TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action Localization (ArXiv 2023) code
- (DiffTAD) DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion (ICCV 2023) code
- Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection (CVPR 2023)
- Boundary-Denoising for Video Activity Localization (Arxiv 2023)
- (ASL) Action Sensitivity Learning for Temporal Action Localization (ICCV 2023)
- (MMNet) A Multi-Modal Transformer Network for Action Detection (Pattern Recognition 2023)
- Truncated attention-aware proposal networks with multi-scale dilation for temporal action detection (Pattern Recognition 2023)
- (MSST) A Multitemporal Scale and Spatial–Temporal Transformer Network for Temporal Action Localization (IEEE Transactions on Human-Machine Systems 2023)
- Exploring Action Centers for Temporal Action Localization (TMM 2023)
- (ETAD) ETAD: Training Action Detection End to End on a Laptop (CVPRW 2023) code
- (BasicTAD) BasicTAD: an Astounding RGB-Only Baseline for Temporal Action Detection (CVIU 2023) code
- (Re2TAL) Re2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization (CVPR 2023) code
- (SoLa) Soft-Landing Strategy for Alleviating the Task Discrepancy Problem in Temporal Action Localization Tasks (CVPR 2023)
- (APN) Progression-Guided Temporal Action Detection in Videos (Arxiv 2023) code
- (Self-DETR) Self-Feedback DETR for Temporal Action Detection (ICCV 2023)
- (UnLoc) UnLoc: A Unified Framework for Video Localization Tasks (ICCV 2023) code
- Benchmarking Data Efficiency and Computational Efficiency of Temporal Action Localization Models (ICCV 2023 Workshop)
- (BAPG) Boundary-Aware Proposal Generation Method for Temporal Action Localization (Arxiv 2023)
- (MENet) Movement Enhancement toward Multi-Scale Video Feature Representation for Temporal Action Detection (ICCV 2023)
- (MRAV-FF) Multi-Resolution Audio-Visual Feature Fusion for Temporal Action Localization (Arxiv 2023)
- (BDRC-Net) Boundary Discretization and Reliable Classification Network for Temporal Action Detection (Arxiv 2023) code
- (STAN) STAN: Spatial-Temporal Awareness Network for Temporal Action Detection (ACM MM W 2023)
- (RefineTAD) RefineTAD: Learning Proposal-free Refinement for Temporal Action Detection (ACM MM 2023)
- SADA: Semantic adversarial unsupervised domain adaptation for Temporal Action Localization (arXiv 2023) code
2022
- (DCAN) DCAN: Improving Temporal Action Detection via Dual Context Aggregation (AAAI 2022)
- (TVNet) TVNet: Temporal Voting Network for Action Localization (arxiv 2022) code
- (ActionFormer) ActionFormer: Localizing Moments of Actions with Transformers (ECCV 2022) code
- (SegTAD)SegTAD: Precise Temporal Action Detection via Semantic Segmentation (arxiv 2022)
- (OpenTAL) OpenTAL: Towards Open Set Temporal Action Localization (CVPR 2022) code
- (TALLFormer) TALLFormer: Temporal Action Localization with Long-memory Transformer (CVPR 2022)
- An Empirical Study of End-to-End Temporal Action Detection (CVPR 2022) code
- (BREM) Estimation of Reliable Proposal Quality for Temporal Action Detection (ACM MM 2022)
- Structured Attention Composition for Temporal Action Localization (Tip 2022) code
- (RCL) RCL: Recurrent Continuous Localization for Temporal Action Detection (CVPR 2022)
- (RefactorNet) Learning to Refactor Action and Co-occurrence Features for Temporal Action Localization (CVPR 2022)
- (MS-TCT) MS-TCT: Multi-Scale Temporal ConvTransformer for Action Detection (CVPR 2022) code
- (OATD) One-stage Action Detection Transformer (EPICKITCHENS-100 2022 V. 26.35 N. 25.83)
- Context-aware Proposal Network for Temporal Action Detection (CVPR-2022 ActivityNet Challenge winning solution)
- Dual relation network for temporal action localization (Pattern Recognition 2022)
- Learning Disentangled Classification and Localization Representations for Temporal Action Localization (AAAI 2022)
- (DDM) Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection (CVPR 2022) code
- Submission to Generic Event Boundary Detection Challenge@CVPR 2022: Local Context Modeling and Global Boundary Decoding Approach (CVPR 2022 Challenge)
- (HTNet) HTNet: Anchor-free Temporal Action Localization with Hierarchical Transformers (arxiv 2022)
- (STPT) An Efficient Spatio-Temporal Pyramid Transformer for Action Detection (ECCV 2022)
- (TAGS) Proposal-Free Temporal Action Detection with Global Segmentation Mask Learning (ECCV 2022) code
- Prompting Visual-Language Models for Efficient Video Understanding (ECCV 2022) code
- (ReAct) ReAct: Temporal Action Detection with Relational Queries (ECCV 2022) code
- (TadTR) End-to-end Temporal Action Detection with Transformer (TIP 2022) code
- (TAL-MTS) Temporal Action Localization with Multi-temporal Scales (arxiv 2022)
- (AdaPerFormer) Adaptive Perception Transformer for Temporal Action Localization (arxiv 2022) code
- (PointTAD) PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points (NeurIPS 2022) code (multi action detection, eg: multiTHUMOS, charades)
- (SoLa) Soft-Landing Strategy for Alleviating the Task Discrepancy Problem in Temporal Action Localization Tasks (arxiv 2022)
- (Re2TAL) Re2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization (arxiv 2022)
- (MUPPET) Multi-Modal Few-Shot Temporal Action Detection (arxiv 2022) code
- Deep Learning-Based Action Detection in Untrimmed Videos: A Survey (TPAMI 2022)
2021
- (activity graph transformer) Activity Graph Transformer for Temporal Action Localization (arxiv 2021) project code
- Coarse-Fine Networks for Temporal Activity Detection in Videos (CVPR 2021) code
- (MLAD) Modeling Multi-Label Action Dependencies for Temporal Action Localization (CVPR 2021)
- (PcmNet) PcmNet: Position-Sensitive Context Modeling Network for Temporal Action Localization (Tip 2021)
- (AFSD) Learning Salient Boundary Feature for Anchor-free Temporal Action Localization (CVPR 2021) code
- Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization (arxiv 2021)
- Read and Attend: Temporal Localisation in Sign Language Videos (CVPR 2021) (Sign Language Videos)
- Low Pass Filter for Anti-aliasing in Temporal Action Localization (arxiv 2021)
- FineAction: A Fined Video Dataset for Temporal Action Localization (One track of DeeperAction Workshop@ICCV2021) Homepage
- Three Birds with One Stone: Multi-Task Temporal Action Detection via Recycling Temporal Annotations (CVPR 2021)
- Proposal Relation Network for Temporal Action Detection (CVPRW 2021)
- Exploring Stronger Feature for Temporal Action Localization (CVPRW 2021)
- (SRF-Net) SRF-Net: Selective Receptive Field Network for Anchor-Free Temporal Action Detection (ICASSP 2021)
- RGB Stream Is Enough for Temporal Action Detection (arxiv 2021) code
- (AVFusion) Hear Me Out: Fusional Approaches for Audio Augmented Temporal Action Localization (arxiv 2021) Code
- Transferable Knowledge-Based Multi-Granularity Aggregation Network for Temporal Action Localization: Submission to ActivityNet Challenge 2021 (HACS challenge 2021)
- Enriching Local and Global Contexts for Temporal Action Localization (ICCV 2021)
- (CSA) Class Semantics-based Attention for Action Detection (ICCV 2021)
- (SP-TAD) Towards High-Quality Temporal Action Detection with Sparse Proposals (arxiv 2021) Code
- Few-Shot Temporal Action Localization with Query Adaptive Transformer (BMVC 2021) code (Few-Shot)
- Graph Convolutional Module for Temporal Action Localization in Videos (TPAMI 2021)
- MS-TCT: Multi-Scale Temporal ConvTransformer for Action Detection (arxiv 2021)
- (VSGN) Video Self-Stitching Graph Network for Temporal Action Localization (ICCV 2021) code
- (MUSES) Multi-shot Temporal Event Localization: a Benchmark (CVPR 2021) project code dataset
2020
- (G-TAD) G-TAD: Sub-Graph Localization for Temporal Action Detection (CVPR 2020) paper code.PyTorch video
- (AGCN-P-3DCNNs) Graph Attention based Proposal 3D ConvNets for Action Detection (AAAI 2020) paper
- (PBRNet) Progressive Boundary Refinement Network for Temporal Action Detection (AAAI 2020) paper
- (TsaNet) Scale Matters: Temporal Scale Aggregation Network for Precise Action Localization in Untrimmed Videos (ICME 2020) paper
- Constraining Temporal Relationship for Action Localization (arxiv 2020) paper
- (CBR-Net) CBR-Net: Cascade Boundary Refinement Network for Action Detection: Submission to ActivityNet Challenge 2020 (Task 1) (ActivityNet Challenge 2020) paper
- Temporal Action Localization with Variance-Aware Networks (arxiv 2020)
- Boundary Uncertainty in a Single-Stage Temporal Action Localization Network (arxiv 2020, Tech report)
- Revisiting Anchor Mechanisms for Temporal Action Localization (Tip 2020) code.PyTorch
- (C-TCN) Deep Concept-wise Temporal Convolutional Networks for Action Localization (ACM MM 2020) code.PaddlePaddle
- (MLTPN) Multi-Level Temporal Pyramid Network for Action Detection (PRCV 2020)
- (SALAD) SALAD: Self-Assessment Learning for Action Detection (arxiv 2020)
2019
- (CMS-RC3D) Contextual Multi-Scale Region Convolutional 3D Network for Activity Detection (ICCVBIC 2019) paper
- (TGM) Temporal Gaussian Mixture Layer for Videos (ICML 2019) paper code.PyTorch
- (Decouple-SSAD) Decoupling Localization and Classification in Single Shot Temporal Action Detection (ICME 2019) paper code.TensorFlow
- Exploring Feature Representation and Training strategies in Temporal Action Localization (ICIP 2019) paper
- (PGCN) Graph Convolutional Networks for Temporal Action Localization (ICCV 2019) paper code.PyTorch
- (S-2D-TAN) Learning Sparse 2D Temporal Adjacent Networks for Temporal Action Localization (ICCV 2019) (winner solution for the HACS Temporal Action Localization Challenge at ICCV 2019) paper
- (2D-TAN) Learning 2D Temporal Adjacent Networks for Moment Localization with Natural Language (AAAI 2020) paper code.PyTorch
- (LCDC) Learning Motion in Feature Space: Locally-Consistent Deformable Convolution Networks for Fine-Grained Action Detection (ICCV 2019) paper slide code.TensorFlow
- (BLP) BLP -- Boundary Likelihood Pinpointing Networks for Accurate Temporal Action Localization (ICASSP 2019) paper
- (GTAN) Gaussian Temporal Awareness Networks for Action Localization (CVPR 2019) paper
- Temporal Action Localization using Long Short-Term Dependency (arxiv 2019) paper
- Relation Attention for Temporal Action Localization (IEEE Trans TMM 2019) paper
- (AFO-TAD) AFO-TAD: Anchor-free One-Stage Detector for Temporal Action Detection (arxiv 2019) paper
- (DBS) Video Imprint Segmentation for Temporal Action Detection in Untrimmed Videos (AAAI 2019) paper
2018
- Diagnosing Error in Temporal Action Detectors (ECCV 2018) paper
- (ETP) Precise Temporal Action Localization by Evolving Temporal Proposals (ICMR 2018) paper
- (Action Search) Action Search: Spotting Actions in Videos and Its Application to Temporal Action Localization (ECCV 2018) paper code.TensorFlow
- (TAL-Net) Rethinking the Faster R-CNN Architecture for Temporal Action Localization (CVPR 2018) paper
- One-shot Action Localization by Learning Sequence Matching Network (CVPR 2018) paper
- Temporal Action Detection by Joint Identification-Verification (arxiv 2018) paper
- (TPC) Exploring Temporal Preservation Networks for Precise Temporal Action Localization (AAAI 2018) paper
- (SAP) A Self-Adaptive Proposal Model for Temporal Action Detection based on Reinforcement Learning (AAAI 2018) paper code.Torch
2017
- (TCN) Temporal Context Network for Activity Localization in Videos (ICCV 2017) paper code.caffe
- (SSN) Temporal Action Detection with Structured Segment Networks (ICCV 2017) paper code.PyTorch
- (R-C3D) R-C3D: Region Convolutional 3D Network for Temporal Activity Detection (ICCV 2017) paper code.caffe code.PyTorch
- (TCNs) Temporal Convolutional Networks for Action Segmentation and Detection (CVPR 2017) paper code.TensorFlow
- (SMS) Temporal Action Localization by Structured Maximal Sums (CVPR 2017) paper code
- (SCC) SCC: Semantic Context Cascade for Efficient Action Detection (CVPR 2017) paper
- (CDC) CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos (CVPR 2017) paper code project
- (SS-TAD) End-to-End, Single-Stream Temporal ActionDetection in Untrimmed Videos (BMVC 2017) paper code.PyTorch
- (CBR) Cascaded Boundary Regression for Temporal Action Detection (BMVC 2017) paper code.TensorFlow
- (SSAD) Single Shot Temporal Action Detection (ACM MM 2017) paper
before
- (PSDF) Temporal Action Localization with Pyramid of Score Distribution Features (CVPR 2016) paper
- Temporal Action Detection using a Statistical Language Model (CVPR 2016) paper code
- (S-CNN) Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs (CVPR 2016) paper code project
- End-to-end Learning of Action Detection from Frame Glimpses in Videos (CVPR 2016) paper code
Papers: Weakly Supervised Temporal Action Detection
2024
- Weakly-Supervised Temporal Action Localization by Inferring Snippet-Feature Affinity (AAAI 2024)
- (HR-Pro) HR-Pro: Point-supervised Temporal Action Localization via Hierarchical Reliability Propagation (AAAI 2024) code
- STAT: Towards Generalizable Temporal Action Localization (Arxiv 2024)
- (TSPNet) Realigning Confidence with Temporal Saliency Information for Point-level Weakly-Supervised Temporal Action Localization (CVPR 2024) code
- (M2PT) Weakly-Supervised Temporal Action Localization with Multi-Modal Plateau Transformers (CVPR Workshop 2024)
- (EPNet) Ensemble Prototype Network For Weakly Supervised Temporal Action Localization (TNNLS 2024)
- (FuSTAL) Full-Stage Pseudo Label Quality Enhancement for Weakly-supervised Temporal Action Localization (arXiv 2024) code
- (PVLR) Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization (ACM MM 2024) code
- (zero-shot) Towards Completeness: A Generalizable Action Proposal Generator for Zero-Shot Temporal Action Localization (ICPR 2024) code
- (SMBD) Stepwise Multi-grained Boundary Detector for Point-supervised Temporal Action Localization (ECCV 2024)
- Zero-shot Action Localization via the Confidence of Large Vision-Language Models (arXiv 2024)
- Can MLLMs Guide Weakly-Supervised Temporal Action Localization Tasks? (arXiv 2024)
2023
- (ASCN) A Novel Action Saliency and Context-Aware Network for Weakly-Supervised Temporal Action Localization (TMM 2023)
- (TFE-DCN) Temporal Feature Enhancement Dilated Convolution Network for Weakly-Supervised Temporal Action Localization (WACV 2023)
- (JCDNet) JCDNet: Joint of Common and Definite phases Network for Weakly Supervised Temporal Action Localization (Arxiv 2023)
- (P-MIL) Proposal-Based Multiple Instance Learning for Weakly-Supervised Temporal Action Localization (CVPR 2023) code
- Two-Stream Networks for Weakly-Supervised Temporal Action Localization With Semantic-Aware Mechanisms (CVPR 2023)
- Boosting Weakly-Supervised Temporal Action Localization with Text Information (CVPR 2023) code
- (PivoTAL) PivoTAL: Prior-Driven Supervision for Weakly-Supervised Temporal Action Localization (CVPR 2023)
- Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels (CVPR 2023) code
- (MTP) Multiple Temporal Pooling Mechanisms for Weakly Supervised Temporal Action Localization (TOMM 2023)
- (VQK-Net) Video-Specific Query-Key Attention Modeling for Weakly-Supervised Temporal Action Localization
- (DFE) Dual-Feature Enhancement for Weakly Supervised Temporal Action Localization (ICASSP 2023)
- (FBA-Net) Collaborative Foreground, Background, and Action Modeling Network for Weakly Supervised Temporal Action Localization (TCSVT 2023)
- (Bi-SCC) Weakly Supervised Temporal Action Localization With Bidirectional Semantic Consistency Constraint (TNNLS 2023)
- (F3-Net) Feature Weakening, Contextualization, and Discrimination for Weakly Supervised Temporal Action Localization (TMM 2023) code
- (LPR) Learning Proposal-aware Re-ranking for Weakly-supervised Temporal Action Localization (TCSVT 2023)
- (STCL-Net) Semantic and Temporal Contextual Correlation Learning for Weakly-Supervised Temporal Action Localization (TPAMI 2023)
- Distilling Vision-Language Pre-training to Collaborate with Weakly-Supervised Temporal Action Localization (CVPR 2023)
- Weakly-Supervised Action Localization by Hierarchically-structured Latent Attention Modeling (ICCV 2023)
- Cross-Video Contextual Knowledge Exploration and Exploitation for Ambiguity Reduction in Weakly Supervised Temporal Action Localization (TCSVT 2023)
- (SPL-Loc) Sub-action Prototype Learning for Point-level Weakly-supervised Temporal Action Localization (arXiv 2023)
- (DDG-Net) DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action Localization (ICCV 2023) code
- Proposal-based Temporal Action Localization with Point-level Supervision (BMVC 2023)
- (LPR) LPR: learning point-level temporal action localization through re-training (MMSJ 2023)
- (POTLoc) POTLoc: Pseudo-Label Oriented Transformer for Point-Supervised Temporal Action Localization (arXiv 2023)
- (ADM-Loc) ADM-Loc: Actionness Distribution Modeling for Point-supervised Temporal Action Localization (arXiv 2023)
- Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-based Approach (ICCV 2023) code
- Sub-action Prototype Learning for Point-level Weakly-supervised Temporal Action Localization (arXiv 2023)
2022
- (ACGNet) ACGNet: Action Complement Graph Network for Weakly-supervised Temporal Action Localization (AAAI 2022)
- (RSKP) Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation (CVPR 2022) code
- (ASM-Loc) ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization (CVPR 2022) code
- (FTCL) Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization (CVPR 2022) code
- (C3BN) Convex Combination Consistency between Neighbors for Weakly-supervised Action Localization (arxiv 2022)
- (DCC) Exploring Denoised Cross-video Contrast for Weakly-supervised Temporal Action Localization (CVPR 2022)
- (HAAN) Weakly-Supervised Temporal Action Detection for Fine-Grained Videos with Hierarchical Atomic Actions (ECCV 2022) code
- (STALE) (Zero-Shot) Zero-Shot Temporal Action Detection via Vision-Language Prompting (ECCV 2022) code
- (SMEN) Slow Motion Matters: A Slow Motion Enhanced Network for Weakly Supervised Temporal Action Localization (TCSVT 2022)
- Dilation-Erosion for Single-Frame Supervised Temporal Action Localization (arxiv 2022)
- (AMS) Adaptive Mutual Supervision for Weakly-Supervised Temporal Action Localization (TMM 2022)
- (DELU) Dual-Evidential Learning for Weakly-supervised Temporal Action Localization (ECCV 2022) code
2021
- (HAM-Net) A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action Localization. (AAAI 2021)
- Cross-Attentional Audio-Visual Fusion for Weakly-Supervised Action Localization (ICLR 2021)
- Weakly-supervised Temporal Action Localization by Uncertainty Modeling (AAAI 2021) code
- (TS-PCA) The Blessings of Unlabeled Background in Untrimmed Videos (CVPR 2021) code
- (ACSNet) ACSNet: Action-Context Separation Network for Weakly Supervised Temporal Action Localization (AAAI 2021)
- (CoLA) CoLA: Weakly-Supervised Temporal Action Localization with Snippet Contrastive Learning (CVPR 2021)
- Weakly Supervised Temporal Action Localization Through Learning Explicit Subspaces for Action and Context (AAAI 2021)
- ACM-Net: Action Context Modeling Network for Weakly-Supervised Temporal Action Localization (arxiv 2021, submitted to Tip) code
- (AUMN) Action Unit Memory Network for Weakly Supervised Temporal Action Localization (CVPR 2021)
- (ASL) Weakly Supervised Action Selection Learning in Video (CVPR 2021)
- (ActShufNet) Action Shuffling for Weakly Supervised Temporal Localization (arxiv 2021)
- Few-Shot Action Localization without Knowing Boundaries (arxiv 2021)
- Uncertainty Guided Collaborative Training for Weakly Supervised Temporal Action Detection (CVPR 2021)
- Two-Stream Consensus Network: Submission to HACS Challenge 2021Weakly-Supervised Learning Track (CVPRW 2021)
- Weakly-Supervised Temporal Action Localization Through Local-Global Background Modeling (CVPRW 2021)
- Cross-modal Consensus Network for Weakly Supervised Temporal Action Localization (ACM MM 2021) code
- Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization (ICCV 2021) code
- Deep Motion Prior for Weakly-Supervised Temporal Action Localization (submit to Tip 2021) project
- Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization (ICCV 2021)
- (BackTAL) Background-Click Supervision for Temporal Action Localization (TPAMI 2021) code
- (ACN) Action Coherence Network for Weakly-Supervised Temporal Action Localization (TMM 2021)
2020
- (WSGN) Weakly Supervised Gaussian Networks for Action Detection (WACV 2020) paper
- Weakly Supervised Temporal Action Localization Using Deep Metric Learning (WACV 2020) paper
- Action Graphs: Weakly-supervised Action Localization with Graph Convolution Networks (WACV 2020) paper
- (DGAM) Weakly-Supervised Action Localization by Generative Attention Modeling (CVPR 2020) paper code.PyTorch
- (EM-MIL) Weakly-Supervised Action Localization with Expectation-Maximization Multi-Instance Learning (ECCV 2020) paper
- Relational Prototypical Network for Weakly Supervised Temporal ActionLocalization (AAAI 2020) paper
- (BaS-Net) Background Suppression Networkfor Weakly-supervised Temporal Action Localization (AAAI 2020) paper code.PyTorch
- Background Modeling via Uncertainty Estimation for Weakly-supervised Action Localization (arxiv 2020) paper code.PyTorch
- (A2CL-PT) Adversarial Background-Aware Loss for Weakly-supervised Temporal Activity Localization (ECCV 2020) paper code.PyTorch
- Weakly Supervised Temporal Action Localization with Segment-Level Labels (arxiv 2020)
- (ECM) Equivalent Classification Mapping for Weakly Supervised Temporal Action Localization (arxiv 2020 -> TPAMI 2022) paper
- Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization (ECCV 2020 spotlight)
- Learning Temporal Co-Attention Models for Unsupervised Video Action Localization (CVPR 2020)
- Action Completeness Modeling with Background Aware Networks for Weakly-Supervised Temporal Action Localization (ACM MM 2020)
- (D2-Net) D2-Net: Weakly-Supervised Action Localization via Discriminative Embeddingsand Denoised Activations (arxiv 2020) (THUMOS'14 mAP@0.5: 35.9)
- (SF-Net) SF-Net: Single-Frame Supervision for Temporal Action Localization (ECCV 2020) code.PyTorch
- Point-Level Temporal Action Localization: Bridging Fully-supervised Proposals to Weakly-supervised Losses (arxiv 2020)
- Transferable Knowledge-Based Multi-Granularity Fusion Network for Weakly Supervised Temporal Action Detection (TMM 2020)
- ActionBytes: Learning From Trimmed Videos to Localize Actions (CVPR 2020)
2019
- (AdapNet) AdapNet: Adaptability Decomposing Encoder-Decoder Network for Weakly Supervised Action Recognition and Localization (IEEE Transactions on Neural Networks and Learning Systems) paper
- Breaking Winner-Takes-All: Iterative-Winners-Out Networks for Weakly Supervised Temporal Action Localization (IEEE Transactions on Image Processing) paper
- Weakly-Supervised Temporal Localization via Occurrence Count Learning (ICML 2019) paper code.TensorFlow
- (MAAN) Marginalized Average Attentional Network for Weakly-Supervised Learning (ICLR 2019) paper code.PyTorch
- Weakly-supervised Action Localization with Background Modeling (ICCV 2019) paper
- (TSM) Temporal Structure Mining for Weakly Supervised Action Detection (ICCV 2019) paper
- (CleanNet) Weakly Supervised Temporal Action Localization through Contrast basedEvaluation Networks (ICCV 2019) paper
- (3C-Net) 3C-Net: Category Count and Center Loss for Weakly-Supervised Action Localization (ICCV 2019) paper code.PyTorch
- (CMCS) Completeness Modeling and Context Separation for Weakly SupervisedTemporal Action Localization (CVPR 2019) paper code.PyTorch
- (RefineLoc) RefineLoc: Iterative Refinement for Weakly-Supervised Action Localization (arxiv 2019) paper homepage
- (ASSG) Adversarial Seeded Sequence Growing for Weakly-Supervised Temporal Action Localization (ACM MM 2019) paper
- (TSRNet) Learning Transferable Self-attentive Representations for Action Recognition in Untrimmed Videos with Weak Supervision (AAAI 2019) paper
- (STAR) Segregated Temporal Assembly Recurrent Networks for Weakly Supervised Multiple Action Detection (AAAI 2019) paper
2018
- Weakly Supervised Temporal Action Detection with Shot-Based Temporal Pooling Network (ICONIP 2018)
- (W-TALC) W-TALC: Weakly-supervised Temporal Activity Localization and Classification (ECCV 2018) code.PyTorch
- (AutoLoc) AutoLoc: Weakly-supervised Temporal Action Localization (ECCV 2018) code
- (STPN) Weakly Supervised Action Localization by Sparse Temporal Pooling Network (CVPR 2018) code
- Step-by-step Erasion, One-by-one Collection: A Weakly Supervised Temporal Action Detector (ACM MM 2018)
- (CPMN) Cascaded Pyramid Mining Network for Weakly Supervised Temporal Action Localization (accv 2018)
2017
- (Hide-and-Seek) Hide-and-Seek: Forcing a Network to be Meticulous for Weakly-supervised Object and Action Localization (ICCV 2017)
- (UntrimmedNets) UntrimmedNets for Weakly Supervised Action Recognition and Detection (CVPR 2017) code
Papers: Online Action Detection
2024
- (JOADAA) JOADAA: joint online action detection and action anticipation (WACV 2024)
- Object Aware Egocentric Online Action Detection (CVPRW 2024)
- ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos (ECCV 2024)
- (MATR) Online Temporal Action Localization with Memory-Augmented Transformer (ECCV 2024) code
- (HAT) HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization (ECCV 2024) code
2023
- (recognation) (GliTr) GliTr: Glimpse Transformers with Spatiotemporal Consistency for Online Action Prediction (WACV 2023)
- (E2E-LOAD) E2E-LOAD: End-to-End Long-form Online Action Detection (ICCV 2023) code
- (MiniROAD) MiniROAD: Minimal RNN Framework for Online Action Detection (ICCV 2023) code
- (MAT) Memory-and-Anticipation Transformer for Online Action Understanding (ICCV 2023) code
- Online Action Detection with Learning Future Representations by Contrastive Learning (ICME 2023)
- (HCM) HCM: Online Action Detection With Hard Video Clip Mining (TMM 2023)
- (DFAformer) DFAformer: A Dual Filtering Auxiliary Transformer for Efficient Online Action Detection in Streaming Videos (PRCV 2023)
2022
- (Colar) Colar: Effective and Efficient Online Action Detection by Consulting Exemplars (CVPR 2022) code
- (GateHUB) GateHUB: Gated History Unit with Background Suppression for Online Action Detection (CVPR 2022)
- A Circular Window-based Cascade Transformer for Online Action Detection (TPAMI 2022)
- (TeSTra) Real-time Online Video Detection with Temporal Smoothing Transformers (ECCV 2022) code
- (SimOn) SimOn: A Simple Framework for Online Temporal Action Localization (arxiv 2022) code
- (survey) Online human action detection and anticipation in videos: A survey
- Uncertainty-Based Spatial-Temporal Attention for Online Action Detection (ECCV 2022)
- (PPKD) Privileged Knowledge Distillation for Online Action Detection (PRhttps://arxiv.org/abs/2011.09158 2022)
2021
- (WOAD) WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos (CVPR 2021)
- (OadTR) OadTR: Online Action Detection with Transformers (ICCV 2021) code
- (LSTR) Long Short-Term Transformer for Online Action Detection (NeurIPS 2021) code
- pre awesome
Semi-Supervised
2024
- (APL) Towards Adaptive Pseudo-label Learning for Semi-Supervised Temporal Action Localization (ECCV 2024)
2023
- (NPL) Learning from Noisy Pseudo Labels for Semi-Supervised Temporal Action Localization (ICCV 2023) code
2022
- (AL-STAL) Active Learning with Effective Scoring Functions for Semi-Supervised Temporal Action Localization (Displays 2022)
- (SPOT) Semi-Supervised Temporal Action Detection with Proposal-Free Masking (ECCV 2022) code
2021
- (SSTAP) Self-Supervised Learning for Semi-Supervised Temporal Action Proposal (CVPR 2021) code
- Temporal Action Detection with Multi-level Supervision (ICCV 2021) code
- (KFC) KFC: An Efficient Framework for Semi-Supervised Temporal Action Localization (Tip 2021)
2019
- Learning Temporal Action Proposals With Fewer Labels (ICCV 2019)
- (TTC-Loc) Towards Train-Test Consistency for Semi-supervised Temporal Action Localization (arxiv 2019)
Open-Vocabulary Temporal Action Detection
2024
- One-Stage Open-Vocabulary Temporal Action Detection Leveraging Temporal Multi-scale and Action Label Features (FG 2024)
- Open-Vocabulary Temporal Action Localization using Multimodal Guidance (arXiv 2024)
- (OV-TAL) Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization (arXiv 2024) code
- Open-vocabulary Temporal Action Localization using VLMs (arXiv 2024)
- (OV-OAD) Does Video-Text Pretraining Help Open-Vocabulary Online Action Detection? (NeurIPS 2024)
2023
- (CELL) Cascade Evidential Learning for Open-World Weakly-Supervised Temporal Action Localization (CVPR 2023)
- (OW-TAL) OW-TAL: Learning Unknown Human Activities for Open-World Temporal Action Localization (PR 2023)
2022
- Open-Vocabulary Temporal Action Detection with Off-the-Shelf ImageText Features (arxiv 2022)
- (OpenTAL) OpenTAL: Towards Open Set Temporal Action Localization (CVPR 2022) code