Home

Awesome

<!-- * @Author: fzy * @Date: 2020-03-09 21:53:10 * @LastEditors: Zhenying * @LastEditTime: 2020-12-03 18:58:12 * @Description: -->

Awesome-Temporal-Action-Detection-Temporal-Action-Proposal-Generation Awesome

Temporal Action Detection & Weakly Supervised & Semi Supervised Temporal Action Detection & Temporal Action Proposal Generation & Open-Vocabulary Temporal Action Detection


Contents

<!-- TOC -->

about pretrained model

  1. (BSP) Boundary-sensitive Pre-training for Temporal Localization in Videos (ICCV 2021)
  2. (TSP) TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks (ICCVW 2021)
  3. (UP-TAL) Unsupervised Pre-training for Temporal Action Localization Tasks (CVPR 2022) code
  4. Contrastive Language-Action Pre-training for Temporal Localization (arxiv 2022)
  5. Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization (NeurIPS 2021)

ActivityNet Challenge and talks

  1. (2021) AcitvityNet 2021
  2. (2021) Transformer在时序行为检测中的应用 & 基于自监督学习的半监督时序行为检测 (DAMO Academy, Alibaba Group)

Papers: Temporal Action Proposal Generation

2023

  1. (MIFNet) MIFNet: Multiple Instances Focused Temporal Action Proposal Generation (Neurocomputing 2023)
  2. (SMBG) Faster Learning of Temporal Action Proposal via Sparse Multilevel Boundary Generator (arxiv 2023) code
  3. (MCBD) [Multi-Level Content-Aware Boundary Detection for Temporal Action Proposal Generation](Tip 2023) code

2022

  1. (BCNet) Temporal Action Proposal Generation with Background Constraint (AAAI 2022)
  2. (PRSA-Net) Pyramid Region-based Slot Attention Network for Temporal Action Proposal Generation (BMVC 2022) code
  3. (TDN) Modeling long-term video semantic distribution for temporal action proposal generation (Neurocomputing 2022)
  4. (AOE-Net) AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation (IJCV 2022)

2021

  1. (BSN++) BSN++: Complementary Boundary Regressor with Scale-Balanced RelationModeling for Temporal Action Proposal Generation (AAAI 2021) Author's Zhihu
  2. (RTD-Net) Relaxed Transformer Decoders for Direct Action Proposal Generation (ICCV 2021) code Zhihu
  3. (TCANet) Temporal Context Aggregation Network for Temporal Action Proposal Refinement (CVPR 2021) Zhihu
  4. Augmented Transformer with Adaptive Graph for Temporal Action Proposal Generation (arxiv 2021)
  5. (TAPG) Temporal Action Proposal Generation with Transformers (arxiv 2021)
  6. (AEN) Agent-Environment Network for Temporal Action Proposal Generation (ICASSP 2021)
  7. (AEI) AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation (BMVC 2021) code

2020

  1. VALSE talk by Tianwei Lin (2020.03.18) link (7y8g)
  2. (RapNet) Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid Network (AAAI 2020) pre-paper 2019 ActivityNet task-1 2nd
  3. (DBG) Fast Learning of Temporal Action Proposal via Dense Boundary Generator (AAAI 2020) paper code.TensorFlow
  4. (BC-GNN) Boundary Content Graph Neural Network for Temporal Action Proposal Generation (ECCV 2020) paper
  5. Bottom-Up Temporal Action Localization with Mutual Regularization (ECCV 2020) code.TensorFlow
  6. (TSI) TSI: Temporal Scale Invariant Network for Action Proposal Generation (ACCV 2020)

2019

  1. (SRG) SRG: Snippet Relatedness-based Temporal Action Proposal Generator (IEEE Trans 2019) paper
  2. (DPP) Deep Point-wise Prediction for Action Temporal Proposal (ICONIP 2019) paper code.PyTorch
  3. (BMN) BMN: Boundary-Matching Network for Temporal Action Proposal Generation (ICCV 2019) paper code.PaddlePaddle code.PyTorch_unofficial
  4. (MGG) Multi-granularity Generator for Temporal Action Proposal (CVPR 2019) paper
  5. Investigation on Combining 3D Convolution of Image Data and Optical Flow to Generate Temporal Action Proposals (2019 CVPR Workshop) paper
  6. (CMSN) CMSN: Continuous Multi-stage Network and Variable Margin Cosine Loss for Temporal Action Proposal Generation (arxiv 2019) paper
  7. A high performance computing method for accelerating temporal action proposal generation (arxiv 2019) paper
  8. Multi-Granularity Fusion Network for Proposal and Activity Localization: Submission to ActivityNet Challenge 2019 Task 1 and Task 2 (ActvityNet challenge 2019) paper
  9. Joint Learning of Local and Global Context for Temporal Action Proposal Generation (TCSVT 2019)

2018

  1. (CTAP) CTAP: Complementary Temporal Action Proposal Generation (ECCV 2018) paper code.TensorFlow
  2. (BSN) BSN: Boundary Sensitive Network for Temporal Action Proposal Generation (ECCV 2018) paper code.TensorFlow code.PyTorch
  3. (SAP) SAP: Self-Adaptive Proposal Model for Temporal Action Detection based on Reinforcement Learning (AAAI 2018) paper code.Torch

2017

  1. (TURN TAP) TURN TAP: Temporal Unit Regression Network for Temporal Action Proposals (ICCV 2017) paper code.TensorFlow
  2. (SST) SST: Single-Stream Temporal Action Proposals (CVPR 2017) paper code.theano code.TensorFlow
  3. YoTube: Searching Action Proposal via Recurrent and Static Regression Networks (IEEE Trans 2017) paper
  4. A Pursuit of Temporal Accuracy in General Activity Detection (arxiv 2017) paper code.PyTorch

before

  1. (DAPs) DAPs: Deep Action Proposals for Action Understanding (ECCV 2016) paper code

Papers: Temporal Action Detection

2024

  1. (DenoiseLoc) Boundary Denoising for Video Activity Localization (ICLR 2024) code
  2. (LITA) LITA: Language Instructed Temporal-Localization Assistant (arXiv 2024) code
  3. (PLOT-TAL) (few-shot) PLOT-TAL -- Prompt Learning with Optimal Transport for Few-Shot Temporal Action Localization (Arxiv 2024)
  4. Benchmarking the Robustness of Temporal Action Detection Models Against Temporal Corruptions (CVPR 2024) code
  5. (zero-shot) (T3AL) Test-Time Zero-Shot Temporal Action Localization (CVPR 2024) code
  6. (UniMD) UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection (ECCV 2024) code
  7. Adapting Short-Term Transformers for Action Detection in Untrimmed Videos (CVPR 2024)
  8. (AdaTAD) End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames (CVPR 2024) code
  9. Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding (ECCV 2024) code
  10. (TE-TAD) TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate Expression (CVPR 2024) code
  11. (ADI-Diff) Action Detection via an Image Diffusion Process (CVPR 2024)
  12. (DualDETR) Dual DETRs for Multi-Label Temporal Action Detection (CVPR 2024) code
  13. An Effective-Efficient Approach for Dense Multi-Label Action Detection (arXiv 2024)
  14. (Spatio-Temporal) End-to-End Spatio-Temporal Action Localisation with Video Transformers (CVPR 2024)
  15. (DyFADet) DyFADet: Dynamic Feature Aggregation for Temporal Action Detection (ECCV 2024) code
  16. (causaltad) Harnessing Temporal Causality for Advanced Temporal Action Detection (arxiv 2024) code
  17. (LTP) Long-Term Pre-training for Temporal Action Detection with Transformers (arxiv 2024)
  18. (Pred-DETR) Prediction-Feedback DETR for Temporal Action Detection (arxiv 2024)
  19. Introducing Gating and Context into Temporal Action Detection (ECCV W 2024)
  20. (ContextDet) ContextDet: Temporal Action Detection with Adaptive Context Aggregation (arXiv 2024)

2023

  1. (AMNet) Action-aware Masking Network with Group-based Attention for Temporal Action Localization (WACV 2023)
  2. (ContextLoc++) ContextLoc++: A Unified Context Model for Temporal Action Localization (TPAMI 2023)
  3. Temporal action detection with dynamic weights based on curriculum learning (Neurocomputing 2023)
  4. (GAP) Post-Processing Temporal Action Detection (CVPR 2023) code
  5. (TriDet) TriDet: Temporal Action Detection with Relative Boundary Modeling (CVPR 2023) code
  6. (TemporalMaxer) TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action Localization (ArXiv 2023) code
  7. (DiffTAD) DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion (ICCV 2023) code
  8. Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection (CVPR 2023)
  9. Boundary-Denoising for Video Activity Localization (Arxiv 2023)
  10. (ASL) Action Sensitivity Learning for Temporal Action Localization (ICCV 2023)
  11. (MMNet) A Multi-Modal Transformer Network for Action Detection (Pattern Recognition 2023)
  12. Truncated attention-aware proposal networks with multi-scale dilation for temporal action detection (Pattern Recognition 2023)
  13. (MSST) A Multitemporal Scale and Spatial–Temporal Transformer Network for Temporal Action Localization (IEEE Transactions on Human-Machine Systems 2023)
  14. Exploring Action Centers for Temporal Action Localization (TMM 2023)
  15. (ETAD) ETAD: Training Action Detection End to End on a Laptop (CVPRW 2023) code
  16. (BasicTAD) BasicTAD: an Astounding RGB-Only Baseline for Temporal Action Detection (CVIU 2023) code
  17. (Re2TAL) Re2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization (CVPR 2023) code
  18. (SoLa) Soft-Landing Strategy for Alleviating the Task Discrepancy Problem in Temporal Action Localization Tasks (CVPR 2023)
  19. (APN) Progression-Guided Temporal Action Detection in Videos (Arxiv 2023) code
  20. (Self-DETR) Self-Feedback DETR for Temporal Action Detection (ICCV 2023)
  21. (UnLoc) UnLoc: A Unified Framework for Video Localization Tasks (ICCV 2023) code
  22. Benchmarking Data Efficiency and Computational Efficiency of Temporal Action Localization Models (ICCV 2023 Workshop)
  23. (BAPG) Boundary-Aware Proposal Generation Method for Temporal Action Localization (Arxiv 2023)
  24. (MENet) Movement Enhancement toward Multi-Scale Video Feature Representation for Temporal Action Detection (ICCV 2023)
  25. (MRAV-FF) Multi-Resolution Audio-Visual Feature Fusion for Temporal Action Localization (Arxiv 2023)
  26. (BDRC-Net) Boundary Discretization and Reliable Classification Network for Temporal Action Detection (Arxiv 2023) code
  27. (STAN) STAN: Spatial-Temporal Awareness Network for Temporal Action Detection (ACM MM W 2023)
  28. (RefineTAD) RefineTAD: Learning Proposal-free Refinement for Temporal Action Detection (ACM MM 2023)
  29. SADA: Semantic adversarial unsupervised domain adaptation for Temporal Action Localization (arXiv 2023) code

2022

  1. (DCAN) DCAN: Improving Temporal Action Detection via Dual Context Aggregation (AAAI 2022)
  2. (TVNet) TVNet: Temporal Voting Network for Action Localization (arxiv 2022) code
  3. (ActionFormer) ActionFormer: Localizing Moments of Actions with Transformers (ECCV 2022) code
  4. (SegTAD)SegTAD: Precise Temporal Action Detection via Semantic Segmentation (arxiv 2022)
  5. (OpenTAL) OpenTAL: Towards Open Set Temporal Action Localization (CVPR 2022) code
  6. (TALLFormer) TALLFormer: Temporal Action Localization with Long-memory Transformer (CVPR 2022)
  7. An Empirical Study of End-to-End Temporal Action Detection (CVPR 2022) code
  8. (BREM) Estimation of Reliable Proposal Quality for Temporal Action Detection (ACM MM 2022)
  9. Structured Attention Composition for Temporal Action Localization (Tip 2022) code
  10. (RCL) RCL: Recurrent Continuous Localization for Temporal Action Detection (CVPR 2022)
  11. (RefactorNet) Learning to Refactor Action and Co-occurrence Features for Temporal Action Localization (CVPR 2022)
  12. (MS-TCT) MS-TCT: Multi-Scale Temporal ConvTransformer for Action Detection (CVPR 2022) code
  13. (OATD) One-stage Action Detection Transformer (EPICKITCHENS-100 2022 V. 26.35 N. 25.83)
  14. Context-aware Proposal Network for Temporal Action Detection (CVPR-2022 ActivityNet Challenge winning solution)
  15. Dual relation network for temporal action localization (Pattern Recognition 2022)
  16. Learning Disentangled Classification and Localization Representations for Temporal Action Localization (AAAI 2022)
  17. (DDM) Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection (CVPR 2022) code
  18. Submission to Generic Event Boundary Detection Challenge@CVPR 2022: Local Context Modeling and Global Boundary Decoding Approach (CVPR 2022 Challenge)
  19. (HTNet) HTNet: Anchor-free Temporal Action Localization with Hierarchical Transformers (arxiv 2022)
  20. (STPT) An Efficient Spatio-Temporal Pyramid Transformer for Action Detection (ECCV 2022)
  21. (TAGS) Proposal-Free Temporal Action Detection with Global Segmentation Mask Learning (ECCV 2022) code
  22. Prompting Visual-Language Models for Efficient Video Understanding (ECCV 2022) code
  23. (ReAct) ReAct: Temporal Action Detection with Relational Queries (ECCV 2022) code
  24. (TadTR) End-to-end Temporal Action Detection with Transformer (TIP 2022) code
  25. (TAL-MTS) Temporal Action Localization with Multi-temporal Scales (arxiv 2022)
  26. (AdaPerFormer) Adaptive Perception Transformer for Temporal Action Localization (arxiv 2022) code
  27. (PointTAD) PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points (NeurIPS 2022) code (multi action detection, eg: multiTHUMOS, charades)
  28. (SoLa) Soft-Landing Strategy for Alleviating the Task Discrepancy Problem in Temporal Action Localization Tasks (arxiv 2022)
  29. (Re2TAL) Re2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization (arxiv 2022)
  30. (MUPPET) Multi-Modal Few-Shot Temporal Action Detection (arxiv 2022) code
  31. Deep Learning-Based Action Detection in Untrimmed Videos: A Survey (TPAMI 2022)

2021

  1. (activity graph transformer) Activity Graph Transformer for Temporal Action Localization (arxiv 2021) project code
  2. Coarse-Fine Networks for Temporal Activity Detection in Videos (CVPR 2021) code
  3. (MLAD) Modeling Multi-Label Action Dependencies for Temporal Action Localization (CVPR 2021)
  4. (PcmNet) PcmNet: Position-Sensitive Context Modeling Network for Temporal Action Localization (Tip 2021)
  5. (AFSD) Learning Salient Boundary Feature for Anchor-free Temporal Action Localization (CVPR 2021) code
  6. Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization (arxiv 2021)
  7. Read and Attend: Temporal Localisation in Sign Language Videos (CVPR 2021) (Sign Language Videos)
  8. Low Pass Filter for Anti-aliasing in Temporal Action Localization (arxiv 2021)
  9. FineAction: A Fined Video Dataset for Temporal Action Localization (One track of DeeperAction Workshop@ICCV2021) Homepage
  10. Three Birds with One Stone: Multi-Task Temporal Action Detection via Recycling Temporal Annotations (CVPR 2021)
  11. Proposal Relation Network for Temporal Action Detection (CVPRW 2021)
  12. Exploring Stronger Feature for Temporal Action Localization (CVPRW 2021)
  13. (SRF-Net) SRF-Net: Selective Receptive Field Network for Anchor-Free Temporal Action Detection (ICASSP 2021)
  14. RGB Stream Is Enough for Temporal Action Detection (arxiv 2021) code
  15. (AVFusion) Hear Me Out: Fusional Approaches for Audio Augmented Temporal Action Localization (arxiv 2021) Code
  16. Transferable Knowledge-Based Multi-Granularity Aggregation Network for Temporal Action Localization: Submission to ActivityNet Challenge 2021 (HACS challenge 2021)
  17. Enriching Local and Global Contexts for Temporal Action Localization (ICCV 2021)
  18. (CSA) Class Semantics-based Attention for Action Detection (ICCV 2021)
  19. (SP-TAD) Towards High-Quality Temporal Action Detection with Sparse Proposals (arxiv 2021) Code
  20. Few-Shot Temporal Action Localization with Query Adaptive Transformer (BMVC 2021) code (Few-Shot)
  21. Graph Convolutional Module for Temporal Action Localization in Videos (TPAMI 2021)
  22. MS-TCT: Multi-Scale Temporal ConvTransformer for Action Detection (arxiv 2021)
  23. (VSGN) Video Self-Stitching Graph Network for Temporal Action Localization (ICCV 2021) code
  24. (MUSES) Multi-shot Temporal Event Localization: a Benchmark (CVPR 2021) project code dataset

2020

  1. (G-TAD) G-TAD: Sub-Graph Localization for Temporal Action Detection (CVPR 2020) paper code.PyTorch video
  2. (AGCN-P-3DCNNs) Graph Attention based Proposal 3D ConvNets for Action Detection (AAAI 2020) paper
  3. (PBRNet) Progressive Boundary Refinement Network for Temporal Action Detection (AAAI 2020) paper
  4. (TsaNet) Scale Matters: Temporal Scale Aggregation Network for Precise Action Localization in Untrimmed Videos (ICME 2020) paper
  5. Constraining Temporal Relationship for Action Localization (arxiv 2020) paper
  6. (CBR-Net) CBR-Net: Cascade Boundary Refinement Network for Action Detection: Submission to ActivityNet Challenge 2020 (Task 1) (ActivityNet Challenge 2020) paper
  7. Temporal Action Localization with Variance-Aware Networks (arxiv 2020)
  8. Boundary Uncertainty in a Single-Stage Temporal Action Localization Network (arxiv 2020, Tech report)
  9. Revisiting Anchor Mechanisms for Temporal Action Localization (Tip 2020) code.PyTorch
  10. (C-TCN) Deep Concept-wise Temporal Convolutional Networks for Action Localization (ACM MM 2020) code.PaddlePaddle
  11. (MLTPN) Multi-Level Temporal Pyramid Network for Action Detection (PRCV 2020)
  12. (SALAD) SALAD: Self-Assessment Learning for Action Detection (arxiv 2020)

2019

  1. (CMS-RC3D) Contextual Multi-Scale Region Convolutional 3D Network for Activity Detection (ICCVBIC 2019) paper
  2. (TGM) Temporal Gaussian Mixture Layer for Videos (ICML 2019) paper code.PyTorch
  3. (Decouple-SSAD) Decoupling Localization and Classification in Single Shot Temporal Action Detection (ICME 2019) paper code.TensorFlow
  4. Exploring Feature Representation and Training strategies in Temporal Action Localization (ICIP 2019) paper
  5. (PGCN) Graph Convolutional Networks for Temporal Action Localization (ICCV 2019) paper code.PyTorch
  6. (S-2D-TAN) Learning Sparse 2D Temporal Adjacent Networks for Temporal Action Localization (ICCV 2019) (winner solution for the HACS Temporal Action Localization Challenge at ICCV 2019) paper
    • (2D-TAN) Learning 2D Temporal Adjacent Networks for Moment Localization with Natural Language (AAAI 2020) paper code.PyTorch
  7. (LCDC) Learning Motion in Feature Space: Locally-Consistent Deformable Convolution Networks for Fine-Grained Action Detection (ICCV 2019) paper slide code.TensorFlow
  8. (BLP) BLP -- Boundary Likelihood Pinpointing Networks for Accurate Temporal Action Localization (ICASSP 2019) paper
  9. (GTAN) Gaussian Temporal Awareness Networks for Action Localization (CVPR 2019) paper
  10. Temporal Action Localization using Long Short-Term Dependency (arxiv 2019) paper
  11. Relation Attention for Temporal Action Localization (IEEE Trans TMM 2019) paper
  12. (AFO-TAD) AFO-TAD: Anchor-free One-Stage Detector for Temporal Action Detection (arxiv 2019) paper
  13. (DBS) Video Imprint Segmentation for Temporal Action Detection in Untrimmed Videos (AAAI 2019) paper

2018

  1. Diagnosing Error in Temporal Action Detectors (ECCV 2018) paper
  2. (ETP) Precise Temporal Action Localization by Evolving Temporal Proposals (ICMR 2018) paper
  3. (Action Search) Action Search: Spotting Actions in Videos and Its Application to Temporal Action Localization (ECCV 2018) paper code.TensorFlow
  4. (TAL-Net) Rethinking the Faster R-CNN Architecture for Temporal Action Localization (CVPR 2018) paper
  5. One-shot Action Localization by Learning Sequence Matching Network (CVPR 2018) paper
  6. Temporal Action Detection by Joint Identification-Verification (arxiv 2018) paper
  7. (TPC) Exploring Temporal Preservation Networks for Precise Temporal Action Localization (AAAI 2018) paper
  8. (SAP) A Self-Adaptive Proposal Model for Temporal Action Detection based on Reinforcement Learning (AAAI 2018) paper code.Torch

2017

  1. (TCN) Temporal Context Network for Activity Localization in Videos (ICCV 2017) paper code.caffe
  2. (SSN) Temporal Action Detection with Structured Segment Networks (ICCV 2017) paper code.PyTorch
  3. (R-C3D) R-C3D: Region Convolutional 3D Network for Temporal Activity Detection (ICCV 2017) paper code.caffe code.PyTorch
  4. (TCNs) Temporal Convolutional Networks for Action Segmentation and Detection (CVPR 2017) paper code.TensorFlow
  5. (SMS) Temporal Action Localization by Structured Maximal Sums (CVPR 2017) paper code
  6. (SCC) SCC: Semantic Context Cascade for Efficient Action Detection (CVPR 2017) paper
  7. (CDC) CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos (CVPR 2017) paper code project
  8. (SS-TAD) End-to-End, Single-Stream Temporal ActionDetection in Untrimmed Videos (BMVC 2017) paper code.PyTorch
  9. (CBR) Cascaded Boundary Regression for Temporal Action Detection (BMVC 2017) paper code.TensorFlow
  10. (SSAD) Single Shot Temporal Action Detection (ACM MM 2017) paper

before

  1. (PSDF) Temporal Action Localization with Pyramid of Score Distribution Features (CVPR 2016) paper
  2. Temporal Action Detection using a Statistical Language Model (CVPR 2016) paper code
  3. (S-CNN) Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs (CVPR 2016) paper code project
  4. End-to-end Learning of Action Detection from Frame Glimpses in Videos (CVPR 2016) paper code

Papers: Weakly Supervised Temporal Action Detection

2024

  1. Weakly-Supervised Temporal Action Localization by Inferring Snippet-Feature Affinity (AAAI 2024)
  2. (HR-Pro) HR-Pro: Point-supervised Temporal Action Localization via Hierarchical Reliability Propagation (AAAI 2024) code
  3. STAT: Towards Generalizable Temporal Action Localization (Arxiv 2024)
  4. (TSPNet) Realigning Confidence with Temporal Saliency Information for Point-level Weakly-Supervised Temporal Action Localization (CVPR 2024) code
  5. (M2PT) Weakly-Supervised Temporal Action Localization with Multi-Modal Plateau Transformers (CVPR Workshop 2024)
  6. (EPNet) Ensemble Prototype Network For Weakly Supervised Temporal Action Localization (TNNLS 2024)
  7. (FuSTAL) Full-Stage Pseudo Label Quality Enhancement for Weakly-supervised Temporal Action Localization (arXiv 2024) code
  8. (PVLR) Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization (ACM MM 2024) code
  9. (zero-shot) Towards Completeness: A Generalizable Action Proposal Generator for Zero-Shot Temporal Action Localization (ICPR 2024) code
  10. (SMBD) Stepwise Multi-grained Boundary Detector for Point-supervised Temporal Action Localization (ECCV 2024)
  11. Zero-shot Action Localization via the Confidence of Large Vision-Language Models (arXiv 2024)
  12. Can MLLMs Guide Weakly-Supervised Temporal Action Localization Tasks? (arXiv 2024)

2023

  1. (ASCN) A Novel Action Saliency and Context-Aware Network for Weakly-Supervised Temporal Action Localization (TMM 2023)
  2. (TFE-DCN) Temporal Feature Enhancement Dilated Convolution Network for Weakly-Supervised Temporal Action Localization (WACV 2023)
  3. (JCDNet) JCDNet: Joint of Common and Definite phases Network for Weakly Supervised Temporal Action Localization (Arxiv 2023)
  4. (P-MIL) Proposal-Based Multiple Instance Learning for Weakly-Supervised Temporal Action Localization (CVPR 2023) code
  5. Two-Stream Networks for Weakly-Supervised Temporal Action Localization With Semantic-Aware Mechanisms (CVPR 2023)
  6. Boosting Weakly-Supervised Temporal Action Localization with Text Information (CVPR 2023) code
  7. (PivoTAL) PivoTAL: Prior-Driven Supervision for Weakly-Supervised Temporal Action Localization (CVPR 2023)
  8. Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels (CVPR 2023) code
  9. (MTP) Multiple Temporal Pooling Mechanisms for Weakly Supervised Temporal Action Localization (TOMM 2023)
  10. (VQK-Net) Video-Specific Query-Key Attention Modeling for Weakly-Supervised Temporal Action Localization
  11. (DFE) Dual-Feature Enhancement for Weakly Supervised Temporal Action Localization (ICASSP 2023)
  12. (FBA-Net) Collaborative Foreground, Background, and Action Modeling Network for Weakly Supervised Temporal Action Localization (TCSVT 2023)
  13. (Bi-SCC) Weakly Supervised Temporal Action Localization With Bidirectional Semantic Consistency Constraint (TNNLS 2023)
  14. (F3-Net) Feature Weakening, Contextualization, and Discrimination for Weakly Supervised Temporal Action Localization (TMM 2023) code
  15. (LPR) Learning Proposal-aware Re-ranking for Weakly-supervised Temporal Action Localization (TCSVT 2023)
  16. (STCL-Net) Semantic and Temporal Contextual Correlation Learning for Weakly-Supervised Temporal Action Localization (TPAMI 2023)
  17. Distilling Vision-Language Pre-training to Collaborate with Weakly-Supervised Temporal Action Localization (CVPR 2023)
  18. Weakly-Supervised Action Localization by Hierarchically-structured Latent Attention Modeling (ICCV 2023)
  19. Cross-Video Contextual Knowledge Exploration and Exploitation for Ambiguity Reduction in Weakly Supervised Temporal Action Localization (TCSVT 2023)
  20. (SPL-Loc) Sub-action Prototype Learning for Point-level Weakly-supervised Temporal Action Localization (arXiv 2023)
  21. (DDG-Net) DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action Localization (ICCV 2023) code
  22. Proposal-based Temporal Action Localization with Point-level Supervision (BMVC 2023)
  23. (LPR) LPR: learning point-level temporal action localization through re-training (MMSJ 2023)
  24. (POTLoc) POTLoc: Pseudo-Label Oriented Transformer for Point-Supervised Temporal Action Localization (arXiv 2023)
  25. (ADM-Loc) ADM-Loc: Actionness Distribution Modeling for Point-supervised Temporal Action Localization (arXiv 2023)
  26. Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-based Approach (ICCV 2023) code
  27. Sub-action Prototype Learning for Point-level Weakly-supervised Temporal Action Localization (arXiv 2023)

2022

  1. (ACGNet) ACGNet: Action Complement Graph Network for Weakly-supervised Temporal Action Localization (AAAI 2022)
  2. (RSKP) Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation (CVPR 2022) code
  3. (ASM-Loc) ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization (CVPR 2022) code
  4. (FTCL) Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization (CVPR 2022) code
  5. (C3BN) Convex Combination Consistency between Neighbors for Weakly-supervised Action Localization (arxiv 2022)
  6. (DCC) Exploring Denoised Cross-video Contrast for Weakly-supervised Temporal Action Localization (CVPR 2022)
  7. (HAAN) Weakly-Supervised Temporal Action Detection for Fine-Grained Videos with Hierarchical Atomic Actions (ECCV 2022) code
  8. (STALE) (Zero-Shot) Zero-Shot Temporal Action Detection via Vision-Language Prompting (ECCV 2022) code
  9. (SMEN) Slow Motion Matters: A Slow Motion Enhanced Network for Weakly Supervised Temporal Action Localization (TCSVT 2022)
  10. Dilation-Erosion for Single-Frame Supervised Temporal Action Localization (arxiv 2022)
  11. (AMS) Adaptive Mutual Supervision for Weakly-Supervised Temporal Action Localization (TMM 2022)
  12. (DELU) Dual-Evidential Learning for Weakly-supervised Temporal Action Localization (ECCV 2022) code

2021

  1. (HAM-Net) A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action Localization. (AAAI 2021)
  2. Cross-Attentional Audio-Visual Fusion for Weakly-Supervised Action Localization (ICLR 2021)
  3. Weakly-supervised Temporal Action Localization by Uncertainty Modeling (AAAI 2021) code
  4. (TS-PCA) The Blessings of Unlabeled Background in Untrimmed Videos (CVPR 2021) code
  5. (ACSNet) ACSNet: Action-Context Separation Network for Weakly Supervised Temporal Action Localization (AAAI 2021)
  6. (CoLA) CoLA: Weakly-Supervised Temporal Action Localization with Snippet Contrastive Learning (CVPR 2021)
  7. Weakly Supervised Temporal Action Localization Through Learning Explicit Subspaces for Action and Context (AAAI 2021)
  8. ACM-Net: Action Context Modeling Network for Weakly-Supervised Temporal Action Localization (arxiv 2021, submitted to Tip) code
  9. (AUMN) Action Unit Memory Network for Weakly Supervised Temporal Action Localization (CVPR 2021)
  10. (ASL) Weakly Supervised Action Selection Learning in Video (CVPR 2021)
  11. (ActShufNet) Action Shuffling for Weakly Supervised Temporal Localization (arxiv 2021)
  12. Few-Shot Action Localization without Knowing Boundaries (arxiv 2021)
  13. Uncertainty Guided Collaborative Training for Weakly Supervised Temporal Action Detection (CVPR 2021)
  14. Two-Stream Consensus Network: Submission to HACS Challenge 2021Weakly-Supervised Learning Track (CVPRW 2021)
  15. Weakly-Supervised Temporal Action Localization Through Local-Global Background Modeling (CVPRW 2021)
  16. Cross-modal Consensus Network for Weakly Supervised Temporal Action Localization (ACM MM 2021) code
  17. Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization (ICCV 2021) code
  18. Deep Motion Prior for Weakly-Supervised Temporal Action Localization (submit to Tip 2021) project
  19. Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization (ICCV 2021)
  20. (BackTAL) Background-Click Supervision for Temporal Action Localization (TPAMI 2021) code
  21. (ACN) Action Coherence Network for Weakly-Supervised Temporal Action Localization (TMM 2021)

2020

  1. (WSGN) Weakly Supervised Gaussian Networks for Action Detection (WACV 2020) paper
  2. Weakly Supervised Temporal Action Localization Using Deep Metric Learning (WACV 2020) paper
  3. Action Graphs: Weakly-supervised Action Localization with Graph Convolution Networks (WACV 2020) paper
  4. (DGAM) Weakly-Supervised Action Localization by Generative Attention Modeling (CVPR 2020) paper code.PyTorch
  5. (EM-MIL) Weakly-Supervised Action Localization with Expectation-Maximization Multi-Instance Learning (ECCV 2020) paper
  6. Relational Prototypical Network for Weakly Supervised Temporal ActionLocalization (AAAI 2020) paper
  7. (BaS-Net) Background Suppression Networkfor Weakly-supervised Temporal Action Localization (AAAI 2020) paper code.PyTorch
  8. Background Modeling via Uncertainty Estimation for Weakly-supervised Action Localization (arxiv 2020) paper code.PyTorch
  9. (A2CL-PT) Adversarial Background-Aware Loss for Weakly-supervised Temporal Activity Localization (ECCV 2020) paper code.PyTorch
  10. Weakly Supervised Temporal Action Localization with Segment-Level Labels (arxiv 2020)
  11. (ECM) Equivalent Classification Mapping for Weakly Supervised Temporal Action Localization (arxiv 2020 -> TPAMI 2022) paper
  12. Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization (ECCV 2020 spotlight)
  13. Learning Temporal Co-Attention Models for Unsupervised Video Action Localization (CVPR 2020)
  14. Action Completeness Modeling with Background Aware Networks for Weakly-Supervised Temporal Action Localization (ACM MM 2020)
  15. (D2-Net) D2-Net: Weakly-Supervised Action Localization via Discriminative Embeddingsand Denoised Activations (arxiv 2020) (THUMOS'14 mAP@0.5: 35.9)
  16. (SF-Net) SF-Net: Single-Frame Supervision for Temporal Action Localization (ECCV 2020) code.PyTorch
  17. Point-Level Temporal Action Localization: Bridging Fully-supervised Proposals to Weakly-supervised Losses (arxiv 2020)
  18. Transferable Knowledge-Based Multi-Granularity Fusion Network for Weakly Supervised Temporal Action Detection (TMM 2020)
  19. ActionBytes: Learning From Trimmed Videos to Localize Actions (CVPR 2020)

2019

  1. (AdapNet) AdapNet: Adaptability Decomposing Encoder-Decoder Network for Weakly Supervised Action Recognition and Localization (IEEE Transactions on Neural Networks and Learning Systems) paper
  2. Breaking Winner-Takes-All: Iterative-Winners-Out Networks for Weakly Supervised Temporal Action Localization (IEEE Transactions on Image Processing) paper
  3. Weakly-Supervised Temporal Localization via Occurrence Count Learning (ICML 2019) paper code.TensorFlow
  4. (MAAN) Marginalized Average Attentional Network for Weakly-Supervised Learning (ICLR 2019) paper code.PyTorch
  5. Weakly-supervised Action Localization with Background Modeling (ICCV 2019) paper
  6. (TSM) Temporal Structure Mining for Weakly Supervised Action Detection (ICCV 2019) paper
  7. (CleanNet) Weakly Supervised Temporal Action Localization through Contrast basedEvaluation Networks (ICCV 2019) paper
  8. (3C-Net) 3C-Net: Category Count and Center Loss for Weakly-Supervised Action Localization (ICCV 2019) paper code.PyTorch
  9. (CMCS) Completeness Modeling and Context Separation for Weakly SupervisedTemporal Action Localization (CVPR 2019) paper code.PyTorch
  10. (RefineLoc) RefineLoc: Iterative Refinement for Weakly-Supervised Action Localization (arxiv 2019) paper homepage
  11. (ASSG) Adversarial Seeded Sequence Growing for Weakly-Supervised Temporal Action Localization (ACM MM 2019) paper
  12. (TSRNet) Learning Transferable Self-attentive Representations for Action Recognition in Untrimmed Videos with Weak Supervision (AAAI 2019) paper
  13. (STAR) Segregated Temporal Assembly Recurrent Networks for Weakly Supervised Multiple Action Detection (AAAI 2019) paper

2018

  1. Weakly Supervised Temporal Action Detection with Shot-Based Temporal Pooling Network (ICONIP 2018)
  2. (W-TALC) W-TALC: Weakly-supervised Temporal Activity Localization and Classification (ECCV 2018) code.PyTorch
  3. (AutoLoc) AutoLoc: Weakly-supervised Temporal Action Localization (ECCV 2018) code
  4. (STPN) Weakly Supervised Action Localization by Sparse Temporal Pooling Network (CVPR 2018) code
  5. Step-by-step Erasion, One-by-one Collection: A Weakly Supervised Temporal Action Detector (ACM MM 2018)
  6. (CPMN) Cascaded Pyramid Mining Network for Weakly Supervised Temporal Action Localization (accv 2018)

2017

  1. (Hide-and-Seek) Hide-and-Seek: Forcing a Network to be Meticulous for Weakly-supervised Object and Action Localization (ICCV 2017)
  2. (UntrimmedNets) UntrimmedNets for Weakly Supervised Action Recognition and Detection (CVPR 2017) code

Papers: Online Action Detection

2024

  1. (JOADAA) JOADAA: joint online action detection and action anticipation (WACV 2024)
  2. Object Aware Egocentric Online Action Detection (CVPRW 2024)
  3. ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos (ECCV 2024)
  4. (MATR) Online Temporal Action Localization with Memory-Augmented Transformer (ECCV 2024) code
  5. (HAT) HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization (ECCV 2024) code

2023

  1. (recognation) (GliTr) GliTr: Glimpse Transformers with Spatiotemporal Consistency for Online Action Prediction (WACV 2023)
  2. (E2E-LOAD) E2E-LOAD: End-to-End Long-form Online Action Detection (ICCV 2023) code
  3. (MiniROAD) MiniROAD: Minimal RNN Framework for Online Action Detection (ICCV 2023) code
  4. (MAT) Memory-and-Anticipation Transformer for Online Action Understanding (ICCV 2023) code
  5. Online Action Detection with Learning Future Representations by Contrastive Learning (ICME 2023)
  6. (HCM) HCM: Online Action Detection With Hard Video Clip Mining (TMM 2023)
  7. (DFAformer) DFAformer: A Dual Filtering Auxiliary Transformer for Efficient Online Action Detection in Streaming Videos (PRCV 2023)

2022

  1. (Colar) Colar: Effective and Efficient Online Action Detection by Consulting Exemplars (CVPR 2022) code
  2. (GateHUB) GateHUB: Gated History Unit with Background Suppression for Online Action Detection (CVPR 2022)
  3. A Circular Window-based Cascade Transformer for Online Action Detection (TPAMI 2022)
  4. (TeSTra) Real-time Online Video Detection with Temporal Smoothing Transformers (ECCV 2022) code
  5. (SimOn) SimOn: A Simple Framework for Online Temporal Action Localization (arxiv 2022) code
  6. (survey) Online human action detection and anticipation in videos: A survey
  7. Uncertainty-Based Spatial-Temporal Attention for Online Action Detection (ECCV 2022)
  8. (PPKD) Privileged Knowledge Distillation for Online Action Detection (PRhttps://arxiv.org/abs/2011.09158 2022)

2021

  1. (WOAD) WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos (CVPR 2021)
  2. (OadTR) OadTR: Online Action Detection with Transformers (ICCV 2021) code
  3. (LSTR) Long Short-Term Transformer for Online Action Detection (NeurIPS 2021) code
  4. pre awesome

Semi-Supervised

2024

  1. (APL) Towards Adaptive Pseudo-label Learning for Semi-Supervised Temporal Action Localization (ECCV 2024)

2023

  1. (NPL) Learning from Noisy Pseudo Labels for Semi-Supervised Temporal Action Localization (ICCV 2023) code

2022

  1. (AL-STAL) Active Learning with Effective Scoring Functions for Semi-Supervised Temporal Action Localization (Displays 2022)
  2. (SPOT) Semi-Supervised Temporal Action Detection with Proposal-Free Masking (ECCV 2022) code

2021

  1. (SSTAP) Self-Supervised Learning for Semi-Supervised Temporal Action Proposal (CVPR 2021) code
  2. Temporal Action Detection with Multi-level Supervision (ICCV 2021) code
  3. (KFC) KFC: An Efficient Framework for Semi-Supervised Temporal Action Localization (Tip 2021)

2019

  1. Learning Temporal Action Proposals With Fewer Labels (ICCV 2019)
  2. (TTC-Loc) Towards Train-Test Consistency for Semi-supervised Temporal Action Localization (arxiv 2019)

Open-Vocabulary Temporal Action Detection

2024

  1. One-Stage Open-Vocabulary Temporal Action Detection Leveraging Temporal Multi-scale and Action Label Features (FG 2024)
  2. Open-Vocabulary Temporal Action Localization using Multimodal Guidance (arXiv 2024)
  3. (OV-TAL) Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization (arXiv 2024) code
  4. Open-vocabulary Temporal Action Localization using VLMs (arXiv 2024)
  5. (OV-OAD) Does Video-Text Pretraining Help Open-Vocabulary Online Action Detection? (NeurIPS 2024)

2023

  1. (CELL) Cascade Evidential Learning for Open-World Weakly-Supervised Temporal Action Localization (CVPR 2023)
  2. (OW-TAL) OW-TAL: Learning Unknown Human Activities for Open-World Temporal Action Localization (PR 2023)

2022

  1. Open-Vocabulary Temporal Action Detection with Off-the-Shelf ImageText Features (arxiv 2022)
  2. (OpenTAL) OpenTAL: Towards Open Set Temporal Action Localization (CVPR 2022) code