Home

Awesome

:dart: ProxyDet

official implementation for "ProxyDet: Synthesizing Proxy Novel Classes via Classwise Mixup for Open-Vocabulary Object Detection"

<p align="center"> <img src='.assets/teaser.png' align="center" height="300px"> </p>

Joonhyun Jeong<sup>1,2</sup>, Geondo Park<sup>2</sup>, Jayeon Yoo<sup>3</sup>, Hyungsik Jung<sup>1</sup>, Heesu Kim<sup>1</sup><br>

<sup>1</sup> <sub>NAVER Cloud, ImageVision</sub><br /> <sup>2</sup> <sub>KAIST</sub><br /> <sup>3</sup> <sub>Seoul National University</sub><br />

Paper Youtube Project Page

:hammer_and_wrench: Install

See install instructions.

:clapper: Demo

python3 demo.py \
    --config-file configs/ProxyDet_R50_Lbase_INL.yaml \
    --input .assets/desk.jpg \
    --output out.jpg \
    --vocabulary custom \
    --custom_vocabulary headphone,webcam,paper,coffe \
    --confidence-threshold 0.3 \
    --zeroshot_weight_path datasets/metadata/lvis_v1_clip_a+cname.npy \
    --opts MODEL.WEIGHTS models/proxydet_r50_w_inl.pth
<img src=".assets/out.jpg" alt="output" >

:fire: Training

See training instructions.

:chart_with_upwards_trend: evaluation

See evaluation instructions.

:newspaper_roll: News

:pushpin: TODO

:link: Citation

If you find that this project helps your research, please consider citing as below:

@article{jeong2023proxydet,
  title={ProxyDet: Synthesizing Proxy Novel Classes via Classwise Mixup for Open Vocabulary Object Detection},
  author={Jeong, Joonhyun and Park, Geondo and Yoo, Jayeon and Jung, Hyungsik and Kim, Heesu},
  journal={arXiv preprint arXiv:2312.07266},
  year={2023}
}

License

ProxyDet
Copyright (c) 2024-present NAVER Cloud Corp.

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.