Awesome
GVT: Good Visual Tokenizer for LLMs
This repo contains assets in our paper What makes for Good Visual Tokenizers for Large Language Models?
Model
Comming Soon
GVTBench
We provide the Object Counting (OC) and Multi-Class Identification (MCI) on MS-COCO and VCR datasets in GVTBench.