Awesome
Unofficial Repo for LLaVA-Reasoner-DPO
This is an unofficial repo for the paper: Improve Vision Language Model Chain-of-thought Reasoning
Release
- [10.22] we will provide third party implementation for arxiv paper
setup
# setup environment, need to fill in the required fields
source setup/setup_env.sh
# data
source setup/setup_train_data.sh
sft
cd llava_reasoner
bash scripts_sft/sft_direct+cot_preview.sh \
$SAVE_DIR/sft/llava_reasoner_sft_preview