Home

Awesome

Unofficial Repo for LLaVA-Reasoner-DPO

This is an unofficial repo for the paper: Improve Vision Language Model Chain-of-thought Reasoning

Release

setup

# setup environment, need to fill in the required fields
source setup/setup_env.sh

# data
source setup/setup_train_data.sh 

sft

cd llava_reasoner
bash scripts_sft/sft_direct+cot_preview.sh \
$SAVE_DIR/sft/llava_reasoner_sft_preview