Home

Awesome


Clinical Camel

⚠️ Upcoming Code Release ⚠️

The codebase related to model inference, training, evaluation, and DBKE is under preparation and will be released soon. Please stay tuned for updates!

Model Description

Clinical Camel model is a transformer-based language model trained on the LLaMA 13B architecture. It is specially designed as a reseach focused medical conversational model.

Access the live demo. This corresponds to an earlier version of the Clinical Camel model.

Model Conversion

Delta weights for the model are provided. The apply_delta script from FastChat can be used to convert LLaMA-13B to Clinical Camel.

The delta weights can be found at this Hugging Face link.

Data

The model was trained on a diverse dataset which includes:

Training

The training code and inference model are based on FastChat. We would like to extend our gratitude to the developers of FastChat for making their code available for use.

The model was trained for 2 epochs. The specific training parameters and configurations used for the Clinical Camel model are detailed below:

ParameterValue
Learning Rate2E-5
Batch Size4
Epochs2
OptimizerAdamW (Torch)
Max Gradient Norm1
Weight Decay0
Warm-up Steps0
Warm-up Ratio0.03
Gradient Accumulation Steps8
Per Device Training Batch Size4
Maximum Sequence Length2048
Learning Rate SchedulerCosine

Model Comparison

The Clinical Camel model was compared to several other popular models in a variety of benchmarks. These include USMLE self-assessment scores, performance on multi-step management problems, and responses to standardized safety questions.

USMLE Self-Assessment Scores

In the USMLE Self-Assessment, Clinical Camel achieved the highest score on Step 1 and Step 3.

GPT-3.5 (%)Clinical Camel (%)Chat Doctor (%)PMC-LLaMAMedAlpaca 13b (%)Vicuna (%)
Step 136.153.211.71.112.221.3
Step 256.951.418.52.727.520.4
Step 355.758.214.83.326.230.3

Multi-Step Management Problems

Clinical Camel shows competitive performance in multi-step management problems.

GPT-3.5 (%)Clinical Camel (%)Chat Doctor (%)PMC-LLaMAMedAlpaca 13b (%)Vicuna (%)
CFPC EM81.374.953.213.931.268.6
CFPC FM85.082.167.014.344.774.8

Publication

The work related to the Clinical Camel model has been published in arXiv. Feel free to check out the paper for a more comprehensive understanding of the model and its performance.

Future Updates

We are in the process of preparing the remainder of the code used in this project for publication. Please check back in the near future for additional updates and resources. Your patience is appreciated as we work to provide a comprehensive and usable repository.

License

This project is licensed under AGPL-3.0. Please see the LICENSE file for more details.