Awesome

<div align="center"> <img width="550px" height="200px" src="https://github.com/FL33TW00D/ratchet/raw/master/.github/ratchet.png"> <p><a href="https://huggingface.co/spaces/FL33TW00D-HF/ratchet-whisper">Demo Site</a> | <a href="https://discord.gg/XFe33KQTG4">Discord</a> | <a href="https://github.com/users/FL33TW00D/projects/3">Roadmap</a></p> <p align="center"> A web-first, cross-platform ML developer toolkit </p> <br> </div>

We are on a mission to bring fast, cross platform GPU accelerated inference on native + browser.

[!NOTE]
Ratchet is currently in active development. We are working on the engine, adding more models and improving compatibility. Please, reach out if you'd like to help!

Getting Started

The easiest way to experience Ratchet is to check out our Hugging Face spaces:

Whisper
Phi

To dig deeper, check out the examples

We welcome contributions from the community. If you have any ideas or suggestions, please feel free to open an issue or pull request.

Javascript

// Asynchronous loading & caching with IndexedDB
let model = await Model.load(AvailableModels.WHISPER_TINY, Quantization.Q8, (p: number) => setProgress(p))
let result = await model.run({ input });

Rust

Rust crate & CLI coming soon...

Philosophy

We want a toolkit for developers to make integrating performant AI functionality into existing production applications easy. The following principles will help us accomplish this:

Inference only
WebGPU/CPU only
First class quantization support
Lazy computation
Inplace by default

Supported Models

Whisper
Phi 2 & 3
Moondream

Upcoming Models

Gemini 2 2B