Home

Awesome

.github/workflows/rust.yml test Crates.io Version Crates.io Total Downloads docs.rs GitHub commit activity Matrix

alt text

valentinus

next generation vector db built with lmdb bindings

dependencies

getting started

git clone https://github.com/kn0sys/valentinus && cd valentinus

optional environment variables

varusagedefault
LMDB_USERworking directory of the user for database$USER
LMDB_MAP_SIZESets max environment size, i.e. size in memory/disk of all data20% of available memory
ONNX_PARALLEL_THREADSparallel execution mode for this session1
VALENTINUS_CUSTOM_DIMembeddings dimensions for custom modelsall-mini-lm-6 -> 384
VALENTINUS_LMDB_ENVenvironment for the database (i.e. test, prod)test

tests

mkdir all-MiniLM-L6-v2_onnx
cd all-MiniLM-L6-v2_onnx && wget https://huggingface.co/nigel-christian/all-MiniLM-L6-v2_onnx/resolve/main/config.json
wget https://huggingface.co/nigel-christian/all-MiniLM-L6-v2_onnx/resolve/main/model.onnx
wget https://huggingface.co/nigel-christian/all-MiniLM-L6-v2_onnx/resolve/main/special_tokens_map.json
wget https://huggingface.co/nigel-christian/all-MiniLM-L6-v2_onnx/resolve/main/tokenizer_config.json
wget https://huggingface.co/nigel-christian/all-MiniLM-L6-v2_onnx/resolve/main/tokenizer.json
wget https://huggingface.co/nigel-christian/all-MiniLM-L6-v2_onnx/resolve/main/vocab.txt

RUST_TEST_THREADS=1 cargo test

examples

see examples

reference

inspired by this chromadb python tutorial