One model for both halves of RAG retrieval; a strong default per size. Contact baa.ai for the optimal pick for your corpus.
AI & ML interests
Model Quantization
Recent Activity
View all activity
Organization Card
Smaller. Smarter. Sovereign.
Making frontier models run anywhere
We publish high-quality quantized models for Apple Silicon and GGUF. Our models use a proprietary optimisation method that delivers superior quality at your target memory budget.
Browse our models, or connect with us below.
models 84
baa-ai/GLM-5.2-RAM-307GB-GGUF
Text Generation • Updated
baa-ai/GLM-5.2-RAM-333GB-MLX
Updated
baa-ai/Merino-Pro-4bit
Sentence Similarity • Updated • 54
baa-ai/Merino-Pro
Sentence Similarity • Updated • 61
baa-ai/Merino-Nano
Sentence Similarity • Updated • 24
baa-ai/Merino-XL-v2
Sentence Similarity • Updated • 24
baa-ai/Merino-XL
Sentence Similarity • Updated • 24
baa-ai/Merino-Large-v2
Sentence Similarity • Updated • 23
baa-ai/Merino-Large
Sentence Similarity • Updated • 22
baa-ai/Merino-Small
Sentence Similarity • Updated • 25 • 1
datasets 0
None public yet