llama.cpp based quantized gguf inference
This space hosts the Granite4 model family from 350m up to 32b. Select the model of your choice in the additional inputs section below.
This space hosts the Granite4 model family from 350m up to 32b. Select the model of your choice in the additional inputs section below.