granite-4.0-h-small-mlx

granite-4.0-h-small-mlx

Publisher
lmstudio-community
Architecture
granitemoehybrid(sparse)
Quantization
4bit (4.0 bpw)
Format
mlx
Max Context
131k tokens
Type
llm
File Size
16.89 GB
Est. RAM
18.4 GB
Est. VRAM
18.4 GB
Capabilities
trained_for_tool_use

Benchmark Results

Compare Models