llama-3.2-1b-instruct

llama-3.2-1b-instruct

Publisher
mlx-community
Architecture
llama
Parameters
1B
Quantization
4bit (4.0 bpw)
Format
mlx
Max Context
131k tokens
Type
llm
File Size
679.6 MB
Est. RAM
2.2 GB
Est. VRAM
2.2 GB
Capabilities
trained_for_tool_use

Benchmark Results

Compare Models