Qwen3 32B FP8

Qwen 3 32B with FP8 quantization.

qwen3-32b-fp8
STABLEModel DeactivatedGet StartedView uptime
40,960 context
Starting at $0.10/M input tokens
Starting at $0.45/M output tokens
Streaming

Select Provider

All Providers for Qwen3 32B FP8

LLM Gateway routes requests to the best providers that are able to handle your prompt size and parameters.

NovitaAI
Context: 41.0k
Deactivated since Jun 5, 2026
Input
$0.1
/M tokens
Cached
/M tokens
Output
$0.45
/M tokens
Get Started