LLM Models
LLM Models
| Model Name | Model ID | Capabilities | Max Context |
|---|---|---|---|
| Qwen3.5-397B-A17B | qwen35-397b-a17b-instruct-general | Text, Code, OCR/Vision | 256K tokens |
| Qwen3.5-397B-A17B Creative | qwen35-397b-a17b-instruct-creative | Same as above — sampling preset for creativity | 256K tokens |
| Qwen3.5-397B-A17B Thinking | qwen35-397b-a17b-thinking-general | Same as above — thinking mode enabled | 256K tokens |
| Qwen3.5-397B-A17B Thinking Coder | qwen35-397b-a17b-thinking-coding | Same as Thinking — sampling preset for code | 256K tokens |
Sampling parameter presets
The Creative and Thinking Coder variants enforce specific sampling parameters on the backend. If you need full control over sampling, use the base instruct-general or thinking-general models instead.