Model Tutorials#
This section provides tutorials for different models of vLLM Ascend.
Model Tutorials
- Qwen2.5-Omni-7B
- Qwen2.5-7B
- Qwen3-Dense(Qwen3-0.6B/8B/32B)
- Qwen-VL-Dense(Qwen2.5VL-3B/7B, Qwen3-VL-2B/4B/8B/32B)
- Qwen3-30B-A3B
- Qwen3-235B-A22B
- Qwen3-VL-30B-A3B-Instruct
- Qwen3-VL-235B-A22B-Instruct
- Qwen3-Coder-30B-A3B
- Qwen3-Embedding
- Qwen3-VL-Embedding
- Qwen3-Reranker
- Qwen3-VL-Reranker
- Qwen3-8B-W4A8
- Qwen3-32B-W4A4
- Qwen3-Next
- Qwen3-Omni-30B-A3B-Thinking
- DeepSeek-V3/3.1
- DeepSeek-V3.2
- DeepSeek-R1
- GLM-4.5/4.6/4.7
- GLM-5
- Kimi-K2-Thinking
- PaddleOCR-VL