Hardware Platforms
xLLM supports multiple accelerator backends for large-scale model inference. This section collects the hardware-specific entry points for environment setup, runtime device selection, launch scripts, and model support.
Platform Guides
Section titled “Platform Guides”- NVIDIA GPU - CUDA backend setup and launch entry points.
- Ascend NPU - Ascend NPU setup, runtime environment, and HCCL launch notes.
- Cambricon MLU - MLU backend setup and launch entry points.
- Hygon DCU - Hygon DCU backend setup and launch entry points.
Common Workflow
Section titled “Common Workflow”- Prepare the platform-specific container image from the explicit commands in each platform guide.
- Build xLLM inside the container, or use a release image that already includes
xllm. - Start the service with the matching device backend in Launch xllm.
- Check model and modality coverage in the Model Support List.