Rise VAST: AI Compute Management Platform
Unified Management · Full Observability: manage heterogeneous compute assets with complete clarity
Platform Overview
Supported Chips
Unified Collection: Heterogeneous Compute Management
RiseUnion and 4Paradigm formed a strategic partnership to deliver enterprise-class AI compute pooling: Rise VAST (HAMi Enterprise Edition), combining HAMi scheduling with 4Paradigm AI platform expertise.
Unified management across geographies and architectures (x86/c86/ARM) with flexible shared and dedicated pool combinations
NVIDIA, Ascend, Hygon, Cambricon, Iluvatar, KunlunXin, Enflame, MetaX, Moore Threads and 10+ vendors under one framework
GPU virtualization and isolation via Kubernetes Device Plugin mechanism. No application code changes needed, with compute limiting and VRAM isolation, transparent to workloads
Shared and dedicated compute pools with tenant and project-based quota allocation. On-demand resource request, use, and release
Edge node vGPU management support, compatible with both CAMP and EDGE cloud-native deployments, with standalone deployment option
Web UI, REST API, and MCP protocol access, abstracting away underlying heterogeneous complexity
Multi-cluster Multi-tenant Unified Management
Unified management of heterogeneous GPU clusters across regions and architectures, with LAN-based inter-cluster coordination and 6 unifications covering the full pipeline
Unified management across Beijing, Inner Mongolia and more, 100G interconnect
x86 / c86 / ARM mixed deployment with edge node vGPU management
Flexible shared and dedicated pool combinations with tenant/project quota allocation
Full Observability & Intelligent O&M
Full-stack Observability
Microscope-level visibility from GPU device layer to K8s task layer, from physical resources to tenant quotas. Collects per-GPU SM compute time-slice ratio, VRAM time-sharing utilization, and inter-board communication traffic. Auto-builds Pod-GPU-compute unit three-tier mapping for second-level utilization pinpointing.
Domestic Chip Dynamic Partitioning
Breaks through vendor device plugin fixed-spec partitioning limitations. Intelligent dynamic allocation on demand, no restart required. Configuration upgraded from complex manual operations to one-click deployment, utilization from 30-50% to 80-90%.
Domestic Chip VRAM Isolation & Alignment
For Ascend 910B and 910C series AI Core and VRAM partitioning combinations, provides strict VRAM boundary checks to prevent out-of-bounds access. Auto-alignment to valid specs, inter-container resource isolation, and real-time VRAM monitoring.
Platform Health Dashboard
Utilization watermark, fragmentation rate, and faulty GPU distribution views by vendor, architecture, and GPU resource pool. One dashboard for complete compute health awareness.
Auto Fault Isolation
XID fault code alerting. When a GPU faults or resource usage hits threshold, triggers P0 alerts and auto-isolates the faulty card, preventing fault propagation to business workloads.
Enterprise Alert Platform
6-step closed loop: metric collection → rule engine → alert generation → noise suppression → tiered notification → incident review. Multi-channel delivery (email, SMS, DingTalk, WeCom) with on-call rotation integration.
Full-stack Observability: Node → GPU → Task → Model
Full-pipeline visibility from GPU device layer to K8s task layer, providing microscope-level observability for heterogeneous GPU clusters
E2E Latency · TTFT · Token Throughput · Req/Resp · Resource Status
Eviction · Compute/VRAM Monitor · Logs · Status · Interconnect Info
Enable/Disable · Fault Recovery · XID Alerts · Resource Monitor · Temp/Power
Enable/Disable · Resources · Status · Driver Version · OS
NVIDIA · Ascend · Hygon · KunlunXin · Iluvatar · More
Technology Deep Dive
Understand the core technology choices and design philosophy behind Rise VAST
User-space GPU Virtualization
Rise VAST uses user-space virtualization — less invasive, more secure, and better cross-platform compatible than kernel-space approaches. Zero kernel modifications for compute isolation and VRAM management.
Read MoreThe Real Value of GPU Pooling
Breaking four common myths: pooling is unnecessary, more GPUs is enough, heterogeneous cards are incompatible, and virtualization always hurts performance. See how pooling unifies fragmented resources.
Read MoreWhy K8s Native Scheduling Falls Short
Kubernetes treats GPUs as homogeneous integer resources, blind to topology, slicing, and VRAM isolation needs. Learn how Rise VAST fills these critical gaps for AI workloads.
Read MoreAI Cloud-Native Heterogeneous Architecture
Multi-vendor chips, multi-framework parallelism, multi-cluster cross-region coordination — managing heterogeneous compute in the cloud-native era requires systematic architecture, not piecemeal tooling.
Read MoreChip Compatibility Certifications
Rise VAST has completed adaptation certification with the following domestic AI chips, ensuring stable management and unified scheduling
Open Source Ecosystem & Industry Standards
HAMi · CNCF Sandbox
Rise VAST is built on HAMi (CNCF Sandbox). RiseUnion is a core maintainer, driving heterogeneous GPU virtualization standardization across 5 major releases (v2.4–v2.8).
Industry Standards Leadership
Elected chair of AIIC Compute Pooling Working Group
Guided by MIIT, first-batch chair unit
Led drafting of "Heterogeneous Compute Pooling" standard
Group standard officially published (T/CESA1463-2025)
Co-authored Diverse Computing Optimization Action Plan
Co-released with xFusion
National Cloud Computing Standards Conference
"Cloud Standards Empowerment" AI cloud initiative