FAQ
Frequently Asked Questions
Product selection, deployment, commercial terms, customers, and company credentials — straight answers to the questions enterprises ask before adopting Rise VAST, CAMP, ModelX, and MAX.
Product Selection
How to choose between VAST, CAMP, ModelX, and MAX — boundaries and stack composition
01 Which product should I choose: Rise VAST, CAMP, ModelX, or MAX?
Follow the "bottom-up, additive" principle: Rise VAST handles GPU virtualization and heterogeneous compute pooling — the foundation everything else builds on. Rise CAMP sits on VAST and adds intelligent task scheduling, cluster and multi-tenant management, quota management, compute resource monitoring, and fault diagnosis. Rise ModelX delivers a unified training-and-inference model serving platform with an AI Gateway. Rise MAX packages all three layers plus hardware and pre-loaded models as a turnkey appliance.
Quick decision logic:
Quick decision logic:
- Greenfield, want to skip integration risk → Rise MAX appliance
- Existing GPU cluster, low utilization → Start with VAST to drive utilization up
- Existing GPU cluster, scheduling chaos → CAMP for fine-grained scheduling
- Need an AI development environment → CAMP for AI dev services
- Model inference for internal and external consumers → ModelX standalone
- Large production cluster, full MLOps → VAST + CAMP + ModelX stacked
02 Do I have to buy all three layers together? Can I roll them out incrementally?
Each layer is independently purchasable and deployable. The typical evolution path:
- Phase 1 — deploy VAST to push existing cluster utilization from 30% to 70%+, prove the cost savings
- Phase 2 — add CAMP for multi-team scheduling governance, boosting task scheduling efficiency
- Phase 3 — layer in ModelX to unify internal model serving
03 I already have a GPU cluster. Where's the highest-ROI starting point?
For most customers, starting with VAST gives the highest ROI. Here's why: the stock K8s Device Plugin only does whole-GPU allocation. In real clusters, inference services, Notebooks, and small training jobs can't saturate a full card — industry-average GPU utilization stays around 30%. VAST's vGPU virtualization lets multiple Pods share a single card, typically pushing utilization to 70%+. Effectively, your existing GPU count "doubles." This can significantly improve the return on hardware investment.
Further reading: GPU Virtualization Technology Guide · GPU Pooling for Accelerated Training
Further reading: GPU Virtualization Technology Guide · GPU Pooling for Accelerated Training
04 If I only need model serving, do I have to buy VAST first?
No. Rise ModelX deploys standalone on any standard Kubernetes + GPU cluster, delivering model registry, inference engines, AI Gateway, and token governance out of the box. But VAST makes it work better — VAST's vGPU partitioning lets a single physical card host multiple inference services concurrently, and combined with ModelX's smart routing, overall serving ROI improves significantly.
Deployment & Delivery
Delivery models, PoC timelines, upgrade paths, and operational requirements
01 What delivery models are supported? Software-only / appliance / public cloud?
Three delivery models:
- Software subscription — customer provides hardware, we handle install, tuning, upgrades, and support. Best for customers with existing GPU clusters or strong hardware preferences.
- Rise MAX turnkey appliance — integrated hardware + software + pre-loaded models, operational in under an hour. Best for customers who want fast time-to-value.
- End-to-end private cloud deployment — for large customers, we handle data center planning, hardware procurement, network buildout, and software deployment.
02 What's a typical PoC timeline? What does the deployment architecture and minimum footprint look like?
Typical PoC runs 2-4 weeks covering environment prep, functional validation, performance testing, and report delivery — with one-click install, core platform components deploy in minutes.
The platform uses a "control-plane and compute-plane separation" architecture: the control plane runs on Kubernetes-native components. Minimum footprint: a single node is enough to kick off a PoC — the control plane and GPU compute plane can co-locate on the same machine, bare-metal or virtual machine, as long as GPU resources are attached. PoCs can run on a subset of an existing cluster or a standalone test environment.
The platform uses a "control-plane and compute-plane separation" architecture: the control plane runs on Kubernetes-native components. Minimum footprint: a single node is enough to kick off a PoC — the control plane and GPU compute plane can co-locate on the same machine, bare-metal or virtual machine, as long as GPU resources are attached. PoCs can run on a subset of an existing cluster or a standalone test environment.
03 What does the customer need to provide for deployment? How high are the operational requirements?
Software deployment requires: standard Kubernetes cluster (1.20+), GPU nodes, network and storage, and ops access (SSH/bastion). Appliance deployment requires: standard rack space, power (220V / 32A dual feed), and network uplink. Day-to-day ops have a low bar — every component has a web console and full observability. Routine operations (creating tenants, adjusting quotas, viewing monitoring) don't require deep Kubernetes expertise. Deep tuning and incident response are handled remotely by certified RiseUnion engineers. Operations training and certification programs are available.
04 What's the typical production delivery timeline?
Recommended production topology: 3-node HA control plane, compute plane scales independently and supports clusters of up to 500 GPU physical servers. Larger environments should add additional clusters. Storage platforms, container registries, and related infrastructure should be sized based on workload and scale.
Typical delivery takes 2-4 weeks, covering architecture design, environment preparation and validation, production deployment, functional verification, performance testing, report delivery, and on-site training. Ongoing 7×24 support from certified RiseUnion engineers follows.
Typical delivery takes 2-4 weeks, covering architecture design, environment preparation and validation, production deployment, functional verification, performance testing, report delivery, and on-site training. Ongoing 7×24 support from certified RiseUnion engineers follows.
05 Does the platform support multi-cluster, multi-region, and hybrid cloud management?
Yes. Rise CAMP provides federated cluster management, unifying multi-region, multi-data-center, and multi-vendor Kubernetes clusters under a single control plane with cross-cluster scheduling, resource pooling, and unified observability. Common patterns:
- Multiple GPU clusters across geographically distributed data centers that need centralized management and unified compute service delivery
- Train in DC, infer at edge — combined with Rise Edge for centralized training + edge inference + OTA model distribution
- Hybrid cloud burst — overflow to public cloud when local capacity is saturated, transparent to applications
06 What's the upgrade path? Will version updates affect production workloads?
All components support rolling upgrades, canary releases, and one-click rollback — workloads stay running throughout. Two to three major releases per year (with new features), plus continuous security patches, all bundled into annual maintenance. Pre-upgrade compatibility validation ensures any historical version can upgrade smoothly to the latest. Upgrades are performed remotely or on-site by RiseUnion engineers — no breaking changes that require cluster rebuilds. Self-service upgrades are also supported with detailed runbooks and rollback procedures for every release.
07 Does ModelX come with model inference best practices? How are non-standard models supported?
Yes. With Rise ModelX, we provide inference deployment best practices for mainstream open-source models, including recommended engine selection (vLLM / TGI / TensorRT-LLM, etc.), VRAM and compute sizing guidance, concurrency and batching tuning, and AI Gateway load-balancing and autoscaling configurations. For popular model families like DeepSeek, Qwen, and LLaMA, we have production-validated deployment templates ready to use.
For non-standard models (custom-trained, privately fine-tuned, or uncommon-framework models), we offer three tiers of support:
For non-standard models (custom-trained, privately fine-tuned, or uncommon-framework models), we offer three tiers of support:
- Self-service onboarding — ModelX accepts any inference backend that exposes an OpenAI-compatible API; customers can integrate following our documentation.
- Remote assistance — RiseUnion engineers help with model adaptation, engine tuning, and performance baselining.
- Deep customization — for custom operators, bespoke inference pipelines, or other complex scenarios, both parties jointly assess the technical approach; additional fees may apply, with specifics agreed during PoC or implementation.
Commercial & Partnership
Licensing model, technical support, trial policy, and training
01 How is licensing calculated? Per-GPU or per-node?
Per-GPU license purchase or subscription, tiered by GPU class. Contact sales for pricing (contact@riseunion.io).
02 Is vendor technical support included? What's the SLA?
Three-tier first-party support:
Default SLA covers business hours 9×5; 24×7 year-round support is available as an upgrade. All support is delivered by RiseUnion engineers directly — no outsourcing, no subcontracting. Quarterly health checks, annual optimization reviews, and proactive upgrades are included.
- L1 ticket response — 30-minute response for critical issues, 4-hour for normal issues
- L2 remote engagement — root cause within 4 hours
- L3 on-site support — 24-hour arrival for major incidents
Default SLA covers business hours 9×5; 24×7 year-round support is available as an upgrade. All support is delivered by RiseUnion engineers directly — no outsourcing, no subcontracting. Quarterly health checks, annual optimization reviews, and proactive upgrades are included.
03 What's the trial / PoC policy?
Free PoC, 2-4 weeks, including solution design, environment deployment, functional validation, performance testing, and a final report. RiseUnion engineers provide on-site or remote support throughout. All test data and findings belong to the customer. For longer trials (1-3 months), we sign a trial agreement and provide a time-limited license. PoC pass rate exceeds 90% — most customers validate the GPU utilization gains and cost reductions during the trial itself.
04 Are training and certification programs available?
Three-tier training program:
- User training — for developers and ML engineers, 1 day, covering job submission, model serving, and monitoring.
- Operations training — for IT operations, 2-3 days, covering daily operations, quota management, and troubleshooting.
- Paid advanced training — for architects and power users, 3-5 days, covering scheduling policy tuning, performance optimization, and customization.
05 I'm a reseller — how do I become a partner?
We have a comprehensive partner program. Contact contact@riseunion.io for details.
Customers & References
Industry verticals, reference cases, and ecosystem partnerships
01 What industries are your existing customers in? Any production-scale deployments?
Products are running in production at scale across finance, government, transportation, education, healthcare, and energy. Publicly disclosed reference customers include BYD (autonomous driving model training), Huatai Futures (financial AI inference), National Supercomputing Center in Jinan (public compute services), Second Affiliated Hospital of Chongqing Medical University (medical AI), and CEPREI Laboratory (research MaaS platform). Detailed cases and industry solutions are on the Industry Cases page.
02 Are there public references in my industry? Can I talk to existing customers?
All public references are listed on the Industry Cases page, organized by vertical, with technical solution, deployment scale, and outcomes. For prospective customers, with the reference customer's authorization, we arrange phone or on-site discussions with same-industry customers to help evaluate implementation risk and benefit. Contact sales (contact@riseunion.io) for reference introductions.
Company & Credentials
Technical background, compliance certifications, and open source contributions
01 What's the company's technical background? What certifications do you hold?
Beijing RiseUnion Technology Co., Ltd. is a National High-tech Enterprise, Beijing Specialized SME, and chair of the AIIC Compute Pooling Working Group. We led the drafting of the Heterogeneous Compute Virtualization and Pooling System Requirements industry standard and co-authored the Diverse Computing Optimization Action Plan. Certifications include ISO 9001, ISO 27001, ISO 20000, and CMMI3, with 10+ software copyrights and patents. Products are deployed in MLPS Level 3 environments at multiple finance and government customers.
Learn more: About RiseUnion
Learn more: About RiseUnion
02 What's your open source involvement?
RiseUnion is one of the core contributors to HAMi (Heterogeneous AI Computing Virtualization Middleware), a CNCF Sandbox project and the most active GPU virtualization open source project in China. We continuously contribute upstream and participate in project governance, while the enterprise-grade features (multi-GPU virtualization, domestic chip support, production-grade HA) are productized in Rise VAST Enterprise. Beyond HAMi, the team also contributes to Volcano, Kubernetes, and other upstream communities.
Further reading: HAMi v2.8.0 Release · HAMi Source Code Analysis
Further reading: HAMi v2.8.0 Release · HAMi Source Code Analysis
03 Funding and growth?
RiseUnion has raised tens of millions of RMB in funding (Plum Ventures sole investor) and received strategic investment from 4Paradigm. Rise VAST is co-developed with 4Paradigm as an enterprise AI compute management product. The company continues heavy R&D investment to maintain technical leadership in AI compute management and scheduling.
Didn't find your answer?
Contact our technical and sales team for custom solution design and PoC support.
contact@riseunion.io