NVIDIA Partnership
MemVerge is a member of the program designed to help startups evolve faster through cutting-edge technology and access to the latest technical resources from NVIDIA. Unlike traditional accelerators, NVIDIA Inception supports all stages of a startup’s life cycle to provide members the best technical tools and latest resources.
Achieving K8S and Public Cloud Operational Efficiency using a New Checkpoint/Restart Feature for GPUs
Watch this video to see how CUDA 12.x driver enhancements will enable the open-source CRIU project to checkpoint and restart a GPU-based compute node. The video includes a technical overview and demonstrates this new capability. This transparent checkpoint/hot restart feature can, in turn, facilitate node maintenance, node rightsizing, and workload migration/bursting for greater operational efficiencies and minimal production interruptions.