Dynamic MIG Partitioning in Kubernetes | by Michele Zanotti | Jan, 2023
Maximize GPU utilization and reduce infrastructure costs.Photo by Growtika on UnsplashTo minimize infrastructure expenses, it’s crucial to use GPU accelerators in the most efficient way. One method to achieve this is by dividing the GPU into smaller partitions, called slices, so that containers can request only the strictly necessary resources. Some workloads may only require a minimal amount of the GPU’s compute and memory, so having the ability in Kubernetes to divide a single GPU into multiple slices, which can be…