Sokovan Container Orchestrator for Accelerated AI:ML Workloads and Massive scale GPU Computing

Jun 30, 2023

Sokovan Container Orchestrator for Accelerated AI:ML Workloads and Massive scale GPU Computing

Jeongkyu
Shin

Jeongkyu Shin

Founder / Researcher / CEO

Joongi
Kim

Joongi Kim

Co-Founder / CTO

backend.ai

Jun 30, 2023

Sokovan Container Orchestrator for Accelerated AI:ML Workloads and Massive scale GPU Computing

Jeongkyu
Shin

Jeongkyu Shin

Founder / Researcher / CEO

Joongi
Kim

Joongi Kim

Co-Founder / CTO

backend.ai

Overview

Sokovan is a Python-based container orchestrator that addresses the challenges of running resource-intensive batch workloads in a containerized environment. It offers acceleration-aware, multi-tenant, batch-oriented job scheduling and fully integrates multiple hardware acceleration technologies into various system layers. It consists of two layers of schedulers. The cluster-level scheduler allows users to customize job placement strategies and control the density and priority of workloads. The node-level scheduler optimizes per-container performance by automatically detecting and mapping underlying hardware accelerators to individual containers, improving the performance of AI workloads compared to Slurm and other existing tools. Sokovan has been deployed on a large scale in various industries for a range of GPU workloads, including AI training and services. It helps container-based MLOps platforms unleash the potential of the latest hardware technologies.

Speakers:
Jeongkyu Shin
Joongi Kim

Request Slide

We're here for you!

Complete the form and we'll be in touch soon

We value your privacy

We use cookies to enhance your browsing experience, analyze site traffic, and understand where our visitors are coming from. By clicking "Accept All", you consent to our use of cookies. Learn more