Skip to content

GH200 drone build templates (pytorch, torchvision, triton, vllm...)

Notifications You must be signed in to change notification settings

feuler/gh200-drone-builds

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Build process - gh200 with pytorch uvm patch

docker build examples with "Drone + Gitea" for gh200 pytorch (with uvm), torchvision, triton, xformers

Steps

  1. build pytorch version required for vllm version

  2. build torchvision version matching the torch version with the custom built torch pre-installed

  3. build xformers (optional) with custom torch and torchvision whl packages pre-installed

  4. build triton

    • pre-install custom torch whl package
    • checkout triton version defined in pytorch version file ".ci/docker/ci_commit_pins/triton.txt" (version number in ".ci/docker/triton_version.txt")
    • clone & checkout & build llvm version defined in triton version file "triton/cmake/llvm-hash.txt"
    • use llvm build in env as llvm paths (lib/include/root)
    • build triton
  5. build vllm package with above whl packages pre-installed and use "use_existing_torch.py" vllm script

  6. build vllm-docker container by installing above whl packages for torch, torchvision, xformers, triton and vllm

Current build environment for all involved builds

  • CUDA: 12.4.1
  • Python: 3.11
  • Docker image: nvidia/cuda:12.4.1-cudnn-devel-ubuntu22.04

About

GH200 drone build templates (pytorch, torchvision, triton, vllm...)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published