- We have compiled ML training examples to familiarize new users with our recommended running environments and packages. These include functionality to log training runs on MLflow, a platform that facilitates important aspects of the training cycle, such as metric monitoring, reproducibility, and deployment. We have put together a "quick start" guide, which are working on expanding to be full-fledged user documentation. We have single-GPU training capabilities in place and are currently working on interfacing with multiple GPUs and submission to multi-GPU nodes via our schedulers.
0 commit comments