AWS Sagemaker distributed training using PyTorch¶
This plugin shows an example of using Sagemaker custom training, with Pytorch distributed training.
Installation¶
To use the flytekit aws sagemaker plugin simply run the following:
pip install flytekitplugins-awssagemaker==0.16.0
Creating a dockerfile for Sagemaker custom training [Required]¶
The dockerfile for Sagemaker custom training is similar to any regular dockerfile, except for the difference in using the Nvidia cuda base.