AWS SageMaker Inference Agent#

Tags: AWS, Integration, Advanced

The AWS SageMaker inference agent allows you to deploy models, and create and trigger inference endpoints. You can also fully remove the SageMaker deployment.

Installation#

To use the AWS SageMaker inference agent, run the following command:

pip install flytekitplugins-awssagemaker

Example usage#

For a usage example, see AWS SageMaker inference agent example usage.

Local testing#

To test an agent locally, create a class for the agent task that inherits from SyncAgentExecutorMixin or AsyncAgentExecutorMixin. These mixins can handle synchronous and synchronous tasks, respectively, and allow flytekit to mimic FlytePropeller’s behavior in calling the agent. For more information, see “Testing agents locally”.

Flyte deployment configuration#

Note

If you are using a managed deployment of Flyte, you will need to contact your deployment administrator to configure agents in your deployment.

To enable the AWS SageMaker inference agent in your Flyte deployment, refer to the AWS SageMaker inference agent setup guide.