Create, invoke, update, and delete a serverless endpoint - Amazon SageMaker
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

Create, invoke, update, and delete a serverless endpoint

Unlike other SageMaker real-time endpoints, Serverless Inference manages compute resources for you, reducing complexity so you can focus on your ML model instead of on managing infrastructure. The following guide highlights the key capabilities of serverless endpoints: how to create, invoke, update, describe, or delete an endpoint. You can use the SageMaker console, the Amazon SDKs, the Amazon SageMaker Python SDK, or the Amazon CLI to manage your serverless endpoints.