Serverless endpoint operations
Unlike other SageMaker AI real-time endpoints, Serverless Inference manages compute resources for you, reducing
  complexity so you can focus on your ML model instead of on managing infrastructure. The
  following guide highlights the key capabilities of serverless endpoints: how to create, invoke,
  update, describe, or delete an endpoint. You can use the SageMaker AI console, the Amazon SDKs,
  the Amazon SageMaker Python SDK