What Is Amazon SageMaker?
Amazon SageMaker is a fully managed machine learning service. With SageMaker, data scientists and developers can quickly and easily build and train machine learning models, and then directly deploy them into a production-ready hosted environment. It provides an integrated Jupyter authoring notebook instance for easy access to your data sources for exploration and analysis, so you don't have to manage servers. It also provides common machine learning algorithms that are optimized to run efficiently against extremely large data in a distributed environment. With native support for bring-your-own-algorithms and frameworks, SageMaker offers flexible distributed training options that adjust to your specific workflows. Deploy a model into a secure and scalable environment by launching it with a few clicks from SageMaker Studio or the SageMaker console. Training and hosting are billed by minutes of usage, with no minimum fees and no upfront commitments.
This guide includes information and tutorials on SageMaker features. For additional information, see
Amazon SageMaker developer resources
Topics
Amazon SageMaker Features
Amazon SageMaker includes the following features:
- SageMaker Studio
-
An integrated machine learning environment where you can build, train, deploy, and analyze your models all in the same application.
- SageMaker Canvas
-
An auto ML service that gives people with no coding experience the ability to build models and make predictions with them.
- SageMaker Ground Truth Plus
-
A turnkey data labeling feature to create high-quality training datasets without having to build labeling applications and manage the labeling workforce on your own.
- SageMaker Studio Lab
-
A free service that gives customers access to Amazon compute resources in an environment based on open-source JupyterLab.
- SageMaker Training Compiler
-
Train deep learning models faster on scalable GPU instances managed by SageMaker.
- SageMaker Studio Universal Notebook
-
Easily discover, connect to, create, terminate and manage Amazon EMR clusters in single account and cross account configurations directly from SageMaker Studio.
- SageMaker Serverless Endpoints
-
A serverless endpoint option for hosting your ML model. Automatically scales in capacity to serve your endpoint traffic. Removes the need to select instance types or manage scaling policies on an endpoint.
- SageMaker Inference Recommender
-
Get recommendations on inference instance types and configurations (e.g. instance count, container parameters and model optimizations) to use your ML models and workloads.
- SageMaker Model Registry
-
Versioning, artifact and lineage tracking, approval workflow, and cross account support for deployment of your machine learning models.
- SageMaker Projects
-
Create end-to-end ML solutions with CI/CD by using SageMaker projects.
- SageMaker Model Building Pipelines
-
Create and manage machine learning pipelines integrated directly with SageMaker jobs.
- SageMaker ML Lineage Tracking
-
Track the lineage of machine learning workflows.
- SageMaker Data Wrangler
-
Import, analyze, prepare, and featurize data in SageMaker Studio. You can integrate Data Wrangler into your machine learning workflows to simplify and streamline data pre-processing and feature engineering using little to no coding. You can also add your own Python scripts and transformations to customize your data prep workflow.
- SageMaker Feature Store
-
A centralized store for features and associated metadata so features can be easily discovered and reused. You can create two types of stores, an Online or Offline store. The Online Store can be used for low latency, real-time inference use cases and the Offline Store can be used for training and batch inference.
- SageMaker JumpStart
-
Learn about SageMaker features and capabilities through curated 1-click solutions, example notebooks, and pretrained models that you can deploy. You can also fine-tune the models and deploy them.
- SageMaker Clarify
-
Improve your machine learning models by detecting potential bias and help explain the predictions that models make.
- SageMaker Edge Manager
-
Optimize custom models for edge devices, create and manage fleets and run models with an efficient runtime.
- SageMaker Ground Truth
-
High-quality training datasets by using workers along with machine learning to create labeled datasets.
- Amazon Augmented AI
-
Build the workflows required for human review of ML predictions. Amazon A2I brings human review to all developers, removing the undifferentiated heavy lifting associated with building human review systems or managing large numbers of human reviewers.
- SageMaker Studio Notebooks
-
The next generation of SageMaker notebooks that include Amazon IAM Identity Center (successor to Amazon Single Sign-On) (IAM Identity Center) integration, fast start-up times, and single-click sharing.
The next generation of SageMaker notebooks that include fast start-up times and single-click sharing.
- SageMaker Experiments
-
Experiment management and tracking. You can use the tracked data to reconstruct an experiment, incrementally build on experiments conducted by peers, and trace model lineage for compliance and audit verifications.
- SageMaker Debugger
-
Inspect training parameters and data throughout the training process. Automatically detect and alert users to commonly occurring errors such as parameter values getting too large or small.
- SageMaker Autopilot
-
Users without machine learning knowledge can quickly build classification and regression models.
- SageMaker Model Monitor
-
Monitor and analyze models in production (endpoints) to detect data drift and deviations in model quality.
- SageMaker Neo
-
Train machine learning models once, then run anywhere in the cloud and at the edge.
- SageMaker Elastic Inference
-
Speed up the throughput and decrease the latency of getting real-time inferences.
- Reinforcement Learning
-
Maximize the long-term reward that an agent receives as a result of its actions.
- Preprocessing
-
Analyze and preprocess data, tackle feature engineering, and evaluate models.
- Batch Transform
-
Preprocess datasets, run inference when you don't need a persistent endpoint, and associate input records with inferences to assist the interpretation of results.
Amazon SageMaker Pricing
As with other Amazon products, there are no contracts or minimum commitments for using
Amazon SageMaker. For more information about the cost of using SageMaker, see
SageMaker Pricing
Are You a First-time User of Amazon SageMaker?
If you are a first-time user of SageMaker, we recommend that you do the following:
-
Read How Amazon SageMaker Works – This section provides an overview of SageMaker, explains key concepts, and describes the core components involved in building AI solutions with SageMaker. We recommend that you read this topic in the order presented.
-
Set Up Amazon SageMaker Prerequisites – This section explains how to set up your Amazon account.
-
Amazon SageMaker Autopilot simplifies the machine learning experience by automating machine learning tasks. If you are new to SageMaker, it provides the easiest learning path. It also serves as an excellent ML learning tool that provides visibility into the code with notebooks generated for each of the automated ML tasks. For an introduction to its capabilities, see Automate model development with Amazon SageMaker Autopilot. To get started building, training, and deploying machine learning models, Autopilot provides:
-
Get Started with Amazon SageMaker – This section walks you through training your first model using SageMaker Studio, or the SageMaker console and the SageMaker API. You use training algorithms provided by SageMaker.
-
Explore other topics – Depending on your needs, do the following:
-
Submit Python code to train with deep learning frameworks – In SageMaker, you can use your own training scripts to train models. For information, see Use Machine Learning Frameworks, Python, and R with Amazon SageMaker.
-
Use SageMaker directly from Apache Spark – For information, see Use Apache Spark with Amazon SageMaker.
-
Use SageMaker to train and deploy your own custom algorithms – Package your custom algorithms with Docker so you can train and/or deploy them in SageMaker. To learn how SageMaker interacts with Docker containers, and for the SageMaker requirements for Docker images, see Using Docker containers with SageMaker.
-
-
View the API Reference – This section describes the SageMaker API operations.
How Amazon SageMaker Works
SageMaker is a fully managed service that enables you to quickly and easily integrate machine learning-based models into your applications. This section provides an overview of machine learning and explains how SageMaker works. If you are a first-time user of SageMaker, we recommend that you read the following sections in order: