How Amazon DataSync Discovery works - Amazon DataSync
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

How Amazon DataSync Discovery works

Learn the key concepts and terminology related to Amazon DataSync Discovery.

DataSync Discovery architecture

The following diagram illustrates how DataSync Discovery collects information and provides recommendations for migrating data from an on-premises storage system to Amazon.

The first connection is for communicating with the source storage location. The second connection is for transferring between locations. The third and final connection is with the destination storage location.
Reference Description
1 A DataSync agent connects to your on-premises storage system's management interface (using port 443, for example). You then run a discovery job to collect information about your system.
2 The agent sends the information that it collects to DataSync Discovery through a public service endpoint.
3 Using the information that it collects, DataSync Discovery recommends Amazon storage services that you can migrate your data to.

Concepts and terminology

Familiarize yourself with DataSync Discovery features.

Agent

An agent is a virtual machine (VM) appliance that DataSync Discovery uses to access the management interface of your on-premises storage system. The agent collects (reads) information about how your storage resources are performing and being used.

You can deploy an agent in your storage environment on VMware ESXi, Linux Kernel-based Virtual Machine (KVM), or Microsoft Hyper-V hypervisors. For storage in a virtual private cloud (VPC) in Amazon, you can deploy an agent as an Amazon EC2 instance.

A DataSync Discovery agent is no different than an agent that you can use for DataSync transfers, but we don't recommend using the same agent for these scenarios.

To get started, see Deploying your Amazon DataSync agent.

Discovery job

You run a discovery job to collect information about your on-premises storage system through the storage system's management interface.

You can run a discovery job between 1 hour and 31 days. You'll get more accurate Amazon storage recommendations the longer your discovery job runs.

For more information, see Working with DataSync discovery jobs.

Storage system resource information

DataSync Discovery can give you performance and utilization information about your on-premises storage system's resources. For example, get an idea about how much storage capacity is being used in a specific storage volume compared to how much capacity you originally provisioned.

You can view this information as your discovery job collects it by using the following:

For more information, see Viewing storage resource information collected by Amazon DataSync Discovery.

Amazon storage recommendations

Using the information that it collects about your on-premises storage system's resources, DataSync Discovery recommends Amazon storage services to help plan your migration to Amazon.

You can view recommendations by using the DescribeStorageSystemResources operation.

For more information, see Getting recommendations from Amazon DataSync Discovery.

Limitations