Architecture guidelines and decisions
This section will provides a brief overview of the Amazon services typically used for SAP workloads and some of the key points to understand when designing your architecture for hosting SAP on Amazon. If you are already familiar with these Amazon services, you can skip this section.
Regions and Availability Zones
The Amazon Global Infrastructure
Regions
Amazon has a global footprint and ensures that customers are served across the world. Amazon maintains multiple Regions in North America, South American, Europe, Asia Pacific, and the Middle East.
An Amazon Region is a collection of Amazon resources in a geographic area. Each Region is isolated and independent. For a list of Region names and codes, see Regional endpoints
Regions provide fault tolerance, stability, and resilience. They enable you to create redundant resources that remain available and unaffected in the unlikely event of an outage.
Amazon Regions consist of multiple Availability Zones (AZs), typically 3. An Availability Zone is a fully isolated partition of the Amazon infrastructure. It consists of discrete data centers housed in separate facilities, with redundant power, networking, and connectivity.
You retain complete control and ownership over the Amazon Region in which your data is physically located, making it easy to meet Regional compliance and data residency requirements.
Availability Zones
Availability Zones (AZs) enable customers to operate production applications and databases that are more highly available than would be possible from a single data center. Distributing your applications across multiple zones provides the ability to remain resilient in the face of most failure modes, including natural disasters or system failures.
Each Availability Zone can be multiple data centers. At full scale, it can contain hundreds of thousands of servers. They are fully isolated partitions of the Amazon global infrastructure. With its own powerful infrastructure, an Availability Zone is physically separated from any other zones. There is a distance of several kilometers, although all are within 100 km (60 miles of each other). This distance provides isolation from the most common disasters that could affect data centers (i.e. floods, fire, severe storms, earthquakes, etc.).
All Availability Zones (AZs) within a Region are interconnected with high-bandwidth and low-latency networking, over fully redundant and dedicated metro fiber. This ensures high-throughput, low-latency networking between zones. The network performance is sufficient to accomplish synchronous replication between zones.
Amazon Availability Zones (AZs) enable our customers to run their applications in a highly-available manner. To be highly available, an application needs to run in more than one location simultaneously with the exact same data, thus allowing for a seamless fail over with minimal downtime, in the event of a disaster.
Services
Our general policy is to deliver Amazon services, features, and instance types to all Amazon Regions within 12 months of general availability, based on customer demand, latency, data sovereignty, and other factors. You can share your interest for local Region delivery, request service roadmap information, or gain insight on service interdependency (under NDA) by contacting your Amazon sales representative
Due to the nature of the service, some Amazon services are delivered globally rather than Regionally, such as Route 53, Amazon Chime, Amazon WorkDocs, Amazon WorkMail, WorkSpaces, and Amazon WorkLink.
Other services, such as Amazon Elastic Compute Cloud (Amazon EC2) and Amazon Elastic Block Store (Amazon EBS) are zonal services. When you create an Amazon EC2 or Amazon EBS resource for launch, you need to specify the required Availability Zone within a Region.
Selecting the Amazon Regions
When selecting the Amazon Region(s) for your SAP environment deployment, you should consider the following:
-
Proximity to on-premises data centers, systems, and end users to minimize network latency.
-
Data residency and compliance requirements.
-
Availability of the Amazon products and services that you plan to use in the Region. For more details, see Region Table
. -
Availability of the Amazon EC2 instance types that you plan to use in the Region. For more details, see Amazon EC2 Instance Types for SAP
. -
Pricing variation between different Amazon Regions. For more details, see SAP on Amazon Pricing and Optimization guide
.
Multi-Region considerations
When deploying across multiple Regions, an important consideration is the associated cost and management effort for core services required in each Region such as networking, security, and audit services.
Network latency
If you decide on a multiple Region approach, you should consider the impact of any increase in the network latency to the secondary Region from your on-premises locations.
Cross-Regional data transfer
Amazon provides several methods of data transfer between Regions. These methods are relevant when designing an SAP Architecture for disaster recovery. You should consider any data residency requirements when transferring data to another Amazon Region, the costs associated with the data transfer (cross-Region peering and/or Amazon S3 replication), and storage in the secondary Region.
Tier 0 services
When using an Amazon Region, there are a number of Tier 0 services that you need before deploying an SAP workload. These include DNS, Active Directory, and/or LDAP as well as any Amazon or ISV-provided security and compliance products and services.
Amazon accounts
While there is no one-size-fits-all answer for how many Amazon accounts a particular customer should have, most organizations want to create more than one Amazon account. Multiple accounts provide the highest level of resource and billing isolation.
In the context of SAP workloads, it is common for customers to deploy the Production environment in a separate Amazon account. It helps isolate the production environment from the rest of the SAP landscape.
Amazon Organizations
Amazon Landing Zone
Note: The Amazon Landing Zone solution is delivered by Amazon Solutions Architects or Professional Services consultants to create a customized baseline of Amazon accounts, networks, and security policies.
Consider using the Amazon Landing Zone solution if you are looking to set up a configurable landing zone with rich customization options through custom add-ons such as, Active Directory, and change management through a code deployment and configuration pipeline.
Amazon Control Tower
Consider using the Amazon Control Tower to set up a new Amazon environment based on a landing zone with pre-configured blueprints. You can interactively govern your accounts with pre-configured guardrails.
Compute
Amazon Elastic Compute Cloud
When the Amazon EC2 instances are deployed across two or more Availability Zones within a single Region then Amazon offers an SLA
Instance types
A range of Amazon EC2 instance types
For the tiers with specific instance type requirements and without flexibility to change during a failure scenario, consider having a capacity reservation using Reserved Instances
Reserved Instances
Reserved Instances
When you deploy Amazon EC2 across multiple Availability Zones for high availability, we recommend that you use zonal Reserved Instances. In addition to the savings over the on-demand instance pricing, a zonal Reserved Instance provides a capacity reservation in the specified Availability Zone. This ensures that the required capacity is readily available as and when you need it.
For billing purposes, the consolidated billing
Savings Plans
Savings Plans
Savings Plans offer significant savings over on-demand, just like the Amazon EC2 Reserved Instances, in exchange for a commitment to use a specific amount of compute power (measured in $/hour) for a one- or three-year period.
On-Demand Capacity Reservations
On-Demand Capacity Reservations
Instance Family Availability across Availability Zones
Certain Amazon EC2 instance families (for example, X1 and High Memory) are not available across all Availability Zones within a Region. You should confirm the instance types required for your SAP workload and check if they are available in your target Availability Zones.
Amazon EC2 auto recovery
Amazon EC2 auto recovery
You can enable auto recovery for Amazon EC2 instances by creating an Amazon CloudWatch alarm which monitors the instance status. Examples of problems that cause system status checks to fail include:
-
Loss of network connectivity
-
Loss of system power
-
Software issues on the physical host
-
Hardware issues on the physical host that impact network reachability
Though it typically takes under 15 minutes for a failed instance to restart, Amazon EC2 auto recovery does not offer an SLA. Therefore, if the recovery of the application that’s running on the failed host is critical (for example, SAP Database or SAP Central Services), you should consider using clustering across two Availability Zones
High Memory Bare Metal Dedicated Hosts
Amazon EC2 High Memory Instances
High Memory instances support Dedicated Host Recovery
We recommend a second-High Memory instance in a different Availability Zone of your chosen Region to protect against zone failure.
Amazon EC2 maintenance
When Amazon maintains the underlying host for an instance, it schedules the instance for maintenance. There are two types of maintenance events:
-
During network maintenance, scheduled instances lose network connectivity for a brief period of time. Normal network connectivity to your instance is restored after maintenance is complete.
-
During power maintenance, scheduled instances are taken offline for a brief period, and then rebooted. When a reboot is performed, all of your instance’s configuration settings are retained.
Additionally, we frequently upgrade our Amazon EC2 fleet with many patches and upgrades being applied to instances transparently. However, some updates require a short reboot. Reboots such as these should be infrequent but necessary to apply upgrades that strengthen our security, reliability, and operational performance.
There are two kinds of reboots that can be required as part of Amazon EC2 scheduled maintenance:
-
Instance reboots are reboots of your virtual instance and are equivalent to an operating system reboot.
-
System reboots require reboots of the underlying physical server hosting an instance.
You can view any upcoming scheduled events for your instances in the Amazon Management Console or using the API tools or command line.
If you do not take any action, the impact on your instance is the same in both cases: during your scheduled maintenance
Alternatively, you can migrate your instance to a new host by performing a stop and start on your instance. For more information, see Stop and start your instance
Networking
Amazon Virtual Private Cloud and subnets
An Amazon Virtual Private Cloud
When you create a VPC, you must specify a range of IPv4 addresses for the VPC in the form of a Classless Inter-Domain Routing (CIDR) block, for example, 10.0.0.0/16. This is the primary CIDR block for your VPC.
You can create a VPC within your chosen Amazon Region and it will be available across all Availability Zones within that Region.
To add a new subnet
To provide future flexibility, we recommend that your subnet and connectivity design support all of the available Availability Zones in your account within the Region, regardless of the number of zones that you initially plan to use within a Region.
Latency across Availability Zones
All Availability Zones (AZs) are interconnected with high-bandwidth, low-latency networking, over fully redundant and dedicated metro fiber. This results in single-digit millisecond latency between resources in different Availability Zones in the same Region.
For high availability, we recommend deploying production SAP workloads across multiple Availability Zones, including the SAP Application Server Layer. If you have SAP transactions or batch jobs that involve significant database calls, we recommend that you run these transactions on SAP Application Servers located in the same Availability Zone as the database and that you use SAP Logon Groups (SMLG) for end users and batch server group (SM61) for background processing jobs. This ensures that latency-sensitive parts of the SAP workload run on the correct application servers.
On premises to Amazon connectivity
You can connect to your VPC through a Site-to-Site virtual private network
Site-to-Site VPN connections are to specific Regions. For Direct Connect-based connections, Direct Connect Gateway
When establishing connectivity to Amazon from on premises, ensure that you have resilient connections either through the use of multiple Direct Connect Links, multiple VPN connections, or a combination of the two.
The Amazon Direct Connect Resiliency Toolkit
VPC endpoints
A VPC endpoint
VPC endpoints are available for all of the core Amazon services that are required to support an SAP-based workload, including Amazon EC2 API, Amazon S3, and Amazon Elastic File System.
Cross-Region peering
Amazon Virtual Private Cloud
Amazon Transit Gateway
Load balancing
Elastic Load Balancing
A Network Load Balancer
A load balancer serves as the single point of contact for clients. The load balancer distributes incoming traffic across multiple targets, such as Amazon EC2 instances.
A listener checks for connection requests from clients, using the protocol and port that you configure, and forwards requests to a target group.
Each target group routes requests to one or more registered targets, such as Amazon EC2 instances, using the TCP protocol and the specified port number. You can configure health checks on a per target group basis. Health checks are performed on all targets registered to a target group that is specified in a listener rule for your load balancer.
For TCP traffic, the Network Load Balancer selects a target using a flow hash algorithm based on the protocol, source IP address, source port, destination IP address, destination port, and TCP sequence number. Each individual TCP connection is routed to a single target for the life of the connection.
DNS
Amazon Route 53
Amazon Route 53 Resolver
Storage
Object storage
Amazon Simple Storage Service
To protect against data loss, you can perform backups (such as database backups or file backups) to Amazon S3. Additionally, Amazon EBS Snapshots
Amazon S3 Replication enables automatic, asynchronous copying of objects across Amazon S3 buckets. Buckets that are configured for object replication can be owned by the same Amazon account or by different accounts.
Amazon S3 replication
You can replicate objects between the same or different Amazon Regions.
-
Cross-Region replication (CRR) is used to copy objects across Amazon S3 buckets in different Amazon Regions.
-
Same-Region replication (SRR) is used to copy objects across Amazon S3 buckets in the same Amazon Region.
Cross-Region replication incurs the following costs
-
Data Transfer charges for the data transferred between the first and second Amazon Regions
-
Amazon S3 charges for the data stored in Amazon S3 in the two different Amazon Regions
Additionally, you can enable Amazon S3 Replication Time Control
Amazon S3 RTC incurs the following costs in addition to the costs listed above for cross-Region replication:
Same-Region replication incurs the following costs
-
Charges for the data stored in Amazon S3
Block storage
Amazon Elastic Block Store (Amazon EBS
Amazon EBS volumes are placed in a specific Availability Zone where they are automatically replicated to protect you from the failure of a single component. All Amazon EBS volume types offer durable snapshot capabilities and are designed for 99.999% availability per volume
Amazon EBS volumes are designed for an annual failure rate (AFR) of between 0.1% - 0.2%, where failure refers to a complete or partial loss of the volume, depending on the size and performance of the volume. This makes Amazon EBS volumes 20 times more reliable than typical commodity disk drives, which fail with an AFR of around 4%. For example, if you have 1,000 Amazon EBS volumes running for 1 year, you should expect 1 to 2 will have a failure.
Amazon EBS offers a number of different volume types
Amazon EBS Multi-Attach enables you to attach a single Provisioned IOPS SSD (io1) volume to up to 16 Amazon Nitro-based instances
Amazon EBS snapshots
You can back up the data on your Amazon EBS volumes to Amazon S3 by taking point-in-time snapshots
Amazon EBS Snapshots can be copied
Copying Snapshots across Regions incurs the following costs
-
Data Transfer charges for the data transferred between the first and second Amazon Regions
-
Amazon EBS Snapshot charges for the data stored in Amazon S3 in the two different Amazon Regions
Restoring snapshots
New volumes created from existing Amazon EBS snapshots load lazily in the background. This means that after a volume is created from a snapshot, there is no need to wait for all of the data to transfer from Amazon S3 to your Amazon EBS volume before your attached instance can start accessing the volume and all its data.
This preliminary action takes time and can significantly increase the latency of I/O operations. If your instance accesses data that hasn’t yet been loaded, the volume immediately downloads the requested data from Amazon S3, and then continues loading the rest of the volume data in the background.
Fast snapshot restore
Amazon EBS fast snapshot restore
File storage
Amazon EFS
Amazon Elastic File System
Amazon EFS file systems can be shared across accounts and VPCs
Amazon DataSync
Amazon FSx
Amazon FSx for Windows File Server
With Single-AZ file systems, Amazon FSx automatically replicates your data within an Availability Zone, continuously monitors for hardware failures and automatically replaces infrastructure components in the event of a failure. Amazon FSx also takes highly durable backups of your file system daily using Windows’ Volume Shadow Copy Service that are stored in Amazon S3. You can take additional backups at any point.
Multi-AZ file systems support all the availability and durability features of Single-AZ file systems. In addition, they are designed to provide continuous availability to data, even when an Availability Zone is unavailable. In a Multi-AZ deployment, Amazon FSx automatically provisions and maintains a standby file server in a different zone. Any changes written to disk in your file system are synchronously replicated across Availability Zones to the standby.
Amazon FSx File systems can be shared across Accounts and VPCs
Additionally, Amazon FSx can also be used for providing Continuously Available (CA) File Shares for Microsoft SQL Server
Monitoring and audit
Amazon CloudWatch
Amazon CloudWatch is a monitoring and observability service built for DevOps engineers, developers, site reliability engineers (SREs), and IT managers. CloudWatch provides you with data and actionable insights to monitor your applications, respond to system-wide performance changes, optimize resource utilization, and get a unified view of operational health. CloudWatch collects monitoring and operational data in the form of logs, metrics, and events, providing you with a unified view of Amazon resources, applications, and services that run on Amazon and on-premises servers. You can use CloudWatch to detect anomalous behavior in your environments, set alarms, visualize logs and metrics side by side, take automated actions, troubleshoot issues, and discover insights to keep your applications running smoothly.
Amazon CloudTrail
Amazon CloudTrail is a service that enables governance, compliance, operational auditing, and risk auditing of your Amazon account. With CloudTrail, you can log, continuously monitor, and retain account activity related to actions across your Amazon infrastructure. CloudTrail provides event history of your Amazon account activity, including actions taken through the Amazon Management Console, Amazon SDKs, command line tools, and other Amazon services. This event history simplifies security analysis, resource change tracking, and troubleshooting. In addition, you can use CloudTrail to detect unusual activity in your Amazon accounts. These capabilities help simplify operational analysis and troubleshooting.