Data integration - General SAP Guides
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

Data integration

RISE with SAP Extensibility for Data Integration with Amazon is a technical framework that enables data flow between SAP systems, Amazon services, and third-party solutions. This integration architecture provides standardized APIs, connectors, and protocols to establish secure communication channels, addressing the critical need for seamless enterprise data integration in modern cloud environments

The RISE with SAP Extensibility for Data Integration outlines two primary data handling and integration mechanisms:

  1. Data Extraction

  2. Data Federation

Data Extraction

Data extraction from SAP is a crucial step in making the data usable for reporting, analysis, and integration with other systems.

Data extraction

Extracting data from SAP using Amazon Managed Services

Data extraction using Managed Services

Amazon AppFlow

The Amazon AppFlow SAP OData connector provides the ability to fetch, create, and update records exposed by SAP S/4HANA and SAP on premises systems through OData APIs. When you connect Amazon AppFlow to ODP providers, you can create flows that run full data transfers or incremental updates. Incremental updates for ODP data are efficient because they transfer only those records that changed since the prior flow run.

The SAP OData connector for Amazon AppFlow user guide provides detailed instructions for setting up AppFlow, covering everything from establishing connectivity to transferring data from SAP.

Amazon Glue

Amazon Glue is a serverless data integration service that makes it easy for analytics users to discover, prepare, move, and integrate data from multiple sources. With Amazon Glue, you can discover and connect to SAP using OData and manage your data in a centralized data catalog. You can visually create, run, and monitor extract, transform, and load (ETL) pipelines to load SAP data into your data lakes and data warehouses.

The Connecting to SAP OData using Glue user guide offers comprehensive instructions for setting up Glue ETL jobs, configuring SAP OData connections, and reading data from SAP, including handling incremental transfers. Note: Key differences of Amazon AppFlow vs Amazon Glue for SAP OData

Amazon AppFlow and Amazon Glue serve distinct roles in data integration, with each offering unique advantages for different use cases. While Amazon Glue excels in complex ETL operations, data discovery, preparation, and extraction, particularly for specialized scenarios like SAP ODP-based extraction. AppFlow is designed as a more streamlined, no-code solution for API-based data transfers.

Amazon Glue requires more hands-on management, including code deployment and maintenance, but offers greater flexibility and control over data transformation processes. In contrast, AppFlow provides a user-friendly, point-and-click interface with pre-built connectors for popular SaaS applications, enabling bidirectional data transfers without requiring complex API implementations.

Amazon SDK for ABAP

Amazon SDK for ABAP simplifies the use of Amazon services alongside SAP applications with a client library of modules that are consistent and familiar to ABAP developers. With its capability to consume Amazon Services through ABAP code, it allows you to build ETL data pipelines and replication from SAP ABAP Stack to Amazon destinations such as S3 buckets, Redshifts, and others.

Extracting data from SAP using SAP services like SAP Datasphere and SAP Data Services

Data extraction using SAP Services

SAP Datasphere

SAP Datasphere offers various connection types such as SAP ABAP Connections, SAP ECC Connections, SAP S/4HANA Cloud Connections supporting RFC and ODP protocols. Refer to SAP Datasphere documentation to choose most appropriate connectivity to extract SAP data. Using premium outbound integration for [Amazon Simple Storage Connection (Amazon S3), configure SAP Datasphere replication flow to ingest data to Amazon S3.

SAP Data Services

SAP Data Services offer various connections to extract data from SAP ECC data. Refer to SAP Data Services documentation to choose most appropriate connectivity. SAP Data Services offers Amazon Redshift Datastore and Amazon S3 datastore to ingest data to Amazon. It also offers options for Amazon S3 file location protocol such as encryption type, compression type, batch-size, number of threads, Amazon S3 storage class, etc.

Extracting data from SAP using Amazon Partner Solutions

Amazon Partner Solutions offer ready to deploy solutions with enhanced features, such as pre-built connectors, specialized data pipelines, and advanced optimization techniques that reduce complexity and improve the speed of deployment.

To find and deploy a solution that fits your specific needs, you can explore the Amazon Partner Solutions Finder or browse through the Amazon Marketplace, where you can search for and quickly deploy partner solutions tailored to your unique SAP use case.

Further Resources

The Guidance for SAP Data Integration and Management on Amazon provides the essential data foundation to build data and analytics solutions. It shows how to integrate data from SAP ERP source systems and Amazon in real-time or batch mode, with change data capture, using Amazon services, SAP products, and Amazon Partner Solutions. This Guidance includes an overview reference architecture showing how to ingest SAP systems to Amazon in addition to five detailed architectural patterns that complement SAP-supported mechanisms (such as OData, ODP, SLT, and BTP) using Amazon services that are highlighted above, SAP products, and Amazon Partner Solutions.

Data Federation

Data federation is a data management strategy that enables seamless integration and effective utilization of data from disparate data sources. When there is business requirement to have a consolidated data for transactional, analytics, machine learning, it is preferred for the data to be accessed from the source rather than replicated to avoid latency, inconsistency and extra storage cost.

In the context of SAP and Amazon services, it allows organizations to access, combine, and analyze data from both SAP systems and Amazon cloud services seamlessly.

Data Federation

Data federation using Amazon Managed Services

Amazon Athena

Amazon Athena is a serverless, scalable and flexible interactive query service by Amazon that allows to analyze data directly in Amazon S3. The data stored in Amazon S3 from multiple sources can be further transformed into tables and views using Amazon Athena and queried to extract meaningful information in a structured way.

Data in Athena can be accessed from SAP Datasphere through data federation from SAP Datasphere connections. Users can also access SAP Datasphere tables and views from Athena by querying SAP HANA using an Athena Federated Query.

Data can also be federated to the SAP HANA Cloud by configuring Athena as a remote source using the Smart Data Access – Athena adapter. The Athena Federated Query connection can also be used to read data from a stand-alone SAP HANA Cloud environment.

Amazon Redshift

Amazon Redshift is a fully managed, peta-byte scale data warehouse service from Amazon. Customers have built their data warehouses and build data models for analytics and reporting.

Data federation from Amazon Redshift into SAP Datasphere is possible with SAP HANA Smart Data Integration (SDI) or the SAP Data Provisioning Agent. Amazon Redshift data can also be federated through the Athena Federated Query data source connector.

Further resources

The Guidance for Data Federation between SAP and Amazon outlines the process of federating data between SAP and Amazon cloud analytics services, enabling you to establish a data mesh architecture. By federating data between SAP and Amazon. you can easily transform and visualize your data in a scalable, secure, and cost-effective way, helping you inform your decision-making.