Amazon Web Services Data Exchange datashares - Amazon Redshift
Services or capabilities described in Amazon Web Services documentation might vary by Region. To see the differences applicable to the China Regions, see Getting Started with Amazon Web Services in China (PDF).

Amazon Web Services Data Exchange datashares

You can use Amazon Web Services Data Exchange datashares to manage billing for Amazon Redshift data sharing.

An Amazon Web Services Data Exchange datashare is a unit of licensing for sharing your data through Amazon Web Services Data Exchange. Amazon manages all billing and payments associated with subscriptions to Amazon Web Services Data Exchange and use of Amazon Redshift data sharing. Approved data providers can add Amazon Web Services Data Exchange datashares to Amazon Web Services Data Exchange products. When customers subscribe to a product with Amazon Web Services Data Exchange datashares, they get access to the datashares in the product.

Amazon Web Services Data Exchange for Amazon Redshift makes it convenient to license access to your Amazon Redshift data through Amazon Web Services Data Exchange. When a customer subscribes to a product with Amazon Web Services Data Exchange datashares, Amazon Web Services Data Exchange automatically adds the customer as a data consumer on all Amazon Web Services Data Exchange datashares included with the product. Invoices are automatically generated, and payments are centrally collected and automatically disbursed through Amazon Marketplace Entitlement Service.

Providers can license data in Amazon Redshift at a granular level, such as schemas, tables, views, and user-defined functions. You can use the same Amazon Web Services Data Exchange datashare across multiple Amazon Web Services Data Exchange products. Any objects added to the Amazon Web Services Data Exchange datashare is available to consumers. Producers can view all Amazon Web Services Data Exchange datashares managed by Amazon Web Services Data Exchange on their behalf using Amazon Redshift API operations, SQL commands, and the Amazon Redshift console. Customers who subscribe to a product Amazon Web Services Data Exchange datashares have read-only access to the objects in the datashares.

Customers who want to consume third-party producer data can browse the Amazon Web Services Data Exchange catalog to discover and subscribe to datasets in Amazon Redshift. After their Amazon Web Services Data Exchange subscription is active, they can create a database from the datashare in their cluster and query the data in Amazon Redshift.

How Amazon Web Services Data Exchange datashares work

Managing Amazon Web Services Data Exchange datashares as a producer administrator

If you are a data producer (also known as a provider on Amazon Web Services Data Exchange), you can create Amazon Web Services Data Exchange datashares that connect to your Amazon Redshift databases. To add Amazon Web Services Data Exchange datashares to products on Amazon Web Services Data Exchange, you must be a registered Amazon Web Services Data Exchange provider.

For more information on how to get started with Amazon Web Services Data Exchange datashares, see Sharing licensed Amazon Redshift data on Amazon Web Services Data Exchange.

Using Amazon Web Services Data Exchange datashares as a consumer with an active Amazon Web Services Data Exchange subscription

If you are a consumer with an active Amazon Web Services Data Exchange subscription (also known as a subscriber on Amazon Web Services Data Exchange), you can browse the Amazon Web Services Data Exchange catalog on the Amazon Web Services Data Exchange console to discover products containing Amazon Web Services Data Exchange datashares.

After you subscribe to a product that contains Amazon Web Services Data Exchange datashares, create a database from the datashare within your cluster. You can then query the data in Amazon Redshift directly without extracting, transforming, and loading the data.

For more information on how to get started with Amazon Web Services Data Exchange datashares, see Sharing licensed Amazon Redshift data on Amazon Web Services Data Exchange.

For Amazon Web Services Data Exchange datashares, consider the following:

  • When a producer cluster is deleted, Amazon Redshift deletes the datashares created by the producer cluster. When a producer cluster is backed up and restored, the created datashares still persist on the restored cluster. For data subscribers to be able to continue accessing the data, create the Amazon Web Services Data Exchange datashares again and publish them to the product's data sets. The consumer database on the consumer cluster points to the datashare from the original cluster where the snapshot is taken. To query the shared data from the restored cluster, the consumer administrator creates a different database, or drops and recreates an existing consumer database to use the newly created Amazon Web Services Data Exchange datashare from the newly restored cluster.

  • When a consumer cluster is deleted and restored from a snapshot, the previous access shared to this cluster remains valid and visible. consumer administrator must drop any stale consumer databases created from the inactive datashares and recreate the consumer database from the datashare after the producer re-grants the permissions. As the namespace GUID is different on a restored cluster from the original cluster, re-grant datashare permissions when the producer cluster is restored from backup.

  • We recommend that you don't delete your cluster if you have any Amazon Web Services Data Exchange datashares. Performing this type of alteration can breach data product terms in Amazon Web Services Data Exchange.

Datashare producers and consumers

Data producers (also known as data sharing producers or datashare producers) are clusters that you want to share data from. producer administrators and database owners can create datashares using the CREATE DATASHARE command. You can add objects such as schemas, tables, views, and SQL user-defined functions (UDFs) from a database that you want the producer cluster to share with consumer clusters.

Data producers (also known as providers on Amazon Web Services Data Exchange) for Amazon Web Services Data Exchange datashares can license data through Amazon Web Services Data Exchange. Approved providers can add Amazon Web Services Data Exchange datashares to Amazon Web Services Data Exchange products.

When a customer subscribes to a product with Amazon Web Services Data Exchange datashares, Amazon Web Services Data Exchange automatically adds the customer as a data consumer on all Amazon Web Services Data Exchange datashares included with the product. Amazon Web Services Data Exchange also removes all customers from Amazon Web Services Data Exchange datashares when their subscription ends. Amazon Web Services Data Exchange also automatically manages billing, invoicing, payment collection, and payment distribution for paid products with Amazon Web Services Data Exchange datashares. For more information, see Amazon Web Services Data Exchange datashares. To register as an Amazon Web Services Data Exchange data provider, see Getting started as a provider.

Data consumers (also known as data sharing consumers or datashare consumers) are clusters that receive datashares from producer clusters.

Amazon Redshift clusters that share data can be in the same or different Amazon Web Services accounts or different Amazon Web Services Regions, so you can share data across organizations and collaborate with other parties. consumer administrators receive the datashares that they are granted usage for and review the contents of each datashare. To consume shared data, the consumer administrator creates an Amazon Redshift database from the datashare. The administrator then assigns permissions for the database to users and roles in the consumer cluster. After permissions are granted, users and roles can list the shared objects as part of the standard metadata queries, along with the local data on the consumer cluster. They can start querying immediately.

If you are a consumer with an active Amazon Web Services Data Exchange subscription (also known as subscribers on Amazon Web Services Data Exchange), you can find, subscribe to, and query granular, up-to-date data in Amazon Redshift without the need to extract, transform, and load the data. For more information, see Amazon Web Services Data Exchange datashares.

Considerations when using Amazon Web Services Data Exchange for Amazon Redshift

When using Amazon Web Services Data Exchange for Amazon Redshift, consider the following:

  • Both producers and consumers must use the RA3 instance types to use Amazon Redshift datashares. Producers must use the RA3 instance types with the latest Amazon Redshift cluster version.

  • Both the producer and consumer clusters must be encrypted.

  • You must be registered as an Amazon Web Services Data Exchange provider to list products on Amazon Web Services Data Exchange, including products that contain Amazon Web Services Data Exchange datashares. For more information, see Getting started as a provider.

  • You don't need to be a registered Amazon Web Services Data Exchange provider to find, subscribe to, and query Amazon Redshift data through Amazon Web Services Data Exchange.

  • To control access to your data, create Amazon Web Services Data Exchange datashares with the publicly accessible setting turned on. To alter an Amazon Web Services Data Exchange datashare to turn off the publicly accessible setting, set the session variable to allow ALTER DATASHARE SET PUBLICACCESSIBLE FALSE. For more information, see ALTER DATASHARE usage notes.

  • Producers can't manually add or remove consumers from Amazon Web Services Data Exchange datashares because access to the datashares is granted based on having an active subscription to an Amazon Web Services Data Exchange product that contains the Amazon Web Services Data Exchange datashare.

  • Producers can't view the SQL queries that consumers run. They can only view metadata, such as the number of queries or the objects consumers query, through Amazon Redshift tables that only the producer can access. For more information, see Monitoring and auditing data sharing in Amazon Redshift.

  • We recommend that you make your datashares publicly accessible. If you don't, subscribers on Amazon Web Services Data Exchange with publicly accessible consumer clusters won't be able to use your datashare.

  • We recommend that you don't delete an Amazon Web Services Data Exchange datashare shared to other Amazon Web Services accounts using the DROP DATASHARE statement. If you do, the Amazon Web Services accounts that have access to the datashare will lose access. This action is irreversible. Performing this type of alteration can breach data product terms in Amazon Web Services Data Exchange. If you want to delete an Amazon Web Services Data Exchange datashare, see DROP DATASHARE usage notes.

  • For cross-Region data sharing, you can create Amazon Web Services Data Exchange datashares to share licensed data.

  • When consuming data from a different Region, the consumer pays the Cross-Region data transfer fee from the producer Region to the consumer Region.