Apache Spark
Apache Spark is a unified analytics engine for large-scale data processing. It provides high-level APIs in Java, Scala, Python and R, and an optimized engine that supports general execution graphs. You can use Apache Spark to build stream processing applications that consume the data in your Kinesis data streams.
To consume Kinesis data streams using Apache Spark Structured Streaming, use the
Amazon Kinesis Data Streams connector
To consume Kinesis data streams using Spark Streaming, see Spark
Streaming + Kinesis Integration