Amazon EMR 7.2.0 - Hadoop release notes
Amazon EMR 7.2.0 - Hadoop changes
Type | Description |
---|---|
New Feature |
HADOOP-18850 |
New Feature |
Support S3 Glacier read restored objects |
Improvement |
Dynamically Scaling S3A maximum connections based upon the cores when running Spark Jobs |
Improvement |
HADOOP-18797 |
Improvement |
HADOOP-19047 |
Improvement |
Add "Export", "Publish" & "AbortPublish" method in MagicV2 Committer |
Improvement |
Implement ReadFullyIntoBuffers FileSystem API in S3A |
Bug Fix |
HADOOP-18793 |
Upgrade |
HADOOP-19008 |
Upgrade |
HADOOP-18613 |
Amazon EMR 7.2.0 - Hadoop features
See the following list for new Hadoop features in Amazon EMR 7.2.0.
-
S3 glacier support to read restored objects – With Amazon EMR release 7.2.0 and higher, you can read restored Glacier objects from the S3 location of the table with the
S3A
protocol. This read operation ignores archived Glacier files if they still in progress of being restored. -
Support concurrent writes with the S3A magic committer – Amazon EMR 7.2.0 introduces support for concurrent writes with the S3A magic committer. This feature was added to the OSS Hadoop community as part of HADOOP-18797
.