AWS Announces Next Generation Graviton3 Processors

AWS Announces Next Generation Graviton3 Processors

In 2018 Amazon Web Services unveiled the first generation of its AWS Graviton chip-powered instances.

At the AWS re: Invent 2021 conference in Las Vegas, AWS announced the third generation of Graviton processors – the AWS Graviton3.

The intention was to build a processor that was optimized specifically for modern workloads. To improve the performance, reduce the cost, and increase the efficiency for the majority of workloads in the cloud.

AWS Graviton3 will power all new Amazon Elastic Compute 2 (EC2) C7g instances that are now available in preview.

Best Magento Hosting now

What is AWS Graviton3?

Graviton3 Performance Gains

Comprehensive performance improvements across computing dimensions

Compute Performance

0%

General workload improvement

Floating-Point

2X

Scientific computing boost

Cryptographic

2X

Security operations

ML Performance

3X

AI/ML workload acceleration

Memory Bandwidth

0%

Memory-intensive apps

Energy Efficiency

0%

Power per operation

Key Insight: The 3X ML performance boost with bfloat16 support positions Graviton3 as a game-changer for AI workloads, while maintaining the same 100W power envelope through sophisticated architectural optimizations.

What is AWS Graviton3?

AWS Graviton3 is the latest in the Graviton family of processors. It is 60% more efficient than its predecessor.

Graviton3 will deliver up to 25% more compute performance. It offers up to 2x as much floating-point & cryptographic performance.

On the machine learning side, Graviton3 includes support for bfloat16 data. Which is a truncated 32-bit floating-point value.

Graviton3 is ideal for machine learning where speed is more important than precision. It will deliver up to 3x better machine learning performance.

Overall the performance claim is for a more modest 25%.

AWS Graviton3 performance & acceleration

Graviton3 achieves 50% higher memory bandwidth. This is done while keeping the entire CPU power consumption in that same ~100W range.

Amazon is using a chiplet design with 7 different dies. They are packaged using advanced packaging. The size of the bumps connecting each die is <55um. Whereas every CPU from Intel and AMD is still at >100um.

Amazon is keeping costs down on the IP side by leveraging ARM’s stock cores and Synopsys / Cadence IP. The price-performance advantage of Graviton3 is huge as it extends beyond just vertical integration.

Graviton servers feature a Nitro card capable of managing three Graviton3 processors.

Graviton3 Specifications

Graviton3 vs Graviton2

Architectural Evolution Through 25 Billion Extra Transistors

83%
More Transistors
60%
Higher IPC
50%
More Memory BW
Transistor Count +83%
Graviton3
55B
Graviton2
30B
Instructions per Cycle +60%
Graviton3
8 IPC
Graviton2
5 IPC
Memory Bandwidth per Core +50%
Graviton3
Enhanced
Graviton2
Baseline
Clock Speed
2.6 GHz
vs 2.5 GHz +4%
Core Count
64 cores
Same as G2 UNCHANGED
Graviton3 Architecture
Neoverse V1/N2 Cores
DDR5 Memory (First in Cloud)
256-bit SVE Support
2x Data per Clock Cycle
Graviton2 Architecture
Neoverse N1 Cores
DDR4 Memory
128-bit NEON
Standard Processing Width

AWS invested 25 billion extra transistors not in adding cores, but in making each core substantially more capable - addressing the memory bandwidth bottleneck that limited Graviton2 workloads.

Graviton3 Specifications

  • 2.6 GHz clock speed
  • 300 GB/sec max memory bandwidth
  • DDR5 RAM
  • 64 cores
  • Seven silicon die chiplet-based design
  • 256-bit SVE (Scalable Vector Extension)
  • 55 billion transistors (Graviton2: 30 billion)

Benefits of Graviton3

  • 25% higher performance
  • 50% more memory bandwidth
  • Support for bfloat16 data
  • 2x floating-point
  • 2x cryptographic performance
  • 3x machine learning performance
  • 60% less energy consumption
  • Enhanced security

Pointer Authentication on AWS Graviton3: Enhancing Security

Graviton3 Advanced Architecture

Silicon-level security meets cutting-edge packaging technology

SECURITY

Hardware Pointer Authentication

Cryptographic signing of return addresses with secret keys prevents stack-based attacks

Blocks ROP attacks by validating signatures before execution

SECURITY

Always-On Memory Encryption

Continuous protection of data in memory against side-channel attacks

Critical for multi-tenant cloud security

ARCHITECTURE

7-Die Chiplet Design

Advanced multi-die architecture for improved modularity and manufacturing yield

Enables mix-and-match silicon optimization

ADVANTAGE

Sub-55um Interconnects

Superior to Intel and AMD processors still using greater than 100um connections

50% better memory bandwidth enabled

INTEGRATION

Nitro Card Management

Single Nitro card manages three Graviton3 processors for data center efficiency

Higher density and resource utilization

ML

bfloat16 Data Format

Optimized for machine learning inference where speed matters more than precision

3x better ML performance for inference

PERFORMANCE

256-bit SVE Support

Scalable Vector Extensions for enhanced SIMD parallel processing capabilities

Accelerates scientific and HPC workloads

STRATEGY

Cost-Effective IP Model

Leveraging ARM stock cores with Synopsys and Cadence IP for optimal economics

Focus engineering on system optimization

Comprehensive Cloud Processor Innovation

Graviton3 combines hardware-level security, advanced packaging technology, and strategic IP licensing to deliver superior price-performance that extends beyond vertical integration, setting new standards for cloud computing infrastructure.

AWS Graviton3: Enhancing Security

Graviton3 includes a new pointer authentication feature to improve security. Before return addresses are pushed onto the stack, they’re first signed with a secret key.
The generated key is additional context information, including the current value of the stack pointer.

When the signed addresses are popped off the stack, they’re validated before being used.

An exception is raised if the address isn’t valid. It works to block attacks by overwriting the stack contents with the address of harmful code.

The Graviton 3 also supports always-on memory encryption.

Graviton3 Improvements Over Graviton2

Graviton3 Performance Gains

Real-world benchmark results showing performance improvements across different workload types

TOP PERFORMER

Nginx Web Server

+60%

Exceptional gains for web serving workloads, enabling 60% more concurrent users

Redis Database

+25%

Solid improvement for memory-intensive caching operations

General Workloads

+25% baseline improvement

Energy Efficiency

60% better than Graviton2

Technical Capabilities

300 GB/s
Memory Bandwidth
DDR5 Technology
30 Gbps
Network Bandwidth
C7g with EFA
HPC Ready
Optimized For
ML, EDA, Analytics

Performance gains vary by workload architecture - Nginx benefits from doubled IPC while Redis leverages 50% memory bandwidth increase

Graviton3 Improvements Over Graviton2

Graviton2 was first previewed in late 2019. Most workloads running on Graviton2 could run more efficiently if they had:

  • More memory bandwidth
  • Lower latency access to memory

In Graviton3, rather than using our extra transistors to pack more cores, AWS uses transistors to improve memory performance.

Real world cloud workloads are using more memory bandwidth. They’re becoming more sensitive to memory latency.

Graviton2 instances already had a lot of memory bandwidth per VCPU. More memory bandwidth has been added to Graviton3 instances. Each Graviton3 core has 50% more memory bandwidth than Graviton2.

Graviton3 Improvements Over Graviton2 -More meory bandwidth

Graviton2 was 64-core and at 2.5 GHz clock speed. Graviton3 has greater width, meaning that twice as much data can be processed in a single clock cycle.

The number of instructions that each core can work on concurrently has increased from 5 to 8 per cycle.

Graviton2 had Arm's Neoverse N1 cores. Graviton3 has Neoverse N2 or Neoverse V1 cores.

Graviton3-powered instances are up to 60% more energy efficient than the Graviton2-powered EC2 instances.

Graviton3 Improvements Over Graviton2

A slide shown by DeSantis shows gains from 25% for a Redis application, to 60% for Nginx.

Amazon EC2 C7g Instances

Amazon EC2 C7g instances

Amazon announced the preview of the EC2 C7g instances, the new instance type running the latest Arm-based processors.

AWS Chief Evangelist Jeff Barr said that the instances should be useful for compute-intensive workloads.

Such as:

  • HPC
  • Batch processing
  • Electronic design automation (EDA)
  • Media encoding
  • Scientific modeling
  • Ad serving
  • Distributed analytics
  • CPU-based machine learning inferencing

C7g is the only Graviton3 VM instance that supports Elastic Fabric Adaptor (EFA). It will offer up to 30 Gbps of network bandwidth.

C7g instances are the first in the cloud industry to be equipped with DDR5 memory.

Another feature of the C7g instance is that it is built on the AWS Nitro System.

The new instances will be available in multiple sizes (including bare metal).

Conclusion

AWS Graviton3 will have a significant impact on performance in current generation instances.

It aims to deliver the best performance and the lowest cost for modern workloads.

AWS Graviton3-powered instances, including C7g, are supported by many popular Linux operating systems. Such as Amazon Linux 2, Red Hat Enterprise Linux, Suse Enterprise Linux Server, and Ubuntu.

Many popular applications and services for security, monitoring, management, containers, and CI/CD also support AWS Graviton-based instances.

Users can do the sums and migrate to Graviton3 where applications are compatible.

The C7g instances are available in preview. It will likely be a few months before general availability.

To try the new instances, customers can sign up to join the preview.

Shraddha S.
Shraddha S.
Head of Content

Shraddha Singh has a lot of thoughts about Technology and the Cloud Services Industry. An Indian native and a professional Technical writer, she gets her management skills from IIT-B.


Get the fastest Magento Hosting! Get Started