Side-by-side, with the numbers that matter.

Two rigs, one table. Higher numbers win in cobalt; TDP and price flip.

AWS · cloud-model

Intel · accelerator

0 of 25 fields comparable

Spec	AWS Trainium2	Intel Gaudi 3 HL-325L
Bf16 Tflops	—	1,835
Bf16 Tflops PER Chip	667	—
Chips PER Trn2 Instance	16	—
Chips PER Ultraserver	64	—
Ethernet Port Gbps	—	200
Ethernet Ports	—	24
Form factor	—	OAM
FP8 Dense Petaflops PER Chip	1.3	—
FP8 Sparse Petaflops PER Chip	5.2	—
FP8 Tflops	—	1,835
HBM (GB)	—	128
HBM Bandwidth TB S PER Chip	2.9	—
HBM Type	—	HBM2E
Hbm3e Capacity GB PER Chip	96	—
Memory bandwidth (TB/s)	—	3.7
MME Engines	—	8
Neuroncores PER Chip	8	—
Process Node NM	—	5
Sram MB	—	96
TDP (W)↓ better	—	900
TPC Engines	—	64
Trn2 Instance FP8 Petaflops	20.8	—
Ultraserver FP8 Petaflops	83.2	—
Ultraserver Total HBM Bandwidth TB S	185	—
Ultraserver Total HBM TB	6	—