NVIDIA Ampere vs. AMD RDNA 2: Architectural Battle

Two years after the launch of Turing in September 2020, NVIDIA changed the architecture of their graphics cards to Ampere . AMD did not stand aside and soon after that also updated the RDNA architecture to the second version.  





, RX 6000 NVIDIA. , , ( , ), .





, ? ?





. AMD 520 2, Navi. : Instinct MI100 750 2.





AMD Navi 21 Radeon R9 Fury Nano, GCN 3.0 Fiji. 596 2.





2018 AMD 7- TSMC, Vega 20 ( Radeon VII) 331 2. Navi , N7P.





, , NVIDIA, . Ampere, GA102, 628 2. 17% , , TU102: 754 2. NVIDIA GA100: , 7- 826 2. , .





NVIDIA Navi 21 , , โ€” . GA102 28,3 , AMD 5% โ€” 26,8 .





, , , , โ€” , . Navi 21 51,5 , GA102 โ€” 41,1 .





Navi 21 TSMC N7P, N7. GA102 NVIDIA Samsung. NVIDIA 8- ( 8N 8NN). 7 8 : , . 





, .





Ampere GA102 RDNA 2 Navi 21

, , , .





, . , , 3D- .





, , . , , , , , .





, , .





AMD Navi 21 NVIDIA GA102 . ,  





, NVIDIA : , . , . 





, Pascal GP106 ( GeForce GTX 1060) GP104 ( GeForce GTX 1070). , - . .





Pascal GP104 and GP106
Pascal GP104 GP106

NVIDIA . ? , PCI Express . , - , , , . , , (ROP).





, , , , . , 3D- โ€” , - . , , , , .





, , AMD Navi 21 Navi 10, Radeon RX 5700 XT. , , , . , Radeon RX 6800 XT 1825 2250 , , TDP, 300 . Radeon RX 5700 XT: 1605 , 1905 225 .





Ampere RDNA 2 , , AMD TSMC - โ€” Radeon RX 6800 Radeon VII .





Radeon VII โ€” AMD TSMC 7- , 64%. : Ampere GA102, NVIDIA TSMC.





, AMD NVIDIA , : , API, , . : , .





. RDNA 2 , : Graphics Command Processor, โ€” (ACE).





, . , , .





AMD Shader Engine (SE), NVIDIA (GPC): , .





: . . , - : , โ€” , โ€” .





RDNA 2 SE โ€” , :





  • Primitive Setup โ€” , () ;





  • โ€” ;





  • (ROP) โ€” , .





Primitive Setup 1 . , , 1,8 2,2 , . Ampere , .





AMD, NVIDIA . NVIDIA Raster Engines. , 1 , โ€” , .





SE Navi 21 128 ROP; GA102 NVIDIA 112 ROP. , AMD , ROP , . - , . SE/GPC.





Shader Engines AMD , (DCU), Navi 21 DCU SE โ€” , Workgroup Processors (WGP). Ampere GA102 (TPC), 6 TPC. 1 , NVIDIA , AMD, TPC, Navi 21 โ€” SE. , GA102 , 42 Primitive Setup, RDNA 2 AMD โ€” 4. Raster Engine TPC, GA102 7 , Navi 21 โ€” . , NVIDIA . 





โ€” (CU) RDNA 2 (SM) Ampere โ€” -ยซยป. 





, , , , . , , - .





 , , . , , , .





  CU SM AMD NVIDIA . , . 





NVIDIA

Turing Pascal, Ampere โ€” , . , , Turing CUDA SM.





Turing ( ), 16x INT32 16x FP32. 32- : INT , FP โ€” .





NVIDIA , SM Ampere 128 CUDA, , , โ€” , Turing . INT32 , . Ampere NVIDIA , FP32. , CUDA SM , .





INT/FP , SM Ampere 128 FP32 64 FP32 64 INT32 . Turing .





, FP32 . , . GeForce RTX 3080, GA102 68 SM.





, FP32 121% GeForce 2080 Ti, 31%. ? 





: , FP32.





NVIDIA Turing 2018 , , 36% , , INT32. , .





, FP32 , . SM , 32 , , FP32. SM Ampere , Turing.





, GeForce RTX 3080 11- FP32 2080 Ti INT+FP. , .





? SM , , Turing. (, FP16 FP16), โ€‹โ€‹ 32 .





Fine-Grained Structured Sparsity. , , . , , , .





: CUDA, , BVH , SM . , , .





, , Optix API NVIDIA.





, , , . , Turing , .





AMD โ€” RDNA 2?





-

, AMD : - SIMD32, SISD, . , . , AMD . 





โ€” , Ampere. BVH, RDNA 2 SIMD 32.





  , , , , , . : , .





  Ray Accelerator , . , , . , , . , NVIDIA SM, Ampere RDNA 2 , .





  AMD , , .





 , Gears 5 Radeon RX 6800 ( Navi 21 60 CU) 17%, Shadow of the Tomb Raider โ€” 52%. , NVIDIA RTX 3080 ( 68 SM GA102) 23% 40% .





  AMD, , , , .





  , RDNA 2 : , INT4 INT8. , AMD (CDNA) , DirectML.





API Microsoft DirectX 12. . NVIDIA, , DLSS. SM โ€” , DirectML, , . Turing, Ampere , FP16.





RDNA 2 โ€” , 32- 16-. 









AMD SIMD32 , . 32 , , . , , SM Ampere, 32 .





NVIDIA SM 128 FMA FP32 , RDNA 2 64 . FP16 128 FMA , , Ampere FP16.





SM NVIDIA (, 64 FP32 64 INT32) FP16, . AMD SIMD32, , .





, , Ampere : GA102 SM, Navi 21, , , . AMD .





Ampere. , : 2- 50% (Turing TU102 4096 , ), 1- SM . 





, - L1 , , . (, ) :





  • 64 ;





  • 48 ;





  • 16 .





, . NVIDIA Micron GDDR6 . , GDDR6, . , 1 , (PAM), GDDR6X :





PAM2 to GDDR6 (top) and PAM4 to GDDR6X (bottom)
PAM2 GDDR6 () PAM4 GDDR6X ()

GDDR6X 2 , . GeForce RTX 3090 24 GDDR6X, 19 /, 936 /. 52% , GeForce RTX 2080 Ti. HBM2, , GDDR6.





Micron, PAM4 , . AMD : , , , - . RDNA 2 โ€” .





. 128 - ? SRAM L3, AMD - 64 . 16 , 64 .





Infinity Cache 1,94 , 1986,6 /. DRAM, . , BVH , Infinity Cache .





, RDNA 2 , Zen 2: , (victim cache) . , , , DRAM.





, , 128 , Infinity Cache 32 L2. GDDR6 DRAM.





AMD โ€” , โ€” .









GDDR6X GA102 , . - 3- Navi 21 DRAM, .





AMD GDDR6 , , , GeForce RTX 3080 3090, Micron. GDDR6 , GDDR6X 8 .





RDNA 2, , , , Ampere, SRAM , DRAM, .  





. Ampere RDNA 2 mesh- variable rate- DirectX12 Ultimate, NVIDIA .





mesh- , . , .





variable rate-. , . , , , .





(ROP), , variable rate-. NVIDIA ROP 2- .





Turing ROP 512 . ROP , , Ampere ROP GPC. GA102 12 ROP GPC ( 1 ), 112 .





AMD , NVIDIA (. . - L2), ROP / . Navi 21 ROP 8 32- 4 64-.





RTX IO NVIDIA โ€” , , , CUDA.





, : , , , DRAM .





: . NVIDIA , ยซ100- ยป ยซ20- ยป, , , . 





AMD RDNA 2 Radeon RX 6000, Smart Access Memory. RTX IO โ€” , . PCI Express 256 . (BAR), 2008 PCI Express 2.0 , . , , DRAM . 





, , , . Windows Ryzen 5000, 500 Radeon RX 6000.





: 15% 4K. , NVIDIA , RTX 3000 .





-

HDMI 2.1 DisplayPort 1.4a. , 4K 240 HDR 8K 60 . 4:2:0, DSC 1.2a. , . HDMI 2.1 6 / 4K 60 . 





LG CK 48-inch OLED Monitor: 4K @ 120Hz requires HDMI 2.1
48- OLED- LG CK: 4K 120 HDMI 2.1

Ampere RDNA 2 (FreeSync AMD, G-Sync NVIDIA), .





, , 8K AV1, 4K H.264 8K H.265, , . . , - .





,

AMD NVIDIA . , 3D- , .





NVIDIA , Ampere: GA100, GA102 GA104.





GA104 used in GeForce RTX 3060 Ti
GA104 GeForce RTX 3060 Ti

โ€” GA102. GA100 โ€” .





CUDA INT32+FP32 โ€” FP64, load/store - L1/L2. , .





GA102/104, , , NVIDIA: , , . Ampere โ€” .





Arcturus CDNA 750 mm2
Arcturus CDNA 750 2

RDNA 2 , , Ampere. AMD GCN .





, RDNA 2 Big Navi, , CDNA Big Vega: Instinct MI100 Arcturus โ€” 50- 128 . 





NVIDIA Quadro Tesla, Navi 21 . - , Ampere ?





: . 





AMD Radeon RX 6900 XT, Navi 21 ( CU), , GeForce RTX 3090 . GA102 , NVIDIA ยซยป-, Turing .





, , RDNA 2 Xbox Series X/S PlayStation 5, . , GCN Xbox One PlayStation 4, , , , .





, 2013 , , GCN 1.0, . Xbox One X, 2017 , GCN 2.0, 3 . 





โ€” , Xbox One PS4, , AMD? . , RDNA 2 , .





, , . NVIDIA AMD , : Ampere , RDNA 2 โ€” .





, . , : Intel Xe. , , , .








All Articles