Engineering:Ampere (microarchitecture)

Short description: GPU microarchitecture by Nvidia

Ampere is the codename for a graphics processing unit (GPU) microarchitecture developed by Nvidia as the successor to both the Volta and Turing architectures. It was officially announced on May 14, 2020 and is named after French mathematician and physicist André-Marie Ampère.^[1]^[2]

Nvidia announced the Ampere architecture GeForce 30 series consumer GPUs at a GeForce Special Event on September 1, 2020.^[3]^[4] Nvidia announced the A100 80GB GPU at SC20 on November 16, 2020.^[5] Mobile RTX graphics cards and the RTX 3060 based on the Ampere architecture were revealed on January 12, 2021.^[6]

Nvidia announced Ampere's successor, Hopper, at GTC 2022, and "Ampere Next Next" for a 2024 release at GPU Technology Conference 2021.

Details

Architectural improvements of the Ampere architecture include the following:

CUDA Compute Capability 8.0 for A100 and 8.6 for the GeForce 30 series^[7]
TSMC's 7 nm FinFET process for A100
Custom version of Samsung's 8 nm process (8N) for the GeForce 30 series^[8]
Third-generation Tensor Cores with FP16, bfloat16, TensorFloat-32 (TF32) and FP64 support and sparsity acceleration.^[9] The individual Tensor cores have with 256 FP16 FMA operations per second 4x processing power (GA100 only, 2x on GA10x) compared to previous Tensor Core generations; the Tensor Core Count is reduced to one per SM.
Second-generation ray tracing cores; concurrent ray tracing, shading, and compute for the GeForce 30 series
High Bandwidth Memory 2 (HBM2) on A100 40GB & A100 80GB
GDDR6X memory for GeForce RTX 3090, RTX 3080 Ti, RTX 3080, RTX 3070 Ti
Double FP32 cores per SM on GA10x GPUs
NVLink 3.0 with a 50Gbit/s per pair throughput^[9]
PCI Express 4.0 with SR-IOV support (SR-IOV is reserved only for A100)
Multi-instance GPU (MIG) virtualization and GPU partitioning feature in A100 supporting up to seven instances
PureVideo feature set K hardware video decoding with AV1 hardware decoding^[10] for the GeForce 30 series and feature set J for A100
5 NVDEC for A100
Adds new hardware-based 5-core JPEG decode (NVJPG) with YUV420, YUV422, YUV444, YUV400, RGBA. Should not be confused with Nvidia NVJPEG (GPU-accelerated library for JPEG encoding/decoding)

Chips

GA100^[11]
GA102
GA103
GA104
GA106
GA107

Comparison of Compute Capability: GP100 vs GV100 vs GA100^[12]

GPU features	NVIDIA Tesla P100	NVIDIA Tesla V100	NVIDIA A100
GPU codename	GP100	GV100	GA100
GPU architecture	NVIDIA Pascal	NVIDIA Volta	NVIDIA Ampere
Compute capability	6.0	7.0	8.0
Threads / warp	32	32	32
Max warps / SM	64	64	64
Max threads / SM	2048	2048	2048
Max thread blocks / SM	32	32	32
Max 32-bit registers / SM	65536	65536	65536
Max registers / block	65536	65536	65536
Max registers / thread	255	255	255
Max thread block size	1024	1024	1024
FP32 cores / SM	64	64	64
Ratio of SM registers to FP32 cores	1024	1024	1024
Shared Memory Size / SM	64 KB	Configurable up to 96 KB	Configurable up to 164 KB

Comparison of Precision Support Matrix^[13]^[14]

	FP16	FP32	FP64	INT1	INT4	INT8	TF32	BF16	FP16	FP32	FP64	INT1	INT4	INT8	TF32	BF16
	Supported CUDA Core Precisions								Supported Tensor Core Precisions
NVIDIA Tesla P4	No	Yes	Yes	No	No	Yes	No	No	No	No	No	No	No	No	No	No
NVIDIA P100	Yes	Yes	Yes	No	No	No	No	No	No	No	No	No	No	No	No	No
NVIDIA Volta	Yes	Yes	Yes	No	No	Yes	No	No	Yes	No	No	No	No	No	No	No
NVIDIA Turing	Yes	Yes	Yes	No	No	No	No	No	Yes	No	No	Yes	Yes	Yes	No	No
NVIDIA A100	Yes	Yes	Yes	No	No	Yes	No	Yes	Yes	No	Yes	Yes	Yes	Yes	Yes	Yes

Legend:

FPnn: floating point with nn bits
INTn: integer with n bits
INT1: binary
TF32: TensorFloat32
BF16: bfloat16

Comparison of Decode Performance

Concurrent streams	H.264 decode (1080p30)	H.265 (HEVC) decode (1080p30)	VP9 decode (1080p30)
V100	16	22	22
A100	75	157	108

A100 accelerator and DGX A100

The Ampere-based A100 accelerator was announced and released on May 14, 2020.^[9] The A100 features 19.5 teraflops of FP32 performance, 6912 CUDA cores, 40GB of graphics memory, and 1.6TB/s of graphics memory bandwidth.^[15] The A100 accelerator was initially available only in the 3rd generation of DGX server, including 8 A100s.^[9] Also included in the DGX A100 is 15TB of PCIe gen 4 NVMe storage,^[15] two 64-core AMD Rome 7742 CPUs, 1 TB of RAM, and Mellanox-powered HDR InfiniBand interconnect. The initial price for the DGX A100 was $199,000.^[9]

Products using Ampere

GeForce MX series
- GeForce MX570 (mobile) (GA107)
GeForce 20 series
- GeForce RTX 2050 (mobile) (GA107)
GeForce 30 series
- GeForce RTX 3050 Laptop GPU (GA107)
- GeForce RTX 3050 (GA106 or GA107)^[16]
- GeForce RTX 3050 Ti Laptop GPU (GA107)
- GeForce RTX 3060 Laptop GPU (GA106)
- GeForce RTX 3060 (GA106 or GA104)^[17]
- GeForce RTX 3060 Ti (GA104 or GA103)^[18]
- GeForce RTX 3070 Laptop GPU (GA104)
- GeForce RTX 3070 (GA104)
- GeForce RTX 3070 Ti Laptop GPU (GA104)
- GeForce RTX 3070 Ti (GA104 or GA102)^[19]
- GeForce RTX 3080 Laptop GPU (GA104)
- GeForce RTX 3080 (GA102)
- GeForce RTX 3080 12GB (GA102)
- GeForce RTX 3080 Ti Laptop GPU (GA103)
- GeForce RTX 3080 Ti (GA102)
- GeForce RTX 3090 (GA102)
- GeForce RTX 3090 Ti (GA102)
Nvidia Workstation GPUs (formerly Quadro)
- RTX A1000 (mobile) (GA107)
- RTX A2000 (mobile) (GA106)
- RTX A2000 (GA106)
- RTX A3000 (mobile) (GA104)
- RTX A4000 (mobile) (GA104)
- RTX A4000 (GA104)
- RTX A4500 (GA102)
- RTX A5000 (mobile) (GA104)
- RTX A5000 (GA102)
- RTX A5500 (GA102)
- RTX A6000 (GA102)
Nvidia Data Center GPUs (formerly Tesla)
- Nvidia A2 (GA107)
- Nvidia A10 (GA102)
- Nvidia A16 (4 × GA107)
- Nvidia A30 (GA100)
- Nvidia A40 (GA102)
- Nvidia A100 (GA100)
- Nvidia A100 80GB (GA100)
Jetson Orin SoCs
- Jetson Orin AGX
- Jetson Orin NX
- Jetson Orin Nano

Products using Ampere (per Chip)
	GA107	GA106	GA104	GA103	GA102	GA100
GeForce MX series	GeForce MX570 (mobile)	N/A	N/A	N/A	N/A	N/A
GeForce 20 series	GeForce RTX 2050 (mobile)	N/A	N/A	N/A	N/A	N/A
GeForce 30 series	GeForce RTX 3050 Laptop GeForce RTX 3050^[16] GeForce RTX 3050 Ti Laptop	GeForce RTX 3050 GeForce RTX 3060 Laptop GeForce RTX 3060	GeForce RTX 3060^[17] GeForce RTX 3060 Ti GeForce RTX 3070 Laptop GeForce RTX 3070 GeForce RTX 3070 Ti Laptop GeForce RTX 3070 Ti GeForce RTX 3080 Laptop	GeForce RTX 3060 Ti^[18] GeForce RTX 3080 Ti Laptop	GeForce RTX 3070 Ti^[19] GeForce RTX 3080 GeForce RTX 3080 Ti GeForce RTX 3090 GeForce RTX 3090 Ti	N/A
Nvidia Workstation GPUs	RTX A1000 (mobile)	RTX A2000 (mobile) RTX A2000	RTX A3000 (mobile) RTX A4000 (mobile) RTX A4000 RTX A5000 (mobile)	N/A	RTX A4500 RTX A5000 RTX A5500 RTX A6000	N/A
Nvidia Data Center GPUs	Nvidia A2 Nvidia A16	N/A	N/A	N/A	Nvidia A10 Nvidia A40	Nvidia A30 Nvidia A100

References

↑ Newsroom, NVIDIA. "NVIDIA's New Ampere Data Center GPU in Full Production". http://nvidianews.nvidia.com/news/nvidias-new-ampere-data-center-gpu-in-full-production.
↑ "NVIDIA Ampere Architecture In-Depth". May 14, 2020. https://developer.nvidia.com/blog/nvidia-ampere-architecture-in-depth/.
↑ "NVIDIA Delivers Greatest-Ever Generational Leap with GeForce RTX 30 Series GPUs" (in en-US). September 1, 2020. http://nvidianews.nvidia.com/news/nvidia-delivers-greatest-ever-generational-leap-in-performance-with-geforce-rtx-30-series-gpus.
↑ "NVIDIA GeForce Ultimate Countdown" (in en-US). https://www.nvidia.com/en-us/geforce/special-event/.
↑ "NVIDIA Doubles Down: Announces A100 80GB GPU, Supercharging World's Most Powerful GPU for AI Supercomputing" (in en-US). November 16, 2020. https://nvidianews.nvidia.com/news/nvidia-doubles-down-announces-a100-80gb-gpu-supercharging-worlds-most-powerful-gpu-for-ai-supercomputing.
↑ "NVIDIA GeForce Beyond at CES 2023". https://www.nvidia.com/en-us/geforce/special-event/.
↑ "I.7. Compute Capability 8.x" (in en-US). https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#compute-capability-8-x.
↑ Bosnjak, Dominik (September 1, 2020). "Samsung's old 8nm tech at the heart of NVIDIA's monstrous Ampere cards" (in en-US). https://www.sammobile.com/news/samsung-8nm-process-nvidia-geforce-rtx-30-ampere.
↑ ^9.0 ^9.1 ^9.2 ^9.3 ^9.4 Smith, Ryan (May 14, 2020). "NVIDIA Ampere Unleashed: NVIDIA Announces New GPU Architecture, A100 GPU, and Accelerator". AnandTech. https://www.anandtech.com/show/15801/nvidia-announces-ampere-architecture-and-a100-products.
↑ Delgado, Gerardo (September 1, 2020). "GeForce RTX 30 Series GPUs: Ushering In A New Era of Video Content With AV1 Decode" (in en-US). https://www.nvidia.com/en-us/geforce/news/rtx-30-series-av1-decoding/.
↑ Morgan, Timothy Prickett (May 29, 2020). "Diving Deep Into The Nvidia Ampere GPU Architecture" (in en-US). https://www.nextplatform.com/2020/05/28/diving-deep-into-the-nvidia-ampere-gpu-architecture/.
↑ "NVIDIA A100 Tensor Core GPU Architecture: Unprecedented Accerlation at Every Scale" (in en-US). https://www.nvidia.com/content/dam/en-zz/Solutions/Data-Center/nvidia-ampere-architecture-whitepaper.pdf.
↑ "NVIDIA Tensor Cores: Versatility for HPC & AI". https://www.nvidia.com/en-us/data-center/tensor-cores/.
↑ "Abstract". https://docs.nvidia.com/deeplearning/tensorrt/support-matrix/index.html.
↑ ^15.0 ^15.1 Tom Warren; James Vincent (May 14, 2020). "Nvidia's first Ampere GPU is designed for data centers and AI, not your PC". The Verge. https://www.theverge.com/2020/5/14/21258419/nvidia-ampere-gpu-ai-data-centers-specs-a100-dgx-supercomputer.
↑ ^16.0 ^16.1 Igor, Wallossek (February 13, 2022). "The two faces of the GeForce RTX 3050 8GB". https://www.igorslab.de/en/the-two-faces-of-geforce-rtx-3050-8gb-different-chips-and-different-thirsts/.
↑ ^17.0 ^17.1 Shilov, Anton (September 25, 2021). "Gainward and Galax List GeForce RTX 3060 Cards With GA104 GPU". https://www.tomshardware.com/news/ga104-based-geforce-rtx-3060-listed.
↑ ^18.0 ^18.1 Tyson, Mark (February 23, 2022). "Zotac Debuts First RTX 3060 Ti Desktop Cards With GA103 GPU". https://www.tomshardware.com/news/zotac-geforce-rtx-3060-ti-ga103.
↑ ^19.0 ^19.1 WhyCry (October 26, 2022). "ZOTAC launches GeForce RTX 3070 Ti with GA102-150 GPU" (in en-US). https://videocardz.com/newz/zotac-launches-geforce-rtx-3070-ti-with-ga102-150-gpu.

External links

0.00

(0 votes)

Original source: https://en.wikipedia.org/wiki/Ampere (microarchitecture). Read more

[1] Newsroom, NVIDIA. "NVIDIA's New Ampere Data Center GPU in Full Production". http://nvidianews.nvidia.com/news/nvidias-new-ampere-data-center-gpu-in-full-production.

[2] "NVIDIA Ampere Architecture In-Depth". May 14, 2020. https://developer.nvidia.com/blog/nvidia-ampere-architecture-in-depth/.

[3] "NVIDIA Delivers Greatest-Ever Generational Leap with GeForce RTX 30 Series GPUs" (in en-US). September 1, 2020. http://nvidianews.nvidia.com/news/nvidia-delivers-greatest-ever-generational-leap-in-performance-with-geforce-rtx-30-series-gpus.

[4] "NVIDIA GeForce Ultimate Countdown" (in en-US). https://www.nvidia.com/en-us/geforce/special-event/.

[5] "NVIDIA Doubles Down: Announces A100 80GB GPU, Supercharging World's Most Powerful GPU for AI Supercomputing" (in en-US). November 16, 2020. https://nvidianews.nvidia.com/news/nvidia-doubles-down-announces-a100-80gb-gpu-supercharging-worlds-most-powerful-gpu-for-ai-supercomputing.

[6] "NVIDIA GeForce Beyond at CES 2023". https://www.nvidia.com/en-us/geforce/special-event/.

[7] "I.7. Compute Capability 8.x" (in en-US). https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#compute-capability-8-x.

[8] Bosnjak, Dominik (September 1, 2020). "Samsung's old 8nm tech at the heart of NVIDIA's monstrous Ampere cards" (in en-US). https://www.sammobile.com/news/samsung-8nm-process-nvidia-geforce-rtx-30-ampere.

[anand-A100-9] 9.0 ^9.1 ^9.2 ^9.3 ^9.4 Smith, Ryan (May 14, 2020). "NVIDIA Ampere Unleashed: NVIDIA Announces New GPU Architecture, A100 GPU, and Accelerator". AnandTech. https://www.anandtech.com/show/15801/nvidia-announces-ampere-architecture-and-a100-products.

[10] Delgado, Gerardo (September 1, 2020). "GeForce RTX 30 Series GPUs: Ushering In A New Era of Video Content With AV1 Decode" (in en-US). https://www.nvidia.com/en-us/geforce/news/rtx-30-series-av1-decoding/.

[11] Morgan, Timothy Prickett (May 29, 2020). "Diving Deep Into The Nvidia Ampere GPU Architecture" (in en-US). https://www.nextplatform.com/2020/05/28/diving-deep-into-the-nvidia-ampere-gpu-architecture/.

[12] "NVIDIA A100 Tensor Core GPU Architecture: Unprecedented Accerlation at Every Scale" (in en-US). https://www.nvidia.com/content/dam/en-zz/Solutions/Data-Center/nvidia-ampere-architecture-whitepaper.pdf.

[13] "NVIDIA Tensor Cores: Versatility for HPC & AI". https://www.nvidia.com/en-us/data-center/tensor-cores/.

[14] "Abstract". https://docs.nvidia.com/deeplearning/tensorrt/support-matrix/index.html.

[verge-A100-15] 15.0 ^15.1 Tom Warren; James Vincent (May 14, 2020). "Nvidia's first Ampere GPU is designed for data centers and AI, not your PC". The Verge. https://www.theverge.com/2020/5/14/21258419/nvidia-ampere-gpu-ai-data-centers-specs-a100-dgx-supercomputer.

[igor-ga107-16] 16.0 ^16.1 Igor, Wallossek (February 13, 2022). "The two faces of the GeForce RTX 3050 8GB". https://www.igorslab.de/en/the-two-faces-of-geforce-rtx-3050-8gb-different-chips-and-different-thirsts/.

[3060-ga104-17] 17.0 ^17.1 Shilov, Anton (September 25, 2021). "Gainward and Galax List GeForce RTX 3060 Cards With GA104 GPU". https://www.tomshardware.com/news/ga104-based-geforce-rtx-3060-listed.

[3060ti-ga103-18] 18.0 ^18.1 Tyson, Mark (February 23, 2022). "Zotac Debuts First RTX 3060 Ti Desktop Cards With GA103 GPU". https://www.tomshardware.com/news/zotac-geforce-rtx-3060-ti-ga103.

[3070ti-ga102-19] 19.0 ^19.1 WhyCry (October 26, 2022). "ZOTAC launches GeForce RTX 3070 Ti with GA102-150 GPU" (in en-US). https://videocardz.com/newz/zotac-launches-geforce-rtx-3070-ti-with-ga102-150-gpu.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

Anonymous

Search

Engineering:Ampere (microarchitecture)

Namespaces

More

Page actions

Contents

Details

Chips

A100 accelerator and DGX A100

Products using Ampere

See also

References

External links

Navigation

Navigation

Help

Translate

Wiki tools

Wiki tools

Anonymous

Search

Engineering:Ampere (microarchitecture)

Details

Chips

A100 accelerator and DGX A100

Products using Ampere

See also

References

External links

Navigation

Wiki tools

Page tools

Other projects

Categories