As part of the NVIDIA RTX™ family of professional GPUs, the NVIDIA T400 provides the performance, features, reliability, and support that customers expect from NVIDIA enterprise solutions. As businesses look to lower the total cost of their computing solutions, while minimizing impacts to productivity, NVIDIA T400 provides them with a robust, cost effective solution that helps them stay within budget, while providing better performance than integrated graphics.
Turing GPU Architecture
Based on state-of-the-art 12nm FFN (FinFET NVIDIA) high-performance manufacturing process customized for NVIDIA to incorporate 384 CUDA cores, the NVIDIA T400 GPU is an efficient Single Slot professional solution for CAD, DCC, financial service industry (FSI) and visualization professionals in general looking to reach great performance in a compact form factor. The Turing GPU architecture enables the biggest leap in computer real-time graphics rendering since NVIDIA’s invention of programmable shaders in 2001.
Advanced Shading Technologies
The Turing GPU architecture features the following new advanced shader technologies.
Mesh Shading: Compute-based geometry pipeline to speed geometry processing and culling on geometrically complex models and scenes. Mesh shading provides up to 2x performance improvement on geometry-bound workloads.
Variable Rate Shading (VRS): Gain rendering efficiency by varying the shading rate based on scene content, direction of gaze, and motion. Variable rate shading provides similar image quality with 50% reduction in shaded pixels.
Texture Space Shading: Object/texture space shading to improve the performance of pixel shader-heavy workloads such as depth-of-field and motion blur. Texture space shading provides greater throughput with increased fidelity by reusing pre-shaded texels for pixel-shader heavy VR workloads.
Advanced Streaming Multiprocessor (SM) Architecture
Combined shared memory and L1 cache improve performance significantly, while simplifying programing and reducing the tuning required to attain best application performance. Each SM contains 96 KB of L1 shared memory, which can be configured for various capabilities depending on compute or graphics workload. For compute cases, up to 64KB can be allocated to the L1 cache or shared memory, while graphics workload can allocate up to 48 KB for shared memory; 32 KB for L1 and 16KB for texture units. Combining the L1 data cache with the shared memory reduces latency and provide higher bandwidth.
Higher Speed GDDR6 Memory
Built with Turing’s vastly optimized GDDR6 memory subsystem for the industry’s fastest graphics the NVIDIA T400 features 2GB of frame buffer capacity and 80 GB/s of peak bandwidth, more than double the throughput. NVIDIA T400 is the ideal platform for 3D professionals and high demanding and multi display environments, with vast arrays of datasets.
Single Instruction, Multiple Thread (SIMT)
New independent thread scheduling capability enables finer-grain synchronization and cooperation between parallel threads by sharing resources among small jobs.
Double the throughput and reduce storage requirements with 16-bit floating point precision computing to enable the training and deployment of larger neural networks. With independent parallel integer and floating-point data paths, the Turing SM is also much more efficient on workloads with a mix of computation and addressing calculations.
Pixel-level preemption provides more granular control to better support time-sensitive tasks such as VR motion tracking.
Preemption at the instruction-level provides finer grain control over compute tasks to prevent long-running applications from either monopolizing system resources or timing out.
H.264 and HEVC Encode/Decode Enginesi
Deliver faster than real-time performance for transcoding, video editing, and other encoding applications with two dedicated H.264 and HEVC encode engines and a dedicated decode engine that are independent of 3D/compute pipeline.
NVIDIA GPU BOOST 4.0
Automatically maximize application performance without exceeding the power and thermal envelope of the card. Allows applications to stay within the boost clock state longer under higher temperature threshold before dropping to a secondary temperature setting base clock.
Full-Scene Antialiasing (FSAA)
Dramatically reduce visual aliasing artifacts or "jaggies" with up to 64X FSAA (128X with SLI) for unparalleled image quality and highly realistic scenes.
32K Texture and Render Processing
Texture from and render to 32K x 32K surfaces to support applications that demand the highest resolution and quality image processing.
NVIDIA® Quadro® Mosaic Technology
Transparently scale the desktop and applications across up to 4 GPUs and 12 displays from a single workstation while delivering full performance and image quality.
Support up to four 5K monitors @ 60Hz, or dual 8K displays per card. NVIDIA T400 supports 4x displays natively, HDR color for 4K @ 120Hz for 10/12b HEVC decode and up to 4K @ 60Hz for 10b HEVC encode. Each DisplayPort connector is capable of driving ultra-high resolutions of 4096x2160 @ 120 Hz with 30-bit color.
NVIDIA® RTX™ Desktop Manager
NVIDIA RTX™ Desktop Manager software allows you to manage single or multi-monitor workspaces with ease, giving you maximum flexibility and control over your display real estate and desktops.
OpenGL Quad Buffered Stereo Support
Provide a smooth and immersive 3D Stereo experience for professional applications.
NVIDIA® Enterprise Management Tools
Maximize system uptime, seamlessly manage wide-scale deployments and remotely control graphics and display settings for efficient operations.
NVIDIA® CUDA® Parallel Computing Platform
Natively execute standard programming languages like C/C++ and Fortran, and APIs such as OpenCL, OpenACC and Direct Compute to accelerates techniques such as ray tracing, video and image processing, and computation fluid dynamics.
Featuring concurrent execution of floating point and integer operations, adaptive shading technology, and a new unified memory architecture with twice the cache of its predecessor, Turing shaders enable awesome performance increases for today’s professional applications.
NVIDIA Turing Architecture
CUDA Parallel Processing cores
Up to 1.094 TFLOPS
Frame Buffer Memory
2 GB GDDR6
Up to 80 GB/s
Max Power Consumption
PCI Express 3.0 x16
mDP 1.4 (3)
2.713 inches H x 6.137 inches L Single Slot