Nvidia’s TensorRT 8.0 boasts faster conversational AI performance

Nvidia has released TensorRT 8.0 for Nvidia GPUs including its Jetson modules. This latest AI inference optimization SDK delivers up to 2x the natural language query performance compared to v7.0, with 1.2ms latency using BERT. At GTC 2021 in April, Nvidia announced TensorRT 8.0 along with related technologies such as a GUI-based TAO framework that […]

from LinuxGizmos.com https://ift.tt/2Tqg8Re

Post a Comment

0 Comments