Subscribe to Digital Engineering
Webcasts · Downloads · Archives
Companies · Glossary · Podcasts

DE · Topics · Digital Thread

NVIDIA TensorRT 7’s Compiler Delivers Real-Time Inference for Smarter Human-to-AI Interactions

TensorRT 7 features a new deep learning compiler designed to automatically optimize and accelerate the complex recurrent and transformer-based neural networks needed for AI speech applications.

Digital Thread News

Digital Thread Resources

Latest News

Volume Graphics Wins 2023 3D Printing Industry Award

Aras Gears Up for ACE 2024 in Texas

ITI Unveils CADfix Viz 2 for Visualization, Virtual Reality

Procter & Gamble, Oden Institute Headline ASSESS Summit 2024

AMUG Encourages Attendees to Register Early

Teamcenter Product Cost Management Now Out

All posts

By DE Editors

December 20, 2019

NVIDIA introduced inference software that developers everywhere can use to deliver conversational AI applications.

NVIDIA TensorRT 7—the seventh generation of the company’s inference software development kit—enables smarter human-to-AI interactions, enabling real-time engagement with applications such as voice agents, chatbots and recommendation engines. TensorRT 7 features a new deep learning compiler designed to automatically optimize and accelerate the complex recurrent and transformer-based neural networks needed for AI speech applications.

“We have entered a new chapter in AI, where machines are capable of understanding human language in real time,” said NVIDIA founder and CEO Jensen Huang at his GTC China keynote. “TensorRT 7 helps make this possible, providing developers everywhere with the tools to build and deploy faster, smarter conversational AI services that allow more natural human-to-AI interaction.”

Importance of Recurrent Neural Networks

TensorRT 7 speeds up a growing universe of AI models that are being used to make predictions on time-series, sequence-data scenarios that use recurrent loop structures, called RNNs. In addition to being used for conversational AI speech networks, RNNs help with arrival time planning for cars or satellites, prediction of events in electronic medical records, financial asset forecasting and fraud detection.

With TensorRT’s new deep learning compiler, developers everywhere now have the ability to automatically optimize networks—such as bespoke automatic speech recognition networks, and WaveRNN and Tacotron 2 for text-to-speech—and to deliver the best possible performance and lowest latencies.

The new compiler also optimizes transformer-based models like BERT for natural language processing.

Accelerating Inference from Edge to Cloud

TensorRT 7 can rapidly optimize, validate and deploy a trained neural network for inference by hyperscale data centers, embedded or automotive GPU platforms.

NVIDIA’s inference platform, which includes TensorRT, as well as several NVIDIA CUDA-X AI libraries and NVIDIA GPUs—delivers low-latency, high-throughput inference for applications beyond conversational AI, including image classification, fraud detection, segmentation, object detection and recommendation engines. Its capabilities are widely used by some of the leading enterprise and consumer technology companies, including Alibaba, American Express, Baidu, PayPal, Pinterest, Snap, Tencent and Twitter.

Availability

TensorRT 7 will be available in the coming days for development and deployment, without charge to members of the NVIDIA Developer program from the TensorRT webpage. The latest versions of plug-ins, parsers and samples are also available as open source from the TensorRT GitHub repository.

Sources: Press materials received from the company and additional information gleaned from the company’s website.

More NVIDIA Coverage

Keeping Pace With Needs of Workstation Users

Industry Leaders Shift Strategies to Harvest AI Spring

Vrgineers Launches Mixed-Reality Headset

NVIDIA’s Ethernet Networking Platform for AI Available Soon

NVIDIA and Dropbox Team to Offer Personalized Generative AI

Share This Article

Subscribe to our FREE magazine,
FREE email newsletters or both!

Join over 90,000 engineering professionals who get fresh engineering news as soon as it is published.

Join Now

Latest News

Volume Graphics Wins 2023 3D Printing Industry Award

Aras Gears Up for ACE 2024 in Texas

ITI Unveils CADfix Viz 2 for Visualization, Virtual Reality

Procter & Gamble, Oden Institute Headline ASSESS Summit 2024

AMUG Encourages Attendees to Register Early

Teamcenter Product Cost Management Now Out

All posts

About the Author

DE Editors

DE’s editors contribute news and new product announcements to Digital Engineering.
Press releases may be sent to them via DE-Editors@digitaleng.news.

Follow DE

Digital Engineering https://www.digitalengineering247.com/article/nvidia-tensorrt-7s-compiler-delivers-real-time-inference-for-smarter-human-to-ai-interactions/NVIDIA https://www.digitalengineering247.com/article/nvidia-tensorrt-7s-compiler-delivers-real-time-inference-for-smarter-human-to-ai-interactions/NVIDIA Last updated December 20, 2019

#23464

New & Noteworthy

New & Noteworthy: Direct Neutronics Analysis on CAD

Coreform Cubit 2023.11 workflows enable neutronics directly on CAD for next-generation nuclear energy...

New & Noteworthy: Agile Engineering Collaboration

Authentise Threads is a new software tool for distributed communications and project...

New & Noteworthy Product Introduction: Enterprise VR Headset

Lenovo ThinkReality VRX has an immersive display works with virtual, augmented and...

New & Noteworthy: Fast Access to Advanced MCAD

The company says Value-Based Licensing is a flexible and cost-effective alternative to...

Design

Simulate

Additive Manufacturing

Digital Thread

Engineering Computing

Companies

Glossary

Podcasts

Webcasts

Downloads

Reviews

Subscribe

Advertise

Customer Service

NVIDIA TensorRT 7’s Compiler Delivers Real-Time Inference for Smarter Human-to-AI Interactions

TensorRT 7 features a new deep learning compiler designed to automatically optimize and accelerate the complex recurrent and transformer-based neural networks needed for AI speech applications.

Digital Thread News

Digital Thread Resources

Latest News

By DE Editors

December 20, 2019

Importance of Recurrent Neural Networks

Accelerating Inference from Edge to Cloud

Availability

More NVIDIA Coverage

Share This Article

Subscribe to our FREE magazine,
FREE email newsletters or both!

Latest News

About the Author

Related Topics

NVIDIA TensorRT 7’s Compiler Delivers Real-Time Inference for Smarter Human-to-AI Interactions

TensorRT 7 features a new deep learning compiler designed to automatically optimize and accelerate the complex recurrent and transformer-based neural networks needed for AI speech applications.

Digital Thread News

Digital Thread Resources

Latest News

By DE Editors

December 20, 2019

Importance of Recurrent Neural Networks

Accelerating Inference from Edge to Cloud

Availability

More NVIDIA Coverage

Share This Article

Subscribe to our FREE magazine, FREE email newsletters or both!

Latest News

About the Author

Related Topics

Subscribe to our FREE magazine,
FREE email newsletters or both!