JOIN / SIGN IN

Welcome to Foyalty

Earn a Foyalty Stamp for every £10 you spend in our shops or online*

Every 10 Foyalty Stamps you earn adds £10 to your Foyalty Balance

Spend your Foyalty Balance in our shops or online, with no minimum redemption*

*Exclusions apply, see full T&Cs for details.

Learn More

Non-Fiction, Computing & Technology, Applications & Programming

Deep Learning with C++: Design and deploy neural networks using CUDA for high-performance AI in C++

Bill Chen (author), Vikash Gupta (author)

Paperback Published on: 30/04/2026

£37.99

No reviews yet, be the first

Free UK delivery on orders over £25

We can order this from the publisher
Usually dispatched within 2 weeks

Make and edit your lists in your account

No stock available in any shop.

We can order this from the publisher
Usually dispatched within 2 weeks

No stock available in any shop.

Synopsis

Build and deploy high-performance deep learning models using C++ for real-time applications where speed and efficiency matter.

Free with your book: DRM-free PDF version + access to Packt's next-gen Reader*

Key Features

Build deep learning models in C++ with PyTorch C++ API and CUDA

Implement CNNs, RNNs, LSTMs, GANs, and Transformers in C++ for real-world applications

Optimize and deploy machine learning models to production with scalable C++ pipelines

Book DescriptionDeep learning systems often struggle to meet performance demands in real-time and production environments. This book shows you how to build high-performance deep learning systems in C++, enabling efficient and scalable artificial intelligence (AI) in resource-constrained environments where performance matters.

You’ll start by setting up a complete C++ deep learning environment and implementing core neural networks from scratch. As you progress, you’ll build advanced architectures, including Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Long Short-Term Memory Networks (LSTMs), Generative Adversarial Networks (GANs), and Transformers, using C++, CUDA, and PyTorch’s C++ API. The book then focuses on model quantization and compression. It will guide you through the model deployment process in production with robust monitoring and explainability. You’ll also explore distributed training and techniques for real-time inference in performance-critical domains.

By the end of this book, you’ll be able to design, optimize, and deploy deep learning systems in C++ that are production-ready, scalable, and efficient across multiple industries.

*Email sign-up and proof of purchase requiredWhat you will learn

Set up and use CUDA and PyTorch's C++ API for deep learning

Implement CNNs, RNNs, LSTMs, GANs, Transformers, and LLMs in C++

Leverage CUDA for high-performance model training

Perform model compression using quantization, pruning, and distillation

Deploy and monitor models in production using C++ tools

Apply explainability techniques such as LIME, SHAP, and Grad-CAM

Who this book is forThis book is for ML engineers, deep learning practitioners, and data scientists with a C++ background who want to build or learn about high-performance deep learning models. It also serves developers transitioning from Python-based frameworks looking for real-time deployment solutions in industries like finance, autonomous systems, and healthcare.

Publisher information

Publisher: Packt Publishing Limited
ISBN: 9781835880029
Number of pages: 610
Dimensions: 235 x 191 mm
Languages: English

Deep Learning with C++: Design and deploy neural networks using CUDA for high-performance AI in C++

Synopsis

Publisher information

Customer Reviews