Recently Viewed
You have no recent viewed item.
You’ve mastered the architecture—now it’s time to own the performance.
Every GPU developer eventually hits the same wall: the profiler indicates you are close to peak throughput, but instinct tells you there is still headroom. What’s missing isn’t another compiler flag—it’s visibility into the hardware’s final truth. That truth lives in SASS, the real machine code executing on NVIDIA GPUs.
Mastering PTX and SASS – Volume II takes you past abstract theory into the territory where nanoseconds matter. This engineering guide moves beyond basic syntax, teaching you how to read, analyze, and tune instruction streams with surgical precision. You will uncover how schedulers pair operations, how register pressure throttles throughput, and how to transform your kernels into clock-cycle-balanced engines of pure efficiency.
This book is for engineers who refuse to settle for “good enough.” It transforms profiling, disassembly, and optimization from an art into a repeatable science—grounded in data, not superstition. From tensor cores to warp shuffles, and from atomic operations to multi-GPU scaling, you will learn how true experts bend hardware to their will. Volume I built the foundation; Volume II shows you how to weaponize it.
About the Burst Books AI Edition:
Forget tired textbooks from years past. These AI-crafted STEM editions advance at the speed of discovery. Each page is generated by intelligence trained on thousands of trusted sources, delivering crystal-clear explanations, flawless equations, and functional examples refreshed through the latest breakthroughs. Explore BurstBooksPublishing on GitHub for technical samples, infographics, and additional study material—a complete hub for hands-on learning.