Status der Bestellung anzeigen

Werden Sie Teil einer Gemeinschaft von Buchliebhabern aus der ganzen Welt und erhalten Sie eine Reihe von Vorteilen. Konto kostenlos anlegen

Kostenloser Versand mit Zásilkovna ab 69.99 €

Österreichische Post 5.49 € GLS-Kurier 4.99 € GLS-Kurier 4.99 € DPD-Kurier 3.99 € DPD-Stelle 2.99 €

Kontakt

Wie einkaufen

Hilfe

Mein Konto

▸ Leer :-(

AI Inference Optimization Engineering

Name: AI Inference Optimization Engineering
Brand: Independently published
SKU: 52770465
Price: 11.79 EUR
Availability: InStock
Author: ChatVariety Team
ISBN: 9798199720021

Quantization, Speculative Decoding, and Hardware-Specific LLM Deployment

ChatVariety Team

Sprache

Englisch

Buch Broschur

Libristo-Code: 52770465

Verlag Independently published, Juni 2026

Slash LLM Deployment Costs and LatencyDeploying Large Language Models (LLMs) in production is a mass... Vollständige Beschreibung

Libristo-Code: 52770465

29 b

Demnächst

Neu

11.79 € inkl. MwSt.

Erwartete Einlagerung Veröffentlichung 07. 06. 2026

30 Tage für die Rückgabe der Ware

Slash LLM Deployment Costs and Latency

Deploying Large Language Models (LLMs) in production is a massive economic and engineering hurdle. AI Inference Optimization Engineering is your comprehensive, hands-on guide to mastering the full stack of modern LLM optimization techniques. From memory-bandwidth solutions to hardware-specific compilation, this book bridges the gap between research-level models and enterprise-grade execution.

What you will master inside this book:

Hardware-Aware Optimization: Dive deep into KV cache mechanics, autoregressive decoding, and GPU memory hierarchies to eliminate latency bottlenecks.
State-of-the-Art Quantization: Apply GPTQ, AWQ, and GGUF compression algorithms to scale down massive neural networks without sacrificing model accuracy.
Advanced Acceleration Methods: Implement speculative decoding with draft models (like Medusa and Eagle), PagedAttention, and FlashAttention to boost throughput by 2-3x.
Production-Grade Serving: Build ultra-low-latency deployment infrastructures using vLLM, Triton Inference Server, and continuous batching.
Cross-Platform Deployment: Optimize models for specific target hardware, including NVIDIA H100 (TensorRT-LLM), Apple Silicon (llama.cpp/Metal), and Qualcomm mobile/edge accelerators.

Whether you are an ML infrastructure engineer, an AI platform architect, or a technical leader looking to scale LLMs cost-effectively, this book provides the production-ready code, equations, and architectural patterns you need to build hyper-efficient AI pipelines.

Schauspielerin & Polyglotte

EWA KASP für

Video abspielen

Libristo bietet die größte Auswahl an fremdsprachiger Literatur an. Deshalb kaufe ich meine Bücher hier ein.

Informationen zum Buch

Vollständiger Name AI Inference Optimization Engineering

Autor ChatVariety Team

Sprache

Englisch

Einband Buch - Broschur

Datum der Veröffentlichung 2026

Anzahl der Seiten 96

EAN 9798199720021

Libristo-Code 52770465

Verlag Independently published

Gewicht 142

Abmessungen 152 x 229 x 5

Kategorie

EDV und Informationstechnologie > Informatik > Künstliche Intelligenz > Natürliche Sprachen und maschinelle Übersetzung

Verschenken Sie dieses Buch noch heute

Es ist ganz einfach

1 Legen Sie das Buch in Ihren Warenkorb und wählen Sie den Versand als Geschenk 2 Wir schicken Ihnen umgehend einen Gutschein 3 Das Buch wird an die Adresse des beschenkten Empfängers geliefert

Häufig gesucht

Categories

Authors

Publishers

Häufig gesucht

Waren

Categories

Authors

Publishers

Versand

Kaufberater

AI Inference Optimization Engineering

Quantization, Speculative Decoding, and Hardware-Specific LLM Deployment

Informationen zum Buch

Kategorie

Verschenken Sie dieses Buch noch heute

Es ist ganz einfach

Häufig gesucht

Categories

Authors

Publishers

AI Inference Optimization Engineering

Quantization, Speculative Decoding, and Hardware-Specific LLM Deployment

Informationen zum Buch

Kategorie

Verschenken Sie dieses Buch noch heute

Es ist ganz einfach

Sie haben kein Konto? Nutzen Sie die Vorteile eines Libristo-Kontos!