Browse Subject Headings
Rearchitecting LLMs : Structural Techniques for Efficient Models
Rearchitecting LLMs : Structural Techniques for Efficient Models
Click to enlarge
Author(s): Martra, Pere
ISBN No.: 9781633434332
Pages: 380
Year: 202610
Format: Trade Paper
Price: $ 83.99
Dispatch delay: Dispatched between 7 to 15 days
Status: Available (Forthcoming)

Get a free eBook (PDF or ePub) from Manning as well as access to the online liveBook format (and its AI assistant that will answer your questions in any language) when you purchase the print book. By default, general purpose LLMs are not optimized for specific domains and business goals. Using techniques like specialized fine-tuning, pruning unnecessary neural components, and knowledge distillation, you can rearchitect your models to cost less, run faster, and deliver more accurate results. This book turns research from the latest AI papers into production-ready practices for domain-specific model optimization. As you work through this practical book, you'll perform hands-on surgery on popular open-source models like Llama-3, Gemma, and Qwen to create cost-effective local small language models (SLMs). Along the way, you'll learn how to combine behavioral analysis with structural modifications, identifying and removing parts that don't contribute to your model's goals, and even use "fair pruning" to reduce model bias at the neuron level. What's inside * Universal techniques for customizing model architecture * End-to-end pipelines for model rearchitecting * Improve bias and explainability with model "cleanup" * Replacing external LLMs with local SLMs About the reader For practicing AI, ML, and data engineers who know Python. About the author Pere Martra is an Applied AI Engineer, the creator of the Optipfair model efficiency library, an international AI speaker, and maintainer of widely-used LLM courses and popular open-source tools.


He is the author of Large Language Models Projects.


To be able to view the table of contents for this publication then please subscribe by clicking the button below...
To be able to view the full description for this publication then please subscribe by clicking the button below...
Browse Subject Headings