Enhancing LLM Performance
Efficacy, Fine-Tuning, and Inference Techniques

Edited by Andy Way,Mehdi Rezagholizadeh,Peyman Passban

ISBN13: 9783031857461

Imprint: Springer International Publishing AG

Publisher: Springer International Publishing AG

Format: Hardback

Published: 27/05/2025

Availability: Not yet available

Description
This book is a pioneering exploration of the state-of-the-art techniques that drive large language models (LLMs) toward greater efficiency and scalability. Edited by three distinguished experts—Peyman Passban, Mehdi Rezagholizadeh, and Andy Way—this book presents practical solutions to the growing challenges of training and deploying these massive models. With their combined experience across academia, research, and industry, the authors provide insights into the tools and strategies required to improve LLM performance while reducing computational demands. This book is more than just a technical guide; it bridges the gap between research and real-world applications. Each chapter presents cutting-edge advancements in inference optimization, model architecture, and fine-tuning techniques, all designed to enhance the usability of LLMs in diverse sectors. Readers will find extensive discussions on the practical aspects of implementing and deploying LLMs in real-world scenarios. The book serves as a comprehensive resource for researchers and industry professionals, offering a balanced blend of in-depth technical insights and practical, hands-on guidance. It is a go-to reference book for students, researchers in computer science and relevant sub-branches, including machine learning, computational linguistics, and more.
Introduction and Fundamentals.- SPEED: Speculative Pipelined Execution for Efficient Decoding.- Efficient LLM Inference on CPUs.- KronA: Parameter-Efficient Tuning with Kronecker Adapter.- LoDA: Low-Dimensional Adaptation of Large Language Models.- Sparse Fine-Tuning for Inference Acceleration of Large Language Models.- TCNCA: Temporal CNN with Chunked Attention for Efficient Training on Long Sequences.- Class-Based Feature Knowledge Distillation.- On the Use of Cross-Attentive Fusion Techniques for Audio-Visual Speaker Verification.- An Efficient Clustering Algorithm for Self-Supervised Speaker Recognition.- Remaining Issues for AI.
  • Natural language & machine translation
  • Machine learning
  • Professional & Vocational
Height:
Width:
Spine:
Weight:0.00
List Price: £119.99