Listen free for 30 days

Listen with offer

  • Large Language Model-Based Solutions

  • How to Deliver Value with Cost-Effective Generative AI Applications
  • By: Shreyas Subramanian
  • Narrated by: Daniel Henning
  • Length: 11 hrs and 42 mins

£0.00 for first 30 days

Pick 1 audiobook a month from our unmatched collection - including bestsellers and new releases.
Listen all you want to thousands of included audiobooks, Originals, celeb exclusives, and podcasts.
Access exclusive sales and deals.
£7.99/month after 30 days. Renews automatically. See here for eligibility.

Large Language Model-Based Solutions

By: Shreyas Subramanian
Narrated by: Daniel Henning
Pre-order: Try for £0.00

£7.99/month after 30 days. Renews automatically. See here for eligibility.

Pre-order Now for £12.89

Pre-order Now for £12.89

Pay using card ending in
By completing your purchase, you agree to Audible's Conditions of Use and authorise Audible to charge your designated card or any other card on file. Please see our Privacy Notice, Cookies Notice and Interest-based Ads Notice.

Summary

In Large Language Model-Based Solutions: How to Deliver Value with Cost-Effective Generative AI Applications, Principal Data Scientist at Amazon Web Services, Shreyas Subramanian, delivers a practical guide for developers and data scientists who wish to build and deploy cost-effective large language model (LLM)-based solutions. In the book, you'll find coverage of a wide range of key topics, including how to select a model, pre- and post-processing of data, prompt engineering, and instruction fine-tuning.

The author sheds light on techniques for optimizing inference, like model quantization and pruning, as well as different and affordable architectures for typical generative AI (GenAI) applications, including search systems, agent assists, and autonomous agents. You'll also find:

● Effective strategies to address the challenge of the high computational cost associated with LLMs

● Assistance with the complexities of building and deploying affordable generative AI apps, including tuning and inference techniques

● Selection criteria for choosing a model, with particular consideration given to compact, nimble, and domain-specific models

©2024 John Wiley & Sons, Inc. (P)2024 Ascent Audio
activate_Holiday_promo_in_buybox_DT_T2

What listeners say about Large Language Model-Based Solutions

Average customer ratings

Reviews - Please select the tabs below to change the source of reviews.