Training a custom LLM will offer greater flexibility, but that comes with high costs and capability requirement [up to $ 90 M+]

byJoaquim Cardoso

9 de maio de 2023

3 minute read

The Health Strategist

research and strategy institute
for continuous transformation,
for in person health and digital health

Joaquim Cardoso MSc
Chief Research and Strategy Officer (CRSO)
May 9, 2023

ONE PAGE SUMMARY

Once leaders identify their golden use cases, they will need to work with their technology teams to make strategic choices about whether to fine-tune existing LLMs or to train a custom model.

1.Fine-Tuning an Existing Model.

Adapting existing open-source or paid models is cost effective — in a 2022 experiment, Snorkel AI found that it cost between $1,915 and $7,418 to fine-tune a LLM model to complete a complex legal classification.

Such an application could save hours of a lawyer’s time, which can cost up to $500 per hour.

Fine-tuning can also jumpstart experimentation, whereas using in-house capabilities will siphon off time, talent, and investment. And it will prepare companies for the future, when generative AI is likely to evolve into a model like cloud services:

a company purchases the solution with the expectation of achieving quality at scale from the cloud provider’s standardization and reliability.

But there are downsides to this approach.

Such models are completely dependent on the functionality and domain knowledge of the core model’s training data; they are also restricted to available modalities, which today are comprised mostly of language models.

And they offer limited options for protecting proprietary data — for example, fine-tuning LLMs that are stored fully on premises.

2.Training a New or Existing Model.

Training a custom LLM will offer greater flexibility, but it comes with high costs and capability requirements:

an estimated $1.6 million to train a 1.5-billion-parameter model with two configurations and 10 runs per configuration, according to AI21 Labs.

To put this investment in context, AI21 Labs estimated that Google spent approximately $10 million for training BERT …

… and OpenAI spent $12 million on a single training run for GPT-3.2 (Note that it takes multiple rounds of training for a successful LLM.)

These costs — as well as data center, computing, and talent requirements — are significantly higher than those associated with other AI models, even when managed through a partnership.

The bar to justify this investment is high, but for a truly differentiated use case, the value generated from the model could offset the cost.

Plan Your Investment

Leaders will need to carefully assess the timing of such an investment, weighing the potential costs of moving too soon on a complex project for which the talent and technology aren’t yet ready against the risks of falling behind.

Today’s generative AI is still limited by its propensity for error and should primarily be implemented for use cases with a high tolerance for variability.

CEOs will also need to consider new funding mechanisms for data and infrastructure — whether, for example, the budget should come from IT, R&D, or another source — if they determine that custom development is a critical and time-sensitive need.

The “fine-tune versus train” debate has other implications when it comes to long-term competitive advantage.

Previously, most research on generative AI was public and models were provided through open-source channels.

Because this research is now moving behind closed doors, open-source models are already falling far behind state-of-the-art solutions.

In other words, we’re on the brink of a generative AI arms race.

Source: Adapted from BCG

This is an excerpt from the paper “The CEO’s Guide to the Generative AI Revolution MARCH 07, 2023 By François Candelon, Abhishek Gupta, Lisa Krayer, and Leonid Zhukov”

Author

Joaquim Cardoso

Deixe um comentário Cancelar resposta

How Well Do Large Language Models- Support Clinician Information Needs?

the health strategist . institute research and strategy institute — for continuous transformationin health, care, cost and tech Joaquim Cardoso MScChief…

byJoaquim Cardoso

ChatGPT creator Sam Altman doesn’t know what the future of A.I. holds

institute for continuous health transformation(InHealth) Joaquim Cardoso MScFounder, Chief Researcher & EditorJanuary 27, 2023 EXECUTIVE SUMMARY OpenAI, the San…

byJoaquim Cardoso

Microsoft launches the new Bing, with ChatGPT built in — [the technology that “will reshape “pretty much every software category” , and even the web.

health, care and tech institute (HCTI) for continuous health transformation, at an affordable cost Joaquim Cardoso MScFounder, Chief…

byJoaquim Cardoso

The Latest

GenerativeAI revisited by Goldman Sachs

Amazon Health Launches $49 Telehealth Service

Amil e Dasa Criam Segunda Maior Rede Hospitalar do Brasil: Fusão Estratégica e Preparação para IPO

Microsoft Discontinues Copilot GPT Builder, Sparks Concern Among Subscribers

Training a custom LLM will offer greater flexibility, but that comes with high costs and capability requirement [up to $ 90 M+]

The Health Strategist

Joaquim Cardoso MSc
Chief Research and Strategy Officer (CRSO)
May 9, 2023

ONE PAGE SUMMARY

Once leaders identify their golden use cases, they will need to work with their technology teams to make strategic choices about whether to fine-tune existing LLMs or to train a custom model.

1.Fine-Tuning an Existing Model.

But there are downsides to this approach.

2.Training a New or Existing Model.

These costs — as well as data center, computing, and talent requirements — are significantly higher than those associated with other AI models, even when managed through a partnership.

Plan Your Investment

Leaders will need to carefully assess the timing of such an investment, weighing the potential costs of moving too soon on a complex project for which the talent and technology aren’t yet ready against the risks of falling behind.

The “fine-tune versus train” debate has other implications when it comes to long-term competitive advantage.

Deixe um comentário Cancelar resposta

Training a custom LLM will offer greater flexibility, but that comes with high costs and capability requirement [up to $ 90 M+]

The Health Strategist

Joaquim Cardoso MScChief Research and Strategy Officer (CRSO) May 9, 2023

ONE PAGE SUMMARY

Once leaders identify their golden use cases, they will need to work with their technology teams to make strategic choices about whether to fine-tune existing LLMs or to train a custom model.

1.Fine-Tuning an Existing Model.

But there are downsides to this approach.

2.Training a New or Existing Model.

These costs — as well as data center, computing, and talent requirements — are significantly higher than those associated with other AI models, even when managed through a partnership.

Plan Your Investment

Leaders will need to carefully assess the timing of such an investment, weighing the potential costs of moving too soon on a complex project for which the talent and technology aren’t yet ready against the risks of falling behind.

The “fine-tune versus train” debate has other implications when it comes to long-term competitive advantage.

Deixe um comentário Cancelar resposta

Related Posts

Joaquim Cardoso MSc
Chief Research and Strategy Officer (CRSO)
May 9, 2023