Instead of fine-tuning an LLM as a first approach, try prompt architecting instead

10:27 AM PDT • September 18, 2023

digitally generated image of a brain-shaped console with buttons and knobs — **Image Credits:** Jorge Greuel (opens in a new window) / Getty Images

Victoria Albrecht

Contributor

Victoria Albrecht is co-founder and CEO of Springbok AI.

Amid the generative AI eruption, innovation directors are bolstering their business’ IT department in pursuit of customized chatbots or LLMs. They want ChatGPT but with domain-specific information underpinning vast functionality, data security and compliance, and improved accuracy and relevance.

The question often arises: Should they build an LLM from scratch, or fine-tune an existing one with their own data? For the majority of companies, both options are impractical. Here’s why.

TL;DR: Given the right sequence of prompts, LLMs are remarkably smart at bending to your will. The LLM itself or its training data need not be modified in order to tailor it to specific data or domain information.

Exhausting efforts in constructing a comprehensive “prompt architecture” is advised before considering more costly alternatives. This approach is designed to maximize the value extracted from a variety of prompts, enhancing API-powered tools.

If this proves inadequate (a minority of cases), then a fine-tuning process (which is often more costly due to the data prep involved) might be considered. Building one from scratch is almost always out of the question.

The sought-after outcome is finding a way to leverage your existing documents to create tailored solutions that accurately, swiftly, and securely automate the execution of frequent tasks or the answering of frequent queries. Prompt architecture stands out as the most efficient and cost-effective path to achieve this.

What’s the difference between prompt architecting and fine-tuning?

If you are considering prompt architecting, you have likely already explored the concept of fine-tuning. Here is the key distinction between the two:

While fine-tuning involves modifying the underlying foundational LLM, prompt architecting does not.

Fine-tuning is a substantial endeavor that entails retraining a segment of an LLM with a large new dataset — ideally your proprietary dataset. This process imbues the LLM with domain-specific knowledge, attempting to tailor it to your industry and business context.

In contrast, prompt architecting involves leveraging existing LLMs without modifying the model itself or its training data. Instead, it combines a complex and cleverly engineered series of prompts to deliver consistent output.

Fine-tuning is appropriate for companies with the most stringent data privacy requirements (e.g., banks)

On the surface, fine-tuning seems efficient: You skip the ordeal of building a new model and simply retrain an existing one with your own data.

Fine-tuning’s surprising hidden cost arises from acquiring the dataset and making it compatible with your LLM and your needs. In comparison, once the dataset is ready, the fine-tuning process (uploading your prepared data, covering the API usage and computing costs) is no drama.

Given the high costs, fine-tuning is recommended only when prompt architecting–based solutions have failed.

Not to mention, a robust prompt architecture is often necessary to make optimal use of the outputs of fine-tuning anyway.

When approaching technology partners for fine-tuning activities, inquire about dataset preparation expertise and comprehensive cost estimates. If they omit them, it should raise a red flag, as it could indicate an unreliable service or a lack of practical experience in handling this task.

Generally, valuable fine-tune cases should undergo a prompt architecture–based proof of concept stage before operational investment.

Build secure solutions tailored to your company’s data

Let’s jump into the example of a research tool: a solution that provides near-instant answers to questions relating to hundreds of documents. This tool can be accessible by employees via a web interface with enterprise-grade security controls and user management. It’s built using an API and tailored to your data and objectives through prompt architecting.

Users can pose questions like “Show me all conversations between Jane Doe and John Smith referencing ‘transaction,’” and the tool scans your documents to provide easily readable results. The system uses a careful system of retrieval mechanisms combined with intelligent prompts to scan through the lengthy text contained in the documents to produce a coherent response.

Dentons [a Springbok AI customer] recently introduced FleetAI: their proprietary ChatGPT version, and a legal industry first, for analyzing and querying uploaded legal documents.

Building a new LLM from scratch is no small task

If, to achieve the same outcomes, you were to build “your own LLM” from scratch, expect an uphill battle. This ambition is often misguided. It can cost at least $150 million and yield experimental outcomes. Aspiring to create a proprietary LLM often competes with established players like Meta, OpenAI, and Google, or the best university research departments.

The number of companies equipped to do this is probably only in the double digits worldwide. What executives usually mean by their “own LLM” is a secure LLM-powered solution tailored to their data. The pragmatic route for most executives seeking their “own LLM” involves solutions tailored to their data via fine-tuning or prompt architecting.

Prompt architecting basic best practices

First you need to create data flow and software architecture diagrams that represent the overall design of a solution, with analytics feedback mechanisms in place.

There should be guidelines for context-based text enhancement, with prompt templates and specified tone and length.

Then the architecture should be adapted to the chosen output mode, such as a dashboard, conversational interface, or template-based document.

Integration with additional data sources is made possible: databases for efficient data retrieval, Salesforce for CRM communication, and optical character recognition (OCR) capabilities for processing text from images or scanned documents.

Finally, output quality measures are implemented. When an output fails the criteria, the text is amended by a feedback loop. It checks for offensive language, inappropriate tone and length, and false information. Once the checks are passed, the message is sent to the user.

There is no guarantee that the LLM will not hallucinate or swerve offtrack. AI can never reach 100% accuracy. Nonetheless, these response accuracy checks strive to nip anomalous output in the bud.

Key takeaways

Innovation directors seek tailored chatbots and LLMs, facing the dilemma of building from scratch or fine-tuning. For most, both options are impractical.

LLMs are impressively adaptive through well-structured prompts. An exhaustive exploration of prompt architectures is recommended before more costly alternatives, especially given that a prompt architecture will be needed to achieve desired results even if you fine-tune or build a model.

More TechCrunch

Bing’s API is down, taking Microsoft Copilot, DuckDuckGo and ChatGPT’s web search feature down too

Romain Dillet

31 mins ago

Bing, Microsoft’s search engine, isn’t working properly right now. At first, we noticed it wasn’t possible to perform a web search at all. Now it seems search results are loading…

Bing’s API is down, taking Microsoft Copilot, DuckDuckGo and ChatGPT’s web search feature down too

Autonomous shipping startup Orca AI tops up with $23M led by OCV Partners and MizMaa Ventures

Mike Butcher

43 mins ago

If you thought autonomous driving was just for cars, think again. The so-called ‘autonomous navigation’ market — where ships steer themselves guided by AI, resulting in fuel and time savings…

Autonomous shipping startup Orca AI tops up with $23M led by OCV Partners and MizMaa Ventures

Biotech & Health

Meet the Finnish biotech startup bringing a long lost mycoprotein to your plate

Natasha Lomas

4 hours ago

The best known mycoprotein is probably Quorn, a meat substitute that’s fast approaching its 40th birthday. But Finnish biotech startup Enifer is cooking up something even older: Its proprietary single-cell…

Meet the Finnish biotech startup bringing a long lost mycoprotein to your plate

Startups

Food supply chain software maker Silo lays off ~30% of staff amid M&A discussions

Sarah Perez

10 hours ago

Silo, a Bay Area food supply chain startup, has hit a rough patch. TechCrunch has learned that the company on Tuesday laid off roughly 30% of its staff, or north…

Food supply chain software maker Silo lays off ~30% of staff amid M&A discussions

Government & Policy

The Biden campaign is looking to hire a seasoned meme lord

Amanda Silberling

11 hours ago

President Joe Biden needs a meme manager.

The Biden campaign is looking to hire a seasoned meme lord

Featured Article

Meta’s new AI council is composed entirely of white men

Meanwhile, women and people of color are disproportionately impacted by irresponsible AI.

Dominic-Madori Davis

Amanda Silberling

Kyle Wiggers

12 hours ago

Meta’s new AI council is composed entirely of white men

Startups

Garry Tan has revealed his ‘secret sauce’ for getting into Y Combinator

Christine Hall

13 hours ago

If you’ve ever wanted to apply to Y Combinator, here’s some inside scoop on how the iconic accelerator goes about choosing companies.

Garry Tan has revealed his ‘secret sauce’ for getting into Y Combinator

Transportation

India’s BluSmart is testing its ride-hailing service in Dubai

Jagmeet Singh

13 hours ago

Indian ride-hailing startup BluSmart has started operating in Dubai, TechCrunch has exclusively learned and confirmed with its executive. The move to Dubai, which has been rumored for months, could help…

India’s BluSmart is testing its ride-hailing service in Dubai

Government & Policy

FCC proposes all AI-generated content in political ads must be disclosed

Devin Coldewey

14 hours ago

Under the envisioned framework, both candidate and issue ads would be required to include an on-air and filed disclosure that AI-generated content was used.

FCC proposes all AI-generated content in political ads must be disclosed

Startups

Refer a founder to Startup Battlefield 200 at Disrupt 2024

TechCrunch Events

14 hours ago

Want to make a founder’s day, week, month, and possibly career? Refer them to Startup Battlefield 200 at Disrupt 2024! Applications close June 10 at 11:59 p.m. PT. TechCrunch’s Startup…

Refer a founder to Startup Battlefield 200 at Disrupt 2024

Apps

Bluesky now has DMs

Aisha Malik

14 hours ago

Social networking startup and X competitor Bluesky is officially launching DMs (direct messages), the company announced on Wednesday. Later, Bluesky plans to “fully support end-to-end encrypted messaging down the line,”…

Venture

Peter Thiel-founded Valar Ventures raised a $300 million fund, half the size of its last one

Marina Temkin

Rebecca Szkutak

14 hours ago

The perception in Silicon Valley is that every investor would love to be in business with Peter Thiel. But the venture capital fundraising environment has become so difficult that even…

Peter Thiel-founded Valar Ventures raised a $300 million fund, half the size of its last one

Featured Article

Spyware found on US hotel check-in computers

Several hotel check-in computers are running a remote access app, which is leaking screenshots of guest information to the internet.

Zack Whittaker

16 hours ago

Spyware found on US hotel check-in computers

Venture

Techstars CEO Maëlle Gavet is out

Dominic-Madori Davis

16 hours ago

Gavet has had a rocky tenure at Techstars and her leadership was the subject of much controversy.

Fundraising

Connected fitness is adrift post-pandemic

Brian Heater

17 hours ago

The struggle isn’t universal, however.

Connected fitness is adrift post-pandemic

Featured Article

A comprehensive list of 2024 tech layoffs

The tech layoff wave is still going strong in 2024. Following significant workforce reductions in 2022 and 2023, this year has already seen 60,000 job cuts across 254 companies, according to independent layoffs tracker Layoffs.fyi. Companies like Tesla, Amazon, Google, TikTok, Snap and Microsoft have conducted sizable layoffs in the first months of 2024. Smaller-sized…

Alyssa Stringer

Cody Corrall

17 hours ago

A comprehensive list of 2024 tech layoffs

Enterprise

HoundDog.ai helps developers prevent personal information from leaking

Frederic Lardinois

17 hours ago

HoundDog actually looks at the code a developer is writing, using both traditional pattern matching and large language models to find potential issues.

HoundDog.ai helps developers prevent personal information from leaking

Fintech

Google Pay will now display card perks, BNPL options and more

Sarah Perez

18 hours ago

The changes are designed to enhance the consumer experience of using Google Pay and make it a more competitive option against other payment methods.

Google Pay will now display card perks, BNPL options and more

Vinod Khosla is coming to Disrupt to discuss how AI might change the future

Julie Bort

18 hours ago

Few figures in the tech industry have earned the storied reputation of Vinod Khosla, founder and partner at Khosla Ventures. For over 40 years, he has been at the center…

Vinod Khosla is coming to Disrupt to discuss how AI might change the future

Apps

Truecaller partners with Microsoft to let its AI respond to calls in your own voice

Jagmeet Singh

18 hours ago

AI has already started replacing voice agents’ jobs. Now, companies are exploring ways to replace the existing computer-generated voice models with synthetic versions of human voices. Truecaller, the widely known…

Truecaller partners with Microsoft to let its AI respond to calls in your own voice

Hardware

Meta’s Ray-Ban smart glasses now let you share images directly to your Instagram Story

Aisha Malik

18 hours ago

Meta is updating its Ray-Ban smart glasses with new hands-free functionality, the company announced on Wednesday. Most notably, users can now share an image from their smart glasses directly to…

Meta’s Ray-Ban smart glasses now let you share images directly to your Instagram Story

Apps

Why Spotify is launching its own font, Spotify Mix

Lauren Forristal

19 hours ago

Spotify launched its own font, the company announced on Wednesday. The music streaming service hopes that its new typeface, “Spotify Mix,” will help Spotify distinguish its own unique visual identity. …

Why Spotify is launching its own font, Spotify Mix

Startups

Hydrolix seeks to make storing log data faster and cheaper

Kyle Wiggers

19 hours ago

In 2008, Marty Kagan, who’d previously worked at Cisco and Akamai, co-founded Cedexis, a (now-Cisco-owned) firm developing observability tech for content delivery networks. Fellow Cisco veteran Hasan Alayli joined Kagan…

Hydrolix seeks to make storing log data faster and cheaper

Bolster, creator of the CheckPhish phishing tracker, raises $14M led by Microsoft’s M12

Ingrid Lunden

19 hours ago

A dodgy email containing a link that looks “legit” but is actually malicious remains one of the most dangerous, yet successful, tricks in a cybercriminal’s handbook. Now, an AI startup…

Bolster, creator of the CheckPhish phishing tracker, raises $14M led by Microsoft’s M12

Space

Boeing, NASA indefinitely delay crewed Starliner launch

Aria Alamalhodaei

19 hours ago

If you’ve been looking forward to seeing Boeing’s Starliner capsule carry two astronauts to the International Space Station for the first time, you’ll have to wait a bit longer. The…

Boeing, NASA indefinitely delay crewed Starliner launch

Apps

TikTok turns to generative AI to boost its ads business

Aisha Malik

20 hours ago

TikTok is the latest tech company to incorporate generative AI into its ads business, as the company announced on Tuesday that it’s launching a new “TikTok Symphony” AI suite for…

TikTok turns to generative AI to boost its ads business

Space

Space VC closes $20M Fund II to back frontier tech founders from day zero

Aria Alamalhodaei

20 hours ago

Gone are the days when space and defense were considered fundamentally antithetical to venture investment. Now, the country’s largest venture capital firms are throwing larger portions of their money behind…

Space VC closes $20M Fund II to back frontier tech founders from day zero

Startups

Patronus AI is off to a magical start as LLM governance tool gains traction

Ron Miller

20 hours ago

These days every company is trying to figure out if their large language models are compliant with whichever rules they deem important, and with legal or regulatory requirements. If you’re…

Patronus AI is off to a magical start as LLM governance tool gains traction

Apps

Linktree surpasses 50M users, rolls out its social commerce program to more creators

Lauren Forristal

20 hours ago

Link-in-bio startup Linktree has crossed 50 million users and is rolling out the beta of its social commerce program.

Linktree surpasses 50M users, rolls out its social commerce program to more creators

Fintech

Immigrant banking platform Majority secures $20M following 3x revenue growth

Christine Hall

21 hours ago

For a $5.99 per month, immigrants have a bank account and debit card with fee-free international money transfers and discounted international calling.

Instead of fine-tuning an LLM as a first approach, try prompt architecting instead

Victoria Albrecht

What’s the difference between prompt architecting and fine-tuning?

Fine-tuning is appropriate for companies with the most stringent data privacy requirements (e.g., banks)

Build secure solutions tailored to your company’s data

Building a new LLM from scratch is no small task

Prompt architecting basic best practices

Key takeaways

More TechCrunch

Get the industry’s biggest tech news

TechCrunch Daily News

Startups Weekly

TechCrunch Fintech

TechCrunch Mobility

Tags