What do you do with 15 years of your writing history when LLMs hit the scene? A chatbot!
Meet SwizBot, a friendly chatbot that knows every article I've published on swizec.com since 2007 and is ready to answer your questions. You can try it out at swizbot-ui.vercel.app, if you promise to be nice to my OpenAI credits.
SwizBot starts every conversation by introducing itself and giving a little recap of topics it can talk about. I figured this was the naturalest way to avoid the Empty Input Of Doom problem.
Hate it when people say "Lemme know how i can help" and I have no idea what's even within their wheelhouse. Do I ask for help moving my couch, or a million dollar investment in my next idea? What do you help with!?
But why?
Why would you turn 15 years of writing into a chatbot? Because that's what nerds do on the weekend.
But I think there's something there. I have 2 use-cases in mind:
- Readers can ask questions that I don't have time to answer
- I can go "What did I say about X?" because my brain can only remember so much
There's bound to be a product hiding in 1.
.
Does SwizBot work?
Works pretty well!
It makes little mistakes here and there like calling Slovenia my hometown even though it's a country, or getting confused between standalone opensource projects and functions exported from larger libraries. And SwizBot keeps saying that it knows the contents of my books, but it does not.
I should feed SwizBot my books 🤔
Overall the answers are directionally correct, seem useful, and it gets the conversational tone right most of the time.
I did it! A chatbot that knows ~15 years of my blogging history and can answer questions 😍 pic.twitter.com/b41xinNOhK
— Swizec Teller (@Swizec) May 19, 2023
My partner asked if I was a good person:
I tried to see how SwizBot does with technical content:
Does great with my technical content too! Even suggests opensource libraries I've built pic.twitter.com/brs6BGApMU
— Swizec Teller (@Swizec) May 19, 2023
And a friend asked what's going to be my professional downfall, obviously. The answer is too long to paste here, but it boils down to "Stop investing in yourself".
Does okay with synthesizing my opinions and experiences 🤘
But SwizBot is reluctant to give advice due to OpenAI's protections built into GPT-4. It's never gonna tell you "do this" because that would be irresponsible or whatever.
How SwizBot works
SwizBot is almost embarrassingly simple in what it does.
- The UI is based on chatbot-ui, an open-source ChatGPT UI clone
- The knowledge retrieval is based on semantic search with LLM embeddings
- GPT-4 does the rest
The hardest part was shoe-horning knowledge retrieval into chatbot-ui and getting all my markdown parsed. So many stupid things you do over 15 years break a strict parser 🫠
You can see the full code on GitHub. The interesting part is api/documentContext.ts
Knowledge retrieval at a high level
LLMs don't have memory. You have to provide all relevant context as part of your prompt.
But LLMs also have limited context windows. There's only so much they can "keep in mind" at a time.
You solve this by finding relevant context. Like asking an intern to read 3 books and then help you out. Those 3 books are the context your intern's going to use to synthesize an answer to your question.
Remember: ChatGPT and friends are like an extremely talented intern who is great at research and also on drugs.
When you ask a question, SwizBot:
- Computes an embedding of your question
- Looks at a "database" of embeddings
- Finds 10 nearest blog fragments
- Feeds those fragments as context into the system prompt
- Passes your question and the huge system prompt to GPT-4
- Streams back the response
Finding relevant blog fragments
SwizBot has a "database" of pre-computed embeddings for every h2 section of my blog (title + content). Around 84,000 rows in a huge CSV file.
Yep, a CSV file. Good enough 😁
It uses this short function to sort the whole database by similarity and return the top 10 results. Yes this is slow and I should upgrade to a vector database.
export async function findRelevantSections(question: string) {
const questionEmbedding = await getEmbedding(question);
// similarity key gets rewritten
const haystack = await readEmbeddingsFromCSV();
for (const item of haystack) {
item.similarity = cosineSimilarity(questionEmbedding, item.embedding);
}
haystack.sort((a, b) => b.similarity - a.similarity);
return haystack.slice(0, 10);
}
The system prompt
The system prompt goes ahead of your question and tells GPT-4 how to behave. Mine also feeds it relevant context, which enables the LLM to answer based on what I've written.
That happens in api/chat.ts. Like this:
async function expandPromptWithContext(
prompt: string,
messages: Message[],
req: NextRequest
): Promise<string> {
const context: Doc[] = await fetch(
`${req.nextUrl.origin}/api/documentContext`,
{
method: "POST",
headers: {
"Content-Type": "application/json",
},
body: JSON.stringify(messages),
}
).then((res) => res.json());
let additions: string[] = [
"The following are some relevant documents to use in your answer",
];
for (const { title, content } of context) {
additions.push(`Title: ${title}; Content: ${content}`);
}
return prompt + "\n" + additions.join("\n");
}
Finds the first user message in a conversation, grabs relevant blog fragments, and shoves 'em in there with a "Use this context in your answer".
The base system prompt is alchemy, I mean trial and error. Needs more work I think.
let promptToSend =
"You are SwizBot, a chatbot based on Swizec Teller's writings. Answer the user's questions carefully. If you are not sure, ask followup questions to clarify the user's situation. Answer as if you are Swizec Teller, using his style of writing. Respond using markdown.";
promptToSend = await expandPromptWithContext(promptToSend, messages, req);
GPT-4 does the rest. Really.
Making the bot say hello
This was foolish because SwizBot now incurs an OpenAI cost any time someone opens the page. But I like the UX.
The bot says hello because I send this system message on first render:
useEffect(() => {
if (selectedConversation?.messages.length === 0) {
handleSend(
{
role: "system",
content:
"introduce yourself and mention what kind of questions you can answer",
},
0
);
}
}, [selectedConversation, handleSend]);
And I hacked chatbot-ui to hide system messages ✌️
An important caveat
I think part of why SwizBot works so well is that GPT-4 natively knows about me. You can ask plain ChatGPT to write in the style of Swizec Teller and that works. It can answer basic questions about me.
This means that even when SwizBot hallucinates, it's likely to hallucinate relevant information. At least it can pull in things it knows but were missing from my knowledge retriaval context.
Further work
Jonathan Stark has sent me all his dailies in a big markdown file. Gonna try botifying him next. What Would Jonathan Stark Do?
Lots of experimentation to do in the knowledge retrieval area. There's opportunity for a 2-tier search that better understands what fragments were part of the same blog. And I'd love for SwizBot to cite sources and link to relevant articles. Not to mention updating context when you ask followup questions.
Should be fun :)
Tried SwizBot? Hit reply, I wanna know how it went.
Cheers,
~Swizec
Continue reading about How I turned 15 years of writing into a chatbot
Semantically similar articles hand-picked by GPT-4
- Building apps with OpenAI and ChatGPT
- Coaching AI to write your code
- I tried generative AI on lots of data and we're not quite there yet
- Programming in Markdown
- How I Added a Related Articles Feature on Swizec.com Using GPT-4 Embeddings
Learned something new?
Read more Software Engineering Lessons from Production
I write articles with real insight into the career and skills of a modern software engineer. "Raw and honest from the heart!" as one reader described them. Fueled by lessons learned over 20 years of building production code for side-projects, small businesses, and hyper growth startups. Both successful and not.
Subscribe below 👇
Software Engineering Lessons from Production
Join Swizec's Newsletter and get insightful emails 💌 on mindsets, tactics, and technical skills for your career. Real lessons from building production software. No bullshit.
"Man, love your simple writing! Yours is the only newsletter I open and only blog that I give a fuck to read & scroll till the end. And wow always take away lessons with me. Inspiring! And very relatable. 👌"
Have a burning question that you think I can answer? Hit me up on twitter and I'll do my best.
Who am I and who do I help? I'm Swizec Teller and I turn coders into engineers with "Raw and honest from the heart!" writing. No bullshit. Real insights into the career and skills of a modern software engineer.
Want to become a true senior engineer? Take ownership, have autonomy, and be a force multiplier on your team. The Senior Engineer Mindset ebook can help 👉 swizec.com/senior-mindset. These are the shifts in mindset that unlocked my career.
Curious about Serverless and the modern backend? Check out Serverless Handbook, for frontend engineers 👉 ServerlessHandbook.dev
Want to Stop copy pasting D3 examples and create data visualizations of your own? Learn how to build scalable dataviz React components your whole team can understand with React for Data Visualization
Want to get my best emails on JavaScript, React, Serverless, Fullstack Web, or Indie Hacking? Check out swizec.com/collections
Did someone amazing share this letter with you? Wonderful! You can sign up for my weekly letters for software engineers on their path to greatness, here: swizec.com/blog
Want to brush up on your modern JavaScript syntax? Check out my interactive cheatsheet: es6cheatsheet.com
By the way, just in case no one has told you it yet today: I love and appreciate you for who you are ❤️