GenAI – Quinn Devery’s Blog!

Are you like me and recently found out that OpenAI has multiple ways to consume their breakthrough GPT models? If so, let’s break down the differences and primary use cases for each of these models:

Image generated by Midjourney for a “Collage of AI Models”

ChatGPT:

ChatGPT is designed specifically for conversational AI applications, where the model interacts with users through text-based conversations.
It is trained using a combination of supervised fine-tuning and Reinforcement Learning from Human Feedback (RLHF).
ChatGPT is useful for building chatbots, virtual assistants, or any system that involves interactive dialogue with users. It excels at generating coherent and contextually relevant responses.

InstructGPT:

InstructGPT is geared towards assisting users with detailed instructions and tasks.
It is trained using a combination of supervised fine-tuning and demonstrations, where human AI trainers provide step-by-step instructions to guide the model.
InstructGPT is well-suited for generating helpful responses when given specific instructions or when guiding users through a process. It can be used for writing code, answering questions, creating tutorials, and more.

Fine-tuning models:

Fine-tuning involves taking a pre-trained language model, such as GPT, and further training it on a specific task or dataset.
Fine-tuning allows for customization of the model to perform well on specific tasks, making it more focused and specialized.
It is useful when you have a specific dataset and task at hand, and you want the model to provide accurate and relevant responses tailored to that task. Fine-tuning can be applied to both ChatGPT and InstructGPT.

Embedding models vs. Language models:

Embedding models focus on generating fixed-length representations (embeddings) of input text. These embeddings capture semantic and contextual information about the text, which can be useful for various downstream tasks.
Language models, like GPT, generate coherent and contextually appropriate text by predicting the next word given the previous context. They have a generative nature and can produce human-like responses.
Embedding models are suitable for tasks like sentiment analysis, document classification, and information retrieval, where the fixed-length representations of text are used as input features.
Language models, on the other hand, are better suited for tasks like text generation, dialogue systems, and content creation, where the model needs to generate text based on context.

In summary, ChatGPT is ideal for conversational AI applications, InstructGPT is tailored for assisting with detailed instructions and tasks, fine-tuning models allow for customization to specific tasks, and embedding models provide fixed-length representations of text for downstream tasks.

Check out all the offerings listed above on OpenAI’s pricing page.

This article aims to give business stakeholders an understanding of the major components of GenAI so they can effectively navigate the GenAI noise and have productive conversations internally and with trusted partners.

The recent advancements in generative AI are driving a race to capitalize and monetize GenAI by businesses. While there is no lack of content on GenAI, I’ve found that much of the content is focused on consumer productivity hacks, deeply technical research papers on Avitx, or code frameworks and GitHub repositories. My focus is on how business stakeholders should approach embedding GenAI in their companies and products though the lens of revenue growth, costs, risks, and sustainable competitive differentiators.

Section One: Generative AI and Foundation Models

Generative AI is based on what the industry refers to as foundation models – large-scale machine learning models trained on massive datasets, typically text or images. These models learn patterns, structures, and nuances from the data they’re trained on, enabling them to generate content, answer questions, translate languages, and more. Some of the most popular Generative AI use cases now include:

Large language Models (LLMs) such as ChatGPT
Image Generators(Text-to-image) such as Midjourney or Stable Diffusion
Code generation tools (Uses LLMs fine-tuned on code) such as Amazon Code Wispherer or GitHub copilot
Audio generation tools such as VALL-E

Section Two: Deployment and Consumption of Generative AI

Deployment and consumption of Gen AI varies greatly. I’ve highlighted the primary areas of today’s GenAI landscape that business stakeholders should focus on figuring out for their company. I’ve highlighted the text and corresponding parts of the tech stack diagram below in green or orange. For most business stakeholders, you should focus on which of the three models benefits you the most.

Use (consume) an off-the-shelf software solution that uses Gen AI to reduce costs. Not many B2B firms have launched GenAI features outside of SFDC’s Einstein GPT.
Consume an existing GenAI-aaS such as ChatGPT and embed (deploy) the APIs functionality in your company’s products, services, or internal applications to drive revenue or lower costs.
Fine-tune an existing open-source foundation model with proprietary data, and deploy it on a cloud or internal infrastructure. Embed the model outputs in your products, services, or internal applications as a competitive differentiator.

Source: Who Owns the Generative AI Platform? (https://a16z.com/)

Section Three: Pre-trained vs. From Scratch Models vs. Fine-tuned

The decision between using a pre-trained service such as ChatGPT, fine-tuning an open large language model (LLM) with your data, or training and deploying your LLM from scratch hinges on several factors – time, cost, skillset, and specificity of the task.

Pre-trained services offer a cost-effective and timely solution, requiring minimal expertise and effort to integrate into your existing processes. However, they might not always provide the level of customization needed for niche applications.

Training and deploying your own LLM from scratch gives the highest degree of customization. Still, it requires significant resources – a dedicated team of AI experts, lots of data, substantial computational resources, and considerable time investment.

Fine-tuning an open-source LLM from providers such as Hugging Face and Meta AI offers a middle ground. You get the benefits of a pre-trained model plus customization for specific use cases. However, it requires expertise in machine learning, access to relevant data for fine-tuning, and infrastructure to host your model endpoints.

Section Four: Open vs. Closed Models

When it comes to open versus closed foundation models, the key differences revolve around transparency, control, and cost. Open-source models generally offer more transparency and flexibility – you can examine, modify, and fine-tune the model as you please. However, they may require a more sophisticated skill set to utilize effectively.

On the other hand, closed models are typically proprietary, meaning the inner workings are not fully disclosed. They often come with customer support and might be better suited for business leaders who prefer an off-the-shelf solution. However, they can be more costly and offer less flexibility than their open-source counterparts.

Conclusion

Understanding the tech stack and associated landscape of generative AI is crucial for business leaders to have informed discussions. In general, we’re seeing less of a focus on increasing the number of parameters and more on fine-tuning models with proprietary data. I believe data will be the biggest differentiator as more websites change their terms of use not to allow web scraping for inclusion in the training of 3rd party models.

We didn’t even get into the business considerations of you are creating a sustainable competitive advantage with Gen AI, the cost implications of GenAI on your margins, and product-customer fit. Still, I will address those in a future blog post. There are more questions than answers, but it’s clear GenAI is more than hype, and everyone should be prepared for the long game.

Category: GenAI

What are the differences between OpenAI’s ChatGPT, InstructGPT, fine-tuned models, and Embedding models?

Four business considerations for anyone in B2B thinking about GenAI adoption