PDF download Download Article
Train the AI chatbot to recognize your documents, data, and preferences
PDF download Download Article

Wouldn't ChatGPT be more useful if it knew more about you, your data, your company, or your knowledge level? If you need ChatGPT to provide more relevant answers or work with your data, there are many ways to train the AI chatbot. To train ChatGPT, you can use plugins to bring your data into the chatbot (ChatGPT Plus only) or try the Custom Instructions feature (all versions). If you'd rather create your own custom AI chatbot using ChatGPT as a backbone, you can use a third-party training tool to simplify bot creation, or code your own in Python using the OpenAI API.

Things You Should Know

  • Try ChatGPT Plus plugins to bring in training data from the web, your CMS, PDFs, & other sources.
  • Use Custom Instructions to teach ChatGPT about you and your data using prompts.
  • To train a custom chatbot for your website or app, try a builder like BotSonic or SocialIntents.
  • You can use the ChatGPT API and Python to create an AI chatbot trained on files from your computer.
Section 1 of 4:

Using ChatGPT Plus Plugins

PDF download Download Article
  1. Subscribe to ChatGPT Plus. With a ChatGPT Plus subscription, you can use plugins to connect your own data to ChatGPT. For example, if you want ChatGPT to perform calculations or work with your data in Google Sheets, on a specific website, or using peer-reviewed articles, you can use plugins to give ChatGPT access to the data it needs to process your requests.
    • ChatGPT Plus also gives you access to GPT-4 , the most advanced version of ChatGPT.
  2. Once you have ChatGPT Plus, you can enable plugins easily:
    • Click the three dots on your name at the bottom-left corner of ChatGPT.
    • Click Beta Features .
    • Toggle on the Plugins feature.
    Advertisement
  3. Make sure to click the GPT-4 model at the top to gain access to the plugin list.
  4. You'll see it below "Default."
  5. This launches the Plugin Store, which is only visible to subscribers.
  6. Search or browse for a plugin that supports your training data type, then click Install to get the plugin. If the plugin requires you to log in to a data source, you'll be prompted to do so during installation. Some plugin ideas for training ChatGPT with your data:
    • If you have a Zapier account, you can use the Zapier plugin to train ChatGPT with data from Salesforce, HubSpot, Google Sheets, Google Docs, your email account, your CMS, databases, and many other sources. [1]
    • For web-based data, try the Link Reader, Web Requests, Web Search, or Scraper plugins.
    • If your training data is in one or more PDF files or Google documents, you can install one of the PDF plug-ins, such as ChatWithPDF or AiPDF.
    • You'll also find plugins that can interact with many different data sources, including Chat With Workspace (for Notion), Sembot (for Semrush), and AI2SQL (for SQL databases).
    • To use a video as training data, try the Video Insights plugin.
  7. At the top of your GPT-4 conversation, click the menu, then check the boxes next to the plugins that pertain to your training data.
    • If the plugin requires you to log in to a data source, you'll be prompted to do so during installation.
  8. Once you've enabled the plugin, you can use it to train ChatGPT on the data it needs to provide you with answers. Provide ChatGPT with a link to the data source, and explain what you want it to do with your data.
    • If you've enabled a plugin that can read websites, Sheets, or PDF files, paste a public link to the file into the chat and provide ChatGPT with a prompt. For example, "Use the data in this spreadsheet to tell me which of my sales agents needs uptraining."
  9. Advertisement
Section 2 of 4:

Using Custom Instructions

PDF download Download Article
  1. If you don't have ChatGPT Plus, you can train the free version of ChatGPT by giving it custom instructions. You can use this method to tell ChatGPT more about you, your projects, your business, or any other details to help the chatbot generate more applicable responses. [2] To enable Custom Instructions:
    • In the mobile app, go to Settings > Account > Custom Instructions , and toggle the feature on.
    • On the web, sign in to ChatGPT, click your name at the bottom-left, click Custom Instructions , and then click OK .
  2. Now you can tell ChatGPT anything it needs to know to help you better.
    • For example, if you're using ChatGPT to write cover letters for multiple jobs, you can explain your work history, your skills, interests, qualifications, and other details that you want the chatbot to remember for future cover letters.
    • You may also use this field to provide specific training data for ChatGPT to better field your questions about certain data, such as your sales figures for the year, customer feedback, and more.
    • Include anything that you don't want to repeat in your prompts. For example, if you're using ChatGPT to develop catchy titles for your blog posts, you can specify your writing style and give examples of your favorite titles here instead of repeating yourself in each prompt.
  3. In the second field, you can describe the tone, format, and other preferences for ChatGPT's responses.
    • For example, if you're a student using ChatGPT to summarize scientific articles, you can type something like "Begin your responses with a summary of the article's findings in no more than 5 sentences. Then, summarize each section of the article as a bulleted list. Remain neutral. Remember that I am a student, not an expert, though I want to become an expert on the topic."
    EXPERT TIP

    Tyrone Showers

    Technologist
    Tyrone Showers is a Technologist and the Co-owner of Taliferro Group, an IT consulting company based in Seattle, Washington. With over 35 years of professional experience, he specializes in API Design, e-Commerce, Operational Efficiency, and website development. He has a B.S. in Computer Science from DeVry Institute of Technology.
    Tyrone Showers
    Technologist

    Imagine yourself as a mentor to ChatGPT. To guide ChatGPT toward your writing style, provide writing samples. After receiving responses, offer constructive feedback, highlighting what aligns with your goals and areas for improvement. Be specific in prompts to avoid repetitive phrases or generic beginnings.

  4. Now that you've provided ChatGPT with some training data, it will use it to better answer your questions in all future conversations. If you don't want ChatGPT to use your data in other conversations, you can turn off Custom Instructions once you start a new chat.
  5. Advertisement
Section 3 of 4:

Creating Custom Chatbots for Your App

PDF download Download Article
  1. If your goal is to create your own version of ChatGPT for your website or app, the easiest way to get started is to use a ChatGPT training tool. These tools create custom chatbots using the ChatGPT API, giving your site or app a version of ChatGPT that's actually trained on your data. These tools are not free, but they are simple to use and are the best way to train ChatGPT on your data if you don't know how to code. Some tools to check out:
  2. Depending on the product you're using, you may not need to use your own API key. But if you're on a plan that requires using your own API key, you can create one easily:
    • Go to https://platform.openai.com/account/api-keys . If you're not signed in, you'll need to do so now.
    • Click Create new secret key .
    • Name the key.
    • Click Create secret key .
    • Copy your key to a text file and save it, as you won't be able to view it again once you close this tab.
  3. Most custom chatbot creators support multiple types of training data, including text files, documents, Excel, websites, SQL, and more. To keep things simple, add your training data to a single folder so you can easily upload it to your provider.
  4. Follow your tool's instructions to add the training data to your custom chatbot. You can usually type some training data in manually, such as your bot's name, company name, address, common responses to frequently asked questions, and more. Once the training is complete, you can integrate your custom ChatGPT-based chatbot into your website or app.
  5. Advertisement
Section 4 of 4:

Creating Custom Chatbots in Python

PDF download Download Article
  1. If you know Python, you can easily train ChatGPT on your own data by creating a script. This can be helpful if you or someone you know wants to interact with a version of ChatGPT that can access and work with specific data that's not necessarily on the web. You'll need Python 3 to use the tools in this method. If you haven't already installed it, see How to Install Python .
    • You'll also want to make sure you've upgraded pip if you haven't done so in a while. You can do so using the command python3 -m pip install --upgrade pip (Mac & Linux) or py -m pip install --upgrade pip (Windows). [3]
    • You won't be able to integrate this chatbot into an existing website, but your custom chatbot will be available on the web.
  2. You'll need to install four different libraries to train ChatGPT with your own data:
    • Install the OpenAI library so you can interact with the ChatGPT API: pip3 install openai
    • Install LlamaIndex so you can connect ChatGPT with your data: pip3 install llama-index
    • If you want to use PDFs to train ChatGPT, install PyPDF2: pip3 install PyPDF2 .
    • Install Gradio, which allows you to use a web-based user interface to interact with the chatbot: pip3 install gradio .
  3. If you haven't already generated an API key for this project, you will need to create one using your OpenAI account . Here's how:
    • Go to https://platform.openai.com/account/api-keys .
    • Click Create new secret key .
    • Name the key.
    • Click Create secret key .
    • Copy your key to a text file and save it, as you won't be able to view it again once you close this window.
  4. Create a folder on your computer called training . Create a new subfolder called docs inside that folder. Then, copy your training data to the docs folder inside the training folder. You can use text files, PDFs (if you installed the PyPDF2 library with pip), CSV files, and more to train your model. [4]
    • If you want to use Excel data to train your model, you will need to convert it to CSV first .
    • Because OpenAI charges you for using the API, start with smaller files when training. The larger the files (and the more files you're using to train the model), the more expensive the process will be. [5]
  5. You can use any text editor, including Notepad for Windows or TextEdit for Mac. We'll provide an example script below, but replace 'Your API Key' with your actual API key. [6]
     from 
     llama_index 
     import 
     SimpleDirectoryReader 
     , 
     GPTVectorStoreIndex 
     , 
     GPTListIndex 
     , 
     LLMPredictor 
     , 
     PromptHelper 
     import 
     gradio 
     as 
     gr 
     import 
     sys 
     import 
     os 
     os 
     . 
     environ 
     [ 
     "OPENAI_API_KEY" 
     ] 
     = 
     'Your API Key' 
     def 
     construct_index 
     ( 
     directory_path 
     ): 
     max_input_size 
     = 
     4096 
     num_outputs 
     = 
     512 
     max_chunk_overlap 
     = 
     20 
     chunk_size_limit 
     = 
     600 
     prompt_helper 
     = 
     PromptHelper 
     ( 
     max_input_size 
     , 
     num_outputs 
     , 
     max_chunk_overlap 
     , 
     chunk_size_limit 
     = 
     chunk_size_limit 
     ) 
     llm_predictor 
     = 
     LLMPredictor 
     ( 
     llm 
     = 
     ChatOpenAI 
     ( 
     temperature 
     = 
     0.7 
     , 
     model_name 
     = 
     "gpt-3.5-turbo" 
     , 
     max_tokens 
     = 
     num_outputs 
     )) 
     documents 
     = 
     SimpleDirectoryReader 
     ( 
     directory_path 
     ) 
     . 
     load_data 
     () 
     index 
     = 
     GPTVectorStoreIndex 
     ( 
     documents 
     , 
     llm_predictor 
     = 
     llm_predictor 
     , 
     prompt_helper 
     = 
     prompt_helper 
     ) 
     index 
     . 
     save_to_disk 
     ( 
     'index.json' 
     ) 
     return 
     index 
     def 
     chatbot 
     ( 
     input_text 
     ): 
     index 
     = 
     GPTVectorStoreIndex 
     . 
     load_from_disk 
     ( 
     'index.json' 
     ) 
     response 
     = 
     index 
     . 
     query 
     ( 
     input_text 
     , 
     response_mode 
     = 
     "compact" 
     ) 
     return 
     response 
     . 
     response 
     iface 
     = 
     gr 
     . 
     Interface 
     ( 
     fn 
     = 
     chatbot 
     , 
     inputs 
     = 
     gr 
     . 
     components 
     . 
     Textbox 
     ( 
     lines 
     = 
     7 
     , 
     label 
     = 
     "Enter your text" 
     ), 
     outputs 
     = 
     "text" 
     , 
     title 
     = 
     "Custom AI Chatbot" 
     ) 
     index 
     = 
     construct_index 
     ( 
     "docs" 
     ) 
     iface 
     . 
     launch 
     ( 
     share 
     = 
     True 
     ) 
    
  6. Make sure you're not saving it to the docs folder—it needs to be in the training folder (the same folder that contains the docs folder). You can call this script something like training.py .
  7. You can do so in a Terminal or Command Prompt window. Here's how:
    • At the prompt, use cd to enter your training folder. For example, if you created the training folder on your Windows desktop, you might use cd C:\Users\yourusername\Desktop\training .
    • Type python3 training.py and press Enter or Return . If you gave your script a different name, use that instead.
    • The process may take several minutes, especially if you've added a lot of training data to your docs folder. Once the process is complete, you'll see two URLs (one local, and the other on the web) to access your custom-trained chatbot.
  8. Copy the local URL from the terminal or command prompt window and paste it into a web browser. Or, if you want someone else to be able to access your chatbot, you can send them the public URL—the link will stay active for 72 hours, but make sure you don't turn your computer off in that time.
    • To restart the chatbot after 72 hours, just run python3 training.app again to generate a new URL.
    • To add new training data, copy new files into the docs folder, then run python3 training.app .
  9. Advertisement

Expert Q&A

Ask a Question
      Advertisement

      Tips

      Submit a Tip
      All tip submissions are carefully reviewed before being published
      Name
      Please provide your name and last initial
      Thanks for submitting a tip for review!

      Expert Interview

      Thanks for reading our article! If you’d like to learn more about using chatgpt, check out our in-depth interview with Tyrone Showers .

      About This Article

      Thanks to all authors for creating a page that has been read 52,745 times.

      Is this article up to date?

      Advertisement