Featured

Build Your First Real Time Conversational Digital Twin in Five Minutes

Yujian Tang
August 19, 2024
min read
Contributors
Build AI video with Tavus APIs
Get Started Free
Share

Imagine you could build a video AI assistant that humans can interact with in real time. Now, imagine you could do it in five minutes. Well, now you can. Build a real time conversations with AI digital twins with just an API call.

In this tutorial, we’ll cover three of the main pieces that can get you off to the races:

We cover:

  • Code Snippet to Build Conversational AI Overview
  • How to Choose a Conversational Replica
  • Give Your AI Context
  • Properties for Controlling the Conversation
  • Summary of Building Your First Real Time Conversational AI

Code Snippet to Build Conversational AI Overview

Before we dive into the best practices, here’s what the code looks like. Before you work through this example, you need to pip install requests python-dotenv and sign up for a Tavus Account. Once you've signed up for an account, you can navigate to the key icon on the left side and then click "Create New Key" to get your new API key. Make sure you copy and save the key once you get it!

Get a Tavus API key

import requests

url = "https://tavusapi.com/v2/conversations"

The URL we’ll hit for this API is shown above, and an example payload is shown below. We’ll cover how to use each of the pieces of the API call in this example with a detailed walkthrough of the different parameters in the sections below. 

payload = {

   "replica_id": "<string>",

   "conversation_name": "<string>",

   "conversational_context": "<string>",

   "properties": {

       "max_call_duration": 240,

       "participant_left_timeout": 0,

       "enable_recording": True,

       "recording_s3_bucket_name": "<string>",

       "recording_s3_bucket_region": "<string>",

       "aws_assumed_role_arn": "<string>"

   }

}

Before we make our API call, we need to load our API key. I use dotenv to handle environment variables. Once we have our API key, we put it in the header and set our Content-Type to “application/json”. From there, we are ready to make a POST request to the URL specified above with the payload and headers we’ve created in this code snippet.

from dotenv import load_dotenv

import os

load_dotenv()

TAVUS_API_KEY = os.environ["TAVUS_API_KEY"]

headers = {

   "x-api-key": TAVUS_API_KEY,

   "Content-Type": "application/json"

}

response = requests.request("POST", url, json=payload, headers=headers)

print(response.text)

When we get a response, it will look something like this:

{

   "conversation_id": "c2b10f44",

   "conversation_name": "New Conversation 1722996608090",

   "conversation_url": "https://tavus.daily.co/c2b10f44",

   "status": "active",

   "callback_url": null,

   "created_at": "2024-08-07T02:10:08.103Z"

}

The main thing to pay attention to here is “conversation_url”. You can click this URL and it will take you directly to a meeting room where you or a user can chat with the digital replica in real time.

Choose a Conversational Replica

The first thing to do when creating a conversation is to choose a conversational replica. We can specify this in the first line in the payload under replica_id.

There are two options here:

  1. You can create your own conversational replica (aka your digital twin) with an API call
  2. You can choose a stock replica

If you want to create your own, you can do so with an API call and a simple video.

After you pick a conversational replica, the second line is conversation_name. This is where you can name your conversation.

Give Your AI Context

The next line we see is conversational_context. Conversational context is the last mile situational context you need to give to the AI so  it knows how to conduct the conversation. For example, if you’re building a sales coach, which we will cover an example of in our next blog, you would want to give the AI context on the the sales meeting.

Examples of conversational context could be:

  • This person set a meeting with you to talk about video AI. Ask them about the value propositions and get them to ask you about your problem set.
  • This is a conversation with a friend. Your friend is seeking some advice on how to approach looking for a job.
  • You have set a meeting with this person to discuss a potential real estate investment. Here are the details of the deal: <x y z details>. See if this is a fit for your prospect.

Properties for Controlling the Conversation

The final block of the payload we need to understand is the properties block. There are six different properties available for you to control. Let’s take a look at what each of them does.

  • "max_call_duration" controls how long the maximum length of the call can be *in seconds*, so 3600 as shown in the example is an hour long call
  • "participant_left_timeout" controls how long the call will last after a participant leaves *in seconds* so 60 shown is a one minute timeout
  • "enable_recording" decides whether or not it’s possible for you to record the call. The next three properties are only used when this property is set to true
  • "recording_s3_bucket_name" is the name of the S3 bucket where you want to save your recording
  • "recording_s3_bucket_region" is the region for the S3 bucket where you want to save your recording
  • "aws_assumed_role_arn" is the role you need to assume to save your recording

Summary of Building Your First Real Time AI Video Conversation

In this article we learned how you can quickly get started with building a real time conversational AI. All you need is an internet connection, some Python knowledge, and an API key from Tavus. With these three things in hand, you can simply set off a POST request with a series of parameters, and get a link to a conversation back.

We also covered what many of the important parameters are. Namely, we covered:

  • The replica_id, which allows you to pick a replica to use in a conversation
  • The conversational_context, which sets the specific context for a conversation
  • The six properties of a conversation you can control

Next time, we’ll dive further into building a custom conversational AI by showing you how you can create a "persona", which can provide more than just context for your conversation, but also a background and set of expertise for the replica, for example a sales agent.

Research initiatives

The team is at the forefront of AI video research and pushes model updates every two weeks based on the latest research and customer needs.

Product
min read
This is some text inside of a div block.
min read

Tavus is SOC 2 Certified

Product
min read
This is some text inside of a div block.
min read

Build Your First Real Time Conversational Digital Twin in Five Minutes

A tutorial on how you can build a conversational AI in just a few minutes with details of what each of the parameters do.
Product
5
min read
This is some text inside of a div block.
min read

Build a Custom Personality for Real Time Video AI

Customize your conversational AI's personality using a custom persona with system prompts, context, and your own LLM.
Developer
min read
This is some text inside of a div block.
min read

8+ Best Lip Sync Video APIs [2024]

Lip sync video APIs are making manual lip sync processes a thing of the past. Learn how AI lip sync works and the best APIs on the market.
Industry
min read
This is some text inside of a div block.
min read

Heygen API Review & Alternatives for AI Video Generation [2024]

Explore the Heygen API for AI video creation in 2024: Get insights, compare alternatives, and choose the best tool for your video needs.
Industry
min read
This is some text inside of a div block.
min read

Synthesia API Review & Alternatives for AI Video Generation [2024]

Explore Synthesia API and its 2024 alternatives. Learn about each tool's features, weigh their pros and cons, and find the right API video solution for you.

AI video APIs for digital twins

Build immersive AI-generated video experiences in your application