Digital Replicas That Can Have Real Conversations


Episode Artwork
1.0x
0% played 00:00 00:00
Oct 11 2024 37 mins   2

Hassaan Raza is the cofounder and CEO of Tavus, a video API platform for digital twins. They've raised more than $28M in funding from investors such as Sequoia and Scale VP.

Hassaan's favorite book: Go Like Hell (Author: A. J. Baime)

(00:01) Introduction
(00:38) Overview of AI in video generation
(01:44) AI models used in video generation
(03:35) Capturing intricate facial movements in real-time
(06:46) Data capture and 3D modeling from basic video input
(09:01) Explanation of neural radiance fields and Gaussian splatting
(10:14) Capturing facial expressions for video generation
(15:22) Temporal coherence in video generation
(18:05) Challenges in conversational video, including lip-syncing and emotion alignment
(20:38) Inference challenges in conversational video
(22:47) Bottlenecks in the pipeline: LLMs and time-to-first-token
(26:58) Multimodal models and trade-offs
(27:36) Advice for founders running API businesses
(30:04) Pitfalls to avoid in API businesses
(32:15) Technological breakthroughs in AI
(34:10) Rapid-fire round

--------
Where to find Prateek Joshi:

Newsletter: https://prateekjoshi.substack.com
Website: https://prateekj.com
LinkedIn: https://www.linkedin.com/in/prateek-joshi-91047b19
Twitter: https://twitter.com/prateekvjoshi