Hugo speaks with Ines Montani and Matthew Honnibal, the creators of spaCy and founders of Explosion AI. Collectively, they've had a huge impact on the fields of industrial natural language processing (NLP), ML, and AI through their widely-used open-source library spaCy and their innovative annotation tool Prodigy. These tools have become essential for many data scientists and NLP practitioners in industry and academia alike.
In this wide-ranging discussion, we dive into:
• The evolution of applied NLP and its role in industry
• The balance between large language models and smaller, specialized models
• Human-in-the-loop distillation for creating faster, more data-private AI systems
• The challenges and opportunities in NLP, including modularity, transparency, and privacy
• The future of AI and software development
• The potential impact of AI regulation on innovation and competition
We also touch on their recent transition back to a smaller, more independent-minded company structure and the lessons learned from their journey in the AI startup world.
Ines and Matt offer invaluable insights for data scientists, machine learning practitioners, and anyone interested in the practical applications of AI. They share their thoughts on how to approach NLP projects, the importance of data quality, and the role of open-source in advancing the field.
Whether you're a seasoned NLP practitioner or just getting started with AI, this episode offers a wealth of knowledge from two of the field's most respected figures. Join us for a discussion that explores the current landscape of AI development, with insights that bridge the gap between cutting-edge research and real-world applications.
LINKS
- The livestream on YouTube
- How S&P Global is making markets more transparent with NLP, spaCy and Prodigy
- A practical guide to human-in-the-loop distillation
- Laws of Tech: Commoditize Your Complement
- spaCy: Industrial-Strength Natural Language Processing
- LLMs with spaCy
- Explosion, building developer tools for AI, Machine Learning and Natural Language Processing
- Back to our roots: Company update and future plans, by Matt and Ines
- Matt's detailed blog post: back to our roots
- Ines on twitter
- Matt on twitter
- Vanishing Gradients on Twitter
- Hugo on Twitter
Check out and subcribe to our lu.ma calendar for upcoming livestreams!