Building The Future Show - Radio / TV / Podcast - Ep. 567 w/ Brian Stevens CEO at Neural Magic

Apr 23 2024 46 mins

Together with our community, we engineer sparse LLM, CV, and NLP models that are more efficient and performant in production. Why does this matter? Sparse models are more flexible and can achieve unrivaled latency and throughput performance on your private CPU and GPU infrastructure. Check us out on GitHub and join the Neural Magic Slack Community to get started with software-delivered AI.

http://neuralmagic.com/

Download episode Share

Copy URL

Subscribe on Podcast Addict