Ep. 567 w/ Brian Stevens CEO at Neural Magic


Episode Artwork
1.0x
0% played 00:00 00:00
Apr 23 2024 46 mins  

Together with our community, we engineer sparse LLM, CV, and NLP models that are more efficient and performant in production. Why does this matter? Sparse models are more flexible and can achieve unrivaled latency and throughput performance on your private CPU and GPU infrastructure. Check us out on GitHub and join the Neural Magic Slack Community to get started with software-delivered AI.

http://neuralmagic.com/