ProductMay 8, 20268 min read
How Superhuman and Databricks built a 200K QPS inference platform together
Superhuman and Databricks engineers share how they jointly migrated spelling and grammar correction workloads to the Databricks Model Serving Platform, serving over 200k QPS, with 60% throughput gains and sub-second P99 latency.
