DeepInfra
Summary active Updated May 10, 2026This profile is AI-generated. If you spot an error, please help us fix it by sharing a URL to the correct information.
Investors
About
DeepInfra is a Palo Alto-based cloud platform purpose-built for high-throughput AI inference, founded around 2022 12. The company processes nearly five trillion tokens per week and provides infrastructure for enterprises and scaleups running open-source and agent-driven AI workloads 2. DeepInfra is an early infrastructure collaborator in NVIDIA’s open AI ecosystem, including providing infrastructure for NVIDIA’s Nemotron models 2. Revenue has tripled since the beginning of 2026 2. The $107M Series B announced May 4, 2026 will fund global compute capacity expansion, developer tooling, and support for next-generation models 12.
Funding History
| Date | Round | Amount | Lead | Co-investors |
|---|---|---|---|---|
| 2026-05-04 | Series B | $107M | 500 Global, Georges Harik | A.Capital Ventures, Crescent Cove, Felicis, NVIDIA, Peak6, Samsung Next, Supermicro, Upper90 123 |
What Investors Say
No independently sourced investor quotes found at this time.
What Founders Say
The DeepInfra team stated: “Inference is no longer a thin layer on top of an AI stack. It’s the system constraint that will define the majority of workloads.” 1
Sources
-
DeepInfra Blog, “DeepInfra Raises $107M Series B to Scale Inference Infrastructure,” May 4, 2026. Accessed May 2026. https://deepinfra.com/blog/deepinfra-series-b↩↩↩↩
-
GlobeNewsWire, “DeepInfra Closes $107M Series B to Power Production-Scale AI Inference,” May 4, 2026. Accessed May 2026. https://www.globenewswire.com/news-release/2026/05/04/3286977/0/en/deepinfra-closes-107m-series-b-to-power-production-scale-ai-inference.html↩↩↩↩↩↩
-
SiliconANGLE, “Deepinfra lands $107M in funding to build out its dedicated inference cloud for open-source models,” May 4, 2026. Accessed May 2026. https://siliconangle.com/2026/05/04/deepinfra-lands-107m-funding-build-dedicated-inference-cloud-open-source-models/ — Independently confirms 500 Global and Georges Harik co-led; lists all participants and references 500 Global Managing Partner Tony Wang’s commentary on purpose-built inference infrastructure. ↩