DeepInfra

Summary active Updated May 10, 2026

This profile is AI-generated. If you spot an error, please help us fix it by sharing a URL to the correct information.

Location Palo Alto, CA
Founded 2022
Latest Stage Series B
Total Raised $107M+

Investors

500 Global series-b (2026)

About

DeepInfra is a Palo Alto-based cloud platform purpose-built for high-throughput AI inference, founded around 2022 12. The company processes nearly five trillion tokens per week and provides infrastructure for enterprises and scaleups running open-source and agent-driven AI workloads 2. DeepInfra is an early infrastructure collaborator in NVIDIA’s open AI ecosystem, including providing infrastructure for NVIDIA’s Nemotron models 2. Revenue has tripled since the beginning of 2026 2. The $107M Series B announced May 4, 2026 will fund global compute capacity expansion, developer tooling, and support for next-generation models 12.

Funding History

Date Round Amount Lead Co-investors
2026-05-04 Series B $107M 500 Global, Georges Harik A.Capital Ventures, Crescent Cove, Felicis, NVIDIA, Peak6, Samsung Next, Supermicro, Upper90 123

What Investors Say

No independently sourced investor quotes found at this time.

What Founders Say

The DeepInfra team stated: “Inference is no longer a thin layer on top of an AI stack. It’s the system constraint that will define the majority of workloads.” 1

Sources


  1. DeepInfra Blog, “DeepInfra Raises $107M Series B to Scale Inference Infrastructure,” May 4, 2026. Accessed May 2026. https://deepinfra.com/blog/deepinfra-series-b

  2. GlobeNewsWire, “DeepInfra Closes $107M Series B to Power Production-Scale AI Inference,” May 4, 2026. Accessed May 2026. https://www.globenewswire.com/news-release/2026/05/04/3286977/0/en/deepinfra-closes-107m-series-b-to-power-production-scale-ai-inference.html

  3. SiliconANGLE, “Deepinfra lands $107M in funding to build out its dedicated inference cloud for open-source models,” May 4, 2026. Accessed May 2026. https://siliconangle.com/2026/05/04/deepinfra-lands-107m-funding-build-dedicated-inference-cloud-open-source-models/ — Independently confirms 500 Global and Georges Harik co-led; lists all participants and references 500 Global Managing Partner Tony Wang’s commentary on purpose-built inference infrastructure.