Data infrastructure is fund quadrant — but the indie wedge inside it is the best on the site.
Scope: Data warehouses, lakehouses, ETL, transformation, observability, governance, vector databases, embedding pipelines, RAG infrastructure.
Cost-to-build
64/100
Core infrastructure is expensive and team-heavy. Adjacent dbt-shaped tooling is buildable at indie scale.
Deal-velocity
82/100
Engineering acceleration in data infra is among the cleanest on the site — 8–10 week lag, with a very strong link to revenue inflection.
Live signal: 12 data infrastructure startups currently tracked for Q2 2026. See the roster →
Where Data Infrastructure lands
Build
Build it yourself
Fund
Write the cheque
Avoid
Reroute the energy
Wait
Wait or partner
High cost-to-build, high deal-velocity. The market rewards capital and rewards it fast — sourcing inside the pre-fundraise window matters more than picking the right sub-niche.
The honest version
The core data-warehouse layer is closed and capital-intensive. But the indie wedge — the developer-tools-shaped sliver that lives on top of the warehouse — is one of the highest-yield build opportunities on the site. Investors should fund the core layer; founders should build the wedge.
If you are building
Fits when: You are shipping a dbt-shaped adjacent tool, not a new warehouse.
If you are funding
Fits when: You can underwrite enterprise infrastructure GTM and your portfolio model handles long Series A → B compounding.
Marginally. The incumbents are entrenched and the cost-to-build is high. The investable layer is now everything adjacent — testing, governance, lineage, vector — not the warehouse itself.
Vector databases, embedding pipelines, and RAG infrastructure straddle both. Treat them as data-infra by mechanics (revenue ties to ingest / query volumes) but with AI/ML co-tagged competitive dynamics.
AI/ML is the most expensive thing to build and the fastest thing to fund.
Fintech rewards capital and patience, not weekend builders.
Climate tech is finally fundable software — but only the software half.
Cyber is the most reliable mid-velocity fund quadrant on the site.
Every sector we track lives somewhere on the 2×2 — the index page groups all 20 verdicts in one place.