Data Infrastructure · sub-niche
Change data capture tools.
CDC pipelines that don't require a Kafka cluster.
One-quarter buildSteady — one deal per month
Why now
CDC is moving from Kafka-centric to lighter SaaS shape. Reverse-ETL category is hot.
What the signal looks like
Repos with multi-database source adapters, exactly-once delivery libraries, and destination connectors.
Public examples
We name publicprojects + categories only — never founders we track inside the paid product. The buyer’s edge stays inside the product.
- Sequin shape
- Estuary / Materialize
- Debezium-as-a-service
What this displaces
A nightly batch job + 24-hour latency.
Our build-vs-invest call
Hard to differentiate from Airbyte / Fivetran. The wedge is real-time + lightweight + database-specific.
Common questions about this niche
- Buyer?
- Data engineering teams.
- Pricing?
- Per event or per source.
- Defensibility?
- Source-connector quality + reliability.
More inside Data Infrastructure
- Vector database engines — Vector search engines optimized for specific workloads — high-dimensional, hybrid, or local.
- Real-time feature stores — Feature stores with sub-second freshness for online ML.
- Postgres extension marketplaces — Postgres is now the AI database. The extension ecosystem is the next platform.
- Columnar warehouse alternatives — Snowflake / BigQuery alternatives optimized for a specific shape — cheap, fast, or open.