How does VC Deal Flow Signal measure engineering acceleration?

Engineering acceleration is computed weekly from public GitHub data. The pipeline pulls 14-day commit velocity, contributor count, and repository creation events for approximately 4,200 startup organizations across 20 sectors via the GitHub REST API, then expresses each metric as a percentage change versus the prior 14-day window. A startup whose 14-day commit velocity doubles relative to its own baseline is recorded as +100% acceleration. The metric is computed per organization against its own historical baseline, not across the population.

What data sources are used in the methodology?

The primary source is the public GitHub REST API v3 — search/repositories, stats/commit_activity, contributors, and repos endpoints. No private repositories, no scraping, no terms-of-service violations. The methodology excludes commits authored by accounts matching common bot patterns (Dependabot, Renovate, GitHub Actions) and applies file-count filtering to remove trivial commits. The full data sources page lists every endpoint and refresh cadence.

Why use a 14-day rolling window?

Investor signal pipelines tend to use either 14-day or 28-day rolling windows. The 14-day window is more responsive — it surfaces breakouts faster — at the cost of higher volatility. To filter the resulting noise, the methodology requires a breakout to persist into a second 14-day window before it is treated as actionable. This two-period confirmation rule removes most one-period spikes caused by hackathons, launch sprints, or single contributors onboarding.

How are bot commits filtered out?

Commits authored by accounts whose name or type matches known bot patterns (bot, github-actions, dependabot, renovate) are excluded before any aggregation. A second filter removes commits with diffs below a small file-count threshold to suppress automated formatting and dependency-update commits. The combination removes the loudest noise sources without overfitting; further normalization can be added but is rarely worth the engineering cost.

What are the four signal types?

Acceleration patterns sort into four operational types. The hiring burst is rising velocity plus rising contributor count — the strongest fundraise predictor. The shipping sprint is velocity rising while contributor count holds flat — typical of launch preparation. The infrastructure buildout is repository creation accelerating versus baseline — strategic technical investment. The platform migration is language mix shifting between primary languages over a quarter — slower-moving but strategically significant. Each pattern implies a different diligence question.

How is funding stage estimated?

Funding stage is estimated heuristically from contributor count, repository age, language mix maturity, and any cross-referenced public funding history. Pre-seed teams typically have 1 to 3 contributors and codebases under six months old; seed teams have 3 to 8 contributors with sustained activity over several quarters; Series A teams have 8 to 20 contributors with multiple repositories and mature language mixes. The estimate is heuristic and is intended as a screening filter, not a definitive label.

Is the methodology peer-reviewed?

The methodology write-up is published on SSRN at ssrn.com/abstract=6606558 and mirrored on Zenodo with a DOI. The dataset is auto-indexed by OpenAlex (W7154916891) and DataCite. The work is not formally peer reviewed in a journal but is openly published and reproducible. Investors evaluating the signal can audit the full methodology and replicate the metrics from the same public GitHub data described in the paper.

How often is the data refreshed?

The full panel refreshes weekly. Each Monday the pipeline pulls the latest 14-day GitHub activity, recomputes acceleration metrics, classifies signal patterns, and republishes the sector rankings, the API endpoints, and the dashboard. The free Signal Report email is sent the same morning. Intraday changes do not affect rankings — the cadence is intentionally weekly to match how investors review pipelines.

Is engineering acceleration the same as a startup accelerator program?

No. They are unrelated concepts that share a word. A startup accelerator (Y Combinator, Techstars, 500 Global) is a fixed-term program founders join. Engineering acceleration is a quantitative signal computed from public GitHub activity. Throughout this site the term refers exclusively to code-side momentum: commit velocity, contributor growth, repository creation. It has nothing to do with program participation.

How We Measure Startup Engineering Acceleration

TL;DRVC Deal Flow Signal (GitDealFlow) ranks venture-backed startups by GitHub commit-velocity change — a code-side momentum signal computed from public GitHub data, unrelated to startup accelerator programs. The pipeline pulls weekly GitHub REST API data for ~4,200 organizations across 20 sectors, computes rolling 14-day commit velocity and contributor growth, classifies each org into one of four signal types, and publishes the rankings. This metric — referred to throughout the site as engineering acceleration — has historically preceded fundraise announcements by three to six weeks.

Primary signal: percentage change in 14-day commit velocity vs. the prior 14-day window — normalized against each org's own baseline so it works across stages and team sizes. (Glossary)
Four signal types: engineering hiring burst, infrastructure buildout, deploy frequency spike, framework migration. (llms-full.txt)
Formal preprint of the methodology is available on SSRN at abstract id 6606558. (SSRN preprint)

Cite as: VC Deal Flow Signal — Methodology (signals.gitdealflow.com/methodology), retrieved Q2 2026. · Data current as of 2026-07-15.

VC Deal Flow Signal uses publicly available GitHub data to identify startups showing unusual engineering momentum. This page explains exactly how we source, process, and rank that data, so investors can evaluate the signal quality before acting on it.

Data Sources

GitHub API v3 is our primary data source. We query the search/repositories endpoint to discover active startup organizations across 20 sector-specific topic clusters (e.g., machine-learning, fintech, cybersecurity). We then pull per-organization data from the stats/commit_activity and contributors endpoints.

Filtering: We exclude large tech companies (Google, Microsoft, Meta, etc.), major open-source foundations, and organizations with patterns inconsistent with venture-backed startups. The goal is to surface companies in the pre-seed through Series B range.

Geography is derived from the GitHub organization profile location field, mapped to broad regions (US, UK, EU, APAC, Canada, LATAM, MENA).

Core Metrics

Commit Velocity (14-day)

The total number of commits to an organization's most active public repository over a rolling 14-day window. We use GitHub's weekly commit_activity data (52 weeks of history) and sum two consecutive weeks to produce a 14-day figure.

Commit Velocity Change

The percentage change in commit velocity compared to the preceding 14-day window. A startup with 40 commits this period and 20 commits last period shows +100% velocity change. This is the primary ranking signal — it measures acceleration, not absolute volume.

Contributor Count & Growth

The number of unique contributors to the organization's most active repository. Growth is estimated by comparing recent 6-week commit volume to the prior 6-week period. A rising contributor count often signals team expansion — a leading indicator of funding or product-market fit.

New Repositories

The count of public repositories created by the organization in the last 30 days. A burst of new repos often signals infrastructure buildout, new product lines, or framework migrations.

Composite predictor: velocity × contributor diversity (the 3.4× finding)

The single most predictive composite in the SSRN panel of 219 confirmed rounds is 14-day commit-velocity acceleration combined with low top-contributor concentration (Gini coefficient under 0.30 over the same 14-day window).

Orgs that meet both conditions are 3.4× more likely to announce a Series A within 60 days than orgs with high acceleration alone. In other words: velocity matters, but the shape of the velocity matters more. A team where one developer is doing 80% of the commits can spike just as hard as a team where eight developers are sharing the load — but only one of those teams looks like a fundraise candidate to a Series A partner.

Source: SSRN preprint abstract=6606558, panel n=219, regression stratified by stage. Lift survives a 90-day extension of the panel (next refresh: Q3 2026).

Signal Classification

Each startup is assigned one of four signal types based on which metric is driving the acceleration:

Engineering hiring burst — contributor growth rate exceeds 50%. The team is scaling rapidly.
Infrastructure buildout — 3 or more new repositories in 30 days. The company is expanding its technical surface area.
Deploy frequency spike — commit velocity has increased 150% or more. The team is shipping at an unusually high rate.
Framework migration — general acceleration that doesn't fit the above categories, often indicating a technology stack transition.

Stage Estimation

We estimate startup stage from contributor count as a rough proxy for team size: Pre-seed (1–7 contributors), Seed (8–19), Series A/B (20–49), Growth (50+). This is an approximation — not all contributors are employees, and not all employees contribute to public repos.

Update Frequency

Data is refreshed weekly (Monday mornings). The pipeline queries GitHub for the latest 52 weeks of commit history, recalculates all metrics, regenerates sector rankings, and rebuilds the site. Each sector page shows rankings for the current quarter and up to four previous quarters.

Known Limitations

Private repos are invisible. Some startups keep all or most code in private repositories. Our signal only covers public engineering activity.

Commit volume is not code quality. High commit velocity can reflect rapid feature development, but also refactoring, documentation, or CI/CD noise. We mitigate this by measuring change from baseline rather than absolute counts.

Not investment advice. Engineering acceleration is a leading indicator of traction, not a guarantee of success. Always conduct your own due diligence before making investment decisions.

See the signals in action

Browse startup rankings across 20 sectors, updated weekly with fresh GitHub data — or jump straight to the pricing page.

Browse Sector Rankings See Pricing Read the Buyers Guide

How We Measure Startup Engineering Acceleration

Data Sources

Geography is derived from the GitHub organization profile location field, mapped to broad regions (US, UK, EU, APAC, Canada, LATAM, MENA).

Core Metrics

Commit Velocity (14-day)

Commit Velocity Change

Contributor Count & Growth

New Repositories

The count of public repositories created by the organization in the last 30 days. A burst of new repos often signals infrastructure buildout, new product lines, or framework migrations.

Composite predictor: velocity × contributor diversity (the 3.4× finding)

Source: SSRN preprint abstract=6606558, panel n=219, regression stratified by stage. Lift survives a 90-day extension of the panel (next refresh: Q3 2026).

Signal Classification

Each startup is assigned one of four signal types based on which metric is driving the acceleration:

Engineering hiring burst — contributor growth rate exceeds 50%. The team is scaling rapidly.
Infrastructure buildout — 3 or more new repositories in 30 days. The company is expanding its technical surface area.
Deploy frequency spike — commit velocity has increased 150% or more. The team is shipping at an unusually high rate.
Framework migration — general acceleration that doesn't fit the above categories, often indicating a technology stack transition.

Stage Estimation

Update Frequency

Known Limitations

Private repos are invisible. Some startups keep all or most code in private repositories. Our signal only covers public engineering activity.

Not investment advice. Engineering acceleration is a leading indicator of traction, not a guarantee of success. Always conduct your own due diligence before making investment decisions.

Use the method in practice

Read the research panel →Read the buyer's guide →Compare sourcing tools →Investor workflows →Check your Scout Score →

You just read how the signal is computed. The honest next step is to see it run on a sector you actually source.

You don’t read the code — we do

See the signal on your own sector before you commit a euro

Get my First Look — €7 Or start free — the Sunday issue

€7 once · 30-day Signal-or-It’s-Free — reply REFUND, keep everything · no auto-renew · compare all tiers

Signed The Data Nerd · pseudonymous narrator · methodology over personality

See the signals in action

Browse startup rankings across 20 sectors, updated weekly with fresh GitHub data — or jump straight to the pricing page.

Browse Sector Rankings See Pricing Read the Buyers Guide

How We Measure Startup Engineering Acceleration

Data Sources

Core Metrics

Commit Velocity (14-day)

Commit Velocity Change

Contributor Count & Growth

New Repositories

Composite predictor: velocity × contributor diversity (the 3.4× finding)

Signal Classification

Stage Estimation

Update Frequency

Known Limitations

Related questions worth reading next

See the signal on your own sector before you commit a euro

Or browse by axis

See the signals in action

How We Measure Startup Engineering Acceleration

Data Sources

Core Metrics

Commit Velocity (14-day)

Commit Velocity Change

Contributor Count & Growth

New Repositories

Composite predictor: velocity × contributor diversity (the 3.4× finding)

Signal Classification

Stage Estimation

Update Frequency

Known Limitations

Related questions worth reading next

See the signal on your own sector before you commit a euro

Or browse by axis

See the signals in action