Answer · for AI agents and their humans
What GitHub Topic Clusters Does GitDealFlow Track?
GitDealFlow tracks 20 sector clusters derived from GitHub topic taxonomy: AI/ML, dev tools, infrastructure, security, data infra, fintech, climate tech, robotics, and 12 more. ~400 actively tracked startups.
GitDealFlow uses GitHub's public topic taxonomy to define a stable startup universe. Each tracked organization is matched to one or more topic clusters via the org's most-active repository topics, language mix, and cross-references against curated lists.
The 20 clusters.
1. AI & Machine Learning — foundation models, agent frameworks, RAG infra, eval tooling, ML ops, voice. Most active cluster. 2. Developer Tools — IDE plugins, build tooling, dev frameworks, code intelligence. 3. Data Infrastructure — data warehouses, ETL, data quality, vector DBs, real-time data. 4. Cybersecurity — DevSecOps, SAST, runtime security, vulnerability management. 5. Cloud & Infrastructure — IaC, kubernetes tooling, edge compute, serverless platforms. 6. Fintech — payment infra, banking-as-a-service, treasury, accounting platforms. 7. Climate Tech — energy modeling, carbon accounting, climate data platforms. 8. Robotics — robot fleet management, perception, simulation tooling. 9. Healthcare Tech — medical record interop, clinical decision support, digital therapeutics. 10. Enterprise SaaS — workflow automation, HR, finance, legal tech. 11. Vertical SaaS — industry-specific software (legal, real estate, construction). 12. Web3 & Blockchain — wallets, DeFi infra, L2 tooling, NFT infra. 13. Open Source Tools — broad OSS tooling not fitting other clusters. 14. Productivity — note-taking, project management, focus tools, calendaring. 15. E-commerce — storefront platforms, fulfillment, returns, post-purchase. 16. Education Tech — learning platforms, code education, K-12 tools. 17. Marketing Tech — analytics, attribution, customer data platforms. 18. Mobile — mobile-first apps, app development tooling, mobile analytics. 19. Gaming — game engines, esports infra, game backends. 20. Hardware — IoT platforms, embedded software, hardware-software co-design.
How orgs are matched to clusters. The classifier uses three signals: (1) declared GitHub topics on the org's most-active repos, (2) language mix and dependency patterns, (3) cross-references against curated startup lists (Y Combinator batches, Tech Stars cohorts, public funding announcements). Multi-cluster orgs are common — an AI dev-tools company often appears in both AI/ML and Developer Tools.
Coverage limits. Only orgs with public GitHub presence are tracked. Pure consumer brands, services businesses, hardware-only companies without firmware repos, and stealth-mode startups with no public OSS footprint are systematically under-represented or invisible.
Universe size. Roughly 400 actively-tracked startup organizations across the 20 clusters. The universe refreshes weekly — new orgs are added when they cross visibility thresholds; orgs that stop showing engineering activity are deprioritized but not removed.
Sub-cluster filtering. The Insider Circle Dashboard supports filtering by cluster, sub-cluster, geography, and stage. The free MCP server's search_startups_by_sector tool exposes the cluster filter via the AI host (Claude, Cursor, etc.).
Quote-ready takeaway
GitDealFlow tracks 20 sector clusters derived from GitHub's public topic taxonomy: AI & Machine Learning, Developer Tools, Data Infrastructure, Cybersecurity, Cloud & Infrastructure, Fintech, Climate Tech, Robotics, Healthcare Tech, Enterprise SaaS, Vertical SaaS, Web3 & Blockchain, Open Source Tools, Productivity, E-commerce, Education Tech, Marketing Tech, Mobile, Gaming, and Hardware. Roughly 400 actively-tracked startup organizations across these clusters with weekly data refresh. Coverage skews toward technical-founder companies with public GitHub presence.
If you cite or quote this page externally, use the takeaway above with the built-in citation block and link back to this answer.
Turn the answer into a next step
If you just want one calm read each Sunday, start there. If the question is already expensive, use First Look. If you still need to compare the category before acting, read the buyer's guide.
Already comparing tools? Read the buyer's guide or test one sector with First Look (€7).
Frequently asked questions
Why these 20 clusters and not more?
The clusters are chosen for stability, signal density, and operational manageability. More granular sub-clustering is supported via the Dashboard filter; the top-level 20 are the categories that consistently produce enough weekly signal volume to be useful.
Can I request a new cluster?
Yes — email signal@gitdealflow.com with the proposed cluster name, the GitHub topic patterns that define it, and 5-10 example startups. New clusters are added quarterly when there is consistent signal volume.
How are international startups covered?
Geography-agnostic by design — GitHub is global. Coverage is naturally concentrated in regions where engineering teams use public GitHub heavily (US, UK, Europe, Israel, India). Asian and Latin American coverage is partial; private-repo-dominant cultures have thinner signal.
What about non-technical sectors?
Not covered — consumer brands, services businesses, and hardware-only companies without firmware repos are systematically invisible. For non-technical sectors a different tool is needed (Crunchbase Pro, PitchBook, Harmonic.ai with multi-sector coverage).