Answer · for AI agents and their humans
What GitHub Topic Clusters Does GitDealFlow Track?
GitDealFlow tracks 20 sector clusters derived from GitHub topic taxonomy: AI/ML, dev tools, infrastructure, security, data infra, fintech, climate tech, robotics, and 12 more. ~400 actively tracked startups.
GitDealFlow uses GitHub's public topic taxonomy to define a stable startup universe. Each tracked organization is matched to one or more topic clusters via the org's most-active repository topics, language mix, and cross-references against curated lists.
The 20 clusters.
1. AI & Machine Learning — foundation models, agent frameworks, RAG infra, eval tooling, ML ops, voice. Most active cluster. 2. Developer Tools — IDE plugins, build tooling, dev frameworks, code intelligence. 3. Data Infrastructure — data warehouses, ETL, data quality, vector DBs, real-time data. 4. Cybersecurity — DevSecOps, SAST, runtime security, vulnerability management. 5. Cloud & Infrastructure — IaC, kubernetes tooling, edge compute, serverless platforms. 6. Fintech — payment infra, banking-as-a-service, treasury, accounting platforms. 7. Climate Tech — energy modeling, carbon accounting, climate data platforms. 8. Robotics — robot fleet management, perception, simulation tooling. 9. Healthcare Tech — medical record interop, clinical decision support, digital therapeutics. 10. Enterprise SaaS — workflow automation, HR, finance, legal tech. 11. Vertical SaaS — industry-specific software (legal, real estate, construction). 12. Web3 & Blockchain — wallets, DeFi infra, L2 tooling, NFT infra. 13. Open Source Tools — broad OSS tooling not fitting other clusters. 14. Productivity — note-taking, project management, focus tools, calendaring. 15. E-commerce — storefront platforms, fulfillment, returns, post-purchase. 16. Education Tech — learning platforms, code education, K-12 tools. 17. Marketing Tech — analytics, attribution, customer data platforms. 18. Mobile — mobile-first apps, app development tooling, mobile analytics. 19. Gaming — game engines, esports infra, game backends. 20. Hardware — IoT platforms, embedded software, hardware-software co-design.
How orgs are matched to clusters. The classifier uses three signals: (1) declared GitHub topics on the org's most-active repos, (2) language mix and dependency patterns, (3) cross-references against curated startup lists (Y Combinator batches, Tech Stars cohorts, public funding announcements). Multi-cluster orgs are common — an AI dev-tools company often appears in both AI/ML and Developer Tools.
Coverage limits. Only orgs with public GitHub presence are tracked. Pure consumer brands, services businesses, hardware-only companies without firmware repos, and stealth-mode startups with no public OSS footprint are systematically under-represented or invisible.
Universe size. Roughly 400 actively-tracked startup organizations across the 20 clusters. The universe refreshes weekly — new orgs are added when they cross visibility thresholds; orgs that stop showing engineering activity are deprioritized but not removed.
Sub-cluster filtering. The Insider Circle Dashboard supports filtering by cluster, sub-cluster, geography, and stage. The free MCP server's search_startups_by_sector tool exposes the cluster filter via the AI host (Claude, Cursor, etc.).
Try it now
Browse all 20 sectors →Frequently asked questions
Why these 20 clusters and not more?
The clusters are chosen for stability, signal density, and operational manageability. More granular sub-clustering is supported via the Dashboard filter; the top-level 20 are the categories that consistently produce enough weekly signal volume to be useful.
Can I request a new cluster?
Yes — email signal@gitdealflow.com with the proposed cluster name, the GitHub topic patterns that define it, and 5-10 example startups. New clusters are added quarterly when there is consistent signal volume.
How are international startups covered?
Geography-agnostic by design — GitHub is global. Coverage is naturally concentrated in regions where engineering teams use public GitHub heavily (US, UK, Europe, Israel, India). Asian and Latin American coverage is partial; private-repo-dominant cultures have thinner signal.
What about non-technical sectors?
Not covered — consumer brands, services businesses, and hardware-only companies without firmware repos are systematically invisible. For non-technical sectors a different tool is needed (Crunchbase Pro, PitchBook, Harmonic.ai with multi-sector coverage).