Case study · GitHub signal → priced round
Together AI — open training stack to a $106M Salesforce Series A
Together AI's RedPajama dataset and training-stack openness led the $106M Salesforce Series A in March 2024.
At a glance
- Company
- Together AI
- Sector
- AI / training infrastructure
- Primary repo
- github.com/togethercomputer/RedPajama-Data
- Trigger window
- first half 2023, accelerating into 2024
- Stars at trigger
- ~4K stars at trigger window
- Announced raise
- $106M Series A (Salesforce) (2024-03-13)
- Lead investor
- Salesforce Ventures
- Time-to-money read
- OSS dataset release + foundation model partnerships led the Series A
Together AI's strategy is to be the open-stack counterweight to closed model labs. The RedPajama dataset release in 2023 was a category-defining moment — the largest curated open training set at the time.
Partnership announcements through 2023 (foundation-model deals, hardware partnerships) were the leading signal. Each one weak alone; together strong.
Salesforce led the $106M Series A in March 2024. The partnership signal had been compounding for at least four quarters.
Signals that would have flagged this pre-raise
- OSS dataset release:RedPajama dataset in 2023
- Partnership density:Multiple foundation-model + hardware partnerships
- Conference signal:Active presence at ML conferences
Repositories
Frequently asked questions
Is dataset release a generalizable signal?
For training-infra companies — yes. Releasing a credible open dataset is a strong public commitment that draws partnership interest.
Find the next one
VC Deal Flow Signal tracks engineering acceleration weekly across twenty sectors — the same signal shapes that preceded the raise above.
Get the weekly signal report →