There used to be a list of launched companies in the YC faq, but there started to be so many that it was always out of date. We have a list ourselves of course, but we couldn't publish that because it's full of stuff companies don't want disclosed. And it would be a lot of work to tease apart the stuff that's secret from the stuff that isn't. Often I'm not sure myself. So making a publicly viewable list of all the startups seems to fall into the category of things that might be interesting to do, but wouldn't actually help the startups much, and would be a significant amount of work.
I think this comment at 3:51 PM EST on 10/17/2010 may be the official death knell of structured data.
A few years ago, YC would have maintained portfolio data in a spreadsheet or database. But in today's world, it's probably in a wiki.
It might be surprising that an uberhacker like PG doesn't have a custom database of YC startups (which would, by definition, make it trivial to export the "non-secret" stuff). That's why I think this comment speaks volumes about how -- even amongst supernerds who think in a systematic, structured way -- a loose affiliation of hypertext documents is the preferred way to capture and record institutional knowledge.
Or maybe it is in a spreadsheet and PG is just lazy. But I doubt it.
We do have a custom database. What makes it hard to export data is that we have no representation of what info is public. The database knows only that e.g. startup x has raised a series A round, or has been acquired, but it doesn't know whether those things have been announced yet.
I believe the only dataset that YC needs to furnish is the official list of alumni.
Follow-on rounds, exits, and current status can all be provided by the community. But there seems to be no authoritative dataset that shows which startups participated in YC.
But from your comments, I'm guessing that some companies prefer that their mere involvement with YC not be disclosed at all, which makes this a harder task then just periodically exporting the name and URL of every portfolio company.
"Rumors of my death have been greatly exaggerated." - Structured Data
There is a reliable source of YC alumni list from outside of YC, but I hesitate to mention it as it includes founders of startups that are still in stealth. If anyone needs the list PM me.
Although with a bit of thinking I'm sure most people can figure out where to get it from.
Well, in all geekiness, pg could release the structure of 'the list' without revealing the actual data & put an end to all of this structured vs. unstructured speculation. Just sayin'.
I think his point was that if it's in a spreadsheet, it would only take a few minutes to copy the "name" column, excluding all startups where the "launched" column indicates that they're still in stealth.
He wasn't calling using a spreadsheet lazy; he was calling using one and still being unwilling to compile a list of YC startups lazy. Of course, PG is busy enough that I wouldn't blame him anyway (but he does have people working for him to do things like that).
I thought that as well and then I remembered, they don't fund companies or ideas, they fund teams. These teams, may or may not change the idea the had when they applied.
Jed (the source spreadsheet maintainer) does a really good job of keeping the newer companies up to date with public info, but it's hard to find a lot of public info on YC companies before the fact that they were YC was newsworthy.
I agree, but it's probably up to the founders and pg how much information they make public, more so from the founders. It'd be interesting to see someone publish their quarterly earnings akin to what balsamiq did awhile back (http://blogs.balsamiq.com/product//2010/01/03/a-look-back-at...).
Does someone have Jed's contact details, I did a lot of research on this for a blog post a while back, probably makes sense for me send some of the corrections his way.