Hacker News new | comments | show | ask | jobs | submit login
GitHub monoculture (nedbatchelder.com)
156 points by ingve 1060 days ago | hide | past | web | 104 comments | favorite

> Someone said to me, "I couldn't find coverage.py on Github." Right, because it's hosted on Bitbucket.

I was curious, so I went to bitbucket.org to try to find coverage.py. Experience was awkward to say the least:

1) Bitbucket homepage does not have a search bar, nor is there a link from the homepage to the search [1]

2) Guessing that the URL is /search puts you in the team profile for account `search`. finally you see what looks like a search bar at top [2]

3) Searching for coverage.py doesn't show `ned / coverage.py` in the first page! There are lesser-known forks of coverage.py that show up higher than the original project. [3]

I'm sure these will improve over time, but discoverability is still incredibly difficult on bitbucket.

[1] http://i.imgur.com/soRXB98.png [2] http://i.imgur.com/z5YmcL2.png [3] http://i.imgur.com/uhg8rbc.png

Why not use Google? Searching for 'coverage.py', the bitbucked repository is the fifth link for me. The first and second link are more general informational pages which link to the bitbucket repository. Searching for 'coverage.py source repository' gives the bitbucket repository as the first and second result.

Because Bitbucket's own search ought to have more insight into searching its content (but apparently not).

I think the core complaint, though, is that people's first step should be "search using a general search engine" (whether Google or Bing or Baidu or Yandex, etc.) not "search using GitHub". The complaint and expectation is not "someone should have tried to see if it was explicitly on BitBucket", but "people should not assume that you are using a single service". It is effectively similar to "I tried to find your email address and I couldn't" coming down to "I looked on G+ so I could find your Gmail address, and you don't seem to even have an account". There are tons of alternatives to both GitHub and BitBucket, including "I am hosting my repository on my own domain": having everyone assume "well, the content must be accessible within a specific silo" to the point of not even considering "I should use a general purpose web search engine for this use" is an egregious example of "bad monoculture".

Bitbucket also has awkward URLs, where my fork of someone else repo appears as a subdirectory of their project. I still can't get over the fact that the Bitbucket website won't let (easily) visitors search without creating an account. That's a terrible user experience and I'm sure many potential users or customers just choose the tab and go to GitHub.

I obviously can't speak for Atlassian, but I get the impression they really don't care about not being a popular FREE public code hosting site. They definitely want to be the go to site for FREE private code hosting for very small teams though.

> I get the impression they really don't care about not being a popular FREE public code hosting site.

Well it's more that they accepted reality (git and github had the network effect) and moved on.

Which is frankly ridiculous if true.

The number of programers in the world is going to follow an exponential curve. By 2032 there will be more programmers than all the atoms in all the stars of the known universe.

If you are logged in their is a search bar in the upper right hand corner. It has the same problem as your search thing.

Just do a

    pip home coverage
And the home page for the package will pop up just like `homebrew`

I go to GitHub to find a project first because that's where I hope it is. GitHub's UI and culture go a long way towards standardizing repo layout in such a way that I can more easily use it, plus if I find bugs or issues there's the Issues and Fork links right there.

In my admittedly anecdotal experience, GitHub projects are way more likely to have a clear overview with a link to a GitHub Pages site with real docs, they're more likely to define the rules/procedures for sending in contributions, and they're more likely to have an active community of contributors. I know I'm not going to have to hunt for the source link in a paragraph somewhere, because if I know the repo name and author I can clone the source.

If issue tracking could be distributed (yes I know about Fossil and others), and the players like github and bitbucket were compatible with such, then we would no longer have any dependence on any of these services.

The way I'd host my projects would be:

1. I have my own server where I have a git repo, the issues repo, and maybe a wiki repo.

2. My server would host semi-statically generated project browsers; source code browsers, issue browsers, etc. Unlike when I was running Trac, I'd have no need to allow people access to my server, no need to worry about spammers, pages would be low-latency/cachable so that I can allow crawlers as well even on a very small linode host.

3. all of my wiki/issues/git would mirror out to github, bitbucket, and whoever else wants to participate, and updates on those sites mirror back. I already do this now with my git repos using WSGI services and hooks.

4. when someone wants to post/update an issue, they link out to one of those services, or even run a local command line/GUI based issue system (wouldn't that be great??), and everything synchronizes back.

The only missing link is a decent distributed issue system everyone can get behind. Building out the glue to sync between services is not that hard of a task.

> If issue tracking could be distributed (yes I know about Fossil and others), and the players like github and bitbucket were compatible with such, then we would no longer have any dependence on any of these services.

This is definitely the biggest problem, there are a bunch of "distributed" trackers living inside the repository, but so far I've seen nothing which worked well (even ignoring the lack of integration with gh/bb/whatever).

There's the second issue of contribution workflow though, it's not really possible to have a dozen different places where contributions can be proposed, discussed, hashed out and possibly rejected; with the sub-problem of tools integration (e.g. integration bots or CI runs on contribution proposals)

My issue with that is how?

I have seen file(s) in the repo. But branching a big makes little sense

I have seen git-notes, which seems good, but nothing survived across rebases.

There is a zombie StackOverflow thread on this but I would love a good solution -

> My issue with that is how?

Well that's the big nut to crack isn't it?

I totally agree with the author.

Worse is, when answering to the question what do you use for version control software, I hear quite often people saying "github" and not git, and not understanding the difference between the two.

Yes, and some people say “just go on git” or “I found it on git”, meaning “GitHub”.

Whenever Github goes down I always wish someone would invent a decentralized version control system.

Remember at work when Github goes down everyone is just sitting there twiddling their thumbs for hours. As in "yeah my commit is ready just waiting to push it" or "waiting for the issue to be close, then we can test".

It is ironic a bit isn't it. One of Git's major selling points was -- we are tired of one centralized version system, you shouldn't need a single point of failure or be tied to the network in order to develop. And then years later we have one giant global single point of failure across the world. There is some irony there.

(Yeah, yeah, I know I can commit locally. But at some point you'll need to update issues, and publish your changes. This is more a of a generalized point. It would be the same if bitbucket went down and so on. We never it seems have gotten to the mythical peer 2 peer internet of things -- it just doesn't work it seems).

> But at some point you'll need to update issues, and publish your changes.

The second part is solved. You just need a second git server. Maybe Bitbucket, or maybe your own. We just never do it though, for reasons I don't know either. I don't do it, even if I know how. There are multiple tools to push to different servers automatically, but we still go the comfortable route. Maybe it's the psychological thing that "if Github goes down it won't be for long, is it really going to affect me?".

Because of that I don't agree the tools are the problem though. Or even that they are part of some irony. Git delivers on its promise. People just don't use it like that.

The issues part or the dependency with Github to test/deploy software is more of a vendor lock in situation. A lot of people depend on it being up to work, but that's the convenience charge since Github isn't open source. Not much different from people depending on Heroku to be up for their application to function.

On that note, question. Is there an easy way to create mirror servers?

For instance, have a Github repo but also a personal Git server and Bitbucket. Whenever you do `git push`, it'd update all three. And all three would be in sync.

There are a few ways.

You can create a single remote called "all" that has all your servers URLs. Then you can just push to that one, like "git push all", and all your servers will update. Pretty much the simplest, you just need to edit the .git/config.

You can set up a mirror too ("git remote add foobar --mirror=push ..."), but I've never done that.

Or make a hook so that whenever you push to a remote it also pushes to another.

There's also mr[1], which I've heard of but not used.

There are ways aplenty, we just neglect to use them.

[1]: http://myrepos.branchable.com/

It is very easy to setup a shared git server.


If you can mount it (w/ write permissions), you can push to it. Most companies already have a networked storage place.

We don't use github, but when our server is offline, we just spin up a VM from backup, push the latest commit to it and start using it as the new centralized server. I think what you lack is a decent policy, not technology.

Having a VM and two separate places for the canonical repo is bad policy. If you have this centralized server why do you also have Github?

OP doesn't use GitHub if I read his/her comment correctly.

However... having multiple remotes is a very easy way to get multiple distributed backups of your code. (You should do a "git fsck" on all your mirrors at least semi-regularly. The reason has to do with how push and --mirror work.)

Git is distributed, but obviously you're going to have a canonical repository somewhere (git.kernel.org, for instance). Decentralization doesn't mean getting rid of a canonical place where approved patches end up, and you really aren't going to see canonical repositories disappear, we have them for a good reason.

However, there is something to be said about mirroring canonical repositories for accessibility.

Do you even understand git? Literally push the whole repo wherever you want. git remote add blah. It doesn't matter where it's hosted. Many popular projects push their code to multiple services -- it's not harder than pushing to one.

If github goes down and your devs are twiddling their thumbs because they can't push, you have larger problems.

Wait are you trying to use the "do you even lift, bro" meme but referencing git? That's funny.

Yeah, I have been using it for about 5 years. I know how git works

> Many popular projects push their code to multiple services

What does that mean "multiple services"? Multiple machines?

> But at some point you'll need to update issues

You can reference (and thus comment on) and close issues from your commit messages.

> and publish your changes.

That does not require sitting on your ass and doing nothing though.

One of the big selling point of github is how little downtime they have.

But when they do, hundreds of thousands of developers sit there waiting for it to come up.

Who are these people, who are incapable of writing product code or tests, or catching up on documentation writing, or stepping back to think about design, or just reading some of the code to learn it better?

These are the people who made the mistake of signing up to a service with a central point of failure in the whole world. When that goes they have to find out other things to do.

But of course it is their fault for not having anything else to do and so someone on HN tells them that, "Well trust me this is totally fine, you guys and gals just need to do something else at the moment. Don't think of it as a bug, it is a feature built it to help you multitask..."

So where it the 'apache' of Github? Presumably that would be Gitlab, but its not quite aligned with the Github experience.

One could imagine a package you install on your droplet, ec2, linode, what-have-you, which would create a 'site' for your project which included a Wiki, a user management package with role management, automated source code backup to Glacier, automated mirror management to choice of mirrors. Install, set up your various keys (aws, ssh to mirrors, etc) and then push to it your source and make it the origin master.

You make that and then people might start creating free floating git repos, except that it will cost them some small money (like $25/month) which would discourage many people.

Gitlab is a non-starter since it requires developers who can barely manage to use Git to also run their own servers. If that weren't the case, then we'd just have another GitHub-lookalike. I'm old enough to remember when self-hosted Subversion was a popular geek "badge of honor", and the frustration of trying to pull code from some guy's VPS that had been offline all weekend.

Something like Fossil's model seems better, where issue tracker and wiki are instead built into the repository. Sure, essentially you're still running a little mini web server on the local machine, but the presentation is completely different (a single binary rather than a huge Ruby webapp with many dependencies).

The problem isn't really that GitHub goes down, but that we can't mutate or make use of certain pieces of state while it is down (mostly issues/wiki), and nobody can see our changes until it comes back up.

Actually Git comes with all the tools necessary to send pure data in the form of changesets by mail, its just that its usability sucks unless you have a working sendmail/mutt config, which 99% of devs don't bother with any more.

That seems the more interesting thing to fix, rather than just having another self-hosted SourceForge clone or whatever

As a side note, GitHub wikis are actually git repos that use Gollum:

https://github.com/gollum/gollum / git@github.com:gollum/gollum.wiki.git

As such, you can clone wikis, work on them locally, and push changes later just like any other git repo.

Yep, but unless you train your team up on using Mou or just reading raw markdown, you can't actually browse or publish to the wiki ("use or mutate state"), and again the usability is horrible so nobody will do it.

Usability is the huge selling point of Github, so any replacement would need to be as slick

It's Gollum. So if you install the Gollum gem and tell it to serve the directory, it'll serve it up just fine.

Unless Gitlab or a similar product starts offering a way that I can search all the Gitlab projects, it's not going to amount to much. Github has taken over the space not only because it's easy and powerful for creators: it's also a powerful search engine for users. As the article notes, the first thing many of us think when looking for code is "Lets search GitHub".

Google is not efficient at searching for code. If each creator spins up their own project sites using Gitlab or other tool, there needs to be some way of searching the network of Gitlab sites if we want users to actually switch.

One of the nice things about Github and Bitbucket is that they both use Git. You can push to both of them and enjoy the best of both worlds.

    git remote add bitbucket git@bitbucket.org:username/reponame.git
    git remote add github git@github.com:username/reponame.git
You could even add another Git repository you control fully as well.

This article captures just a little of what I've been feeling for some time.

There are a couple of other things about GitHub specifically, and the presumption in some quarters that everyone uses it, that put me off.

Firstly, they create their own world where lots of things aren't quite like normal Git. Some are simple terminology changes, like using "fork" instead of "clone". Some are more intrusive, like tying the pull request mechanism to the GitHub review tool.

Secondly, GitHub terms only let you have one unpaid account. If I want to maintain some personal repos, but also some with one of my professional hats on, it costs me money. If I want to keep my work on a major project separate to a few sidelines, it costs me money.

Thirdly, GitHub is an on-line service, and like anything else in the cloud, that means it comes with unknown security, privacy and reliability implications. For a big open source project, that might or might not be a problem, but I don't know why anyone would expect me to host my own projects there instead of, I don't know, in a Git repository on a server at home or at work, or something crazy like that.

So no, sorry, I won't fork you on GitHub. Nor will I contribute a patch via a pull request. Nor will I raise my helpful bug report on your project if you require me to submit it via GitHub. And I guess I'm lucky that I don't apply for jobs as an employee any more, because I am able to show approximately none of the best professional quality code I write without violating confidentiality agreements, and certainly none of it is in a GitHub repo for public inspection.

I am happy to support open source projects when I have a bit of time free, and I'm happy to demonstrate my credentials and abilities to prospective clients, but GitHub is a hoop that is rarely worth jumping through IME.

Well, a fork isn't the same as a clone, so that is why there are different words.

There is no pull request mechanism not tied to a site, unless you want to send an email with a public git URL yourself. Which you can still do, assuming the project you are working with has specified that. But why should you be able to dictate to other people that they do it this way? Github gives you the public URL, if you want it to, and lots of people do.

Hosting your own source repository also has unknown security, privacy and reliability implications. (Or known ones that are often not very good)

It doesn't really matter to Github users if you don't want to use Github. Doesn't make a lot of sense that you would decline to make bug reports just because the project used Github. That sounds like you are boycotting those projects for some moral reason.

Well, a fork isn't the same as a clone, so that is why there are different words.

Perhaps unintentionally, I think you have just made my first point better than anything I wrote before.

But why should you be able to dictate to other people that they do it this way?

I wouldn't presume to dictate anything to anyone. I'm just explaining why I find it frustrating that so many people now assume others will work their way, as if using GitHub is some sort of universal norm.

Hosting your own source repository also has unknown security, privacy and reliability implications.

So the cloud guys keep saying. And yet we keep hearing about downtime and security breaches in cloud services, while the servers and normal Git repos I use regularly all seem to work just fine. I guess the guys running all those projects must be some kind of sysadmin demigods or something.

It doesn't really matter to Github users if you don't want to use Github.

It does if they want me (or anyone with similar feelings; let's not make this personal) to contribute to their projects.

Doesn't make a lot of sense that you would decline to make bug reports just because the project used Github.

I don't. I decline to make bug reports when making the bug report requires the use of GitHub, which is probably a more dangerous trend that I have observed with increasing frequency in recent months.

That sounds like you are boycotting those projects for some moral reason.

I have no idea where you got that from. I'm not "boycotting" those projects at all. But since I can't create a free GitHub account just to contribute to them without affecting anything else I might do there, and since participating via GitHub is significantly more hassle than contributing to other projects I have supported in various ways, I choose to spend my available time where I can help other projects more efficiently.

> Secondly, GitHub terms only let you have one unpaid account. If I want to maintain some personal repos, but also some with one of my professional hats on, it costs me money. If I want to keep my work on a major project separate to a few sidelines, it costs me money.

What they do allow, though, is adding new "organizations". These are effectively groups that get top level URLs just like users (https://github.com/galvanix/ is one my business partner and I set up).

When you create a repo you can put it in your main account or in any of the organizations you are part of.

If your organization doesn't host any private repos then there is no cost.

Now, this isn't completely disconnected from your main account—you still do everything in that name, but it's close.

> …because I am able to show approximately none of the best professional quality code I write without violating confidentiality agreements, and certainly none of it is in a GitHub repo for public inspection.

Well, ain't that the truth for most of us! :-)

Mercurial is so much easier for day to day usage, I work with both and need to explain so much more when getting a new person up to speed with git. The "detached head" state is very difficult to explain.

I absolutely agree but I still use git.

Since git has largely won the dvcs war and I'm a one man developer having to inter-operate with others it makes sense for me to suck it up and use git myself.

I really dislike git though.

You might try SmartGit/Hg. I've been using it for a year now and find it vastly superior to the Git command line.

As one example mentioned by the gp, the "detached heads" are a breeze: Just click the Lost Heads checkbox and they all show up in the visual log like any other commit. It was when I saw this that the concept of detached heads made sense to me.

Of course, when I've suggested SmartGit to people, a few have replied, "Oh no, I would never use a GUI for this, it's command line only for me."

To each his own, but I don't understand that. To me, source control is an inherently visual activity just like text editing, and a visual tool is a good match.

So for anyone who is willing to consider a visual tool, I highly recommend it.


That looks interesting.

I'm mostly ok now though since I rammed enough of git's weirdness into my head that I can work with it (that and phpstorm/pycharm have nice basic git support).

Mercurial may be easier to use than git, but Github is easier to use than Mercurial. Switching to a workflow where all commits are made to forks and all updates to master come through pull requests largely solves the problems with using bare git.

Which is kinda the problem we're discussing here. Git is far less usable without Github and yet all the workflow niceties that we've come to rely upon are locked up in Github's proprietary implementation. The more developers come to rely upon Github features, the more locked into Github we are and the more difficult it would be if Github were to go away or become obnoxious.

That's completely irrelevant to TFA (github is not the only git host after all)

I wouldn't mind -- strictly from a user standpoint -- if all code was in one place. It's easier to collaborate and contribute to other repositories when they're all in the same SCM and host. If someone has their source in another host (whether it's Git or some other SCM), I'm less likely to bother contributing -- and I would assume this is the case with many others.

Of course, from an "all your eggs in one basket" or business monopolization standpoint, I agree with the author.

But selfishly speaking, I like Github and find that it adds more value to me the more it becomes a monoculture.

I wrote about something similar in in 2010 because these sites are walled gardens: http://kev009.com/wp/2010/10/stop-distributed-version-contro...

I was hoping for something like Diaspora (https://joindiaspora.com/) so we could use our choice of repo or self-hosting but still interact with a global community.

Github isn't the only choice, but I think it's the preferred choice for a code sharing community because they basically give their product for free for open-source projects on public repositories. And it's easy and good looking.

Bitbucket distinguishes itself from Github by making itself free for small teams only, but allowing you free private repositories. Seems like they are more interested in businesses and proprietary code.

I like Github, but just switched our business repos over to bitbucket. Github's pricing for businesses isn't even competitive.

It's not immediately apparent from the front accessible parts of Bitbucket's website, but if you refer users with a provided link you also get up to 3 extra free users.

So BitBucket and Gitorious and others will need to excel and beat GitHub at their game. Competition will eventually solve this the same way we aren't beholden to the same methods and sites we used 10 years ago. The next winner may not even exist yet and I expect that the barriers to entry here are not nearly as high as those in other markets that have been disrupted over the last few years.

GitHub has already won due to networking effects. It's like trying to unseat Facebook as the dominant social network. It will require a sea change in technology used for version control or sharing code.

We've seen other, similar, widely-used platforms lose prominence rather quickly. SourceForge and Google Code are two such examples (both of which also support Git these days). I see no reason why GitHub would be immune from such forces.

It isn't, but something better has to come along first. We'd still be using Sourceforge if it were not for its successors.

This is the key. Also, people need to remember that "better," in the case of taking over market leader position, means that the value add of the improvement must be > the benefits of using the market leader. This is a higher bar than simply newProductX > marketLeader.

It helps that ad-driven SourceForge seems to be doing its best to shove people off the platform.

Git wasn't the first version control system. It won because people liked it best. If something better emerges (by better, I don't mean slightly better, but something where value_over_Github > switching_cost, which is often missed when people discuss what is "better" than the current leader), it will replace it

The competition you describe exists now. Having a new "winner" won't change the point being made in the article. I don't agree with the conclusion he came to, but he's pointing out issues with there being a winner whose site is the de facto place where code / project / people go.

> he's pointing out issues with there being a winner whose site is the de facto place where code / project / people go.

That's likely true. However, it ignores whether or not those issues are outweighed by the substantial benefits to a de facto place that everyone goes, which are generally very large.

I use Bitbucket too. Github doesn't provide free private repos last time I checked, needed for bootstrapped startup. The centralization vs. distribution+independent evolution debate is always an interesting one, though. I think you need high quality centralization at first then competition can be useful; Github set the bar, now lets see what everybody else can bring to the table

I would love it if Github built in a contributors license agreement into my repos so all issues and wiki content were licensed to the repo owner. This would give repo owners the ability to move from one version control service to another without getting locked in by copyright issues on issues and comments made by others.

This is a non-issue and I don't mean that in a small way.

Github has produced a near monopoly on hosting for popular open source projects because it is so liked (as OP himself admits). Soon enough something better will come along that makes Github look like SourceForge and then we will all migrate our projects over there.

Finding out a project is on Bitbucket or uses Mercurial has actually stopped me wanting to contribute. The additional hassle of remembering how Mercurial works, digging out my BitBucket credentials etc, is a pain in the backside.

You can use bitbucket just fine to host git projects.

I use a mixture of bitbucket and github for different projects, not wanting all my eggs in one basket, really. I find them pretty much equal in many respects - except being able to have private repos on bitbucket is very convenient, and github pages is also very useful. I wish bitbucket had something similar.

https://bitbucket.org/dfairhead/streetsign-server in case you're interested.

I have contributed to a git project on bitbucket where the process was well explained (enough for a git beginner): https://bitbucket.org/LukasKnuth/backslide/src/5313b2c63a3da...

Bitbucket is often slower than github, but this has not impact for my small projects. And I like to have some free repository that remain private.

The opposite to me (not github, but git). I wish github support mercurial. I have used hg-git, but still is something else to do.

Bitbucket supports git.

I don't see why this is a bad thing.

There is inherently less cognitive overhead to use something you are already familiar with. Same reason why people wouldn't give Bing the time of day.

All things being equal, and you just had to choose a public place for a remote repo, Github should be your first choice unless you have a very good reason go use something else.

I'd rather not, nor expect consumers of my code, to have to learn a completely different UI than what they are used to in order to use or contribute to my public repo.

I recently started a storing a new project and its services on Bitbucket for a variety of reasons.

- I think it's healthy to always look at the different tools in the space to witness first hand if you're missing out on something

- There's some product integration in the works, so, it only makes sense to get curious about how existing users of the platform may feel once it is there.

So far it's going well, but it's early days.

I actually much prefer bitbucket's issue tracker to github's.

It's interesting to see the progression of public source control hosting. SourceForge was the goto place for many years, but it's really only used for hosting large binaries at this point, and very few people host code there. Google Code started to steal some people away from SF, but again it was more focused on project hosting then just getting code to people.

Github is interesting is that the code is shown front-and-center whenever you visit a project page. Sure there is the Readme.md and wiki stuff you can do, but it makes browsing code and project discovery very easy. Github also made project creation and management super easy and lightweight. The number of questions you need to answer to setup a repo is so minimal. It gets out of your way when setting up a project.

I still feel like Google Code has some things going for it. The biggest being it's defect tracker compared to Github.

I actually don't feel like having the code at the top is helpful for me. I rarely look at code on github. I go to a project page and scroll down past the code to read the readme. That happens easily 37 times out of 38. Yea, I made up those numbers. I know there's been a few times I've clicked on the code but it seems so seldom I'd much rather the readme was front and center and there was a "code" button/tab.

If the project sounds interesting I download it and look at the code locally. The few times I look at the code online is in response to like a stackoverflow question if I don't have the code locally already and I want to look up something.

But that's just me. It's not that big a deal for me to scroll down. If everyone else goes to code more than the readme under it then fine

I find that casually overlooking the root directory very often gives me a good estimate of how active the project is (and in which subdirectories it’s been worked on most recently) and also (very superficially) how clean the code base is (e.g. are there falsely committed backup/swap/a.out files even in the root dir). Of course, in most cases I’d immediately scroll down to the readme.

Unless, however, I really do want to view a file online (say because I want to check the API of some command I’m using and it is not documented otherwise) in which case I really appreciate that it is so easy to find. I really don’t understand why most other web frontends (Bitbucket, Gitweb, GitLab) make the commit log the first thing to see on the summary page. These mean nothing to me from an outside perspective.

Bitbucket shows you the README followed by a list of recent commits on the front page of a project. I have grown to really like their UI as a whole, although both github and bitbucket suffer from placing the clone URL in a tiny little box that's incredibly annoying to actually use.

This basically reduces to two questions:

a) is github monoculture is bad?

I really disagree with author's view on the emphasize on public github project in job interview process is misplaced. Yes, it will not eval how good you are at your previous job, but the problem is it is very very hard to find how good you are at previous job. Doing open source project at spare time is a positive sign of interest in programming. If in the interview process you said I don't have a github account, but I put all my repo at abc.com, I am pretty sure it will work, too.

And I don't care the monoculture if its service is the best in this field. Why use the second best if the best is possble? I would like to be an earlier adoptor if there are better service, even if there are currently no user.

b) how to disrupt github?

That I have no idea. What github does not provide but is really needed? I do not know. Would love to hear ideas.

> b) how to disrupt github?

There are certainly areas that github could use some improvement. One of the basic features that I see as currently missing is actual side-by-side Diffs like those available in Chromium Code Reviews[1]. This makes it so much easier to review code changes and thus improves the experience of collaboratively working on a project. The Github folks are smart people, so I'm sure they know this and yet years after launch this is still not available in Github while they continue adding things like improved map diffs [2] which while nice features on their own, it only serves to show that a good code diff experience is not as important to them now.

Secondly, I find Github search to be lacking in terms of fulfilling its potential. It seems we all agree that Github is the number one place most people put their public code on. This is great, we have one central public repository. And yet searching for, finding and discovering good code on Github is not as easy as it should be. I argue that it should be easy to find solutions to programming problems that others have already tackled. Imagine being able to type "compute sha256, c++" and seeing relevant code snippets right there. I am not saying this is an easy problem to solve, but I think it's definitely worthwhile one to try. Who knows, may be someone will build an awesome code search engine on top of github.

[1] - https://codereview.chromium.org/261573002/diff/60001/google_...

[2] - https://github.com/blog/1772-diffable-more-customizable-maps

If you use Chrome you can install the Octosplitt[1] extension for side-by-side diffs.

[1]: https://chrome.google.com/webstore/detail/mnkacicafjlllhcedh...

Just to mention, Bitbucket has side-by-side Diffs.

That a person is applying for a programming job is the first, and most important, positive sign of interest in programming. These days, in fact, thanks to the ever increasing emphasis on having a github profile as part of the recruitment process, I view having a github profile as being little different from plugging in the Language Alphabet in the right places on a resume. It's empty filler that may or may not have any use.

You don't "disrupt" something because you don't like it, you disrupt it to make it better.

Every industry that's been disrupted was filled with companies that were coasting or in decline. GitHub has a ways to go before it ends up there.

Remember, GitHub disrupted SourceForge because it wasn't improving. Stack Overflow disrupted Experts Exchange for similar reasons.

The one thing GitHub excels at where everyone else doesn't even come close is as a place to read the code. Big bucket, unfortunately, doesn't even come close in code reading.

GitHub is the only place where you can easily read, search, and immediately get the commit history for a file in one place extremely quickly.

Oddly what seems to have sparked this blog was someone failing to find Ned's own "coverage.py" on githubs search. Ned's response was interesting - there is clearly a monoculture - they should have searched on google.

my first though was - but Ned is caught in the far more worrying search monoculture.

And my second thought was the other monocultures to fear more - antibiotics, wheat and soya...

In the end, contributing to someone else's code is hard - it requires not merely coding skills but agreements on standards, willingness to compromise, and willingness never to do so, it requires humility and determination. Anything that makes that easier I welcome - I suspect githubs biggest issue is not having people start repos, it's having > 1 person work in the same repo.

They can solve that, more than happy to live with that monoculture

>they should be a choice, not the only choice.

They are a choice. The author even mentioned Bitbucket. The fact that Github has become the dominant player doesn't make it not a choice. There is no one preventing a competitor from emerging and working to compete with Github (the same way Git emerged and enabled Github to exist in the first place). Being frustrated that people default to Github doesn't mean the choice to use something else doesn't exist. It does, most people have just decided not to.

Most people can't be bothered to choose a domain, set up a web and e-mail servers for it (because if you're doing it, you gotta do it right) and then maintain it all. Even though it's on almost everyone's todo list, I'm sure.

Setting up a GitHub profile and posting a few quips on Twitter is almost effortless. So that's what we use. If someone comes a long and makes a competing product that is just as easy to use, people will switch. It won't be much of a problem.

GitHub got content creators in the door by being easy to use like that. It became the de facto search engine for users looking to find projects because once you have a critical mass of the creators, that's what happens.

If whoever comes along with a competing product runs it in a centralized manner like GitHub, then they'll create the same issue the OP has with GitHub. If they offer a decentralized, easy to use solution, they need to also offer a way for the person looking for a project to search the decentralized net. Without a unified search, nobody is going to oust GitHub from their spot.

It's not just the setup, it's having to monitor and update the servers, deal with outages, etc. Not to mention you are paying for the machine. For most people this has no inherent value over just letting Github do it.

I feel the same about the rise of Macbooks in the dev community. A proprietary platform becoming a standard and excluding people having different use-cases.

This is not a social problem, talking about it like one is just going to end in tears.

This is a technical problem.

There is no way to efficiently look for code across the various repository sites. Too often a search on google turns up nothing, while searching on github (or pypi or another repository of software libraries) finds exactly what you are looking for.

Until there is an effective way to search for code, people will be left no other choice than to search on a few sites (github is by far the largest, so most people will check there, and some may not check anywhere else!).

To the author: if this bothers you, fix it, but don't act like it's a behavioral issue. (Also, bitbucket has been dead for like 5 years, it's a backwater and it's not going to come back.)

No it's a social problem. I do not search for code using function names or snippets of amusing for loops.

I want to find code based on "it's simple, you know my kind of simple, like made it into the BSD manual simple, but I also need it to be in python or bash because I am going to need to back on it but not spend so long on it that it becomes full time (leaves out C++ and haskell)

Ok now it must work with rabbit MQ and preferably install via pipy and run with decent test coverage using py.test and

you get the idea.

Anyway no search system does that - what does that is newsletters and emails and recommendation and ... social interactions

Maybe all the other places to host code should just do their job better. Ever think of that?

There are a plethora of choices, and you can always use Google to find projects. However, it happens to be that GitHub is the best choice, it's a free choice, and there's really no reason NOT to use it other than some stupid prejudice that putting all your code in one place is a bad idea.

Keep backups, and of course make sure GitHub isn't the only place your code exists (since you're using Git, that's not the case). and I don't really see a problem here.

"The D in GitHub stands for Distributed"

It is a valid point of view and the same argument can be used to many other websites and services across the web.

I actually much prefer Unfuddle to GitHub, but everyone is on GitHub, so I use GitHub.

"we are asked to provide a Github username and a Twitter handle. I suggested that a homepage URL is a more powerful and flexible way for authors to invite the curious to learn more about them"

Your website demonstrates who you say you are. Github work demonstrates who you actually are.

Many of the best programmers I've ever worked with have absolutely no code on GitHub. Why is that? It's because they're using their talents and abilities in industry, solving real problems, in positions where they're obligated to not share the code they've produced.

I'd hire those people any day over somebody with numerous GitHub repos filled with quasi-maintained JavaScript libraries or numerous forks of code they didn't write.

No, Github work demonstrates what work you have done on projects that happen to live on Github. Saying that that is "who you actually are" shows a rather narrow idea of what "being" means.

There are other things than code to define who you are; and even if you're only look for code, that code need not live on Github.

If I give prospective employers my github handle they'll think I'm inactive in open source. If I point them to my webpage they can see the links to all the non-github hosted open source I do (Launchpad, self-hosted, etc).

I was hoping this had to do with monocles

Guidelines | FAQ | Support | API | Security | Lists | Bookmarklet | DMCA | Apply to YC | Contact