Here is an example run on sqlite:
Top Committers (of 28 authors):
D. Richard Hipp 13359 commits during 19 years until 2019-09-17
Dan Kennedy 5813 commits during 17 years until 2019-09-16
together these authors have 80+% of the commits (19172/20987)
Files with most commits:
1143 commits: src/sqlite.h.in during 19 years until 2019-09-16
1331 commits: src/where.c during 19 years until 2019-09-03
1360 commits: src/btree.c during 18 years until 2019-08-24
1650 commits: src/vdbe.c during 19 years until 2019-09-16
1893 commits: src/sqliteInt.h during 19 years until 2019-09-14
Files with most authors:
11 authors: src/main.c
11 authors: src/sqliteInt.h
12 authors: configure.ac
12 authors: src/shell.c
15 authors: Makefile.in
By file extension:
.test: 1333 files
.c: 379 files
together these make up 80+% of the files (1712/2138)
Codescene was built on top of an opensource tool by the way. But the UI is nice to have.
I must admit, I was playing around building something similar while learning Rust, just for the fun of it. Now I probably won’t finish that:)
I modified existing tools to scan all branches.
The most interesting tool is https://github.com/src-d/gitbase which turns your git repo into a database you can query. This is more effective than parsing git log output.
I keep a list of similar tools here https://github.com/hbt/git-forks-analysis#other-git-data-min...
Happy to add yours