Hey fzliu looks like this is struggling to stay up -- repl.co URLs are mainly built for small-scale sharing and local testing, so they'll sometimes have issues when shared on places like HN. We created deployments (dedicated VMs on replit.app URLs) for this sort of thing. If you deploy it, it should work much better. Feel free to email me at lincoln@repl.it if you need any help with that.
Probably a collection of all available large language models. Haven't managed to access the site but by the name and some comments I can deduce that much.
Got it to load the landing page at least (although the generation thing itself doesn't work), and it links to the following paper: https://arxiv.org/abs/2307.09793 (On the Origin of LLMs: An Evolutionary Tree and Graph for 15,821 Large Language Models)
> We take advantage of the relatively systematic nomenclature of Hugging Face LLMs to perform hierarchical clustering and identify communities amongst LLMs using n-grams and term frequency-inverse document frequency. Our methods successfully identify families of LLMs and accurately cluster LLMs into meaningful subgroups. We present a public web application to navigate and explore Constellation, our atlas of 15,821 LLMs. Constellation rapidly generates a variety of visualizations, namely dendrograms, graphs, word clouds, and scatter plots.
One thing this shows, besides how they are all related or not, is just how many models are out there that nearly everyone has never heard of. Quite impressive.
There are thousands on hugging face but if you look at their leaderboards for all the metrics, almost all of them are garbage. A small number really stand out.
chronos-hermes-13b is probably one of the most niche ones that i know about, seems pretty good at detailed roleplaying and storywriting (from others' anecdotes and also my own testing)
i don't like redmond-puffin because it makes the mistake of baking "human" into the training data, which is quite difficult to avoid as it's a pretty common practice... and unfortunately it looks like luna-ai-llama2-uncensored bakes "assistant" into the training data!
so both those models are probably not suitable for simulating interactions between non-human, non-AI entities... the chronos and hermes models only do input/instruction which are incredibly generic and you can just eschew the format altogether, and it follows chats correctly with no bias towards human/user/assistant.
that app looks very interesting though, reminds me of something that we were planning to make, but gave up because winui xaml is too cursed