LLM Constellation

lincolnwb · on July 21, 2023

Hey fzliu looks like this is struggling to stay up -- repl.co URLs are mainly built for small-scale sharing and local testing, so they'll sometimes have issues when shared on places like HN. We created deployments (dedicated VMs on replit.app URLs) for this sort of thing. If you deploy it, it should work much better. Feel free to email me at lincoln@repl.it if you need any help with that.

adt · on July 21, 2023

Related table of models:

https://lifearchitect.ai/models-table/

PUSH_AX · on July 21, 2023

I think it's experiencing a hug of death. What is it?

6gvONxR4sf7o · on July 21, 2023

It's the app described here https://arxiv.org/abs/2307.09793

elorant · on July 21, 2023

Probably a collection of all available large language models. Haven't managed to access the site but by the name and some comments I can deduce that much.

capableweb · on July 21, 2023

Got it to load the landing page at least (although the generation thing itself doesn't work), and it links to the following paper: https://arxiv.org/abs/2307.09793 (On the Origin of LLMs: An Evolutionary Tree and Graph for 15,821 Large Language Models)

> We take advantage of the relatively systematic nomenclature of Hugging Face LLMs to perform hierarchical clustering and identify communities amongst LLMs using n-grams and term frequency-inverse document frequency. Our methods successfully identify families of LLMs and accurately cluster LLMs into meaningful subgroups. We present a public web application to navigate and explore Constellation, our atlas of 15,821 LLMs. Constellation rapidly generates a variety of visualizations, namely dendrograms, graphs, word clouds, and scatter plots.

ksaj · on July 21, 2023

One thing this shows, besides how they are all related or not, is just how many models are out there that nearly everyone has never heard of. Quite impressive.

cjtrowbridge · on July 21, 2023

There are thousands on hugging face but if you look at their leaderboards for all the metrics, almost all of them are garbage. A small number really stand out.

LoganDark · on July 21, 2023

chronos-hermes-13b is probably one of the most niche ones that i know about, seems pretty good at detailed roleplaying and storywriting (from others' anecdotes and also my own testing)

biwills · on July 21, 2023

we've been looking at at a lot of custom models for https://faraday.dev (shameless plug)

There's already some interesting fine tuned llama2 models:

- https://huggingface.co/NousResearch/Redmond-Puffin-13B - https://huggingface.co/Tap-M/Luna-AI-Llama2-Uncensored

LoganDark · on July 21, 2023

i don't like redmond-puffin because it makes the mistake of baking "human" into the training data, which is quite difficult to avoid as it's a pretty common practice... and unfortunately it looks like luna-ai-llama2-uncensored bakes "assistant" into the training data!

so both those models are probably not suitable for simulating interactions between non-human, non-AI entities... the chronos and hermes models only do input/instruction which are incredibly generic and you can just eschew the format altogether, and it follows chats correctly with no bias towards human/user/assistant.

that app looks very interesting though, reminds me of something that we were planning to make, but gave up because winui xaml is too cursed

joecool1029 · on July 21, 2023

Doesn't load, what is this?

Bakersfield · on July 21, 2023

Electrostim sounding videos with detailed instructions. You're lucky it didn't load. /s