It's been on my list to do a proper shootout of all the various new quant format...

brucethemoose2 · 2023-12-09T07:27:58

> https://llm-tracker.info/books/llms/page/quantization-overvi...

This is a very cool resource, thanks!

Gems like this, even in areas I follow pretty closely, are why I keep coming back to HN.

0xDEADFED5 · 2023-12-09T04:50:10

i humbly request you to add mlc-llm to your quant test when/if you get around to doing it

lhl · 2023-12-09T05:43:12

Sure, I think their quant format is pretty basic, something similar to bnb q4 - my plan will to be scripting a framework for testing, so should do that as well since the omniquant implementation is in mlc-llm anyways.

acosmism · 2023-12-09T05:53:22

i was trying to get this to work with mlc-llm. i'd appreciate any pointers

acosmism · 2023-12-09T05:55:19

more specifically on a non-cuda gpu - mali on orangepi via opencl