Hacker News new | past | comments | ask | show | jobs | submit login

Congrats on the launch! Curious to know, which OSS models you see works best at the moment?



We've had a decent amount of luck with InternVL 2.0 w/ Llama, and are pretty excited about Llama 3.2

It's still super early in the open source x vision model space. The limiter actually seems to be the vision encoder -- advancements here will pay off huge dividends

https://huggingface.co/spaces/opencompass/open_vlm_leaderboa...


Thank you! Great insight.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: