Hacker News new | past | comments | ask | show | jobs | submit login

What's the model behind it? I asked a simple question (that others LLM got it right without a problem) and this answered somethign completely wrong (and curious, since I don't know where the hallucination came from)



Looks like it's using llama3-8b-8192 as the LLM [1], which is a relatively small model, so hallucination is quite likely.

[1]: https://github.com/ai-ng/swift/blob/7d1f993b095abc4a51cf9c70...




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: