Hacker News new | past | comments | ask | show | jobs | submit login
Show HN: New LLM outperforming GPT-3.5
6 points by i-am-agi on Oct 18, 2023 | hide | past | favorite | 1 comment
Refuel LLM (84.2%) outperforms trained human annotators (80.4%), GPT-3-5-turbo (81.3%), PaLM-2 (82.3%) and Claude (79.3%) across a benchmark of 15 text labeling datasets. It is a Llama-v2-13b base model, trained on over 2500 unique datasets (5.24B tokens) spanning categories such as classification, entity resolution, matching, reading comprehension and information extraction.

Here is the interactive demo: https://labs.refuel.ai/playground. Pretty fun to play with!




How does it score against a similar finetuned 3.5? The finetuning costs for 3.5 are negligible the finetuning of a llama is not. Is your plan to just provide the finetuned model for their special use cases? Is there free prompting available?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: