gpt3.5 turbo is (mostly likely) Curie which is (most likely) 6.7b params. So, ye...

JackRumford · on Sept 14, 2023

These sites say 154B:

why_only_15 · on Sept 12, 2023

gpt3.5 turbo is a new model, not Curie. As others have stated, it probably uses Mixture of Experts which lowers inference cost.

csjh · on Sept 12, 2023

Is there a source on that? I've never seen anyone think it's below even 70B

ronyfadel · on Sept 12, 2023

It still does a much better job at translation than llama 2 70b even, at 6.7b params

two_in_one · on Sept 12, 2023

If it's MOE that may explain why it's faster and better...

yumraj · on Sept 12, 2023

sarthaksrinivas · on Sept 12, 2023

jiggawatts · on Sept 12, 2023

I thought it was fairly well established that GPT 3.5 has something like 130B parameters and that GPT 4 is on the order of 600-1,000

avion23 · on Sept 13, 2023

I remember:

- gpt-3.5 175b params

- gpt-4 1800b params