karsuren's comments

karsuren · on June 4, 2023

You should probably add an example for using the special commands '.critic' '.resolve' commands that you are projecting as the key 'spice' of your project. There is no 'how to use' or 'how it works' provided for these special commands in your readme.md. One would have to walk through the entire code to hopefully get an idea. I went through the main script - I have an idea on the overall flow, but I still don't know how these special commands work. Other people might also run into the same issue. So, additional documentation can help

mr_kotan · on June 5, 2023

Thanks! You are right. I will add more instructions about the commands.

karsuren · on June 4, 2023

What is the perf of GPT4 vs GPT3.5 in the 'reasoning', 'reflection', 'critisism' and 'resolver' tasks like in your project? I see that you have commented out gpt3.5 and replaced with GPT4 in config yaml. Was GPT3.5 perf too bad? I don't think many people have GPT4 API access. If this requires atleast GPT4 to be effective, it might take a while before anyone else in the community can take it up.

mr_kotan · on June 5, 2023

GPT3.5 works not bad for generating, but it is not very good at self-reflection and self-critisism. Or maybe my prompts are not so good.

karsuren · on June 4, 2023

Claude has 100k context for around 2$ per million tokens.

With GPT4's 4-8k token limit, anything but very small projects in their early phase can benefit from this. Also GPT4 would be far too cost prohibitive

mr_kotan · on June 5, 2023

Good idea about Claude, never tried it myself. But I've heard that GPT-4 is superior at the moment. Is it true? Also thought about GPT-4 prices, but from my experience: If I use it intensively for a 3-4 hours it costs about $5. This is much cheaper than anyones hour rate.

lgas · on June 5, 2023

How does it compare to GPT-4 at coding tasks? I haven't tried it but everything I've heard suggests that it is noticeably worse.