ChatGPT o3 mini high thinks at about 140 tokens/s by my estimation and I sometimes wish it can return answers quicker.
Getting a simple prompt answer would take 2-3 minutes using the AMD system and forget about longer context.
ChatGPT o3 mini high thinks at about 140 tokens/s by my estimation and I sometimes wish it can return answers quicker.
Getting a simple prompt answer would take 2-3 minutes using the AMD system and forget about longer context.