Hacker Newsnew | past | comments | ask | show | jobs | submit | thinkevolve's commentslogin

whats the point of doing this. You have found loop holes to exploit and aced the benchmark.We did something similar with the DAB Benchmark. This exploit seems like an extension of it with lookups for the gold standard for other benchmarks.

UC Berkley will be better placed if the grads spend their time in suggesting ways to make the benchmark better.. Instead of making such simple exploits


You have started a WhatsApp group its gaining traction, but you find it difficult to prevent spammers and self-promotion. We vibe coded a WhatsApp agent that can moderate messages and gamify engagement for the community.

https://youtu.be/q3nniIK7Rpo


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: