Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
sdenton4
3 days ago
|
parent
|
context
|
favorite
| on:
ProofOfThought: LLM-based reasoning using Z3 theor...
Indeed - human judges suck on average. And you can prompt an llm judge to look for particular kinds of problems, then throw the ensemble of judges at an output to nitpick. (Essentially, bake in a diversity of biases through a collection of prompts.)
Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: