tzafrir's comments

tzafrir · 2026-03-16T15:23:01 1773674581

I built a methodology at Fiverr Labs for generating agent prompts from product specs using tests instead of manual prompt engineering. You write a behavioral spec, a coding agent generates tests from it, and a second agent iterates on the prompt until tests pass. Hidden test splits and mutation testing address specification gaming.

Evaluated on 4 agent specs across 24 trials — 92% compilation success, $2–3 per compilation. The benchmark and all code are open at https://github.com/f-labs-io/tdad-paper-code

Happy to discuss the methodology, limitations, and directions for follow-ups

tzafrir · on July 13, 2014

A quick hack I wrote using IDF's data feed.

Other pages using the feed focused on alarming citizens, I made this one to give the feeling of what it's like here.

iwangulenko · on July 14, 2014

Thank you.