Hacker Newsnew | past | comments | ask | show | jobs | submit | sarkarsh's commentslogin

I've been using Claude Code for months and kept hitting the same issue: tasks would get "completed" but I'd have no visibility into what assumptions the agent made or what it quietly simplified when it got stuck.

  ctlsurf is a notebook that connects to AI agents via MCP. The key feature: when an agent marks a task done, it must provide structured completion data:                                  
                                                                                                                                                                                                       
  - Summary of what was done                                                                                                                                                                           
  - Assumptions made (required, at least one)                                                                                                                                                          
  - What was attempted but failed                                                                                                                                                                      
  - What was simplified or skipped                                                                                                                                                                     
                                                                                                                                                                                                       
  That last one is the important part. Agents often silently give up on parts of tasks, and this forces transparency.                                                                                  
                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                     
  https://app.ctlsurf.com                                                                                                                                                                                  
                                                                                                                                                                                                       
  Interested in feedback, especially from teams using AI agents in production where accountability matters.

Over the past couple of months(not sure how many :), I’ve been building a set of minimal, no-bloat tools aimed at solo entrepreneurs.

I’m trying to figure out if this solves a real problem or if I’ve been tinkering in a vacuum.

Would love your honest feedback.. brutal is fine on whether this belongs in an already crowded space or should be scrapped entirely.


I am working on it. Hopefully soon.


Wonderful! Looking forward to it!


Thanks! This is first version and I am still working on the design.


I have pushed an update. It's getting closer. Here is preview. http://www.crimsonbox.net/new-update-pushed-to-app-store/


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: