Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Why Your Chunking Strategy Makes or Breaks Your AI System (medium.com/utkarshhpatel13)
4 points by savanpatel 21 days ago | hide | past | favorite | 2 comments



Personally I’ve been thinking about this problem for some time and have had a neat idea that im almost tempted to patent, I have yet to test it via an actual implementation but the core idea is quite simple…


IMHO, your article is missing an important point: 90% of implementations today flatten documents to plain text before chunking them. Why not consider the visual appearance that the human gave to the document? Using layout information combined with semantics, you can increase rag performances by +160% (tested via benchmarks), so why do most of us only use text?

Note: multimodal ≠ layout




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: