Pandoc is awesome for converting text-based formats and markup files, but from m...

dotancohen · 2025-01-03T14:29:28 1735914568

I see, thank you!

How do you handle complex office formats with embedded elements? Do you reimplement ODT and other standards?

nicbars · 2025-01-03T20:20:58 1735935658

Thanks for asking! I don’t reimplement those formats myself—LibreOffice does the heavy lifting for parsing and converting office documents with embedded elements. That way, I just leverage an existing engine instead of reinventing the wheel, and it helps preserve formatting as accurately as it can.

dotancohen · 2025-01-03T21:40:31 1735940431

So the web app actually loads LO components? How about MS Office? Other esoteric formats?

You mention elsewhere that all this can be done offline once the web app has loaded, all these components are pulled in?

nicbars · 2025-01-04T02:45:52 1735958752

I’m not actually loading LO into the browser—those parts run on the server side, so you still need an internet connection for complex document conversions. The offline functionality mostly covers simpler features like merging PDFs or converting images to PDF with WebAssembly libraries in the browser. For MS Office files or other advanced formats, I rely on LO’s server-side engine to handle parsing and conversion

dotancohen · 2025-01-04T07:46:28 1735976788

I see, thank you for taking the time to explain.