Hacker News new | past | comments | ask | show | jobs | submit login

Does anyone have some experience with this use-case: I have a MP4 file and a SRT file with subtitles is there a convenient way to re render the video with subtitles drawn on it so I can use it in places where subtitles are not supported? Is it a super cpu intensive process that takes a long time for let's say a typical 2 hour movie?



Burning subtitles into the video implies re-encoding. And if you want maintain video quality, you'll have to choose a pretty good quality setting like crf=24 and this will take a lot of time even on H.264/AVC.

You are looking at multiple hours (>2) for a 2-hour movie from my experience. One way to reduce this is to spin up a DigitalOcean droplet (a $80 per month droplet) that'll give you a lot of processing power, finish your encoding in 2-3 hours,and shut it down. It'll save you a lot of headache and time.

Not sure I answered your question, but, hope that helped!


> Burning subtitles into the video implies re-encoding

In principle, would it be possible to only re-encode the macroblocks affected by the subtitles, leaving the rest of the frame bit-for-bit identical?

It's been a long time since i read a video encoding specification, so i don't know if things still work this way.


Much harder than it sounds because future macroblocks depend on present ones, and motion vectors mean that over a few seconds, transitive dependency can reach the entire image (and does in panned and zoomed shots)

Can possibly work, but depending on movie content might not save all that much encoding.


What would be much better would be to encode an overlay raster stream, which the player can then composite at runtime, like they do with text subtitles.

Much faster to encode. Can be toggled. But players need to support it.


Yeah, but the whole point of the exercise is to see subtitles on players that don't even support subtitles; they're definitely not going to support an overlay stream.


Thanks. I'm surprised that there isn't something like lambda where you can just get a really fancy cpu on demand for a quick encoding job.


Paralleling many of the other comments that give excellent test advice, it's also worth seeing if your CPU has Intel Quicksync support, which can give a hardware-accelerated rendering option to ffmpeg. See below link for reference of the flags.

https://trac.ffmpeg.org/wiki/Hardware/QuickSync

Even the most low-end processors like the Celeron, if they have embeded GPUs, have this encoding acceleration built in and it makes a huge difference versus using the general purpose CPU portion. The generation of CPU will determine the encoders/decoders available.

I learned about this from the below post, and have been using the functionality where ever I can as it applies many of the popular transcoding sofwares out there, for example handbrake also.

https://forums.serverbuilds.net/t/guide-hardware-transcoding...


> I have a MP4 file and a SRT file with subtitles is there a convenient way to re render the video with subtitles drawn on it so I can use it in places where subtitles are not supported?

You're going to have to reencode it, which is a lossy and CPU-intensive process (unless you manage to get hardware encoding working with your video card, which is possible but very fiddly IME).

> Is it a super cpu intensive process that takes a long time for let's say a typical 2 hour movie?

There are a lot of compute/filesize tradeoffs that you can make. If you don't mind a file that's 2-3x larger than your original then you can do it fairly quickly (say 1/3 of realtime). If you don't mind an effectively uncompressed video (so tens or hundreds of gigabytes) then you can do it as fast as your disks will write. If you want something similar to the original without sacrificing too much quality then it'll probably be slower than realtime.


If compression ratio or filesize isn't a priority, NVENC definitely is good enough to consider. The speed is just too good.

It can do 300fps+ (for simply transcoding. Burning in subtitles probably will slow it down a little ibt) already on my very dated GPU, likely would be even faster on newer ones.


What some people might not know is that the transcoding results and quality are hardware dependent. You will get different results depending on whether you're using NVENC, intel's on chip stuff, actual encoding hardware, etc.

Edit: Obviously it's more complicated than this. But I think this is one of the reasons behind the need for things like Netflix's VMAF


Also generations of hardware matter - earlier GeForces and Intel CPUs have a worse encoding block than newer generations.

In any case, they massively lag behind software encoding.


I mean, obviously it would be the case for NVENC. Old(er) versions don't even support B frames.

But I guess you can call it "firmware" or "software" dependant too, because they don't really use/support the same version of NVENC to begin with.

And I doubt the result is significantly different, if any, for software encoders like x264 - of course assuming you use the exactly same version and parameter.


That's right. I did not realize it before: if you do not mind the size, just "dumping" the stream will work quite fast, obviously.


ffmpeg -i video.mp4 -vf subtitles=subtitle.srt -t 300 out.mp4

The above line will hardcode the subtitles onto the video and stop after five minutes. This should give you an idea of how slow re-encoding is going to be on your machine, and what the quality of the result is going to look like.

What you ideally want is that your rendered video has approximately the same bitrate as the original, and the same visual quality. You can play with the -crf parameter to get the quality the same, and the -preset parameter to get the bitrate the same.

For reference, when I'm rendering 720p video I'm getting about 4x speed on my 10th gen Core i5. 1080p should take about twice the time.

I've played around a bit with using the gpu-accelerated encoders in ffmpeg, but I could never get the same quality or bitrate as the cpu encoder, it seems to me that they're tuned for quickly turning raw capture into something that can realistically be streamed and handled.


And when ingesting srt files in ffmpeg. Ffmpeg expects the srt file to be utf8 encoded, but it is sometimes encoded in Windows-1252 which is the original format. So you might need to transcode it first.


Urghk. I had no idea, but then again I would never use .srt myself, it's a terrible format. I'm always using .ass, because then you can get proper left-aligned subtitles, proper font, and a proper box around them.


You can set the charenc option for other character sets.


Is it that a separate subtitle file is not supported or subtitles are not supported at all? Otherwise you can use ffmpeg to merge the srt file and the mp4 file into either a single mp4 or mkv file without re-encoding the video (using the -c copy flag). Many players support subtitles embedded into a .mp4 file.


^this. I'd be curious what player gp has found that can playback the mp4, but not read embedded subtitles. There are a few that have poor support for external (separate file) subtitles.

See eg: https://mutsinzi.com/add-srt-subtitles-to-quicktime/

TL;DR:

ffmpeg -i yourVideo.mp4 -i yourSubtitle.srt -c:v copy -c:a copy -c:s mov_text -metadata:s:s:0 language=eng yourOutputVideo.mp4

I'm uncertain if there are some metadata you could toggle to hint at default subtitle language and/or default to subtitles on (or off). I generally use vlc, so this isn't much of a problem for me.


You can hint the video player with metadata.

To make the first subtitle track default:

  -disposition:s:0 default
Or you can make it forced:

  -disposition:s:0 forced
And to clear the disposition of the second subtitle track (if it was default in the source stream)

  -disposition:s:1 
The problem is that video players are inconsistent in term of how they honour these parameters.


You can do this easily with Handbrake as well - using the "hard burn" option: https://handbrake.fr/docs/en/latest/advanced/subtitles.html


or with VLC.


It will probably take approx 1:1 time. Maybe a bit more but not so much. As another sibling says, you need good quality re-encoding. Obviosly it depends on your CPU/GPU but I guess that is a giod approximation.


Disclaimer: I work for Transloadit

At my job we have an API that lets you do this to "burn in" subtitles onto a video.

A demo for this: https://transloadit.com/demos/video-encoding/add-subtitle-to...


I don't know how Plex does it, but it burns subtitles in near real time and I don't notice any quality hit


Plex uses FFmpeg for transcoding video and audio.

See https://support.plex.tv/articles/200250377-transcoding-media...


So does Airflow, which I'm a happy user of.

https://airflow.app/


Ooo that's interesting. I am going to check it out. What resolution btw? 1080p60? Same codec before and after?


I'm sorry I'm not knowledgeable in that area to tell you what it does, but it works regardless of resolution and quality. 4k or 1080p, it doesn't matter.

What's also impressive is it can stream your media at a lower quality, say 4k to 720p to your phone, also in near real time, on a very old laptop that I repurposed into a NAS.




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: