Technically it's possible.

We did do inlaid server-side ads that way for a while.

IT just takes an excessive amount of work.

The real solution is just the full support of ASS/TTML/VTT subtitles on all platforms. Usually smart devices are kind of only partially supported.

For instance - casting to a chromecast fallsback to SRT.