The interesting idea to me was the idea of notating captions with stressing / emphasis.

It would be really neat to have automatic transcription that could annotate the result accordingly.