why isn't the cleanup done on the transcription (as opposed to screen record)