That`s very cool. I chose the phone mic, because everybody already has one, no extra device needed. The downside is normalization: from room noise to breathing style. I do use an ML layer to deal with some of these variations but only after basic signal checks. The hard part is to decide when the microphone signal is too ambiguous to use at all...