Check out [0]. You can do 'Voice AI' on small/cheap hardware. It's the most fun you can have in the space ATM :) It's been a while, but posted a demo here [1]

[0] https://github.com/pipecat-ai/pipecat-esp32

[1] https://www.youtube.com/watch?v=6f0sUEUuruw

beautiful demo - is it running fully locally or talking to 3rd party API’s? That box was jaw dropping small

For the best experience, you'll still want it to communicate with 3rd party APIs to handle the speech to text, text to speech, and LLM.