Hacker News

rob 7 hours ago [ - ]

I hardly type at all now. I use Handy (free) with Parakeet and use its post-LLM processing feature with a custom prompt tailored towards coding, so I can say things like "Have it go to slash remote dash control" and it'll output "/remote-control". Converts brackets, etc.

Everything is almost instant, it's insanely fast, and lets me work on multiple different agents/windows at the same time fast with cmux.

I use the same thing to talk to people on Slack, iMessage, etc now when I'm working from home instead of typing.

I also can help articulate my thoughts better when I'm thinking them literally out loud instead of just sitting silent and typing them on a computer for hours.

It's just something that you need to try and get used to because I also thought it was something I wouldn't like at first.

thefreeman 6 hours ago [ - ]

Can you share more information on the post-LLM processing and the prompt you use? I would like to try this out but don't see any post-LLM options in Handy.

edit: nevermind, found info on the docs about how to enable post processing. Would still be interested in your prompt though if you don't mind sharing!

rob 6 hours ago [ - ]

You have to enable "Experimental Features" under "Advanced."

This is the prompt I use (it's probably overkill and can be condensed):

https://pastebin.com/raw/RUVAqLCU

ezconnect 5 hours ago [ - ]

What is Parakeet?

rafaelm 5 hours ago [ - ]

I believe this is the correct link. I use it too in Handy, for English and Spanish transcriptions: https://huggingface.co/nvidia/parakeet-tdt-0.6b-v3

stijnveken 5 hours ago [ - ]

Maybe they meant narakeet?

https://www.narakeet.com/tools/

dghlsakjg 4 hours ago [ - ]

Parakeet is the name of a speech to text model from Nvidia. Roughly comparable to whisper from openAI.

It's the model doing the work inside the wrapper that an app provides.

rob 4 hours ago [ - ]

Yep, here's the v2 and v3:

https://huggingface.co/nvidia/parakeet-tdt-0.6b-v2

https://huggingface.co/nvidia/parakeet-tdt-0.6b-v3

It's almost instant on my new M5 Max w/ 36GB of memory, but I used both with Handy on my previous 2019 Intel Mac w/ 16GB memory and was completely surprised at just how fast it was for being on-device! Not instant, but only a couple seconds.

dghlsakjg 3 hours ago [ - ]

I’m using it on an M3 max 32gb, and I’m getting 60-70x realtime for recordings and crazy good accuracy. I can get an hour of audio transcribed in a minute. Similar results from Whisper, but half the speed.

Transcription this good used to cost A LOT, now it rounds down to free.