> clearly written by an LLM, the long emdash was even present.

Can we please stop propagating this accusation? Alright, sure, maybe LLMs overuse the em-dash, but it is a valid topographical mark which was in use way before LLMs and is even auto-inserted by default by popular software on popular operating systems—it is never sufficient on its own to identify LLM use (and yes, I just used it—multiple times—on purpose on 100% human-written text).

Sincerily,

Someone who enjoys and would like to be able to continue to use correct punctuation, but doesn’t judge those who don’t.

So do you always put in the ALT+<code> incantation to get an emdash or copy&paste?

I feel the emdash is a tell because you have to go out of your way to use it on a computer keyboard. Something anyone other than the most dedicated punctuation geeks won't do for a random message on the internet.

Things are different for typeset books.

> So do you always put in the ALT+<code> incantation to get an emdash or copy&paste?

There’s no incantation. On macOS it’s either ⌥- (option+hyphen) or ⇧⌥- (shift+option+hyphen) depending on keyboard layout. It’s no more effort than using ⇧ for an uppercase letter. On iOS I long-press the hyphen key. I do the same for the correct apostrophe (’). These are so ingrained in my muscle memory I can’t even tell you the exact keys I press without looking at the keyboard. For quotes I have an Alfred snippet which replaces "" with “” and places the cursor between them.

But here’s the thing: you don’t even have to do that because Apple operating systems do it for you by default. Type -- and it converts to —; type ' in the middle of a word and it replaces it with ’; quotes it also adds the correct start and end ones depending on where you type them.

The reason I type these myself instead of using the native system methods is that those work a bit too well. Sometimes I need to type code in non-code apps (such as in a textarea in a browser) and don’t want the replacements to happen.

> I feel the emdash is a tell because you have to go out of your way to use it on a computer keyboard.

You do not. Again, on Apple operating systems these are trivial and on by default.

> Something anyone other than the most dedicated punctuation geeks won't do for a random message on the internet.

Even if that were true—which, as per above, it’s not, you don't have to be that dedicated to type two hyphens in a row—it makes no sense to conflate those who care enough about their writing to use correct punctuation and those who don’t even care enough to type the words themselves. They stand at opposite ends of the spectrum.

Again, using em-dashes as one signal is fine; using it as the principal or sole signal is not.

On Linux, I configured my Caps Lock key to function as a compose key, and then use my ~/.XCompose file to make it easier.

I also set things up such that hitting Caps Lock twice in a row sends an Escape character, which makes using Vim a tiny bit nicer.

On Windows, I use autohotkey and have a bunch of keyboard shortcuts for producing characters that I use fairly often but are difficult to type.

My keyboard has no keypad so I’m not sure there’s another way.

You type -- and it gets auto converted.

no I use -- and ---. not all of us use Microsoft word for serious writing.

Fact is that I maybe saw it in 10% of blogs and news articles before Chatgpt. And now it pops up in emails, slack messages, HN/reddit comments and probably more than half of blog posts?

Yes it's not a guarantee but it is at least a very good signal that something was at least partially LLM written. It is also a very practical signal, there are a few other signs but none of them are this obvious.

> Fact is that I maybe saw it in 10% of blogs and news articles before Chatgpt.

I believe you. But also be aware of the Frequency Illusion. The fact that someone mentions that as an LLM signal also makes you see it more.

https://en.wikipedia.org/wiki/Frequency_illusion

> Yes it's not a guarantee but it is at least a very good signal that something was at least partially LLM written.

Which is perfectly congruent with what I said with emphasis:

> it is never sufficient on its own to identify LLM use

I have no quarrel with using it as one signal. My beef is when it’s used as the principal or sole signal.

Dubious. The only signal this gives that in aggregate people use AI. On individual basis, presence of em dashes means nothing.

> And now it pops up in emails, slack messages, HN/reddit comments and probably more than half of blog posts?

Yeah, maybe that's the one thing people who didn't know how to do it before have learnt from "AI" output.