Exactly. The singularity is already here. It's just "programmers + AI" as a whole, rather than independent self-improvements of the AI.

I wonder how a "programmers + AI" self-improving loop is different from an "AI only" one.

The AI only one presumably has a much faster response time. The singularity is thus not here because programmer time is still the bottleneck, whereas as I understand in the singularity time is no longer a bottleneck component.

AGI will be faster as it doesn't need initial question.

AGI will also be generic.

LLM is already very impressive though