The problem with that editing model for me is that it makes text objects much more cumbersome.

In Vim you can for example do "dap" to delete around a paragraph, but you cannot easily invert it ("pad") because 'p' is too common and is already bound.

You can also easily do the "select first" in Vim by first pressing 'v' to start a visual selection, so I just don't see the point.