This isn't ram. this is unified memory. It's shared between GPU and CPU. Soldered VRAM for GPUs have been the norm for probably 20 years because of the latency and reliability required, so why is this any different?
The only way to achieve what you're after is to do any of;
- Give up on unified memory and switch to a traditional platform (which there are thousands of alternatives for)
- Cripple the GPU for games and some productivity software by raising latency beyond the norm.
- Change to a server-class chip for 5x the price.
This is an amazing chip giving server-class specs in a cheap mobile platform, that fill a special nieche in the market for for both productivity and local AI at a very competitive price. What you're arguing for makes no sense.