I've learned that we can best reproduce the errors and storage delays when running a robocopy task that generates a lot of storage throughput while the host is also running many other VM's. The problem doesn't happen except under high load.
The issue is still unacceptable.
VMware support just suggested changing the PVSCSI to LSI, but I have not tried it yet.