State-of-the-art Zero-shot Speech Synthesis with Vall-E