Redesiging Neural Architectures for Sequence to Sequence Learning