Learning Visual Curiosity For An Agent Through Language And Embodiment