Unsupervised Contrastive Learning Of Sound Event Representations