Pair expansion for learning multilingual semantic embeddings using disjoint visually-grounded speech audio datasets

Publication
Annual Conference of the International Speech Communication Association (InterSpeech)

Related