Pregunta de entrevista de Go Vivace

Why and when do we use multi-headed attention module in Natural Language Processing