OpenAI shows how their CLIP model is able to learn concepts with multimodal neurons which have also been found in humans. This may be why CLIP is able to preform so well in zero-shot transfer tasks.