Text this: Language-guided invariance probing of vision–language models.