Text this: Double‐Attention Transformer for Cross‐Modal Image Captioning