Text this: Extracting cross-modal semantic incongruity with attention for multimodal sarcasm detection.