Text this: Two-Stage Dynamic Fusion Framework for Multimodal Classification Tasks.