Text this: Federated Learning for Heterogeneous Multimodal Emotion Recognition on Edge Devices.