Text this: Incorporating semantic consistency for improved semi-supervised image captioning.