Text this: Learning mDTD Extraction Patterns for Semi-Structured Web Information Extraction.