Text this: Intelligent information extraction from scholarly document databases.