Text this: An open natural language processing (NLP) framework for EHR-based clinical research