Text this: Graph-theoretic Techniques For Web Content Mining