The internet is a big place and most people’s interaction with it is regulated by a few companies paid to sell you things. My team has been building tools for the DARPA Memex project to democratize search for all, with tools that go beyond the surface web and pull out rich structured data to analyze. In this presentation, we dive into using our Python based open source tool stack for finding information and utilizing the rest of the Python ecosystem for analysis. With an interface to crawling, extraction, topic modeling,search indexing, and image analysis.