tiigle set of websites crawling, scraping and unsupervised clustering Architecture considerations Crawling and Scraping Based on Scrapy Unsupervised Clustering sources: unsupervised clustering of movies based on abstracts unsupervised clustering of news articles Output Clustered webpages and websites by content structured search (Google on steroids)