Implementation of an Image Search Engine

In this project I present a scalable, integrated text-based image retrieval system (search engine) and evaluate its effectiveness. The search engine crawls and indexes all the pages in given domains, retrieves images found on the pages along with all the relevant keywords that can be used to identify the images. The keywords are loaded into a database along with several statistics indicating the location of the keywords in the page. Thumbnail versions of the images are downloaded to the server to save disk space. Several heuristics and metrics are used for identifying the images and their relevance...

