Crawler Module

to get webpages from websites by following hyperlinks

Page Repository

stores pages obtained by the crawler for use, subsequent indexing or may support de-duplication

Indexing module

To analyse content of the webpages and construct information for the indexes

Offer the data structures needed to support the query operation and return of data

Query module

to resolve the string typed by the user into the form required to gain information from the indexes

Ranking module

to determine how to present the result obtained from the index

