Search Engine on Wikipedia dump with support for field queries
- Python 2.6 or above
- Python libraries:
- Porter Stemmer
- XML Parser
- NLTK
Index can be generated using:
./index.sh "path_to_wiki_dump"
For Searching:
python search.py
Sample Query
- Plain query
- Field query: "C:Plane B:Bus T:Air"
Term Field Abbreviations: b:Body, t:Title e:External Link, c:Category
You can download a small dump to test run from here.