Deakin University SIT772: Search Algorithm Design with Models

Verified

Added on  2023/06/07

|13
|1108
|105
Report
AI Summary
This report details the design of a search algorithm using both Boolean and Vector models, including the creation of an inverted index with stop word removal and Porter stemming. It explains the merged inverted list with in-document frequencies and dictionary/posting file creation. The report then tests the posting file and demonstrates Boolean queries (AND, OR, NOT) and the Vector Space Model using cosine similarity to rank documents based on relevance to a given query. Finally, it evaluates the performance of Google and Bing search engines based on recall and precision, determining Google's superiority. The report is based on SIT772 Database and Information Retrieval Assessment Task 2 Trimester 2 2018 from Deakin University.
chevron_up_icon
1 out of 13
circle_padding
hide_on_mobile
zoom_out_icon
Loading PDF…
[object Object]