SIT772 - Database and Information Retrieval Assignment 2 Solutions
VerifiedAdded on 2022/11/16
|32
|2201
|499
Homework Assignment
AI Summary
This document presents a comprehensive solution for SIT772 Assignment 2, focusing on information retrieval techniques. The solution addresses the problem of designing a search algorithm using both Vector and Boolean models. It includes the removal of stop words, stemming, and the creation of an inverted index with within-document frequencies. The solution also covers the merging of the inverted list, dictionary and posting file. The document tests the inverted index with keywords and compares the Boolean and Vector models using cosine similarity for document retrieval. Furthermore, the solution includes an evaluation of search engines (Google and Yahoo) based on their ability to retrieve course information. The assignment covers key concepts like data retrieval, search algorithms, inverted indexes, and search engine evaluation.