SIT772 - Database and Information Retrieval: Assignment 2
VerifiedAdded on 2023/03/21
|25
|2043
|81
Homework Assignment
AI Summary
This document presents a comprehensive solution to a data science assignment focusing on information retrieval techniques. The assignment begins with stemming and stop word removal applied to three documents, followed by the creation of a merged inverted list with within-document frequencies and a dictionary. The core of the solution explores the Boolean and vector models for query processing. The Boolean model utilizes logical operators to retrieve documents, while the vector model employs cosine similarity to rank documents based on relevance. The document then evaluates the performance of Google and Bing search engines by analyzing their results for queries related to the price of a new Xbox One. The analysis includes identifying relevant and irrelevant documents returned by each search engine, highlighting the differences in precision between the two engines. The assignment demonstrates the practical application of information retrieval concepts and provides a comparative analysis of different search engine approaches.
1 out of 25