logo

Creating Inverted Index, Boolean and Vector Queries

   

Added on  2023-06-07

13 Pages1108 Words105 Views
COVER PAGE
Creating Inverted Index, Boolean and Vector Queries_1
Contents
COVER PAGE................................................................................................................................................1
Question 1...................................................................................................................................................3
1 creating inverted index.........................................................................................................................3
2 Boolean and vector queries..................................................................................................................9
Question 2.................................................................................................................................................10
Bibliography...............................................................................................................................................12
Creating Inverted Index, Boolean and Vector Queries_2
Question 1
Documents;
Computer vision
As a scientific discipline, computer vision is concerned with the theory and technology for building
artificial systems that obtain information from images or multi-dimensional data
Search Engine
Google launched a dedicated dataset search website to help journalists and researchers crunch through
the numbers
Database
Alternative Database Models. Over time, database expectations have stretched beyond relational
models to include non-relational database system
1 creating inverted index
a. Stop Words removal
Documents without stop words become;
Computer vision
scientific discipline computer vision concerned theory technology building artificial systems
obtain information images multi-dimensional data
Search Engine
Google launched dedicated dataset search website help journalists researchers crunch numbers
Database
Alternative Database Models database expectations stretched beyond relational models include
non-relational database system
Porter Stemming algorithm
The new stemmed documents become;
Doc 1-Computer vision
scientif disciplin comput vision concern theori technolog build artifici system obtain inform imag
multi dimension data
Doc 2-Search Engine
Googl launch dedic dataset search websit help journalist research crunch number
Doc 3-Database
Creating Inverted Index, Boolean and Vector Queries_3
Altern Databas Model databas expect stretch beyond relat model includ non relat databas
system
b. Merged inverted list with in document frequencies
Doc 1-Computer vision
scientif disciplin comput vision concern
theori technolog build artifici system obtain
inform imag multi dimension data
Doc 2-Search Engine
Googl launch dedic dataset search websit
help journalist research crunch number
Doc 3-Database
Altern Databas Model over databas expect
stretch beyond relat model includ non relat
databas system
Step 1: List normalized tokens for each document
Term Doc ID
scientif 1
disciplin 1
comput 1
vision 1
concern 1
theori 1
technolog 1
build 1
artifici 1
system 1
obtain 1
inform 1
imag 1
multi 1
dimension 1
data 1
Googl 2
launch 2
dedic 2
dataset 2
search 2
websit 2
help 2
journalist 2
research 2
crunch 2
number 2
altern 3
databas 3
expect 3
stretch 3
beyond 3
relat 3
model 3
includ 3
non 3
relat 3
databas 3
system 3
Creating Inverted Index, Boolean and Vector Queries_4

End of preview

Want to access all the pages? Upload your documents or become a member.