logo

Creating an Inverted Index and Removing Stop Words

   

Added on  2023-04-19

10 Pages1419 Words246 Views
 | 
 | 
 | 
COVER PAGE (ENTER YOUR DETAILS)
Creating an Inverted Index and Removing Stop Words_1

Contents
Question 1...................................................................................................................................................3
Creating an inverted index......................................................................................................................3
Remove stop words.............................................................................................................................3
Porter Stemming algorithm.................................................................................................................3
Merged inverted list................................................................................................................................4
Posting file...........................................................................................................................................5
Testing.................................................................................................................................................7
Boolean Model and vector Model...........................................................................................................7
Question 2...................................................................................................................................................8
Bibliography...............................................................................................................................................10
Creating an Inverted Index and Removing Stop Words_2

Question 1
Creating an inverted index
Document 1
Information retrieval is the activity of obtaining information resources relevant to an
information need from a collection of information resources. Searches can be based on full-text
or other content-based indexing.
Document 2
Information retrieval is finding material of an unstructured nature that satisfies an information
need from within large collections
Document 3
Information systems is the study of complementary networks of hardware and software that
people and organizations use to collect, filter, process, create, and distribute data.
Remove stop words
Results
Document 1
Information retrieval activity obtaining information resources relevant information collection
information resources Searches based full-text content-based indexing
Document 2
Information retrieval finding material unstructured nature satisfies information within large
collections
Document 3
Information systems study complementary networks hardware software people organizations
collect filter process create distribute data
Porter Stemming algorithm
Results
Document 1
Informat retriev activ obtain inform resourc relev inform collect inform resourc Search base full
text content base index
Document 2
Informat retriev find materi unstructur natur satisfi inform within larg collect
Document 3
Informat system studi complementari network hardwar softwar peopl organ collect filter
process creat distribut data
Creating an Inverted Index and Removing Stop Words_3

End of preview

Want to access all the pages? Upload your documents or become a member.

Related Documents