Thread: about inverted indexing program

  1. #1
    Registered User
    Join Date
    Apr 2002

    about inverted indexing program


    I was trying to implement an indexing program that uses an inverted index file system, i.e. the progaram provides a weighting each document based on keyword method, once the keyword is searched the program should pickup a document that has the highest frequency of that keyword.

    I am having a go but I would need some guidance from someone from the board with information retrievial experience, I would be
    happy to be guided by examples.

    thanks in advance

  2. #2
    Registered User hk_mp5kpdw's Avatar
    Join Date
    Jan 2002
    Northern Virginia/Washington DC Metropolitan Area
    Well, lets say you need to go through a certain directory file-by-file and count the number of occurances of a specific word in those files. If you were to store, for instance, the count and the filename results into a multimap<int,string,greater<int> > container then this would result in a list of filenames where those files with the higher counts would be at the start of the multimap. Is this what you are talking about?
    "Owners of dogs will have noticed that, if you provide them with food and water and shelter and affection, they will think you are god. Whereas owners of cats are compelled to realize that, if you provide them with food and water and shelter and affection, they draw the conclusion that they are gods."
    -Christopher Hitchens

Popular pages Recent additions subscribe to a feed

Similar Threads

  1. Need help with a program, theres something in it for you
    By engstudent363 in forum C Programming
    Replies: 1
    Last Post: 02-29-2008, 01:41 PM
  2. Replies: 4
    Last Post: 02-21-2008, 10:39 AM
  3. Using variables in system()
    By Afro in forum C Programming
    Replies: 8
    Last Post: 07-03-2007, 12:27 PM
  4. Replies: 3
    Last Post: 03-04-2005, 02:46 PM
  5. My program, anyhelp
    By @licomb in forum C Programming
    Replies: 14
    Last Post: 08-14-2001, 10:04 PM