Blog Archive

Thursday, June 4, 2009

An indexing system for your wikipedia/... research ;)

Hey, i like to say, always we find nice articles, and when it's related to our current task, it's wonderful.

Last post i wrote about making research with the contents of the wikipedia in a different way, in offline-mode, i mean, you download a whole database of a specific language(or more), and do your research for whatever you would like to.

This as we known improves a lot the speed and don't make the wikipedia web admins crazy with your "bot's" grabbing information from their web services.

You could make your own search engine/Indexing system, but it takes time because you need to optimize the way you index the strings/objects of you content's, i good start is from:

Hyperestraier

By Ben Okopnik in http://linuxgazette.net

I could point just the project link and let you search for information related to it, but i found that article really useful.

The projects page is here.

At the first time i thought that it was just a library, as you known implement something independent of the library always take time, because of it i didn't dealed with it till now.

No comments:

Post a Comment