> Dear Group,
> Currently we are working on a school project for a Distributed Systems
> Course. We have decided to do an analysis and possibly an
> implementation of a Search Engine. It would help us a lot if we could
> obtain the full source code(and maybe the documentation) of a search engine
> that can be operated on UNIX/Linux.
> The implementation needs for the search engine we need is as follows :
> 1. Must be implemented using a multithreaded design.
> 2. Should be either an Integrated or Meta Search Engine.
> If you have or know any information on documents, source codes, etc.
Please follow up your queries on the newsgroup that you presumed to
elicit free advice!
There are two disparate meanings attributed to a generic "where is
source code for a search engine?" question. The first has to be a
disappointing "there is none available." The source code for the
major search engines, almost always based on the inktomi kernel
(see www.inktomi.com), is carefully guarded and never released
non-commercially (see www.searchenginewatch.com/resources/). However,
the good news is that quality software for CGI-capable search
engines (meant for internally hashing and searching of an intranet
or local web site) is freely available and in a constant state of
development and experimentation.
The most well known of this latter category is glimpse/webglimpse at:
http://www.webglimpse.net/
Quote:> Thank you.
You're welcome! Good luck.
=Brian