[seqfan] Re: help needed with Mediawiki's Lucene-search extension

Olivier Gerard olivier.gerard at gmail.com
Fri Dec 18 07:35:37 CET 2009


On Fri, Dec 18, 2009 at 02:34, N. J. A. Sloane <njas at research.att.com> wrote:
>
> The trouble is, Russ Cox's search takes its data from
> "cat25", which is the big flat file that contains
> ALL the sequences, in the internal format.
>
> wc cat25
>  2612489  21440539 176452802 cat25
>
> But once the wiki is stabilized, cat25 will go away.
> It would require a huge amount of work to modify
> Russ's program so that it takes its data from the
> 170,000 individual wiki pages, all written in
> wiki language.  (Of course we considered this,
> it was our first choice.  But it won't work.)
>

As I suggested, it could be a relatively straightforward job
to make a script producing and updating daily the successor
of cat25 (let's say cat26... or catwiki ) from the pages.

Perhaps it would require a backstage machine dedicated to that
to do it at sufficiently close intervals.


Olivier




More information about the SeqFan mailing list