[seqfan] Re: Broken link hunt
michel.marcus at free.fr
michel.marcus at free.fr
Sun Aug 2 20:41:49 CEST 2020
I think your file.txt should have the OEIS link line, to be able to search for the link title on the web.
And the A_number, to know where the corrected URL must be entered.
Best.
MM
----- Mail original -----
De: "Elijah Beregovsky" <elijah.beregovsky at gmail.com>
À: "Sequence Fanatics Discussion list" <seqfan at list.seqfan.eu>
Envoyé: Dimanche 2 Août 2020 18:14:27
Objet: [seqfan] Broken link hunt
Hi, Seqfans!
Everyone knows that there are loads of rotten links in the OEIS. For the
past couple of days I've been trying to locate and fix as many as I can.
But then my father suggested I automate this process, so I did exactly
that. I made a (not very sophisticated) crawler that finds and stores in a
file all links throwing Error 404. (
https://github.com/BIGfoot496/OEIS-crawler) After approximately an hour of
searching it returned a file with over a hundred links (in attachment).
That's definitely not all of the dead links and I'm going to run the code
for a much longer time, but this is already too much work for me to do it
alone. Let's fix them!
Elijah
PS: I wouldn't reject coding help, because the crawler isn't nearly optimal
yet. It only catches 404s and slows down significantly after working for
some time.
--
Seqfan Mailing list - http://list.seqfan.eu/
More information about the SeqFan
mailing list