[seqfan] Re: Broken link hunt

michel.marcus at free.fr michel.marcus at free.fr
Sun Aug 2 20:41:49 CEST 2020


I think your file.txt should have the OEIS link line, to be able to search for the link title on the web. 
And the A_number, to know where the corrected URL must be entered. 
Best. 
MM 

----- Mail original -----

De: "Elijah Beregovsky" <elijah.beregovsky at gmail.com> 
À: "Sequence Fanatics Discussion list" <seqfan at list.seqfan.eu> 
Envoyé: Dimanche 2 Août 2020 18:14:27 
Objet: [seqfan] Broken link hunt 

Hi, Seqfans! 
Everyone knows that there are loads of rotten links in the OEIS. For the 
past couple of days I've been trying to locate and fix as many as I can. 
But then my father suggested I automate this process, so I did exactly 
that. I made a (not very sophisticated) crawler that finds and stores in a 
file all links throwing Error 404. ( 
https://github.com/BIGfoot496/OEIS-crawler) After approximately an hour of 
searching it returned a file with over a hundred links (in attachment). 
That's definitely not all of the dead links and I'm going to run the code 
for a much longer time, but this is already too much work for me to do it 
alone. Let's fix them! 
Elijah 

PS: I wouldn't reject coding help, because the crawler isn't nearly optimal 
yet. It only catches 404s and slows down significantly after working for 
some time. 


-- 
Seqfan Mailing list - http://list.seqfan.eu/ 




More information about the SeqFan mailing list