[seqfan] Re: Trying to use the author field of the oeis for FindStat

Neil Sloane njasloane at gmail.com
Mon Jun 1 23:53:38 CEST 2015


I concur with Robert Israel's response. Write a better parser!

Concerning the particular changes that were suggested, most of them would
change the meaning of the line or are unnecessary.  Here are some
particular responses to those suggestions:

('A005985', '_Colin Mallows_; revised Jun 13 2005'),
Not that important but ";" -> "," would be good. NO, that changes the
meaning

('A117239', '_Eric W. Weisstein_, Mar 05, 2006')
I'd appreciate removing the comma after 05.  NO, we have always made that
comma optional

('A184184', '_Emeric Deutsch_, Feb 16 2011 (based on communication from
_Vladeta Jovovic_)'),
or leave it as is. LEAVE IT AS IS

('A135533', '_N. J. A. Sloane_, based on a message from Guy Steele and D.
E. Knuth, Mar 01 2008'),
Same problem, but worse. LEAVE IT THE WAY IT IS.

('A199352', '_R. H. Hardin_ Nov 05 2011'),
I'd appreciate a comma after the second underscore. LEAVE IT THE WAY IT IS,
there are probably a lot of these, and there is no ambiguity here

('A018178', '_N. J. A. Sloane_.'),
Please remove the period. LEAVE IT THE WAY IT IS. That period is fine.

('A016437', 'rwgk(AT)cci.lbl.gov<http://cci.lbl.gov> (R.W.
Grosse-Kunstleve)'),
I'd appreciate
'R. W. Grosse-Kunstleve (rwgk(AT)cci.lbl.gov<http://cci.lbl.gov>)'
I WILL FIX THIS (but not right away)




Best regards
Neil

Neil J. A. Sloane, President, OEIS Foundation.
11 South Adelaide Avenue, Highland Park, NJ 08904, USA.
Also Visiting Scientist, Math. Dept., Rutgers University, Piscataway, NJ.
Phone: 732 828 6098; home page: http://NeilSloane.com
Email: njasloane at gmail.com


On Mon, Jun 1, 2015 at 2:05 AM, <israel at math.ubc.ca> wrote:

> Basically the problem is that the entries are written by many different
> people over many years, and we are not always diligent about following
> formatting rules.  It might be best if you could make your software be more
> flexible at parsing names and dates.
>
> Anything between "_"'s is an author. Month names (in all their possible
> variants) are pretty easy to recognize; a four-digit integer from 1990 to
> the current year is a year; a one or two digit integer before or after a
> month name is a day. Anything else can probably be ignored, as a first
> approximation.
>
> Cheers,
> Robert
>
>
> On May 31 2015, Rubey Martin wrote:
>
>  Hi Alonso!
>>
>>  Some of those latter eight need to be corrected. I'll get back to you
>>> later today. Al
>>>
>>
>>  These entries look OK (authors are separated by " and " or "," possibly
>>>> followed by a date). However, the following are harder to parse:
>>>>
>>>
>> ('A032537', '_Patrick De Geest_, april 1998.'),
>>
>> I'd appreciate "april 1998." -> "Apr 1998"
>>
>> ('A005985', '_Colin Mallows_; revised Jun 13 2005'),
>>
>> Not that important but ";" -> "," would be good.
>>
>> ('A117239', '_Eric W. Weisstein_, Mar 05, 2006')
>>
>> I'd appreciate removing the comma after 05
>>
>> ('A184184', '_Emeric Deutsch_, Feb 16 2011 (based on communication from
>> _Vladeta Jovovic_)'),
>>
>> I'm not sure what's best here.  Either
>>
>> '_Emeric Deutsch_, _Vladeta Jovovic_, Feb 16 2011'
>>
>> or leave it as is.
>>
>> ('A135533', '_N. J. A. Sloane_, based on a message from Guy Steele and D.
>> E. Knuth, Mar 01 2008'),
>>
>> Same problem, but worse.  Either
>>
>> '_N. J. A. Sloane_, Guy Steele, D. E. Knuth, Mar 01 2008'
>>
>> or
>>
>> '_N. J. A. Sloane_, Mar 01 2008 (based on a message from Guy Steele and
>> D. E. Knuth)')
>>
>> FindStat would then remove parenthetical remarks to obtain the author.
>>
>> ('A199352', '_R. H. Hardin_ Nov 05 2011'),
>>
>> I'd appreciate a comma after the second underscore.
>>
>> ('A018178', '_N. J. A. Sloane_.'),
>>
>> Please remove the period.
>>
>> ('A016437', 'rwgk(AT)cci.lbl.gov<http://cci.lbl.gov> (R.W.
>> Grosse-Kunstleve)'),
>>
>> I'd appreciate
>>
>> 'R. W. Grosse-Kunstleve (rwgk(AT)cci.lbl.gov<http://cci.lbl.gov>)'
>>
>> Many many many thanks!
>>
>> Martin
>>
>> _______________________________________________
>>
>> Seqfan Mailing list - http://list.seqfan.eu/
>>
>>
>>
> _______________________________________________
>
> Seqfan Mailing list - http://list.seqfan.eu/
>



More information about the SeqFan mailing list