patches for UserAgent/*
Joost van Baal
joostvb at logreport.org
Wed Aug 18 13:41:59 CEST 2004
Hi Egon,
On Sun, Aug 15, 2004 at 01:53:03AM +0200, Egon Willighagen wrote:
>
> attached are three patches for Robot.pm, Browser.pm and OS.pm.
>
> My local www report had to many "Unknown" in the reports so I did some
> research and hacked a bit in my local Lire 1.5 installation...
>
> On Browser.pm:
>
> It did not recognize recent browsers, like Firefox, Epiphany or Safari...
> Nor did it recognize Java VM's that can retrieve over HTTP... The test
> for 'gecko' is no longer valid marker for Mozilla browsers, as there are many
> browsers based on the Gecko engine, *like* Mozilla... but that does not
> make them Mozilla...
Did you sent this patch to the AWStats maintainers too?
> On OS.pm:
>
> After some heavy debugging, I found that the keys method for hashes does not
> return them in order in which entered in the hashtable... and it turned out
> that x11 ("Unknown X11 System") was tested before things like "linux" :(
> Therefor, I separate these more generic types into a separate hash.
This one didn't apply cleanly to the version in CVS: $Id: OS.pm,v 1.1
2004/05/24 15:24:33 flacoste Exp $ . After some tweaking it applied.
> On Robot.pm:
>
> Robots which user agent string started with "Mozilla/x.0_(compatible" were
> never recognized as assumed to always be browsers, instead of robots...
> I a local log I found two exceptions: "Yahoo! Slurp" and "Grub.org" which
> is a distributed crawler... Because of these two (and likely more) exceptions,
> I made a new hash for these kind of robots that mimick a true browser...
> Furthermore, I added 4 robots found in this local log, which were not
> available in the perl module...
Did you sent this to the http://www.robotstxt.org/wc/active.html
maintainer, Martijn Koster?
> I have not checked wether I still have CVS write access, but otherwise,
> consider these patches GPL (that's the Lire license, correct?) and I hereby
> transfer any copyright on these patches to the foundation.
OK.
> PS. the new HTML output is not in 1.5, is it?
I believe not, it's planned for Lire 2.0. See
http://logreport.org/lire/roadmap-2.0.php . It is in CVS though, IIRC.
Thanks a _lot_ for the patches, I've just applied them. They'll get
distributed with Lire 2.0 which is expected to get shipped within a
couple of weeks.
Thanks again, Bye,
Joost
--
. . http://logreport.com/
| '.| /^LogReport$/
| Lire http://logreport.org/
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: Digital signature
Url : http://lists.logreport.org/pipermail/development/attachments/20040818/6c12d572/attachment.bin
More information about the Development
mailing list