high ascii in apache logs

Joost van Baal joostvb at logreport.org
Wed Aug 22 18:45:20 CEST 2001


Hi,

In an apache logfile I've got, this line occurs:

cache23.bora.net - - [14/Jun/2001:03:44:21 +0200] "GET / HTTP/1.0" 200 6577 "http://search.naver.com/search.naver?where=nexearch&history=001003dns008nexearch000000000000&query=powered+by+dns" "Mozilla/4.0 (compatible; MSIE 5.5; Windows NT 5.0; ¾ÆÆ®¹Ìµð¾î(artmedia.org))"

By current lire from stable cvs, this occurs as

cache23.bora.net bora.net - 200 6577 GET / - / HTTP/1.0 992483061 http://search.naver.com/search.naver?where=nexearch&history=001003dns008nexearch000000000000&query=powered+by+dns Mozilla/4.0_(compatible;_MSIE_5.5;_Windows_NT_5.0;_¾ÆÆ®¹Ìµð¾î(artmedia.org))

in the .dlf file, and as

        <name>Mozilla/4.0_(compatible;_MSIE_5.5;_Windows_NT_5.0;_¾ÆÆ®¹Ìµð¾î(artmedia.org))</name>

in the xml file.  When lr_xml2ascii feeds this to Xalan-C to typeset
an ascii report, Xalan-C fails:

 XSLException Type is : TranscodingException
 Message is : An error occurred while transcoding!

I believe it's the high bit ascii which makes xalan choke.

What should we do about this?  Should we allow high ascii in the xml?
Is this valid xml?

Bye,

Joost


-- 
To UNSUBSCRIBE, email to development-request at logreport.org with a subject of
"unsubscribe". Trouble? Send an email with subject "help" to
development-request at logreport.org



More information about the Development mailing list