Brian Foster <blf at utvinternet.ie> writes:
> after a bit of head-scratching, the easiest approach
> seems to be a bit of pre-processing; that is, make
> the two types of spaces unique.
Frankly, the easiest approach is to bite the bullet and go for a
regexp approach. Digging out a bit of old Perl code that did this
job (if for slightly differently formatted log lines):
while (<STDIN>) {
/^([0-9.]+) ([^ ]) ([^ ]) (\[.+\]) \"([^\"]+)\" ([0-9]+) ([0-9]+) \"([^\"]+)\" \"([^\"]+)\"/;
$hitter = $1;
$datestr = $4;
$httpcmd = $5;
$httpres = $6;
$size = $7;
$referer = $8;
$agentid = $9;
}
Brendan
--
Brendan Halpin, Department of Sociology, University of Limerick, Ireland
Tel: w +353-61-213147 f +353-61-202569 h +353-61-338562; Room F2-025 x 3147
mailto:brendan.halpin at ul.iehttp://www.ul.ie/sociology/brendan.halpin.html
Maintained by the ILUG website team. The aim of Linux.ie is to
support and help commercial and private users of Linux in Ireland. You can
display ILUG news in your own webpages, read backend
information to find out how. Networking services kindly provided by HEAnet, server kindly donated by
Dell. Linux is a trademark of Linus Torvalds,
used with permission. No penguins were harmed in the production or maintenance
of this highly praised website. Looking for the
Indian Linux Users' Group? Try here. If you've read all this and aren't a lawyer: you should be!