|
CSE Home |
About Us |
Search |
Contact Info |
The web server creates a set of log files as it runs. At CSE, we currently create these logs:
For www.cs.washington.edu, all of these logs can be read from any host that mounts /cse/www, which includes most or all of the "research" Unix hosts (but not "instructional" hosts; instructional users should see the section on split logs for information on how to get log reports). CSEResearch Windows hosts can get at the logs by mounting the share \\ntdfs\cs\cse\.
Details of the format of common_log are here (local users only).
Here is Apache documentation for mod_log_config.
The error log can be very handy for debugging CGI scripts and detecting broken links.
The anonymized_log (when provided) is provided for research and instructional purposes by users that aren't members of the CSE research community.
error_log is kept for one day after it is created. The log for the previous day can be found in /cse/www/logs/error_log.<date>. common_log and usertrack_log are kept for seven days, in /cse/www/logs/common_log.<date>. This "date" filename extension is in the format YYYYMMDD- for example, 20000202 is the exension for February 2, 2000. suexec_log is entirely removed nightly.
Each night near midnight, a set of split logs are generated. A split log is a file of entries from common_log that contains entries for only a single top-level document tree, and are named according to the tree they represent. For example, rose.log contains entries for accesses to /homes/rose/. Besides the trees in /homes/, split logs are also created for each tree in /research/projects/.
Split logs are stored in /cse/www/logs/split/, and are kept for one day only.
Instructional users, who don't have access to the file systems containing the logs, can subscribe to a daily log report: the split log corresponding to their content will be mailed nightly to each such user for whom a file called .splitinfo is found in their www/ directory. The contents of the file are ignored.
Each night, a report summarizing the logs is generated. Server statistics are maintained here. This report is accessable to local users only.
Those statistics are generated by analog. Local documentation for analog is here.
Publishers of web documents on www.cs.washington.edu can request a weekly report on accesses to their document. It works like this:
The .statinfo files are sought in all the subdirectories of the following URL trees:
which means that /homes/turing/.statinfo will be sought but e.g. /homes/turing/awards/.statinfo will not be. For Dr. Turing, that means that his .statinfo file should be created in ~turing/www/. The report gives a count of the accesses for the past week to each document, plus the names of the refering documents (when available).
So, if, for example, there were fifty accesses to http://www.cs.washington.edu/homes/rose/ in a week, forty of which were from http://www.fbi.gov/least-wanted.html, five of which were from http://www.cs.washington.edu/lab/staff.html, and five with no referer info, the report would contain text roughly like this:
/homes/rose/index.html (50 accesses)
<= http:/www/fbi.gov/least-wanted.html (40 accesses)
<= http://www.cs.washington.edu/lab/staff.html (5 accesses)
To start receiving such reports, a user must create a .statinfo file in a subdirectory of the URL /homes/ or the URL /research/projects/. To turn it off, remove the .statinfo file. To get reports on some other tree not rooted in /homes/ or /research/projects/ contact the webmaster to request that the script be modified to look for .statinfo files there.
|
Computer Science & Engineering University of Washington Box 352350 Seattle, WA 98195-2350 (206) 543-1695 voice, (206) 543-2969 FAX [comments to webmaster] | |