README.webchecker.txt 1.62 KB
Newer Older
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34
Utility able to call wget and varnishlog to extract Headers and return all failures
according expected caching policy.

This utility is configurable through a dictionnary like
   configuration = {'url': 'http://www.example.com',
                   'working_directory': '/home/me/tmp/crawled_content',
                   'varnishlog_binary_path': 'varnishlog',
                   'header_list': {'Last-Modified': True,
                                   'Cache-Control': ('max-age=300', 'max-age=3600',),
                                   'Vary' : ('Accept-Language, Cookie', 'Accept-Language,Cookie'),
                                   'Expires': True,
                                  },
                   'email_address': 'me@example.com',
                   'smtp_host': 'localhost',
                   'debug_level': 'debug',
                  }

url : website to check
working_directory : fetched data will be downloaded
varnishlog_binary_path :  path to varnishlog
header_list : Key == Header id.
              value: if equals to True, it means that header needs to be present in RESPONSE
                     if it is a tuple, the Header value must sastify at least one of the proposed values
email_address : email address to send result
smtp_host : smtp host to use
debug_level : log level of this utility (debug =>very verbose,
                                         info=>normal,
                                         warning=>nothing)

This utility requires wget => 1.12
And a callable varnishlog.
The utility must be run on same server where varnish is running.

web_checker reads varnishlogs to detect if a Query goes to the backend.