24232 patterns, 10469 right anchor strings, 39250 test IPs.
More QA, almost entirely banner tests and class adjustments.
PLEASE NOTE: the sendmail "embedded regex map" distributions below are
among the last I'll be doing; please upgrade to the latest sendmail m4
package, and enable DNSBL lookups instead of the maps. If you need more
time to handle the transition, please let me know, and I'll keep doing
these builds until then. Thanks!
PLEASE NOTE: the patterns_xwalk file and rightanchors file now contain
a new token, 'mixed' for patterns_xwalk and 'MIXED' for rightanchors,
which will be increasingly used to designate and distinguish between a
"we're not sure what this is but it is generic" class, and a "this is
a naming shared by both known dynamic and known static hosts" class. I
will be going back through the history of the dataset and looking for
those cases where a pattern had a non-generic class that was subsequently
reduced to generic, and changing these to 'mixed'. It's restricted to a
very small number of strings and anchors at present, but expect more.
Download them here:
Posted by schampeo at November 27, 2007 10:32 AM