pyzor.digest¶
Handle digesting the messages.
-
class
pyzor.digest.DataDigester(msg, spec=None)[source]¶ Bases:
objectThe major workhouse class.
-
atomic_num_lines= 4¶
-
digest¶
-
email_ptrn= re.compile('\\S+@\\S+')¶
-
longstr_ptrn= re.compile('\\S{10,}')¶
-
min_line_length= 8¶
-
unwanted_txt_repl= ''¶
-
url_ptrn= re.compile('[a-z]+:\\S+', re.IGNORECASE)¶
-
value¶
-
ws_ptrn= re.compile('\\s')¶
-
-
class
pyzor.digest.HTMLStripper(collector)[source]¶ Bases:
html.parser.HTMLParserStrip all tags from the HTML.
-
class
pyzor.digest.PrintingDataDigester(msg, spec=None)[source]¶ Bases:
pyzor.digest.DataDigesterExtends DataDigester: prints out what we’re digesting.