pyzor.digest¶
Handle digesting the messages.
-
class
pyzor.digest.DataDigester(msg, spec=None)[source]¶ Bases:
objectThe major workhouse class.
-
atomic_num_lines= 4¶
-
digest¶
-
email_ptrn= <_sre.SRE_Pattern object>¶
-
longstr_ptrn= <_sre.SRE_Pattern object>¶
-
min_line_length= 8¶
-
unwanted_txt_repl= ''¶
-
url_ptrn= <_sre.SRE_Pattern object>¶
-
value¶
-
ws_ptrn= <_sre.SRE_Pattern object>¶
-
-
class
pyzor.digest.HTMLStripper(collector)[source]¶ Bases:
HTMLParser.HTMLParserStrip all tags from the HTML.
-
class
pyzor.digest.PrintingDataDigester(msg, spec=None)[source]¶ Bases:
pyzor.digest.DataDigesterExtends DataDigester: prints out what we’re digesting.