Spamassassin 3.3.0 a FuzzyOcr

Miroslav Zidek linux na mzidek.net
Pondělí Únor 15 22:13:33 CET 2010


Antonín Kolísek píše v Út 02. 02. 2010 v 07:09 +0100:

>  chtěl bych se zeptat zda někomu z Vás kdo používáte Spamassassin
> funguje plugin FuzzyOcr. U mne
> nebyl u verze SA 3.2.X vůbec problém, ale pod novou verzí SA 3.3.0
> nemohu FOCR za
> žádnou cenu rozběhnout.

je treba stahnout testovaci verzi -
http://fuzzyocr.own-hero.net/wiki/Installation-3.6.x

problem dela modul pro logovani - po zakomentovani
focr_logfile /etc/mail/spamassassin/FuzzyOcr.log 

spamassassin --lint chyby prestane reportovat, a testovaci maily jsou
oznacene jako spam: 

spamassassin --debug FuzzyOcr < ocr-animated.eml > /dev/null
dava vysledek


---- snip -----------

Feb 15 22:10:26.301 [32141] info: FuzzyOcr: Scanset "ocrad" found word
"price" with fuzz of 0.0000
Feb 15 22:10:26.302 [32141] info: FuzzyOcr: line: "alert www price up
lls www alert"
Feb 15 22:10:26.311 [32141] info: FuzzyOcr: Scanset "ocrad" found word
"company" with fuzz of 0.0000
Feb 15 22:10:26.311 [32141] info: FuzzyOcr: line: "company red
reeflaboratories lnc"
Feb 15 22:10:26.313 [32141] info: FuzzyOcr: Scanset "ocrad" found word
"alert" with fuzz of 0.0000
Feb 15 22:10:26.313 [32141] info: FuzzyOcr: line: "alert www price up
lls www alert"
Feb 15 22:10:26.335 [32141] info: FuzzyOcr: Scanset "ocrad" found word
"news" with fuzz of 0.0000
Feb 15 22:10:26.336 [32141] info: FuzzyOcr: line: "will be releasing big
newstomorrow morning and we hopeto seethis kind"
Feb 15 22:10:26.341 [32141] dbg: FuzzyOcr: Enough OCR Hits without space
stripping, skipping second matching pass...
Feb 15 22:10:26.341 [32141] info: FuzzyOcr: Scanset "ocrad" generates
enough hits (4), skipping further scansets...
Feb 15 22:10:26.341 [32141] info: FuzzyOcr: Message is spam, score =
9.000

------- snip ------------


pokusim se prijit i na to logovani a kdyz tak dam vedet


MZ




Další informace o konferenci Linux