[Imc-tech] Perl and SA

Arun Bhalla bhalla at uiuc.edu
Thu Jun 10 18:47:04 CDT 2004


And um, I renamed the /tmp/bayes/old directory to /tmp/bayes/upgrade,
so it would be more intuitive. ;-)

A good way to test whether SpamAssassin is happy with the
"upgraded" DB files after dropping them in their expected location:
spamassassin -d {old-email}  | spamassassin -D

Arun

Arun Bhalla writes:
> On a whim, I tried db4.0_upgrade, and it seems to work.  (You're right
> that db_dump/db4.0_load doesn't work -- I imagine the versions dump
> in slightly different formats, and db4.0_load never complains.)
> 
> My work is at /tmp/bayes/old.  I imagine you could drop them in place.
> 
> \u@\h:\w\$ sa-learn -D --dbpath `pwd`  --dump all |& less
> [...]
> debug: bayes: 26649 tie-ing to DB file R/O /tmp/bayes/old/bayes_toks
> debug: bayes: 26649 tie-ing to DB file R/O /tmp/bayes/old/bayes_seen
> debug: bayes: found bayes db version 2
> 0.000          0          2          0  non-token data: bayes db version
> 0.000          0     117688          0  non-token data: nspam
> 0.000          0      64911          0  non-token data: nham
> 0.000          0     143062          0  non-token data: ntokens
> 0.000          0 1086293280          0  non-token data: oldest atime
> 0.000          0 1086732234          0  non-token data: newest atime
> 0.000          0 1086731609          0  non-token data: last journal sync ati
> me
> 0.000          0 1086638818          0  non-token data: last expiry atime
> 0.000          0     345600          0  non-token data: last expire atime del
> ta
> 0.000          0      25528          0  non-token data: last expire reduction
>  count
> 0.002          4       1058 1086727778  H*r:66.218.67
> 0.551       2077        932 1086724589  largest
> 0.016        122       4096 1086721186  N:H*x:FN.N
> 0.696        129         31 1086725653  anti
> [...]
> 
> 
> Looks good!
> 
> Arun
> 
> 
> "Zachary C. Miller" writes:
> > Another unexpected victim of the Perl upgrade was Spamassassin's
> > bayesian filtering.
> > 
> > Perl 5.6.1 used libdb2.
> > Perl 5.8.4 uses libdb4.0. 
> > 
> > Files created by these versions are completely incompatible. 
> > 
> > If you find that you have some problems with Perl and any other old
> > DB_File files please let me know and I'll see what I can do about
> > fixing them.
> > 
> > I made some attempt to use db4.0_upgrade or db_dump/db4.0_load but
> > neither worked. So in the end I had to just delete the
> > /home/spamfilt/.spamassassin/bayes_* files and let the autolearner
> > start from scratch.
> > 
> > This is why there has been increased spam volume over the past several
> > days (since I upgraded SA). Spam volume should slowly but surely go
> > down over the next few weeks as the bayesian filter relearns about all
> > the variations of spelling for viagra and such. 
> > 
> > Unrelatedly, since yesterday there was a typo in
> > /etc/spamassassin/local.cf which was my fault and which I have now
> > fixed. That one invalidating some (but not all) of SA's rules and the
> > errors caused by it filled up /var. Doh!
> > 
> > -- 
> > Zachary C. Miller - @= - http://wolfgang.groogroo.com/
> > IMSA 1995 - UIUC 2000 - Just Another Leftist Muppet - Ya Basta!
> >  Social Justice, Community, Nonviolence, Decentralization, Feminism,
> >  Sustainability, Responsibility, Diversity, Democracy, Ecology
> > _______________________________________________
> > Imc-tech mailing list
> > Imc-tech at urbana.indymedia.org
> > http://lists.cu.groogroo.com/cgi-bin/listinfo/imc-tech
> > 
> 
> --
> Arun Bhalla
> 
> _______________________________________________
> Imc-tech mailing list
> Imc-tech at urbana.indymedia.org
> http://lists.cu.groogroo.com/cgi-bin/listinfo/imc-tech
> 

--
Arun Bhalla



More information about the Imc-tech mailing list