[Imc-tech] Perl and SA
Arun Bhalla
bhalla at uiuc.edu
Thu Jun 10 18:47:04 CDT 2004
And um, I renamed the /tmp/bayes/old directory to /tmp/bayes/upgrade,
so it would be more intuitive. ;-)
A good way to test whether SpamAssassin is happy with the
"upgraded" DB files after dropping them in their expected location:
spamassassin -d {old-email} | spamassassin -D
Arun
Arun Bhalla writes:
> On a whim, I tried db4.0_upgrade, and it seems to work. (You're right
> that db_dump/db4.0_load doesn't work -- I imagine the versions dump
> in slightly different formats, and db4.0_load never complains.)
>
> My work is at /tmp/bayes/old. I imagine you could drop them in place.
>
> \u@\h:\w\$ sa-learn -D --dbpath `pwd` --dump all |& less
> [...]
> debug: bayes: 26649 tie-ing to DB file R/O /tmp/bayes/old/bayes_toks
> debug: bayes: 26649 tie-ing to DB file R/O /tmp/bayes/old/bayes_seen
> debug: bayes: found bayes db version 2
> 0.000 0 2 0 non-token data: bayes db version
> 0.000 0 117688 0 non-token data: nspam
> 0.000 0 64911 0 non-token data: nham
> 0.000 0 143062 0 non-token data: ntokens
> 0.000 0 1086293280 0 non-token data: oldest atime
> 0.000 0 1086732234 0 non-token data: newest atime
> 0.000 0 1086731609 0 non-token data: last journal sync ati
> me
> 0.000 0 1086638818 0 non-token data: last expiry atime
> 0.000 0 345600 0 non-token data: last expire atime del
> ta
> 0.000 0 25528 0 non-token data: last expire reduction
> count
> 0.002 4 1058 1086727778 H*r:66.218.67
> 0.551 2077 932 1086724589 largest
> 0.016 122 4096 1086721186 N:H*x:FN.N
> 0.696 129 31 1086725653 anti
> [...]
>
>
> Looks good!
>
> Arun
>
>
> "Zachary C. Miller" writes:
> > Another unexpected victim of the Perl upgrade was Spamassassin's
> > bayesian filtering.
> >
> > Perl 5.6.1 used libdb2.
> > Perl 5.8.4 uses libdb4.0.
> >
> > Files created by these versions are completely incompatible.
> >
> > If you find that you have some problems with Perl and any other old
> > DB_File files please let me know and I'll see what I can do about
> > fixing them.
> >
> > I made some attempt to use db4.0_upgrade or db_dump/db4.0_load but
> > neither worked. So in the end I had to just delete the
> > /home/spamfilt/.spamassassin/bayes_* files and let the autolearner
> > start from scratch.
> >
> > This is why there has been increased spam volume over the past several
> > days (since I upgraded SA). Spam volume should slowly but surely go
> > down over the next few weeks as the bayesian filter relearns about all
> > the variations of spelling for viagra and such.
> >
> > Unrelatedly, since yesterday there was a typo in
> > /etc/spamassassin/local.cf which was my fault and which I have now
> > fixed. That one invalidating some (but not all) of SA's rules and the
> > errors caused by it filled up /var. Doh!
> >
> > --
> > Zachary C. Miller - @= - http://wolfgang.groogroo.com/
> > IMSA 1995 - UIUC 2000 - Just Another Leftist Muppet - Ya Basta!
> > Social Justice, Community, Nonviolence, Decentralization, Feminism,
> > Sustainability, Responsibility, Diversity, Democracy, Ecology
> > _______________________________________________
> > Imc-tech mailing list
> > Imc-tech at urbana.indymedia.org
> > http://lists.cu.groogroo.com/cgi-bin/listinfo/imc-tech
> >
>
> --
> Arun Bhalla
>
> _______________________________________________
> Imc-tech mailing list
> Imc-tech at urbana.indymedia.org
> http://lists.cu.groogroo.com/cgi-bin/listinfo/imc-tech
>
--
Arun Bhalla
More information about the Imc-tech
mailing list