[CUWiN-Dev] hslsd updates coming

David Young dyoung at pobox.com
Thu Sep 22 17:26:44 CDT 2005


On Thu, Sep 22, 2005 at 04:17:20PM -0500, Bill Comisky wrote:
> On Wed, 21 Sep 2005, David Young wrote:
> 
> >I have found some hslsd bugs by watching the Race Street network, which 
> >keeps growing with Tom Wiltzius' help, and by watching the indoor 
> >testbed. I have some fixes under development.
> >
> >Dave
> 
> I've seen recently a few occasions where a node will reboot frequently, 
> though the intervals vary; sometimes hours between reboots and sometimes 
> minutes.  The ETX metric and beacon strength to the gateway from the node 
> in question looks like a solid link, and there is typically a fair amount 
> of traffic on the wireless network at the time... I have some cron jobs 
> fetching files on a few nodes, including the rebooting one.
> 
> Once the node has rebooted, the evidence for what happened is gone, but I 
> have seen an hslsd segfault before, in dmesg output and /var/core/hslsd.* 
> files.  Is this symptomatic of the bugs you've found?

I know of a rare condition where hslsd will segfault.  I'm working on
a fix in the ls-refcnt-hsls branch.  There may be other conditions, too.

If hellowdog finds that hslsd isn't running, it should not stop the
watchdog tickle, but it should restart hslsd.  I guess it's possible hslsd
will fail to restart if, say, the memory disk is full of core files....

> I could tweak hellowdog to scp over some information before rebooting 
> (core files, dmesg output, etc); I guess you'd need the unstripped 
> binaries too.  Let me know if this would be useful.

That would be very useful.

Now and again I have mentioned adding build stages that tar up the object
directories ($BUILDDIR/O/, $BUILDDIR/Z/) for debugging later.

Dave

-- 
David Young             OJC Technologies
dyoung at ojctech.com      Urbana, IL * (217) 278-3933


More information about the CU-Wireless-Dev mailing list