[CUWiN-Dev] hslsd updates coming
David Young
dyoung at pobox.com
Thu Sep 22 17:26:44 CDT 2005
On Thu, Sep 22, 2005 at 04:17:20PM -0500, Bill Comisky wrote:
> On Wed, 21 Sep 2005, David Young wrote:
>
> >I have found some hslsd bugs by watching the Race Street network, which
> >keeps growing with Tom Wiltzius' help, and by watching the indoor
> >testbed. I have some fixes under development.
> >
> >Dave
>
> I've seen recently a few occasions where a node will reboot frequently,
> though the intervals vary; sometimes hours between reboots and sometimes
> minutes. The ETX metric and beacon strength to the gateway from the node
> in question looks like a solid link, and there is typically a fair amount
> of traffic on the wireless network at the time... I have some cron jobs
> fetching files on a few nodes, including the rebooting one.
>
> Once the node has rebooted, the evidence for what happened is gone, but I
> have seen an hslsd segfault before, in dmesg output and /var/core/hslsd.*
> files. Is this symptomatic of the bugs you've found?
I know of a rare condition where hslsd will segfault. I'm working on
a fix in the ls-refcnt-hsls branch. There may be other conditions, too.
If hellowdog finds that hslsd isn't running, it should not stop the
watchdog tickle, but it should restart hslsd. I guess it's possible hslsd
will fail to restart if, say, the memory disk is full of core files....
> I could tweak hellowdog to scp over some information before rebooting
> (core files, dmesg output, etc); I guess you'd need the unstripped
> binaries too. Let me know if this would be useful.
That would be very useful.
Now and again I have mentioned adding build stages that tar up the object
directories ($BUILDDIR/O/, $BUILDDIR/Z/) for debugging later.
Dave
--
David Young OJC Technologies
dyoung at ojctech.com Urbana, IL * (217) 278-3933
More information about the CU-Wireless-Dev
mailing list