[CUWiN-Dev] Existing CUWiN monitoring tools and proposed work for monitoring tools

Sascha Meinrath sascha at ucimc.org
Wed Aug 9 13:02:51 CDT 2006


Chase Phillips wrote:
> Nagios provides a nice extensible framework.  It won't assist you with
> incident cause and correlation, but if what you need is straight
> monitoring, it's your app.  Another monitoring tool I've been exposed
> to is Big Brother, but I wouldn't recommend that over Nagios.
> 
> I've heard mixed reviews of cfengine for configuration management.  It
> might be overpowered (or mispowered) for our purposes.  Depending on
> what features are needed, we could start just by scp'ing a master
> config file out to nodes.  Another idea is to have a scheduled process
> on the nodes retrieve their config file via HTTP on a regular basis.
> 
> WNMap or Funkfeuer Redeemer can provide a slick 'at-a-glance' view of
> the network.  Last I checked Redeemer was further along in managing
> and visualization of current network information but only available in
> Austrian.  Funkfeuer is proficient in English so they may assist in
> the g18n effort if it meant CUWiN would like to use their stuff.

I would just add that we have an open invitation to work with Redeemer 
developers (and the FunkFeuer crew).  This might be a good place to combine efforts.

--Sascha

> On 8/9/06, dan blah <dan.blah at gmail.com> wrote:
> 
>> what would really be cool would be porting over something like nagios
>> (http://www.nagios.org/) and cfengine (http://www.cfengine.org/) into
>> our package.  both are open source and are used widely for creating
>> 'self-healing' networks.
>>
>> On 8/9/06, dan blah <dan.blah at gmail.com> wrote:
>> > On 8/9/06, Ross Musselman <rgmussel at cuwireless.net> wrote:
>> > > Just to fill in the gap for those who want more context....
>> > >
>> > > This work is part of the CRI grant that we are working on with 
>> Jennifer
>> > > Hou of UIUC. The link for the CRI RFP is here:
>> > > http://www.nsf.gov/pubs/2004/nsf04588/nsf04588.htm
>> > > The idea is to create a suite of monitoring and administration tools
>> > > that can be used on a CUWiN network, and there is a particular 
>> emphasis
>> > > on the management of a test bed, since that is one of the big
>> > > deliverables on the grant.
>> > >
>> > > Dan, I don't think it's fair to say that there are no tools. It is
>> > as it is now, i dont know of any monitoring tools that are included
>> > with cuwin.  anyone can compile 3rd party software in on their own...
>> > but we dont now.
>> >
>> > > better to say that there are no integrated tools. What our suite 
>> needs
>> > > to do is integrate the tools to provide better monitoring and
>> > > administration, right. The proposal will need to flesh out the 
>> idea that
>> > > the suite is significantly better than the sum of its parts.
>> > >
>> > > As far as the actual functions of the suite, your list looks good 
>> to me
>> > > (but what do I know).
>> > >
>> > > Ross
>> > >
>> > > dan blah wrote:
>> > > > i have to have this *formalized* by tomorrow.  any additions,
>> > > > subtractions, edits let me know.
>> > > >
>> > > > existing tools...
>> > > > none <insert tool here>
>> > > >
>> > > > proposed work...
>> > > > monitoring of network service status (inetd, zebra, hslsd, 
>> dhcpselect,
>> > > > wdogctl, etc)
>> > > > monitoring of node recourses (processor load, disk and memory 
>> usage,
>> > > > running processes, network load)
>> > > > monitoring of environmental factors such as node temperature
>> > > > monitoring of system state changes (network interface changes,
>> > > > configuration file add/delete/changes, file/drive permission 
>> changes)
>> > > > list of node clients with basic client information
>> > > > web interface for viewing current network status, problem 
>> history, log
>> > > > file (/var/log/daemon, /var/log/messages), application and kernel
>> > > > routes
>> > > > web interface visible nodes (parsing of /var/db/linkstates, 
>> routeviz)
>> > > > web interface authorization
>> > > > ability to define event handler to be run during service of host
>> > > > events for proactive problem resolution and self healing nodes
>> > > > contact notification when service of host problems occur and get
>> > > > resolved via email
>> > > > local node engine viewable from local httpd or console with the
>> > > > ability to be queried from a remote management engine
>> > >
>> > >
>> > >
>> >
>> >
>> > --
>> > Daniel
>> >
>>
>>
>> -- 
>> Daniel
>> _______________________________________________
>> CU-Wireless-Dev mailing list
>> CU-Wireless-Dev at lists.cuwireless.net
>> http://lists.chambana.net/cgi-bin/listinfo/cu-wireless-dev
>>
> _______________________________________________
> CU-Wireless-Dev mailing list
> CU-Wireless-Dev at lists.cuwireless.net
> http://lists.chambana.net/cgi-bin/listinfo/cu-wireless-dev
> 
> 

-- 
Sascha Meinrath
Executive Director  *  Principal          *  President
CUWiN              *** The Ethos Group   *** Acorn Active Media
CUWireless.Net      *  EthosWireless.com  *  AcornActiveMedia.com


More information about the CU-Wireless-Dev mailing list