CPR: Campus-Wide Network Performance Monitoring and Recovery.

Thursday, 02-Jul-2009 21:35:00 EDT

What is CPR?

Reliable and stable network performance is essential, but technologies are getting more complicated. Ability to detect, trouble-shoot and resolve problems before they become major campus issues should be an integral part of the service provided by the backbone team.

Current Status of the Project

The CPR team proposes to deploy hosts across campus to provide the access and information to dramatically improve the way network services does business. The hosts will probably run Linux, but the only requirement is that they have sshd for access, and can run the required measurement tools. Specific options for the CPR hosts are being evaluated. The proposal is to widely deploy hosts at all levels of the Georgia Tech network. A CPR host will be connected to each of the gateway-routers (e.g. cpr-gateway1.ns.gatech.edu), the Campus routers (e.g. cpr-campus1.ns.gatech.edu) and the core routers (e.g. cpr-ni.ns.gatech.edu). In addition, as many buildings and departments as possible should be monitored. In future, purchase of a CPR host should be among the standard equipment recommended by network services.

The hosts should be connected to the local router or switch and connected to a terminal server for access even if the device is isolated from the network. The CPR team recommends the purchase of a dedicated terminal server for this project.

A member of the CPR team will need to oversee maintenance and security of the boxes. Perhaps using student help to undertake the tasks. Automated patching, perhaps using YUM (The Yellow-dog Update manager) or a more sophisticated self-patching mechanism will be investigated as the project develops.

This page was last modified on Tuesday, 23-Jan-2007 13:40:47 EST .