A Large Federal Agency

“NetMRI allows the agency to be much more productive with the same staff. They ensure that everything stays up with best practices without needing a person to do that.”

—Marty Atkins, Senior Consultant, Chesapeake NetCraftsmen

NetMRI Monitors Network in Real Time across 800 Nationwide Sites

The communications team at a large federal agency has its work cut out for it. With a multibuilding campus in Washington, D.C. and 800 locations nationwide, network change is a constant challenge.

A 10-person group manages more than 2,200 network devices. In the past, as the network complexity increased, so did the number of problems. Configuration errors impacted traffic flow and application availability, affecting employee productivity.

Over the years, the agency tried various network management solutions, but still found themselves in reactive mode much of the time. And to make things even more complicated for the agency, as a government body, they also must be prepared for internal audits by the Inspector General.

“The agency has good processes and a competent staff, but when we’re talking about that many devices and sites, there are plenty of opportunities for things to go awry,” said Marty Adkins, senior consultant, Chesapeake NetCraftsmen, a mid-Atlantic area network consulting firm that has been helping the agency cope with these problems. “They needed a way to audit continuously—not just when someone could catch their breath and search.”

Monitoring the Live Network 24/7

Working with Chesapeake NetCraftsmen, the agency was an early adopter of Infoblox’s NetMRI, which automates network management by correlating the impact of network configuration issues to network health, and identifying network problems early on. NetMRI enables organizations to take control of configurations and changes—making it easy to identify hard-to-find problems.

“I have not seen any other tools that analyze a live network, that actually monitor the live network 24/7,” Adkins said. “NetMRI finds things that are not yet service-impacting and alerts you to them. This solution allows us to continually monitor the impact of network changes on correctness, compliance and availability.”

Without any customization, NetMRI alerts the agency to issues that may impact network health and provides remediation options. The solution takes automation a step beyond just checking configurations; it actually logs into devices and issues scripted commands as a staff member would. “It’s quite extensible with the scripting capability,” Adkins added.

With NetMRI, the team flexibly groups devices into logical categories by geography, function and sphere of control, with overlaps, allowing someone to isolate specific devices quickly. The solution’s discovery engine intelligently and quickly assesses devices and the relationships between them.

NetMRI constantly monitors for change, alerting operators immediately to what changed, where and by whom. NetMRI assesses the resulting configurations to proactively identify inconsistent or incorrect settings, and facilitates remediation for fast issue resolution. When they push changes out, they do so quickly to hundreds of offices nationwide, and verify that they are correct.

Finding Issues – Before they Become Problems

NetMRI serves as a constant monitor, and found specific issues in the first few hours of deployment for the agency, such as:

  • Configuration errors before going live
  • Over-temperature conditions
  • Redundant power-supply disconnects
  • Redundant link outages
  • Unstable or marginal WAN links and VPN connections
  • Spanning tree instability
  • Device crashes in remote offices

Not only does NetMRI find issues, but it also gives the communications staff “post-mortem” analysis to understand why the event might have happened.

NetMRI also simplifies annual Inspector General (IG) compliance audits for the communications team. In a recent audit, the IG team asked if they could find configuration changes on a specific device for the past year. NetMRI exceeded that with a history of configuration changes on a device going all the way back to the initial installation—impressing the IG team.

More Uptime, Same Size Staff

With NetMRI, the agency operates at a level that would otherwise require more staff. For code upgrades, NetMRI helped the communications team quickly identify what-ifs, determine space for new images and pinpoint what to delete, giving them confidence that everything was in place before reloading. Logs then showed any problems or reload needs.

“We used NetMRI to do code upgrades on Cisco devices in a rather bulletproof fashion—all at an extremely high rate,” Adkins said. “In just a few hours, we can do hundreds of devices and have complete details and logs.”

Ultimately that means fiscal efficiency for the government agency, and with fewer outages, greater productivity for employees nationwide.

“NetMRI allows the agency to be much more productive with the same staff,” Adkins said. “They ensure that everything stays up with best practices without needing a person to do that.”

Profile

The Customer

A Large Federal Agency

Locations

800 offices nationwide

Solution

NetMRI

Partner

Chesapeake NetCraftsmen

Results

  • A more automated network management practice
  • Code upgrades on hundreds of devices in a few hours
  • Complete history of all device changes to support audits
  • Fewer outages with proactive alerts about issues

Download .pdf