Job Description:
• Provide technical direction and leadership to the 24X7 Application Assurance team who monitors, responds, and repairs on data center, server systems, and applications issues.
• Operate and oversee a team of NOC Assurance technicians who monitor and respond to applications, systems, and data center anomalies and failures on a 24X7 basis.
• Responsible for daily oversight and support of Metronet NOC technicians, providing direction on workload priorities, ticket queues, and technical assistance as required.
• Lead and manage a diverse team of technicians, providing mentorship, guidance, and support to ensure project, stakeholder, team, and individual contributor success.
• Collaborate with seasoned architects and engineers to solve problems, challenge assumptions, review the status quo, evaluate solutions, determine priorities, and swiftly drive advancements.
• Provide technical assistance and guidance with a high sense of urgency during systems and application events.
Requirements:
• Bachelor’s degree or equivalent experience in server reliability engineering, computer science, information systems, or business
• 5+ years of technical leadership in the Data Center, Server reliability, IT Infrastructure, virtualization, and/or application assurance space, supporting service provider and/or enterprise ecosystems
• Experience with monitoring and observability software (Grafana, DynaTrace, Solarwinds, Zabbix, Nagios, etc.), ticketing systems, and other infrastructure and network management tools.
• A solid understanding of the physical data center, server and storage infrastructure, virtualization, operating system, and application stack.
• Must be legally authorized to work in the U.S.
Benefits:
• 80% of medical premiums paid by the company
• Company-paid disability and life insurance
• 401(k)-company match with immediate vesting
• Discounted services within our coverage areas