Senior DevOps Engineer
What We Seek
We are looking for a Senior DevOps Engineer to help ensure that the systems our students and teachers rely on every day are reliable, secure, scalable, observable, and high performing. This is a hands-on engineering role for someone who enjoys improving systems, reducing toil through automation, enabling developers, and strengthening operational excellence. You will work across CI/CD, infrastructure as code, observability, incident response, and cloud reliability, while helping modernize legacy practices and improve the developer experience.
You should be a strong communicator who collaborates well in a distributed environment and is comfortable partnering across engineering, QA, support, and product teams. We value people who are practical, curious, accountable, and motivated by continuous improvement. Our environment includes AWS, Jenkins, CloudFormation, ECS/Fargate, GitHub, Jira, Splunk, New Relic, Cortex.io, Slack, Snowflake, Databricks, and other modern engineering tools and platforms.
Location is flexible! This role can work from our St. John's, NL office on the water, or remote anywhere within Canada. Candidates must already be residing in Canada. No visa sponsorship is available.
What Your Day Will Look Like
DevOps, CI/CD and Developer Enablement
- Develop and maintain Jenkins shared libraries and Jenkins pipelines using Groovy
- Improve build, test, and deployment workflows to make software delivery more reliable and efficient
- Partner with development and QA teams to support internal development and test environments
- Help teams adopt better engineering practices around release quality, automation, and deployment confidence
- Carnegie Learning is evolving toward a platform engineering model to improve developer experience and operational excellence, leveraging Cortex.io to centralize service cataloging, ownership, standards, and engineering insights.
Infrastructure and Platform Engineering
- Build and manage AWS infrastructure using Infrastructure as Code, primarily CloudFormation
- Design, deploy, and improve secure, scalable cloud environment
- Troubleshoot infrastructure and platform issues independently and drive long-term fixes
- Help modernize legacy tooling and operational practices
Production Reliability and Operations
- Design and implement monitoring, alerting, trend analysis, and self-healing capabilities
- Support SLIs and SLOs and help teams use reliability metrics to improve service health
- Monitor and respond to alerts and production issues across applications and infrastructure
- Participate in incident response and post-incident reviews, identifying both technical and process improvements
- Assist support and engineering teams with log analysis, troubleshooting, and root cause investigation
Collaboration and Ownership
- Work effectively in a remote-first environment using tools like Slack, Jira, and shared documentation
- Keep tasks, documentation, and operational runbooks current
- Communicate clearly during both planned technical sessions and real-time incident situations
- Contribute to a strong culture of teamwork, accountability, and customer focus
- On-call on a monthly rotating basis (not heavy after hours but some is needed on occasion)
What Should Be In Your Bookbag
- 5+ years of experience building and operating production-grade cloud solutions, preferably in AWS
- Cloud certification beyond Practitioner level, such as SysOps, DevOps, Solutions Architect, or Security
- Strong hands-on experience with Jenkins, including Jenkins DSL, plugin ecosystem, CI/CD pipelines, and Git-based workflows
- Strong scripting and automation skills, including Groovy and at least one additional language such as Python, Go, Java, or Bash
- Experience with web applications and modern frameworks/languages such as JavaScript, TypeScript, Angular, Node.js, Django, or Laravel
- Strong troubleshooting skills across the SDLC, including failed builds, pipeline issues, and infrastructure bottlenecks
- Experience designing and implementing secure AWS infrastructure using Infrastructure as Code, preferably CloudFormation
- Hands-on experience with Docker, containers, and container orchestration, especially ECS Fargate
- Experience with high availability, load balancing, and content delivery platforms and practices
- Strong cloud security and networking experience, including least-privilege access models, IAM policies, and secure infrastructure design
- Experience with observability, logging, and performance monitoring tools for troubleshooting and capacity planning, preferably Splunk and New Relic
- Experience with production change management, including rollback planning and documentation
- Strong communication, presentation, and customer service skills, with the ability to work independently and solve complex technical problems
- Experience leveraging AI-powered tools or platforms to improve operational efficiency, troubleshooting, automation, developer experience, or service reliability
Bonus points for experience with:
- Database knowledge (writing queries, troubleshooting, performance, and monitoring)
- DevSecOps, including integrating code analysis and vulnerability scanning tools into the CI/CD pipeline; additionally, familiarity with cybersecurity and regulatory frameworks (e.g., NIST, SOC 2, ISO, and COBIT)
What Gives Us Purpose
Others teach what; we teach why. For over 25 years, Carnegie Learning has developed solutions that help students achieve Aha moments and set them up for long-term learning success. We’re driven to find new, better ways to support teachers and reach more students, so we put our leading cognitive science research and real-world insights into solutions for how students think, learn, and do their best. This is a key part of our commitment to helping teachers spark student curiosity. As a Carnegie Learning team member, you’ll work alongside a team of passionate individuals dedicated to making a real difference in the lives of students and educators.
What We Provide
- Holistic Wellbeing
- Cost-shared health and dental benefits plan
- Competitive Retirement Savings Matching Program to plan for your future
- Fostering Joy
- Flexible work arrangements with our Work From Anywhere Policy
- Your Time, Your Way - paid time off that you can use as you see fit to recharge and nurture your personal life
- Empowering Parenthood
- Top-Up Parental Leave
- Reduced working hours on full pay for new parents
- Free access to CL products for employees and their children
- A Place for Connection
- Quarterly Wellness Incentives
- Monthly employee activities + recognition program
- Employee Allyship Groups (EAGs)
What We Believe
We celebrate the unique attributes, characteristics, and perspectives that define each person's individuality. This fusion of perspectives enriches our collective knowledge, fosters innovation, and empowers us all. Together, we can collectively and more effectively address issues that face our business and industry.
If our commitment to building an inclusive workplace resonates with you, we invite you to join our mission and welcome you to apply with us. Carnegie Learning is an Equal Opportunity Employer, including veterans and individuals with disabilities.