Senior Site Reliability Engineer
Loblaw Companies Limited
Date: 3 days ago
City: Brampton, ON
Contract type: Full time

Come make your difference in communities across Canada, where authenticity, trust and making connections is valued – as we shape the future of Canadian retail, together. Our unique position as one of the country's largest employers, coupled with our commitment to positively impact the lives of all Canadians, provides our colleagues a range of opportunities and experiences to help Canadians Live Life Well.
At Loblaw Companies Limited, we succeed through collaboration and commitment and set a high bar for ourselves and those around us. Whether you are just starting your career, re-entering the workforce, or looking for a new job, this is where you belong.
Senior Site Reliability Engineer, Brampton, ON
As a Senior SRE, you will lead efforts to ensure the reliability, availability, and efficiency of our infrastructure and services. You’ll drive proactive monitoring, incident response, and root cause analysis, while partnering with cross-functional teams to design systems that are fault-tolerant and self-healing.
In this role, you’ll play a key part in shaping our reliability culture, leading initiatives to improve availability, observability, automation, and deployment pipelines. Your experience will help mentor junior engineers and establish best practices that scale with our growth.
What You’ll Do:
If you are unsure whether your experience matches every requirement above, we encourage you to apply anyway. We are looking for varied perspectives which include diverse experiences that we can add to our team.
We have a long-standing focus on diversity, equity and inclusion because we know it will make our company a better place to work and shop. We are committed to creating accessible environments for our colleagues, candidates and customers. Requests for accommodation due to a disability (which may be visible or invisible, temporary or permanent) can be made at any stage of application and employment. We encourage candidates to make their accommodation needs known so that we can provide equitable opportunities.
Please Note:
Candidates who are 18 years or older are required to complete a criminal background check. Details will be provided through the application process.
#EN
#SS #LTnA #ON
At Loblaw Companies Limited, we succeed through collaboration and commitment and set a high bar for ourselves and those around us. Whether you are just starting your career, re-entering the workforce, or looking for a new job, this is where you belong.
Senior Site Reliability Engineer, Brampton, ON
As a Senior SRE, you will lead efforts to ensure the reliability, availability, and efficiency of our infrastructure and services. You’ll drive proactive monitoring, incident response, and root cause analysis, while partnering with cross-functional teams to design systems that are fault-tolerant and self-healing.
In this role, you’ll play a key part in shaping our reliability culture, leading initiatives to improve availability, observability, automation, and deployment pipelines. Your experience will help mentor junior engineers and establish best practices that scale with our growth.
What You’ll Do:
- Champion System Reliability – Lead efforts to monitor, maintain, and enhance the availability, performance, and scalability of complex distributed systems in production.
- Own Incident Response & Resilience Engineering – Drive incident management processes, participate in and lead on-call rotations, perform root cause analyses, and design long-term solutions to prevent recurrence.
- Automate at Scale – Design and implement advanced automation frameworks, tools, and self-healing mechanisms to reduce toil and increase operational efficiency.
- Infrastructure as Code (IaC) Leadership – Architect and maintain infrastructure using tools like Terraform, Ansible, or equivalents, applying version control, modularization, and automation best practices.
- Optimize CI/CD Pipelines – Improve and maintain robust CI/CD workflows, enabling fast, secure, and reliable application delivery with minimal risk.
- Cross-Functional Collaboration & Mentorship – Partner with engineering, platform, and support teams to improve system design and reliability; mentor junior engineers and contribute to a culture of operational excellence.
- Deep understanding of SRE principles and distributed system reliability
- Strong scripting skills (e.g., Python, Bash, Go) for automation and tooling
- Hands-on experience with AWS, GCP, or Azure and IaC tools like Terraform, Ansible
- Proven ability to design and improve CI/CD, observability, and alerting systems
- Experience leading incident response, root cause analysis, and postmortems
- Excellent problem-solving and communication skills; mentorship mindset
If you are unsure whether your experience matches every requirement above, we encourage you to apply anyway. We are looking for varied perspectives which include diverse experiences that we can add to our team.
We have a long-standing focus on diversity, equity and inclusion because we know it will make our company a better place to work and shop. We are committed to creating accessible environments for our colleagues, candidates and customers. Requests for accommodation due to a disability (which may be visible or invisible, temporary or permanent) can be made at any stage of application and employment. We encourage candidates to make their accommodation needs known so that we can provide equitable opportunities.
Please Note:
Candidates who are 18 years or older are required to complete a criminal background check. Details will be provided through the application process.
#EN
#SS #LTnA #ON
How to apply
To apply for this job you need to authorize on our website. If you don't have an account yet, please register.
Post a resume