SRE Lead
Infosys
Date: 1 week ago
City: Calgary, AB
Contract type: Full time

Job Description
Infosys is seeking a SRE Lead. This position will interface with key stakeholders and apply technical proficiency across different stages of the Software Development Life Cycle including Requirements Elicitation, Application Architecture definition and Design; play an important role in creating the high-level design artifacts; deliver high quality code deliverables for a module, lead validation for all types of testing and support activities related to implementation, transition and warranty. This is an opportunity to be part of a learning culture, where teamwork and collaboration are encouraged, excellence is rewarded, and diversity is respected and valued.
Required Qualifications
Estimated annual compensation range for the candidate based in the below location will be:
British Columbia: $ 81575 to $ 116670
Ontario: $ 89004 to $ 115491
About Us
Infosys is a global leader in next-generation digital services and consulting. We enable clients in more than 50 countries to navigate their digital transformation. With over four decades of experience in managing the systems and workings of global enterprises, we expertly steer our clients through their digital journey. We do it by enabling the enterprise with an AI-powered core that helps prioritize the execution of change. We also empower the business with agile digital at scale to deliver unprecedented levels of performance and customer delight. Our always-on learning agenda drives their continuous improvement through building and transferring digital skills, expertise, and ideas from our innovation ecosystem.
Infosys is seeking a SRE Lead. This position will interface with key stakeholders and apply technical proficiency across different stages of the Software Development Life Cycle including Requirements Elicitation, Application Architecture definition and Design; play an important role in creating the high-level design artifacts; deliver high quality code deliverables for a module, lead validation for all types of testing and support activities related to implementation, transition and warranty. This is an opportunity to be part of a learning culture, where teamwork and collaboration are encouraged, excellence is rewarded, and diversity is respected and valued.
Required Qualifications
- Candidate must be located within commuting distance of Calgary, AB or Mississauga, ON or Vancouver, BC or be willing to relocate to the area. This position may require travel.
- Bachelor’s degree or foreign equivalent required from an accredited institution. Will also consider three years of progressive experience in the specialty in lieu of every year of education.
- At least 4 years of Information Technology experience.
- Candidates authorized to work for any employer in Canada without employer-based visa sponsorship are welcome to apply. Infosys is unable to provide immigration sponsorship for this role at this time.
- SRE Mindset in Production support: Proactive issue identification using observability tools.
- Skilled in using different monitoring & observability tools to track system performance
- Incident commander: Ability to diagnose complex issues and actively drive incident calls working with technical, product SMEs, and Tier 2 SREs.
- Experience in Splunk (including Splunk APM and Splunk O11y), AppDynamics,
- Experience in DB, Network, Linux / Unix, Kubernetes
- Experience in APM, NMON , Wireshark usage and analysis
- Knowledge of Grafana, RedMetrics, 1000Eyes
- Knowledge of VMs, Load balancers, Firewalls, API Gateways,
- Knowledge of Containerization, Docker, AWS, PCF, GCP, ServiceNow (including AIOps, tools for Self-Heal and automated playbooks)
- Experience in UEM and synthetic monitoring tools
- System Administration: Strong knowledge of infrastructure, including command-line tools and system internals. (Kubernetes triage, linux administration)
- Networking: Understanding of network protocols, configurations, and troubleshooting. (nmon, Wireshark)
- Cloud Computing: Experience with cloud understanding, including cloud architecture (on-perm and public) and services. (AWS and Azure)
- Application Management: Familiarity with continuous integration and continuous deployment processes and tools.
- Advanced programming knowledge: Experience with triaging issues with application code. (Java, Python)
- DB troubleshooting: Familiarity in troubleshooting issues with traditional and NoSQL databases (eg: Oracle, SQL Server, MySQL, MongoDB, Cassandra)
- Monitoring and Observability: Skills in using monitoring tools to track system performance and detect issues including all the backend systems, database, and API's (Splunk, AppDynamics, Splunk o11y, Open Telemetry)
- Problem-Solving: Ability to diagnose and resolve complex issues quickly and efficiently
- Strong communication skills to work effectively with cross-functional teams
- Adaptability: Flexibility to handle changing priorities and technologies
- Attention to Detail: Precision in managing configurations and deployments to avoid errors
- Excellent communicator who could interact with Director/Sr. Director and above.
- Production support activities including proactive identification of issues leveraging observability tools with the aim of reducing MTTD and MTTR
- Coordinate all activities required to lead incident triage in compliance with SLAs and OLAs. Corelating inputs from various dashboards & tools to drive resolution.
- Flexibility to work in 24 X 7 environment
Estimated annual compensation range for the candidate based in the below location will be:
British Columbia: $ 81575 to $ 116670
Ontario: $ 89004 to $ 115491
About Us
Infosys is a global leader in next-generation digital services and consulting. We enable clients in more than 50 countries to navigate their digital transformation. With over four decades of experience in managing the systems and workings of global enterprises, we expertly steer our clients through their digital journey. We do it by enabling the enterprise with an AI-powered core that helps prioritize the execution of change. We also empower the business with agile digital at scale to deliver unprecedented levels of performance and customer delight. Our always-on learning agenda drives their continuous improvement through building and transferring digital skills, expertise, and ideas from our innovation ecosystem.
How to apply
To apply for this job you need to authorize on our website. If you don't have an account yet, please register.
Post a resume