Recherche d'emploi > Montréal, QC > Technical lead

Technical Site Reliability Engineering (SRE) Lead

Ubisoft
Montreal, Québec, Canada
Temps plein

Job Description

As a Technical Site Reliability Engineering (SRE) Lead within Ubisoft’s IT department, you will manage a team of SREs to ensure the reliability, scalability, and performance of our IT platform.

You will play a pivotal role in shaping the architecture and operations of our cloud-native infrastructure, with a strong focus on automation and large-scale system management.

Responsibilities :

  • Leadership : manage and mentor a team of SREs, fostering a culture of continuous learning and improvement.
  • Design and Development : Oversee the design and development of tools and solutions for the smooth operation of the Kubernetes environments.
  • Maintenance and Operation : Ensure the maintenance and operation of various components of the Ubisoft IT Platform, emphasizing documented and automated installation and support procedures.
  • Continuous Improvement : Drive enhancements in continuous integration and delivery systems, ensuring they meet the highest standards of reliability and performance.
  • Collaboration : Collaborate closely with Developer teams to assess their needs and ensure the platform is designed for operability and ease of use.
  • Advocate : Advocate for the use of Kubernetes and other cloud-native technologies within Ubisoft.
  • Evaluation : steer the evaluation of new requirements, technical designs, and standards to ensure they align with best practices and organizational goals.
  • Strategic Planning : Contribute to strategic planning and decision-making processes to guide the future direction of the platform.Qualifications

This role involves on-call.*

Qualifications

  • Expertise in cloud-native architectures, Kubernetes (e.g., CRD, CNI, admission controllers), and Linux systems.
  • Strong CI / CD capabilities with tools like GitLab CI and ArgoCD, plus experience with public cloud providers (Azure, AWS, GCP).
  • Proficient in scripting or development (preferably Go and / or Python) and infrastructure automation with Terraform.
  • Advanced understanding of Linux networking, system configuration, and network administration.
  • Effective collaboration skills, including experience working with remote teams.

Bonus :

  • Familiarity with OpenStack, Docker, Flask, OPA, and other DevOps tools.
  • Previous leadership experience managing large-scale production systems.

Additional Information

Just a heads up : If you require a work permit, your eligibility may depend on your education and years of relevant work experience, as required by the government.

Skills and competencies show up in different forms and can be based on different experiences, that is why we strongly encourage you to apply even though you may not have all the requirements listed above.

At Ubisoft, we embrace diversity in all its forms. We’re committed to fostering an inclusive and respectful work environment for all.

We know the importance of providing a pleasant interview experience, therefore if you need any accommodation, please let us know if there is anything we can do to facilitate the interview process.

Il y a 10 heures
Emplois reliés
Ubisoft
Montréal, Québec

As a Technical Site Reliability Engineering (SRE) Lead within Ubisoft’s IT department, you will manage a team of SREs to ensure the reliability, scalability, and performance of our IT platform. SREs, fostering a culture of continuous learning and improvement. Drive enhancements in continuous integra...

Offre sponsorisée
Pratt & Whitney Canada
Longueuil, Québec

En tant qu'Ingénieur en fiabilité des sites (Site Reliability Engineer), vous serez responsable de rationaliser notre cycle de vie de développement logiciel en intégrant et automatisant divers processus, du développement aux opérations. As a Site Reliability Engineer, you will be responsible for str...

NBC
Montréal, Québec

As a Systems Reliability Developper,  you will help all IT teams put in place the necessary mechanisms to improve and maintain the highest standards of resilience and availability of IT services. ...

Morgan Stanley
Montréal, Québec

We're seeking someone to join as a Site Reliability Engineering Lead in Enterprise Computing’s team to manage the operations, reliability and services for Morgan Stanley's suite of Software Distribution product ecosystem products. Site Reliability Engineering Lead (Hybrid). The multi-faceted and hig...

S.i. Systems
Montréal, Québec

Senior Site Reliability Engineer (SRE). Local candidates with the ability to work on-site three days/week in a hybrid model will be prioritized however % remote options will be available. As the successful candidate, you will work with other application and operational experts to ensure the highest ...

Alltech Consulting Services
Montréal, Québec

The Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to help drive the reliability engineering, operations and customer support services for Company’s ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead. Successful cand...

Ubisoft
Montréal, Québec

Ensure that technical decisions align with our quality, performance, scalability, reliability, and security goals while promoting engineering excellence. You will lead the Core Infrastructure team, leveraging your deep technical knowledge to scale, optimize and operate the transversal infrastructure...

Genpact
Montréal, Québec

Inviting applications for the role of Principal Consultant, Site Reliability Engineering. Lead a team of SREs in the implementation, and operation of the IT infrastructure . Powered by our purpose – the relentless pursuit of a world that works better for people – we serve and transform leading enter...

Bourse de Montreal Inc.
Montréal, Québec

Previous experience as a Site Reliability Engineer (SRE). The TMX group of companies includes leading global exchanges such as the Toronto Stock Exchange, Montreal Exchange, and numerous innovative organizations enhancing capital markets. The Devops Engineering team is responsible for working closel...

Genpact Limited
Montréal, Québec

Inviting applications for the role of Principal Consultant, Site Reliability Engineering. Lead a team of SREs in the implementation, and operation of the IT infrastructure. Powered by our purpose - the relentless pursuit of a world that works better for people - we serve and transform leading enterp...