Site Reliability Engineer

Lyft

Montreal, Canada

$100K-$175K a year (estimated)

Full-time

At Lyft, our mission is to improve people’s lives with the world’s best transportation. Imagine cities where streets are safe, communities thrive, and personal cars are a thing of the past.

We envision a future where shared and active transportation modes are the norm, fostering vibrant, connected neighborhoods.

As a leader in micromobility, Lyft powers millions of rides daily across over 200 cities with our cutting-edge ride-sharing, bike-sharing, and scooter-sharing technologies.

Our Montreal office is the birthplace of North America's first automated bike-share system, Bixi, which has since revolutionized urban mobility.

Today, our pioneering system is operational in more than 50 cities worldwide, including Barcelona, Bogota, Boston, Buenos Aires, Chicago, Dubai, London, Madrid, Mexico City, Montreal, New York, Rio de Janeiro, San Francisco, and Washington DC, to name just a few.

Join us and be part of the team behind some of the world's largest and most successful bike-share systems!

The Transit, Bikes, and Scooters (TBS) infrastructure team at Lyft in Montreal is growing, and we are looking for a Site Reliability Engineer to support our production systems, platforms, and the tools our developers use, while ensuring the reliability of our systems.

Every engineering team at Lyft is responsible for running and operating the software that they build. The Infrastructure team works towards standardizing and supporting all the rapidly evolving teams throughout our organization, assessing their architecture, helping them design scalable services, and fostering excellent operational practices.

It's a mission-critical role of ensuring that our systems are always healthy, monitored, automated, and designed to scale.

The nature of work is interdisciplinary, and our teammates come from varying backgrounds e.g. (Site Reliability Engineer (SRE), Systems Engineer, Software Engineer, DevOps Engineer, Infrastructure Engineer, Production Engineer).

We urge you to apply even if you feel uncertain that you have the exact background.

Technical interviews and interactions with the other offices in the company will be mainly in English; however, the working environment in Montreal is bilingual.

Responsibilities :

Help define the team’s roadmap and architecture based on technology and business needs
Design and implement effective infrastructure abstractions that increase velocity of our application teams
Be responsible for, design, develop, deploy, monitor, operate and maintain existing or new elements of our systems infrastructure.
Build holistic visibility into SLIs, SLOs, SLAs, dependency graphs, past performance of software, network, and system to ensure that we can continue to scale without increasing operational burden or toil
Use the core Site Reliability Engineering principles of change management, monitoring, emergency response, capacity planning, and production readiness reviews to run the platform
Step back to observe patterns and develop innovative tools and automation to minimize toil. Use those learnings to drive the best operational practices.
Partner with the broader Lyft organization to build a culture of rigorously learning from incidents
Unblock, support, and effectively communicate across teams to achieve results
Have a good grasp and ability to explain the various tradeoffs made in decisions
Share your knowledge by giving brown bags, tech talks, and evangelizing appropriate tech and engineering best practices.

Experience :

5+ years of software engineering / production infrastructure industry experience
Experience designing, debugging and running fault-tolerant large-scale distributed systems
Experience with high level programming languages (Python, Go, Java, etc.)
Experience working with public cloud platforms (e.g., AWS, Google Cloud Platform, Microsoft Azure, etc.)
Experience bringing software to production at high scale
Experience with common CI tools (Jenkins, Buildkite, CircleCI, TeamCity), and proficiency in at least one of those tools an asset
Experience working with databases, relational or NoSQL an asset
Experience in Linux system administration, or familiarity with managing a fleet of Linux servers an asset
Must be fluent in spoken and written English and minimally be willing to learn French if required

Benefits :

Comprehensive health, dental, and vision insurance plans, including family coverage
Life insurance and disability benefits
Mental health support programs
Healthcare Spending Account (HSA)
Fertility and family-building support
Complimentary lunch, snacks, beverages, coffee, and tea in our offices
Additional holidays (13 in 2024, 5 more than the legal requirement)
15 days of paid time off, with an extra day for each year of service, up to a maximum of 25 days
4 floating holidays per year
10 paid sick days annually
Occasional company-wide recharge days (5 in 2024)
Up to 18 weeks of fully paid parental leave, subject to certain conditions, for biological, adoptive, and foster parents
And other special benefits related to our services

Lyft proudly pursues and hires a diverse workforce. Lyft believes that every person has a right to equal employment opportunities without discrimination because of race, ancestry, place of origin, colour, ethnic origin, citizenship, creed, sex, sexual orientation, gender identity, gender expression, age, marital status, family status, disability, pardoned record of offences, or any other basis protected by applicable law or by Company policy.

Lyft also strives for a healthy and safe workplace and strictly prohibits harassment of any kind. Accommodation for persons with disabilities will be provided upon request in accordance with applicable law during the application and hiring process.

Please contact your recruiter now if you wish to make such a request.

This role will be in-office on a hybrid schedule Team Members will be expected to work in the office 3 days per week on Mondays, Thursdays and a team-specific third day.

Additionally, hybrid roles have the flexibility to work from anywhere for up to 4 weeks per year. #Hybrid

30+ days ago

Related jobs

Site Reliability Engineer (Hybrid)

National Bank

Montreal, Quebec

Information technology As a Specialist in site reliability engineering on the National Bank Data Protection team, you will ensure the operational reliability of data protection assets. ...

Site Reliability Engineer (SRE)

Bourse de Montreal Inc.

Montréal, Quebec

Previous experience as a Site Reliability Engineer (SRE). The Devops Engineering team is responsible for working closely with various business units and stakeholders to solve complex problems using innovative solutions, quickly and effectively using agile, lean and devops methodologies, while ensuri...

Site Reliability Engineer (SRE), ServiceNow, Application Infrastructure

Axelon Services Corporation

Montreal, Quebec

The Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to help drive the reliability engineering, operations and customer support services for ***'s ServiceNow SaaS implementation. Job Title: Site Reliability Engineer (SRE), ServiceNow, Application Infrastructure...

Senior DevOps Engineer / Site Reliability

Leica Geosystems

Canada

Senior DevOps Engineer / Site Reliability. DevOps &/or Site Reliability Engineering principles. Senior DevOps Engineer / Site Reliability | Hexagon Geosystems. As a Senior DevOps/SRE Engineer, you will help build solutions that allow our cloud-based platform, HxDR, to continue to evolve and grow thr...

Site Reliability Engineer

ALTEN Canada

Montreal, Quebec

The Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to help drive the reliability engineering, operations and customer support services for Morgan Stanley's ServiceNow SaaS implementation. Le département Application Infrastructure (AI) recherche un Site Reliab...

Site Reliability Engineer - Kubernetes

Okta, Inc.

Canada

Triaging and troubleshooting complex production issues to ensure reliability and performance. Are passionate about encouraging the development of engineering peers and leading by example. A proven track record of successful SRE engagements and collaborating closely with engineering teams. ...

Site Reliability Engineer (SRE)

Alltech Consulting Services

Montreal, Quebec

The Application Infrastructure (AI) department is seeking a Site Reliability Engineer (SRE) to help drive the reliability engineering, operations and customer support services for Company’s ServiceNow SaaS implementation. Reporting to a Site Reliability Engineering & Operations Lead. Successful cand...

Site Reliability Engineer 3

Behavox

Canada

As a Site Reliability Engineer, you will be responsible for the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning of all production systems and services. You will work together with other DevOps, Product, and Engineering teams to...

Site Reliability Engineer

Great Canadian Gaming Corp.

Canada, Canada

Mindwire is currently looking for a Site Reliability Engineer to work for our valued public sector client. The position is located in Ottawa, Ontario, 3 days onsite preferred, but would be open to remote for the right candidate. ...

Senior Site Reliability Engineer (SRE) to support the installation & configuration of Dynatrace to ensure seamless integration with existing systems & infrastructure for a crown corporation client

S.i. Systems

Montreal, Quebec

Senior Site Reliability Engineer (SRE). Local candidates with the ability to work on-site three days/week in a hybrid model will be prioritized however % remote options will be available. As the successful candidate, you will work with other application and operational experts to ensure the highest ...

Site Reliability Engineer

Site Reliability Engineer (Hybrid)

Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE), ServiceNow, Application Infrastructure

Senior DevOps Engineer / Site Reliability

Site Reliability Engineer

Site Reliability Engineer - Kubernetes

Site Reliability Engineer (SRE)

Site Reliability Engineer 3

Site Reliability Engineer

Senior Site Reliability Engineer (SRE) to support the installation & configuration of Dynatrace to ensure seamless integration with existing systems & infrastructure for a crown corporation client

Popular searches