Search jobs > Toronto, ON > Remote > Site reliability engineer

Site Reliability Engineer

SGS
Toronto, MB
Remote
Full-time

Job Description

The Site Reliability Engineer will play a critical part in ensuring the reliability, supportability, scalability, and performance of our .

NET stack applications built with ASP.NET MVC, Angular, and Web API.

  • Partner with developers and product operations teams to understand application requirements and translate them into operational practices.
  • Design, implement, and maintain infrastructure automation tools using Infrastructure as Code (IaC) methodologies.
  • Monitor application health and performance metrics, proactively identifying and resolving potential issues.
  • Implement incident response procedures to ensure timely resolution of outages and service disruptions.
  • Establish and improve best practices for product solution design / architecture, and development.
  • Participate in peer and team code reviews by developing comprehensive coding standards and guidelines to ensure consistency, maintainability, and quality in software development.

By establishing clear protocols for code formatting, naming conventions, error handling, testing, and documentation, we can enhance code readability, reduce defects, and facilitate knowledge sharing among team members.

  • Collaborate with engineers to develop and implement disaster recovery plans.
  • Continuously improve monitoring and alerting processes to ensure efficient problem identification and resolution.
  • Stay up-to-date on the latest advancements in .NET infrastructure and SRE best practices.

Qualifications

  • Bachelor degree required
  • Minimum 3+ years of experience in a related technical role (e.g., Systems Administrator, Network Engineer) required
  • Experience with configuration management tools like Ansible, Puppet, or Chef preferred
  • Azure experience required
  • Familiarity with monitoring and alerting tools (.NET performance counters, Azure App Insight, Prometheus, Grafana) is a plus preferred
  • Ability to manage and coordinate multiple projects in a fast paced, highly professional environment.
  • While coding proficiency is not required, a strong understanding of the .NET ecosystem and a desire to delve into infrastructure and automation will be essential for success.
  • Strong understanding of system administration principles, including operating systems (Windows Server preferred) and networking concepts.
  • Familiarity with monitoring and alerting tools (.NET performance counters, Azure App Insight, Prometheus, Grafana)
  • Ability to work independently and as part of a team

Additional Information

SGS is an Equal Opportunity Employer, and as such we recruit, hire, train, and promote persons in all job classifications without regard to race, color, religion, sex, national origin, disability, age, marital status, sexual orientation, gender identity or expression and Indigenous status, or any other characteristics protected by law.

To perform this job successfully, an individual must be able to perform each essential duty satisfactorily with or without reasonable accommodations.

The requirements listed above are representative of the knowledge, skills, and / or abilities required.

This job description should not be construed as an exhaustive statement of duties, responsibilities, or requirements, but a general description of the job.

Nothing contained herein restricts the company's rights to assign or reassign duties and responsibilities to this job at any time.

Accommodations are available on request for qualified candidates during each stage of the recruitment process.

Please note that candidates applying for Canadian job openings should be authorized to work in Canada.

28 days ago
Related jobs
Promoted
Canonical - Jobs
Mississauga, Ontario

As a Site Reliability / Gitops Engineer engineer you will. As an SRE & Gitops engineer you'll be in a unique position to drive operations automation to the next level, both in our own private clouds as well as in the public clouds. Provide assistance and work with globally distributed engine...

SGS
ON, Canada
Remote

The Site Reliability Engineer will play a critical part in ensuring the reliability, supportability, scalability, and performance of our. Collaborate with engineers to develop and implement disaster recovery plans. Systems Administrator, Network Engineer) required. ...

Promoted
Canonical - Jobs
Toronto, Ontario

As a Senior Site Reliability / Gitops Engineer you will. As an Senior SRE & Gitops engineer you'll be in a unique position to drive operations automation to the next level, both in our own private clouds as well as in the public clouds. Provide assistance and work with globally distributed e...

Behavox
Toronto, Ontario

As a Site Reliability Engineer, you will be responsible for the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning of all production systems and services. You will work together with other DevOps, Product, and Engineering teams to...

Royal Bank of Canada>
Toronto, Ontario

We are seeking a Site Reliability Engineer (SRE) to join our team in a newly created role. This engineer will play a key role in ensuring the reliability, scalability, and performance of our systems. Conduct post-incident reviews, implement lessons learned, and recommend changes to increase system r...

LTIMindtree
Ontario, Canada

SREs ensure the reliability of content ingestion platform primarily based on. ...

Magnet Forensics
Ontario

Cloud Site Reliability Engineer. If you think you would be the right person to join our team working towards this goal, we would love to hear from you! Your TeamThe Engineering team is focused on producing software that solves the most important problems facing digital forensics professionals today....

CLIO
Toronto, Ontario

As a Site Reliability Engineer, you will help build, improve, and maintain Clios globally distributed network of service regions, which enables our clients worldwide to excel in their respective jurisdictions. ...

Morningstar
Toronto, Ontario

Moreover, you will leverage engineering skills and operational insights to establish and advocate operational excellence and collaborate with diverse teams to contribute to initiatives that brings data products and services operations to the next level. ...

Index Exchange
Toronto, Ontario

We are seeking an experienced Staff Engineer with a strong background in Site Reliability Engineering (SRE) to own and develop on-premise and hybrid cloud environments, with a focus on optimizing performance low-latency on Kubernetes platforms supporting a robust developer experience framework. As w...