theScore, a wholly-owned subsidiary of PENN Entertainment , empowers millions of sports fans through its digital media and sports betting products.
Its media app 'theScore' is one of the most popular in North America, delivering fans highly personalized live scores, news, stats, and betting information from their favorite teams, leagues, and players.
theScore's sports betting app 'theScore Bet Sportsbook & Casino' delivers an immersive and holistic mobile sports betting and iCasino experience.
theScore Bet is currently live in the Company's home province of Ontario. theScore also creates and distributes innovative digital content through its web, social and esports platforms.
About the Role & Team
As part of the theScore team, you will be working with a team of smart, friendly, and dedicated Engineers, Product Managers and Designers determined to deliver some of the best apps the market has to offer.
We want you to be challenged and to get the full experience of what it's like to work at theScore! We are looking for an Incident Commander to join our site reliability team, to work cross-functionally across engineering, and be the front line for incidents and working with Release Engineering to help prevent new events.
This is a management position responsible for all SRE incidents, which includes P1, P2, P3 and P4. Classifying and documenting all incidents and carrying out support, assisting and driving all incidents, regarding investigation, hierarchical and technical escalation, diagnosis and recovery and root cause analysis.
Additionally driving improvements to our service delivery and release processes based on disruption reports.
About the work
- Drive and enhance collaboration with other Command Support members and Commanders, Customer Support, Application teams, Release Engineering leader and cross-functional teams to lead real-time incident management.
- Provides Leadership for developing Practices, Frameworks, Process Flows, Templates and Process Guides
- Continuously improve and enhance the internal framework, methodology, processes, and tools
- Developing and maintaining key practice capabilities
- Collaborating with SRE Teams and Infrastructure teams to identify requirements.
- Recommends innovative solutions that enable the organization to deliver on its objectives and goals.
- Promote opportunities for Continuous Service Improvements
- Manage and update Root Cause Analysis documentation.
- Lead SRE communications to stakeholders via email, Slack, Google Meet, & Teams in timely manner
- Lead initiatives to promote JIRA Release Ticket management, quality and alignment with Incident management communication supporting SLAs
- Other duties as required.
About You
- Experience in a similar role or incident management role.
- Experience and understanding of Containerization (Docker & Kubernetes preferred)
- Automation : Understanding of configuration management and infrastructure as code tools is a must. Terraform (preferred), Ansible, Helm, etc.
- Experience with a programming language (Python preferred).
- Comfortable within Linux environments and needs.
- Experience working with AWS, GCP, and / or on-premise environments needs.
- Ability to work independently and learn quickly with little supervision.
- Ability to handle multiple projects simultaneously.
- Willingness to drop everything and take on an ad-hoc task.
- You're the type of individual who is extremely tech-savvy and passionate about learning new technologies and tools.
- A bachelor's degree in computer science, engineering, and / or similar experience.
- Nice to have : Postgres, MySQL, Elastic Search, Kafka, Redis, Helmfile, Terragrunt, Prometheus, and any web programming.
What We Offer
- Competitive compensation package.
- Comprehensive Benefits package.
- Fun, relaxed work environment.
- Education and conference reimbursements.
LI-HYBRID
LI-REMOTE
theScore is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability or age.