Position : Senior Data Engineer, Baseball Systems
Overview :
The Toronto Blue Jays are seeking an experienced Data Engineer to join their Baseball Systems team to help build their next generation of databases and data processing systems.
In this role you will work on challenging and compelling problems with some of the largest and most exciting datasets in sports.
Your work will impact the entire organization from Research and Development through to Scouting, Player Development, and the Major League team by giving staff and players access to the full range of available information.
Primary Focus :
- To build and maintain databases and other systems for storing baseball data.
- To build processing pipelines to transform data and extract useful information.
- Optimize new and existing data infrastructure to improve performance and scalability.
Responsibilities and Duties :
- Design and build systems to store a growing volume of statistical, ball tracking, player tracking, and biomechanical data.
- Build and maintain data pipelines for processing, transforming, and integrating large volumes of baseball data from a variety of sources and tracking systems.
- Develop systems for reporting, monitoring, and quality control of incoming data to ensure that systems are performing as expected and the data conforms to required standards.
- Understand and document database structures, data formats, definitions, and the limitations of current systems and establish future requirements.
- Oversee the evaluation, selection, and rollout of new software and data infrastructure.
- Complete ad-hoc database queries and analysis as required by the circumstances.
- Provide technical assistance to other Baseball Operations staff and collaborate with the Research team to ensure they have the data and resources required for their work.
- Recommend new data sources for purchase and new techniques to gather data.
Experience and Job Requirements :
- Demonstrated experience building and maintaining relational databases and data warehouses.
- Strong working knowledge of SQL and database programming.
- Highly proficient with Python (preferred) or another general-purpose programming language for manipulating data.
- Knowledge of database architecture, design patterns, and best practices to ensure system performance and long-term maintainability.
- Exceptional problem-solving skills finds pragmatic solutions to data and engineering challenges within time or technology constraints.
- High attention to detail anticipates and identifies potential issues early and strives to produce reliable and high-quality work.
- Excellent interpersonal skills and the ability to effectively explain complex technical concepts to a wide range of individuals.
- Ability to read, speak and comprehend English effectively.
- The ability to work with multicultural populations and a commitment to fairness and equality.
- Ability to work evenings, weekends, and holiday hours as dictated by the baseball calendar.
Beneficial Experience :
- Bachelor’s degree in computer science, computer engineering, or other educational qualifications. Higher level academic qualifications are a plus.
- Experience with a task orchestration framework such as Airflow, Luigi, or Dagster.
- Experience with cloud services and infrastructure (Azure, AWS, GCP).
- Experience with R programming and data visualization.
- Experience working with baseball, sports, or biomechanics datasets is beneficial but not required.
We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, sex, sexual orientation, age, disability, gender identity, marital or veteran status, or any other protected class.