Our valued client is seeking a Lead High Performance Computing (HPC) Architect to support the development of new high-performance computing (HPC) and Artificial Intelligence (AI) solutions!
Open to all of Canada. Work is to be performed remotely with semi-regular travel required throughout the year in various cities across the country (at least once a quarter).
As the successful candidate you will provide architectural leadership & best practices for new solutions with a strong focus on HPC, supercomputing, HTC (High through-put computing), and associated technologies such as Quantum and AI computing.
This is an exciting opportunity to serve as a thought leader on an ambitious team aiming to implement some of Canada's first quantum computers!
Responsibilities :
- Lead working groups and committees in HPC, HTC, AI, and other technology streams.
- Keep up with HPC / supercomputing emerging trends and market insights, both in academia and in industry.
- Lead experimental and proof-of-concept projects to test feasibility and value of initiatives.
- Coach, mentor and guide staff and community members on matters related to HPC architecture and architecture documentation.
- Work with the vendor and stakeholder community to understand the latest HPC developments, and how they might be incorporated into enterprise services and offerings.
- Analyze requirements for new advanced research computing solutions from diverse stakeholder groups, and transform those into scalable, flexible, and resilient technical architectures.
- Perform architecture options and feasibility analysis, proactively debate alternatives with subject matter experts, and build consensus on recommended architecture within the SME community.
- Communicate technical information to both technical and non-technical staff and stakeholders and participate in enterprise training initiatives.
- Participate in a range of national and international committees and working groups, and occasional speaking engagements to provide architectural and technical expertise.
Must Have Skills :
- 10+ years experience working with complex High Performance Computing (HPC) and High Thorough-put Computing (HTC) , with a strong knowledge of similar infrastructure
- Experience researching and evaluating new technology and solutions within the realm of high performance computing, supercomputing , quantum computing , Artificial Intelligence (AI) computing , storage systems, high-performance file systems, parallel workflows, networking at scale, and edge computing.
- Advanced knowledge of HPC middleware stacks including cluster management tools, job schedulers, and resources managers;
such as HTCondor, Maui, Onesis, Slurm, PBS (or derivatives), OpenHPC, Rocks, etc.
- Demonstrated experience working with enterprise architecture frameworks and methodologies, such as TOGAF or ITIL.
- Demonstrated experience with the research, design, modification, implementation, and deployment of HPC applications and tools
Nice-to-have Skills :
- Experience in virtualization, containerization, and public and private cloud technologies and associated management and orchestration tools.
- Cisco, Cray, Dell, HPE, or IBM training.
- Exposure to Quantum and AI technologies and workloads.
- TOGAF, ITIL, or other industry certifications.
- Bilingualism in English & French