Are you a passionate Data Engineer with expertise in building and maintaining large-scale data pipelines? Do you thrive in a cloud environment and enjoy tackling complex data challenges?
If so, we want you on our team!We are seeking a talented Staff Software Engineer to join our growing Data Infrastructure team.
In this role, you will leverage your past experiences and deep understanding of data warehousing and data lake concepts to help shape and execute Achievers’s roadmap technologies to design, develop, and maintain our data pipelines on Google Cloud Platform (GCP).
You will collaborate closely with Data Architects and Data Scientists to translate business needs into robust and scalable data solutions.
Responsibilities :
- Design, develop, and implement scalable and reliable data pipelines using Dataflow, Airflow, and Apache Beam on GCP.
- Work with Data Architects to define data requirements and translate them into technical specifications.
- Write high-quality, maintainable, and efficient code using Python, PHP, and Go.
- Collaborate with data engineers and data scientists to troubleshoot data pipelines and ensure data quality.
- Automate data pipeline deployment and monitoring processes.
- Stay up-to-date on the latest trends and technologies in the big data landscape.
- Document data pipelines and code for maintainability.
- Identify and fill gaps in the team and create the processes necessary for the team's success.
Qualifications :
- Minimum 5+ years of experience as a Software Engineer with a focus on data infrastructure.
- Proven experience in designing and developing data pipelines on cloud platforms (preferably GCP).
- Strong expertise in software development best practices (clean code, version control, testing).
- In-depth knowledge of distributed processing frameworks like Apache Beam.
- Experience with containerization technologies like Kubernetes.
- Proficiency in programming languages like Python, PHP, and Go.
- You're comfortable with the building blocks of modern back-end systems, such as horizontally scalable data infrastructure, event-driven architecture, and beyond and can clearly articulate the pros / cons of different approaches, while also providing a recommended solution based on the current context.
- You have familiarity with databases and analytics technologies in the industry, including Data Warehousing, Data Lakes, ETL and Relational Databases.
- Excellent communication, collaboration, and problem-solving skills.
30+ days ago