Software Engineer, Data Transformation Movement

Stripe
Canada
$134.4K-$258K a year
Remote
Full-time

Who we are

About Stripe

Stripe is a financial infrastructure platform for businesses. Millions of companies from the world’s largest enterprises to the most ambitious startups use Stripe to accept payments, grow their revenue, and accelerate new business opportunities.

Our mission is to increase the GDP of the internet, and we have a staggering amount of work ahead. That means you have an unprecedented opportunity to put the global economy within everyone’s reach while doing the most important work of your career.

About the team

The Data Transformation and Movement team operates the critical infrastructure that powers near-realtime and batch data processing at Stripe.

The team supports a variety of use cases, including Payment, Ledger, ML, Fraud Detection, Product Analytics, Regulatory Reporting, Financial Data Reconciliation, and externally facing products like and .

As an example of the scale, the team’s systems serve hundreds of teams, thousands of workflows, 100,000+ task executions, O(billion) streaming transformations, and moving terabytes of data processing over 1 GB / second every day.

Our users inside Stripe include other engineering teams, Data Scientists, Sales & Operations, Finance, etc.

This role could be on any one of the following sub-teams :

Data Movement builds and operates a constellation of multi-region, high scale ingestion systems that moves data from all online sources into Iceberg, with sub-minute latency.

On the cusp of innovation, we're pushing the boundaries of open-source Iceberg and Spark for real-time ingestion.

Data Orchestration builds and operates the time-based and event-based orchestration infrastructure that powers and accelerates batch data pipelines.

Data Transformation builds and operates the transformation abstractions and infrastructure that support frictionless data development across the board, sub-minute event data to enormous daily partitions - or even for-all-time snapshots.

Our team operates on a wide range of tech stacks including Kafka, Event Bus, Change Data Capture, Flink, Spark, Airflow, Hive MetaStore, Trino, Pinot, SQL, Python, Java, Scala, S3, and Iceberg.

What you’ll do

As a Software Engineer on our team, you will do the following :

  • Design, build, and maintain innovative next-generation or first-generation versions of key Data Platform products, with an emphasis on usability, reliability, security, and efficiency.
  • Design ergonomic APIs and abstractions that build a great customer experience for internal Stripes, that will in turn enhance the experience of millions of Stripe users.
  • Ensure operational excellence and enable a highly available & reliable Data Transformation & Movement platform across streaming and batch workloads.
  • Collaborate nimbly with high-visibility teams and their stakeholders to support their key initiatives - while building a robust platform that benefits all of Stripe in the long term.
  • Plan for the growth of Stripe’s infrastructure by unblocking, supporting, and communicating proactively with internal partners to achieve results.
  • Connect your work with improvements in the usability and reliability of Open Source Software (OSS) like Apache Airflow, Iceberg, Spark and contribute back to the OSS community.

Who you are

Minimum requirements

  • 2-5 years of professional experience writing high quality production level code or software programs
  • Has experience operating or enabling large-scale, high-availability data pipelines from design, to execution and safe change management.

Expertise in Spark, Flink, Spark, Airflow, Python, Java, SQL, and API design is a plus.

  • Has experience developing, maintaining, and debugging distributed systems built with open source tools
  • Has experience building infrastructure-as-a-product with a strong focus on users needs
  • Has strong collaboration and communication skills, and can comfortably interact with both technical and non-technical participants.
  • Has the curiosity to continuously learn about new technologies and business processes.
  • Is energized by delivering effective, user-first solutions through creative problem-solving and collaboration.

Preferred qualifications

  • Has experience writing production-level code in Expertise in Scala, Spark, Flink, Spark, Airflow, Python, Java, and SQL is a plus.
  • Experience packaging and deploying code into cloud-based environments (AWS, GCP, Azure) with tools including Bazel, Docker Containers, etc
  • Has experience designing APIs or building developer platforms
  • Has experience optimizing the end to end performance of distributed systems
  • Has experience with scaling distributed systems in a rapidly moving environment
  • BS or MS in Computer Science or equivalent field and interest in data
  • Has experience working with data pipelines
  • Genuine enjoyment of innovation and a deep interest in understanding how things work

Hybrid work at Stripe

This role is available either in an office or a remote location (typically, 35+ miles or 56+ km from a Stripe office).

Office-assigned Stripes spend at least 50% of the time in a given month in their local office or with users. This hits a balance between bringing people together for in-person collaboration and learning from each other, while supporting flexibility about how to do this in a way that makes sense for individuals and their teams.

A remote location, in most cases, is defined as being 35 miles (56 kilometers) or more from one of our offices. While you would be welcome to come into the office for team / business meetings, on-sites, meet-ups, and events, our expectation is you would regularly work from home rather than a Stripe office.

Stripe does not cover the cost of relocating to a remote location. We encourage you to apply for roles that match the location where you currently or plan to live.

Pay and benefits

The annual salary range for this role in the primary location is C$134,400 - C$258,000. This range may change if you are hired in another location.

For sales roles, the range provided is the role’s On Target Earnings ( OTE ) range, meaning that the range includes both the sales commissions / sales bonuses target and annual base salary for the role.

This salary range may be inclusive of several career levels at Stripe and will be narrowed during the interview process based on a number of factors, including the candidate’s experience, qualifications, and specific location.

Applicants interested in this role and who are not located in the primary location may request the annual salary range for their location during the interview process.

Specific benefits and details about what compensation is included in the salary range listed above will vary depending on the applicant’s location and can be discussed in more detail during the interview process.

Benefits / additional compensation for this role may include : equity, company bonus or sales commissions / bonuses; retirement plans;

health benefits; and wellness stipends.

30+ days ago
Related jobs
Stripe
Canada
Remote

Data Transformation builds and operates the transformation abstractions and infrastructure that support frictionless data development across the board, sub-minute event data to enormous daily partitions - or even for-all-time snapshots. The Data Transformation and Movement team operates the critical...

Lime
Canada

The Data Engineering team at Lime is responsible for ingesting, transforming and making available timely, high-quality data that powers analytics, bookkeeping and visibility for a wide range of customers. Implement data governance policies and ensure data security and compliance. You have a strong d...

Doximity
Remote, Canada
Remote

Collaborate with product managers, data analysts, and other data engineers to develop data pipelines and ETL tasks in order to facilitate the extraction of insights. You have developed maintainable data pipelines with these languages. You strive for high code quality, create automated testing, apply...

StackAdapt
Canada

Working with large data sets and various databases including Aerospike, Elasticsearch, Redis, ScyllaDB, Redshift, TiDB, MariaDB. Our real-time advertising bidding system handles over 3,000,000 requests per second and stores several terabytes of data every day. Build software that utilize messaging q...

Doximity
Remote, Canada
Remote

Collaborate with product managers, data analysts, and machine learning engineers to develop pipelines and ETL tasks in order to facilitate the extraction of insights. You have developed maintainable data pipelines with them. You are experienced in creating automated testing, applying design patterns...

StackAdapt
Canada

We're seeking a Staff Software Engineer to help lead our growing backend engineering team. Integrate data into StackAdapt’s Customer Data Platform (CDP). At least 5 years experience of software development in distributed systems, architecting scalable microservices and data pipelines in a successful...

Stripe
Canada
Remote

With all this data, the Growth Data Engineering team is looking for talented data-minded engineers to help us manage business critical data leveraged across the entire organization. Data Engineering or Software Engineering role, with a focus on building data pipelines, or applications powered by big...

Promoted
Accelerize 360
Canada

You will work closely with clients to understand their data needs, ensure data governance and security compliance, optimize costs, design efficient data pipelines, provide expert guidance on best practices, collaborate with cross-functional teams to ensure seamless integration, perform data migratio...

Promoted
Myticas Consulting
Canada

The ideal candidate will be experienced in building and optimizing data pipelines on Microsoft Azure, with expertise in Azure Data Fabric and other Azure data services. Proven experience as a Data Engineer, with a focus on Microsoft Azure and Azure Data Fabric. Seeking a highly skilled Data Engineer...

Promoted
Capgemini Engineering
Canada

As a Senior Engineer, you will build distributed data processing solution and highly loaded database solutions for various cases including reporting, product analytics, marketing optimization and financial reporting. Chip in as part of self-organized team of data engineers working in an innovative e...