[Remote] Director, Data Reliability Engineering
Note: The job is a remote job and is open to candidates in USA. Rocket is seeking a Director, Data Reliability Engineering to lead the reliability, observability, operational maturity, and trustworthiness of enterprise data platforms. This leader will define how data infrastructure is operated, measured, supported, and improved across a complex enterprise environment, with an initial focus on affiliated business platforms and the opportunity to help shape the broader future-state Data Infrastructure organization.
Responsibilities
- Lead Engineering teams responsible for improving the reliability, observability, recoverability, and operational maturity of enterprise data platforms
- Define reliability standards for databases, data warehouses, pipelines, jobs, storage, access patterns, and supporting infrastructure
- Establish operating expectations for monitoring, alerting, logging, incident response, change management, backup/recovery, disaster recovery, patching, access controls, service ownership, and operational readiness
- Create metrics that measure platform health, data freshness, data quality, recovery readiness, incident trends, operational risk, compliance alignment, and business impact
- Lead current-state assessments of systems, data flows, operational processes, observability, access patterns, and reliability gaps
- Convert assessment findings into executable roadmaps that improve platform stability, data trust, security alignment, and operational predictability
- Support migration and modernization programs involving on-premise platforms, AWS, Snowflake, and related enterprise data systems
- Build durable operating mechanisms, including reliability reviews, service health reviews, incident reviews, operational readiness reviews, risk reviews, roadmap reviews, and executive reporting
- Develop senior technical talent and create the leadership structure required to scale Data Reliability Engineering over time
Skills
- 10+ years of experience in data infrastructure, database engineering, data platform engineering, cloud infrastructure, site reliability engineering, or related technical disciplines
- 5+ years of experience leading engineering teams responsible for production systems, databases, data platforms, infrastructure platforms, or reliability engineering
- Strong understanding of enterprise data infrastructure, including databases, data warehouses, pipelines, storage, compute, backup/recovery, resiliency, and production operations
- Experience improving reliability practices across complex production environments, including observability, monitoring, incident response, change management, disaster recovery, and lifecycle management
- Experience establishing service health metrics, data reliability metrics, operational maturity indicators, and executive-level reporting
- Strong understanding of enterprise security, compliance, access management, auditability, operational controls, and infrastructure standards
- Proven ability to create structure in ambiguous environments, set clear priorities, influence across teams, and translate technical reliability work into business outcomes
- Experience in financial services, mortgage, banking, lending, insurance, or other regulated enterprise environments
- Experience leading teams through mergers, acquisitions, integrations, or large-scale enterprise transformation
- Experience with AWS, Snowflake, Microsoft SQL Server, Postgres, Redshift, Aurora or similar data platform technologies
- Experience supporting on-premise to cloud migrations, data platform modernization, or large-scale infrastructure transformation
- Experience defining data reliability practices such as data freshness, data quality, lineage, reconciliation, pipeline observability, and data incident management
- Experience leading senior technical talent, including Staff or Principal Engineers
Benefits
- Annual bonus
- Incentives
- Medical benefits
- Dental benefits
- Vision benefits
- 401K retirement plan
- Paid-time off
Company Overview
Company H1B Sponsorship