← all jobs

[Remote] Site Reliability Engineering Tech Lead

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. DataHub is an AI & Data Context Platform adopted by over 3,000 enterprises, including major companies like Apple and Netflix. They are seeking an experienced Site Reliability Engineering (SRE) Tech Lead to drive the reliability, scalability, and operational excellence of their platform offerings, focusing on technical leadership and architecture, enterprise platform development, and platform reliability operations.

Responsibilities

  • Design and implement robust, scalable infrastructure solutions for DataHub Cloud and enterprise deployments
  • Lead the technical vision for multi-cloud deployment strategies and distributed system integrations
  • Architect monitoring, observability, and alerting systems across diverse environments
  • Drive best practices for infrastructure as code, configuration management, and deployment automation
  • Partner with product and engineering teams to influence the development of advanced deployment capabilities
  • Collaborate with cross-functional teams to help build systems for seamless installation, upgrade, and rollback processes across various environments
  • Influence the design and help implement comprehensive monitoring and health check systems for distributed deployments
  • Partner with engineering teams to help develop self-healing and automated remediation capabilities
  • Establish and maintain SLAs/SLOs for both cloud and enterprise offerings
  • Lead incident response and post-mortem processes to drive continuous improvement
  • Implement chaos engineering practices to proactively identify system weaknesses
  • Optimize system performance, capacity planning, and cost efficiency
  • Mentor and guide a team of SRE engineers and collaborate with platform engineering teams
  • Work closely with product, engineering, and customer success teams to ensure reliable product delivery
  • Improve on-call practices, runbooks, and knowledge sharing processes
  • Drive cross-functional initiatives to improve overall system reliability

Skills

  • 8+ years of experience in Site Reliability Engineering, Platform Engineering, or DevOps roles
  • 3+ years of technical leadership experience managing engineering teams
  • Strong expertise with cloud platforms (AWS, GCP, Azure) and infrastructure automation tools
  • Proficiency in containerization technologies (Docker, Kubernetes) and orchestration
  • Experience with infrastructure as code tools (Terraform, CloudFormation, Pulumi)
  • Strong programming skills in Python, Java, or similar languages
  • Deep understanding of monitoring and observability tools (Prometheus, Grafana, Datadog, etc.)
  • Experience with CI/CD pipelines and deployment automation
  • Strong knowledge of networking, security, and database operations in cloud environments
  • Experience building and operating multi-tenant SaaS platforms
  • Background in developing customer-facing deployment and management tools
  • Knowledge of data infrastructure and metadata management systems
  • Experience with service mesh technologies and microservices architectures
  • Previous experience in a customer-facing technical role or working with enterprise clients
  • Experience with data governance or data catalog platforms

Benefits

  • Competitive compensation
  • Equity for everyone
  • Remote Work
  • Location flexibility
  • You’ll receive a monthly coworking stipend to use whenever you need a change of pace or in-person collaboration time.
  • Comprehensive health coverage
  • We cover 99% of medical, dental, and vision premiums employees, and 65% for dependents.
  • Flexible savings accounts
  • We offer FSAs to help cover planned or unexpected healthcare costs.
  • You can also opt into a Dependent Care FSA to support family needs.
  • Support for every path to parenthood
  • Through Carrot Fertility, we provide inclusive fertility benefits and family-forming support.
  • All U.S. employees have access, regardless of age, gender identity, or family structure.
  • Time off that works for you
  • Our unlimited PTO and sick leave policy is designed for flexibility, rest, and real life.

Company Overview

  • DataHub is an open-source metadata platform that unifies data discovery, observability, and governance for AI and data ecosystems. It was founded in 2021, and is headquartered in Palo Alto, California, USA, with a workforce of 51-200 employees. Its website is https://datahub.com.
  • Company H1B Sponsorship

  • DataHub has a track record of offering H1B sponsorships, with 3 in 2025, 1 in 2024, 2 in 2021. Please note that this does not guarantee sponsorship for this specific role.
  • More open positions

    [Remote] SAP MDG (Master Data Governance) Business Partner Analyst

    Work from home Full-time role

    [Remote] Administrative Coordinator

    Work from home Full-time role

    [Remote] Sr. Client Partner – Healthcare (Provider, Payer & HealthcareTech)

    Work from home Full-time role

    [Remote] Sr. Client Partner – Financial Services

    Work from home Full-time role

    [Remote] Senior Technical Recruiter

    Work from home Full-time role

    Senior Medicaid Program and Policy Consultant

    Work from home Full-time role

    Healthcare Recruiter- Remote- unlimited opportunity

    Work from home Full-time role

    [Remote] Part-Time Intake Specialist (Personal Injury)

    Work from home Full-time role

    Non-Dispensing Pharmacy Technician - Remote, TN

    Work from home Full-time role

    Payroll Specialist

    Work from home Full-time role

    Compliance and Contracts Administrator

    Work from home Full-time role

    Account Executive, Public Relations (B2B Technology)

    Work from home Full-time role

    Experienced Customer Support Specialist - Virtual/Remote: Join careerzynith's Innovative Team

    Work from home Full-time role

    Senior Regulatory Compliance Analyst

    Work from home Full-time role

    Head of Creative Production, Paid Social & Native

    Work from home Full-time role

    AML Specialist

    Work from home Full-time role

    Behavior Tech - Center and Home Based

    Work from home Full-time role

    SEO Content Editor

    Work from home Full-time role

    [Remote] AI FinOps Engineer

    Work from home Full-time role

    Associate Specialist, Talent Partnership, Enterprise

    Work from home Full-time role

    Architect II

    Work from home Full-time role