Back to positions

Site Reliability Engineer

Remote role Full-time Open position

Thales Cybersecurity Products is a leader in digital security solutions, providing critical services to businesses and governments. They are seeking a Site Reliability Engineer to ensure high service levels and operational excellence for their Telecommunication solution deployed in the public cloud, focusing on automation, reliability, and incident management.

Responsibilities

  • Design, build, and maintain scalable infrastructure using tools such as Terraform, Ansible, and Kubernetes
  • Develop automated CI/CD pipelines via GitLab to reduce manual toil
  • Define and monitor Service Level Objectives (SLOs) and Service Level Indicators (SLIs)
  • Manage 'Error Budgets' to balance the velocity of new features with the stability of the platform
  • Participate in 24/7 on-call rotations to provide emergency response and perform deep-dive troubleshooting for production issues
  • Conduct system performance analysis, identify bottlenecks, and perform capacity planning to ensure the infrastructure can handle growth and peak loads
  • Implement and refine symptom-based alerting and comprehensive monitoring strategies using platforms like Datadog to ensure high visibility into system health
  • Lead blameless postmortems after incidents to identify root causes and implement long-term technical fixes to prevent recurrence
  • Partner with Cloud Security teams to implement security best practices, manage access controls, and respond to security breaches or vulnerabilities
  • Support customer relationship
  • Interface with other stakeholders to define solution improvement plan
  • You will have the ownership of solution service availability

Skills

  • Engineer or equivalent
  • At least 1 year experience
  • Java development skill is required
  • You are familiar with Public Cloud (GCP, AWS), containers and microservices (Docker, Kubernetes, Java), CI/CD and automation (Jenkins, Gitlab, Helm), NoSQL database
  • Must have U.S. or Dual Citizenship and be able to obtain post-hire clearance from the Committee on Foreign Investments in the U.S. (CFIUS) and Department of Treasury
  • You have already set up product monitoring and the underlying infrastructure
  • You have development experience in a distributed systems and/or high availability context
  • You are familiar with microservices development
  • You participated in the definition of architectures, data structures, algorithms with performance, security, reliability constraints, etc
  • Public cloud architect certification
  • You are interested in aspects of Site Reliability Engineer: CI/CD, automation, monitoring and observability, and continuous improvement
  • You are an accomplished, versatile and multi-tasking developer engineer

Benefits

  • Elective Health, Dental, Vision, FSA/HSA, Voluntary Life and AD&D, Whole Group Life w/LTC, Critical Illness, Hospital Indemnity, Accident Insurance, Legal Plan, Identity Theft, and Pet Insurance
  • Retirement Savings Plan after 30 days of employment with a company contribution and a match, and with no vesting period
  • Company paid holidays and Paid Time Off
  • Company provided Life Insurance, AD&D, Disability, Employee Assistance Plan, and Well-being Program

Company Overview

  • Thales is a global leader in cybersecurity, helping the most trusted organizations in the world protect their most critical applications, data, and identities anywhere at scale. It was founded in 2001, and is headquartered in Austin, Texas, USA, with a workforce of 1001-5000 employees. Its website is https://cpl.thalesgroup.com/.
  • Apply To This Job

    Further positions

    [Remote] Customer Service Rep I

    Remote role Full-time

    [Remote] Customer Service Representative-I (Escalations & Self-Pay) - PFS (Remote)

    Remote role Full-time

    Digital Asset Coordinator

    Remote role Full-time

    [Remote] Customer Support Specialist

    Remote role Full-time

    [Remote] Work from Home Bilingual Collections Account Representative

    Remote role Full-time

    [Remote] Material Damage Specialist

    Remote role Full-time

    Collections Support Specialist (Hybrid)

    Remote role Full-time

    [Remote] Sales Development Representative

    Remote role Full-time

    Development Associate Communications and Engagement

    Remote role Full-time

    Program Support Representative - Customer Service & Data Management

    Remote role Full-time

    Customer Care Representative I - $19 to start plus incentives - Hybrid (Partially Work From Home) - Now Hiring

    Remote role Full-time

    Experienced Data Entry Specialist – Remote Opportunity with arenaflex

    Remote role Full-time

    Incident Responders

    Remote role Full-time

    Experienced Full Stack Customer Support Specialist – Remote Live Chat Support

    Remote role Full-time

    Experienced Customer Support Specialist (Remote) - arenaflex

    Remote role Full-time

    VIRTUAL Hiring Event - Behavioral Health Specialist (Direct Care) - North Los Angeles (Camarillo, Moorpark, & Santa Rosa Valley) Tuesday, 12/2, 10AM-2PM

    Remote role Full-time

    Experienced Remote Data Entry Specialist – Home-Based Opportunity for Detail-Oriented Individuals with Strong Typing Skills

    Remote role Full-time

    Customer Specialist (Project-based)

    Remote role Full-time

    Account Manager, Business Sales - Harlingen, Brownsville

    Remote role Full-time

    Telepharmacist - Part Time/Per Diem (Overnight & Weekend Hours)

    Remote role Full-time