SRE

full timedevopsengineeringremote FROM 🇦🇷 🇧🇷 🇨🇴 🇨🇷 🇵🇪
Open to candidates in: Argentina, Brazil, Colombia, Costa Rica, Peru
Resilient Co
🏭 Not specified
📍 N/A
👤 Not specified

Summary

The Purple Platform Engineer – SRE is a hybrid engineering role combining Site Reliability Engineering excellence, cloudnative software engineering expertise, and deep knowledge of our internal Purple Platform, HealthEquity’s cloudnative application delivery ecosystem.

Responsibilities

  • You will design, build, and operate highly reliable systems while enabling product teams to selfserve, deploy, and operate applications securely and efficiently—aligned with the platform’s core tenets: GitOps integration, and cloudnative operational excellence.
  • This role requires an engineer who thrives in modern DevOps environments, understands distributed systems deeply, writes high-quality code, and can translate platform guardrails and policies into a world class developer experience.

Requirements

  • CloudNative Ecosystem
  • Strong Kubernetes expertise (workloads, scaling, networking, operators, CRDs).
  • Advanced containerization practices (Docker multi-stage, security hardening).
  • Hands On Experience implementing service mesh (ISTIO) and API gateways
  • Infrastructure-as-Code (Terraform).
  • Understanding and ability to configure and troubleshoot MongoDB collections,
  • Redis Cache, Azure Service Bus, Azure Document Storage etc.

Software Engineering Core

  • Strong background in C# , Python and/or Node.js.
  • Ability to build highly reliable distributed applications and automation tools.
  • Building CI/CD pipelines.
  • Experience with AI Assisted development to improve quality and productivity

GitOps & Platform Delivery

  • Deep understanding of declarative deployment workflows (Argo CD, Flux).
  • Expertise in Helm, Kustomize, deployment manifests, and environment modeling.
  • Experience integrating automated tests, scans, and policy controls into Git workflows—supporting the platform’s “shift-left feedback and shift-right enforcement” model.

Observability & Monitoring

  • Strong experience with configuring and using Dynatrace for observability, setting up OpenTelemetry integrations, App Insights
  • Competence using Kusto (KQL), analyzing logs, distributed traces, and performance metrics.
  • Incident response leadership, postmortem writing, error budget management.

Security & Governance

  • Familiarity with container scanning, supply chain security, SBOM tools.
  • Experience applying and troubleshooting policies for security and using secure
  • secret management (Vault/KMS).
  • Configuring and implementing Managed Identities for secure authentication
  • Understanding of compliance frameworks relevant to healthcare systems.

Developer Tooling & Automation

  • Building internal tools, CLIs, templates, plug-ins that improve velocity.
  • Knowledge of Backstage or internal developer portals is a plus.
  • Strong scripting skills (Bash, PowerShell, Python, Go utilities).

Preferred Qualifications

  • 3+ Years Experience in large-scale, enterprise-grade cloud native platforms.
  • Previous work in SRE, Platform Engineering, DevOps, or Production Engineering roles.
  • Experience with self-service portals and cloud resource orchestration.
  • Familiarity with classification-driven policy models and governance automation.

Selection Process

  1. Screening with Resilient Co. team. 
  2. First technical interview.
  3. Client interview.
  4. Manager interview.
Resilient Co
🏭 Not specified
📍 N/A
👤 Not specified