Senior Platform Engineer / SRE

full timeengineeringdevopsremote FROM πŸ‡§πŸ‡·
Open to candidates in: Brazil
Jobgether
🏭 Not specified
πŸ“ N/A
πŸ‘€ Not specified

This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior Platform Engineer / SRE in Brazil.

In this role, you will lead the evolution of scalable and highly automated cloud infrastructure in a fast-paced, engineering-driven environment. You will work on complex platform challenges involving Kubernetes, Infrastructure as Code, GitOps, and reliability engineering, helping shape the long-term architecture of critical production systems. This opportunity is ideal for senior engineers who thrive in solving large-scale operational problems while driving automation, resiliency, and platform self-service capabilities. You will collaborate closely with engineering, security, and leadership teams to define technical standards, improve developer experience, and enhance operational excellence. The position offers strong ownership, architectural influence, and the opportunity to work with modern infrastructure technologies in a fully remote environment aligned with North American time zones.


Accountabilities:

  • Design and evolve Infrastructure as Code architectures using Terraform, including module strategy, state management, and multi-account configurations.
  • Drive GitOps implementation and operational excellence using tools such as ArgoCD, including deployment workflows, progressive delivery, and environment promotion strategies.
  • Architect, manage, and optimize multi-tenant Kubernetes infrastructure on AWS EKS with focus on scalability, isolation, reliability, and performance.
  • Build and maintain self-service infrastructure platforms and automation pipelines that reduce manual operational effort for engineering teams.
  • Develop infrastructure automation standards and leverage AI-assisted or agentic coding tools to improve engineering productivity and scalability.
  • Define and manage reliability engineering practices, including SLOs, error budgets, incident management, and post-mortem processes.
  • Improve observability standards through monitoring, tracing, alerting, and operational runbook best practices.
  • Collaborate with security teams on infrastructure hardening, secrets management, and zero-trust architecture initiatives.
  • Contribute to technical roadmap planning, platform strategy, and engineering prioritization discussions.
  • Mentor engineers through code reviews, technical guidance, design feedback, and operational best practices.
  • Requirements:

    • 6+ years of experience in Platform Engineering, Site Reliability Engineering, DevOps, or infrastructure-focused roles supporting production-scale systems.
    • Deep expertise in Infrastructure as Code and Terraform architecture design, including complex state management and multi-account cloud environments.
    • Strong GitOps background with hands-on experience implementing declarative infrastructure and deployment management practices.
    • Advanced Kubernetes knowledge with proven experience operating production clusters and troubleshooting distributed system failures.
    • Solid AWS expertise covering networking, compute, IAM, storage, and scalable cloud architecture patterns.
    • Experience managing multi-tenant infrastructure environments and implementing workload isolation strategies.
    • Strong automation mindset with the ability to design scalable systems that eliminate repetitive operational tasks.
    • Familiarity with AI-assisted or agentic coding tools for infrastructure and automation workflows.
    • Proven experience implementing reliability engineering principles such as SLOs, incident response, and operational improvement initiatives.
    • Excellent communication and collaboration skills with the ability to explain technical concepts to both engineering and leadership stakeholders.
    • Experience with Karpenter, FinOps practices, data infrastructure, AI/ML workloads, or enterprise-scale startup environments is considered a plus.
    • Benefits:

      • Full-time remote work opportunity based in Brazil.
      • Flexible work environment aligned with EST or PST collaboration hours.
      • Opportunity to work on cutting-edge cloud infrastructure and automation initiatives.
      • High-impact engineering culture with strong ownership and technical autonomy.
      • Exposure to modern technologies including Kubernetes, GitOps, AWS, and AI-assisted infrastructure tooling.
      • Collaborative and innovation-driven environment focused on continuous improvement and scalability.
      • Career growth opportunities through technical leadership, architecture ownership, and mentorship responsibilities.

How Jobgether works: We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team. We appreciate your interest and wish you the best!  Why Apply Through Jobgether?    Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.     #LI-CL1
Jobgether
🏭 Not specified
πŸ“ N/A
πŸ‘€ Not specified