> HackerTyper Jobs

Mid Site Reliability Engineer

Brazil, Remote

Zipdev is looking to add a remote Site Reliability Engineer to its team of LatAm developers! As a Site Reliability Engineer, you will work as an integrated member of product teams to help build, deploy and reliably monitor cloud services. You will work on complex software development projects to keep important, revenue-critical services up. You will actively develop code and build frameworks to monitor the services deployed in production to drive reliability and performance across a massive scale.

We're looking for a talented Site Reliability Engineer who can work under minimal supervision, define test procedures, and collaborate closely with Developers, Designers, Customer Support, and Engineering Leadership.

What you will do:

  • Build systems and infrastructure to monitor complex, large-scale distributed systems
  • Identify stability/performance issues and collaborate with developers to triage critical issues in production systems.
  • Represent the SRE organization in design reviews and operational readiness exercises for new and existing services
  • Devise ways to actively monitor system throughput, capacity and reliability.
  • Ability to debug complex systems and evolve a running environment without downtime.
  • Engage in service capacity planning and demand forecasting, software performance analysis and system tuning.
  • Drive standardization efforts across multiple disciplines and services in conjunction with embedded SREs throughout the organization.

Zipdev

Apply now
Sponsor