> HackerTyper Jobs

Site Reliability Engineer

United States, Remote

***This role is 100% Remote***

As a part of a reliability-focused team, you will help exemplify, measure and raise the reliability of products and operational capabilities by creating tools, collaborating with teams, evangelizing best practices, and encouraging learning across the organization.

You'll work closely with engineers across many disciplines to advocate for sensible, scalable, systems design and share responsibility with them in diagnosing, resolving, and preventing production issues - we believe strongly that engineering teams take operational responsibility for their products and work hard to support them in this.

Some things you will do:

  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.
  • Work closely with product squads to assist them in adopting and improving their use of existing tools and practices such as SLOs, alerting, runbooks, synthetic testing, and general observability.
  • Monitor user-facing systems using best practices, with reliability and scalability in mind.
  • Work with engineering teams to debug and fix issues.
  • Be an evangelist for SRE best practices throughout the product and engineering organization.
  • Lead and participate in performance tests; identify bottlenecks, opportunities for optimization, and capacity demands.
  • Participate in an on-call rotation alongside the engineers who build our products.