About Guidewire
At Guidewire, we deliver the software that Property and Casualty (P&C) insurance companies rely on to protect their customers during crises, natural disasters, accidents, and cyber risks. Our core applications enable insurers to sell and underwrite policies, settle claims, and bill their customers. We also offer a suite of innovative products for data management, digital portals, and predictive analytics.Hundreds of insurers worldwide use Guidewire's products, running on our cutting-edge Guidewire Cloud Platform, to handle billions of dollars in business. We are dedicated to providing the tools and technology that help insurers protect and support their customers when they need it most.
The Opportunity
We are seeking a Site Reliability Engineer III who is eager to contribute to the transformation of the insurance industry with our leading cloud platform. As a member of the SRE-Application team, you'll play a critical role in ensuring the reliability, performance, and scalability of applications running on our Guidewire Cloud Platform. This position offers a unique opportunity to apply your skills in automation, software engineering, and operational discipline to support our cloud-based solutions.
What you'll do
- Assist in troubleshooting and resolving issues in collaboration with development teams, reducing customer impact.
- Develop and maintain automated runbooks to address common issues proactively.
- Apply engineering principles and basic automation to enhance our operating environments.
- Monitor applications and help improve their reliability and performance on the Guidewire Cloud Platform.
- Use your software engineering skills to optimize systems and reduce manual tasks.
- Document incidents and assist in refining processes to prevent future occurrences.
- Stay informed about industry trends, tools, and best practices in site reliability engineering.
- Contribute to a culture of innovation, learning, and continuous improvement.
- Participate in on-call rotations to ensure the availability and reliability of our services.
What you'll bring:
- Strong interest in pursuing a career as an SRE or similar role, focusing on improving system reliability
- Eagerness to learn and develop problem-solving skills to assist in analyzing complex systems and devising effective solutions
- Ability to collaborate and communicate effectively with team members and other stakeholders
- Familiarity with or desire to learn automation, monitoring, and performance optimization tools and techniques
- Commitment to maximizing uptime, scalability, and delivering an exceptional end-user experience
- Passion for technology and a strong desire to continuously learn and grow your skills
- Alignment with Guidewire's mission to leverage technology to help protect and support others
Required skills:
- Enrolled in or recently graduated from a Bachelor's or Master's degree program in Computer Science, Engineering, or a related
- interest in learning about SRE and DevOps practices, such as monitoring, automation, and infrastructure management
- Familiarity with basic concepts of cloud computing, particularly AWS
- Basic understanding of Linux system administration
- Ability to program/script using Python, Go, Java, shell, or equivalent
- Strong problem-solving skills and ability to learn quicklyExcellent communication and collaboration skills
Preferred Skills:
- Coursework or projects related to distributed systems, cloud computing, or infrastructure managementFa
- miliarity with version control systems such as GitExposure to containerization technologies such as Docker or Kubernetes
- Familiarity with infrastructure as code (IaC) concepts and tools such as TerraformBasic understanding of networking conceptsFamiliarity with agile development methodologies
Why Guidewire
This is an opportunity to join a mission-driven company and make a real impact in the lives of people facing challenges. You'll work with cutting-edge technology, collaborate with talented peers, and grow your skills in a culture that values innovation, teamwork, and work-life balance. We offer competitive compensation, comprehensive benefits, and opportunities for career development.If you're a Senior SRE who combines deep technical expertise with a passion for problem-solving and a commitment to reliability, we'd love to hear from you. Join us in building the software that helps insurers care for their customers when they need it most.This position requires participation in mandatory on-call rotations to ensure the availability and reliability of our services. This includes responding to incidents and alerts outside of regular business hours, on weekends, and during holidays, as per the established on-call schedule. Candidates must be willing and able to fulfill this critical responsibility.