UWorld is a worldwide leader in online test prep for college entrance, undergraduate, graduate, and professional licensing exams throughout the United States. Since 2003, over 2 million students have trusted us to help them prepare for high-stakes examinations.
We are seeking a highly skilled and experienced Site Reliability Engineer - Azure to join our dynamic team. As a site reliability engineer, you will play a critical role in designing, implementing, and maintaining the reliability, availability, and performance of our Azure-based infrastructure. You will be responsible for ensuring the smooth operation of our cloud-based systems and resolving any incidents or issues that may arise.
Responsibilities:
- Design, deploy, and manage Azure infrastructure components, including virtual networks, subnets, load balancers, and application gateways.
- Implement and maintain monitoring systems for Azure resources, utilizing tools such as New Relic, Log Analytics Workspace, and other relevant monitoring solutions.
- Perform incident management and troubleshooting, identifying and resolving issues to ensure maximum uptime and optimal performance.
- Configure and manage networking components, including VPN gateways, DNS, and ExpressRoute connections.
- Collaborate with cross-functional teams to implement security best practices and ensure the protection of our Azure resources.
- Perform database administration tasks for both SQL and NoSQL databases, including performance tuning, backup and recovery, and data replication.
- Collaborate with developers and other stakeholders to ensure efficient deployment and scaling of applications on Azure.
- Stay up-to-date with the latest Azure features, services, and best practices and evaluate their potential impact on our infrastructure and operations.
- Document processes, procedures, and configurations to ensure knowledge transfer and maintain a reliable knowledge base.
Requirements:
- Bachelor's degree in computer science, information technology, or a related field (or equivalent work experience).
- Minimum of 5 years of hands-on experience in Azure infrastructure management and administration.
- Experince in managing Azure services, including IIS, Front Door, application gateways, virtual machine management (including snapshots) and Apache servers administration.
- Proficiency in incident management and troubleshooting in Azure environments.
- Experience with load balancers and network configurations in Azure.
- Experience in Databricks Administration
- Experience in Linux and Windows Servers Administration
- Familiarity with monitoring tools such as New Relic, Log Analytics Workspace, and other relevant monitoring solutions.
- Experience in network administration and deep understanding of networking concepts and protocols, including VPNs, DNS, ExpressRoute, and virtual networks.
- Strong understanding of cloud security principles and best practices for securing Azure resources.
- Experience in managing and optimizing both SQL and NoSQL databases (MSSQL, MySQL, and MongoDB) in Azure.
- Excellent problem-solving and analytical skills, with the ability to quickly diagnose and resolve complex technical issues.
- Strong communication and interpersonal skills, with the ability to collaborate effectively with cross-functional teams.
Preferred Qualifications:
- Azure certifications, such as Azure Solutions Architect Expert or Azure Administrator Associate.
- Experience with automation and configuration management tools such as Azure DevOps, Terraform, or PowerShell.
Soft Skills:
- Working proficiency and communication skills in verbal and written English
- Excellent attention to detail and organization skills and ability to articulate ideas clearly and concisely
- Ability to work effectively within a changing environment that is going through high growth
- Exceptional follow-through, personal drive, and ability to understand direction and feedback