We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results
New

Site Reliability Engineer II - CTJ - Top Secret

Microsoft
United States, Nevada, Reno
6840 Sierra Center Parkway (Show on map)
Sep 05, 2025
OverviewAre you interested in working on cutting-edge cloud security products Would you like to be part of one of the world's most advanced cyber-security solutions and protect millions of computers from thousands of active attack attempts, every month Look no further than the Microsoft Defender engineering team. We are looking for a Site Reliability Engineer II who will be building and delivering cloud solutions to meet the scale that few companies in the industry are required to support. Leveraging state-of-the-art technologies, you will be instrumental in delivering holistic protection within highly sensitive and secure government environments. The Microsoft Defender team is responsible for delivering a constantly evolving set of services and solutions to meet the challenging landscape of our ever-evolving attackers. This is a team which provides on-call operational support and improvements to the operational posture of the Microsoft Defender products within US Government clouds. You will operate our production services, and work closely with other engineering teams to ensure services and systems are highly stable, meet performance SLAs, and meet the expectations of internal and external customers and users. TheMicrosoft Defender team is responsible for delivering a constantly evolving set of services and solutions to meet the challenging landscape of our ever-evolving attackers.
ResponsibilitiesLive Site Operations: Serve as a Designated Responsible Individual (DRI) in a 24x7 on-call rotation, monitoring service health and responding to incidents within SLA timelines. Automation & Deployment: Contribute to automation efforts and validate code functionality in non-production environments to ensure smooth deployments. Compliance & Security: Support compliance processes by verifying security, privacy, and accessibility standards during onboarding of new technologies. Continuous Learning: Stay current with industry trends and internal tools to improve reliability, performance, and observability at scale. Engineering Best Practices: Apply proven development and scaling practices to meet performance and customer requirements. Cross-Team Collaboration: Communicate effectively with engineering partners to align on goals and deliver user-centric solutions. Incident Response & Postmortems: Address complex live site issues, implement mitigations, and document learnings through postmortems.
Applied = 0

(web-759df7d4f5-j8zzc)