Software Engineer
Microsoft
Software Engineer
Prague (CEE HQ), Prague, Czech Republic
Save
Overview
Security represents the most critical priorities for our customers in a world awash in digital threats, regulatory scrutiny, and estate complexity. Microsoft Security aspires to make the world a safer place for all. We want to reshape security and empower every user, customer, and developer with a security cloud that protects them with end to end, simplified solutions. The Microsoft Security organization accelerates Microsoft’s mission and bold ambitions to ensure that our company and industry is securing digital technology platforms, devices, and clouds in our customers’ heterogeneous environments, as well as ensuring the security of our own internal estate. Our culture is centered on embracing a growth mindset, a theme of inspiring excellence, and encouraging teams and leaders to bring their best each day. In doing so, we create life-changing innovations that impact billions of lives around the world.
The PALM Team (Proactive Alert Lifecycle Management) is part of Microsoft Security’s Identity division, focused on reducing time to detect (TTD) and time to notify (TTN) for service incidents across critical identity platforms. We build intelligent systems that proactively detect, triage, and communicate outages—empowering service owners and incident managers to resolve issues faster and more transparently. Our team leads the development of tools that streamlines incident investigation and response. We leverage machine learning to help on call engineers for better incident mitigation. We are deeply embedded in Microsoft’s AI transformation, applying large language models to simplify incident workflows and reduce cognitive load during live site events. Our work spans across public, sovereign, and government clouds, ensuring resiliency and compliance in diverse environments. PALM engineers operate at the intersection of reliability, security, and customer trust. We collaborate across disciplines to deliver scalable, resilient, and secure solutions that protect billions of users. Our culture values curiosity, ownership, and continuous improvement—whether we’re refining disaster recovery drills, enhancing telemetry, or designing the next generation of incident response tooling. If you’re passionate about building systems that make security simpler and more effective, and want to work on high-impact problems that shape the future of identity and reliability at Microsoft, PALM is the place for you.
Qualifications
Required Qualifications
- Bachelor's Degree in Computer Science or related technical field AND 3+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Master's Degree in Computer Science or related technical field AND 1+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
Other Requirements
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft background and Microsoft Cloud background check upon hire/transfer and every two years thereafter.
#EIP
Responsibilities
We are looking for passionate people with experiences working with all service aspects of high throughput and multi-tenant services, ability to design components carefully, properly handle errors, write clean and well-factored code with good tests and good maintainability.
Responsibilities include:
- Design and implement scalable backend services and automation workflows.
- Build and maintain AI-integrated tools.
- Own end-to-end development of features, ensuring high reliability and customer transparency.
- Collaborate across engineering, product, and incident management teams to drive improvements in service health, observability, and operational readiness.
- Participate in on-call rotations and live site investigations, applying structured debugging and telemetry analysis to resolve issues efficiently.