Software Engineer II - XStore on Pilotfish
Microsoft
Software Engineer II - XStore on Pilotfish
Multiple Locations, United States
Save
Overview
We are seeking a Software Engineer II - XStore on Pilotfish to join our team working on the XPF (XStore on Pilotfish) platform.
This role is ideal for individuals who are passionate about building reliable, diagnosable, and scalable systems. As a developer in the team, you will play a key role in ensuring the operational readiness and long-term sustainability of existing platform and future projects.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Qualifications
- Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
- 2+ years of experience with data structure, distributed systems and cloud computing
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
- Bachelor's Degree in Computer Science, or related technical field and 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- Master's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
Product Management IC3-The typical base pay range for this role across the U.S. is USD $100,600 - $199,000 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $131,400 - $215,400 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay
Microsoft will accept applications for the role until September 13, 2025.
#azurecorejobs
Responsibilities
- Works with appropriate stakeholders to determine user requirements for a set of features. Investigate and resolve complex issues across hardware, firmware, and software layers. Drive root cause analysis and implement durable fixes to improve system reliability.
- Design and develop tools that reduce manual interventions in diagnostics, repair workflows, and node lifecycle management. Focus on automation to streamline operations. Contributes to the identification of dependencies, and the development of design documents for a product area with little oversight. Creates and implements code for a product, service, or feature, reusing code as applicable. Contributes to efforts to break down larger work items into smaller work items and provides estimation.
- Build and enhance diagnostics workflows to detect and isolate hardware and software faults. Ensure comprehensive diagnostics coverage for new hardware Stock Keeping Unit (SKUs) and evolving platform requirements.
- Identify and close operational and security gaps in the platform. Ensure systems are resilient, secure, and compliant with internal standards by working with partner teams as needed.
- Collaborate with engineering and support teams to ensure Storage’s requirements are met across all phases of the platform lifecycle. Drive alignment on diagnostics, telemetry, and repair strategies.
- Acts as a Designated Responsible Individual (DRI) working on-call to monitor system/product feature/service for degradation, downtime, or interruptions and gains approval to restore system/product/service for simple problems.
- Remains current in skills by investing time and effort into staying abreast of current developments that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations.