Senior Software Engineer (Open‑Source AI Platforms)
Microsoft
Build the future of cloud‑native AI with us!
We’re looking for a Senior Engineer to design and build open‑source, cloud‑native AI platforms that power large‑scale training, fine‑tuning, inference, and agentic workloads.
In this role, you’ll create Kubernetes‑native abstractions and operators that make advanced AI workloads reliable, scalable, and easy for developers to use across cloud and hybrid environments. You’ll work at the intersection of infrastructure and AI—turning complex systems into elegant, production‑grade platforms.
You’ll also help lead and contribute to upstream open‑source communities while shaping AI platforms used by internal teams and external customers at global scale. This is a hands‑on role for engineers who enjoy solving hard problems, influencing architecture, and collaborabation.
At Microsoft, we build with a growth mindset and a deep sense of purpose—empowering people and organizations worldwide through technology, while fostering a culture of inclusion, trust, and technical excellence.
Responsibilities
- Design, implement, and maintain Kubernetes operators and controllers for AI/ML workloads
- Partner with product managers, business stakeholders, and users to understand user pain points deeply and create innovative solutions that delight your customers in an agile development environment.
- Contribute to applicable upstream open-source projects
- Write technical design documents and participate in architecture reviews
- Mentor team members and external contributors through code reviews
- Debug and optimize distributed AI systems running at scale
- Strive for excellence in everything you do: culture, collaboration, process, tools, design, engineering practices, customer experience, performance, security etc.
Qualifications
Required Qualifications:
- Bachelor's Degree in Computer Science or related technical field AND technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, Go, or Python
- OR equivalent experience.
- Master's Degree in Computer Science or related technical field AND technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, Go, or Python
- Hands‑on experience building or operating AI/ML training, fine-tuning, and inference platforms in cloud‑native environments.
- Proficiency with Go and/or Python for building platform components, Kubernetes operators/controllers, and integrations in production environments.
- Demonstrated experience contributing to or maintaining open‑source software, especially in the Kubernetes, AI/ML, or cloud‑native ecosystem.
Other Requirements:
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
#azurecorejobs
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.