Senior Software Engineer
Microsoft
Senior Software Engineer
Bangalore, Karnataka, India
Save
Overview
The Foundry Agent Platform organization within Azure AI Platform is at the forefront of building the next generation of AI-driven solutions that power Microsoft’s most innovative products and services. Our work underpins key Azure AI offerings, enabling intelligent, scalable, and reliable AI experiences across the Microsoft ecosystem.
Within Foundry Agent Platform, our team is focused on developing advanced tools and services to build and host self-improving agents on AKS. We are seeking a Senior Software Engineer to join the Agent Foundry Infrastructure team to build the best-in-class Agent platform.
The infrastructure team in Agents Foundry is looking for a senior software engineer who loves to build scalable, highly available, and secure microservices that run in Kubernetes. The infrastructure team focuses on managing a large fleet of Azure Kubernetes Services (AKS) that represents the control and data plane services for AI Platform.
Qualifications
Required Qualifications:
- Deep experience in K8s platform design and implementation at scale.
- Bachelor’s degree in computer science.
- 5+ years of experience with coding in one of C#, Python, Go, Rust, Java, C or C++.
- Experience with improving service operations or engineering fundamentals.
- Excellent collaboration skills, team player, thrive to make a difference.
- Excellent architecture and design skills.
Preferred Qualifications
- Understanding of Observability stack in K8s ecosystem.
- Livesite experience in debugging and analyzing microservices.
Responsibilities
The team focuses on:
- Managing Kubernetes Cluster Deployments at Scale.
- Secure Control and Data Plane assets from malicious attacks and unauthorized access using industry standard tools and frameworks.
- Automate Monitors and critical alerts using best in class observability tools such as: Azure Monitor, Prometheus, Azure Data Explorer, Grafana.
- Automate CI/CD deployments using YAML builds and releases.
We build tools to increase the observability of the applications running in the Kubernetes clusters, improve the speed, security, and reliability of our deployments, secure our supply chain and services, and debug production with ease. We use the best of open source, like Prometheus, Grafana, KEDA and Istio, and build solutions to enable AI Platform to deliver a global service that handles large scale ML training and inferencing workloads.