Software Engineer II - Azure Kubernetes Fleet Manager
Microsoft
Software Engineer II - Azure Kubernetes Fleet Manager
Multiple Locations, United States
Save
Overview
The Azure Kubernetes Fleet Manager (https://azure.microsoft.com/en-us/products/kubernetes-fleet-manager/) team is creating a world-class container management and orchestration services for the cloud and beyond. We are the team working on multi-cluster/multi-cloud solutions behind the CNCF (Cloud Native Computing Foundation) project. Our charter is to define the next generation of cloud-native infrastructure for customers to manage their fleet of Kubernetes clusters and the applications running on top of it.
We are looking for a Software Engineer II - Azure Kubernetes Fleet Manager who is excited about container orchestration with Kubernetes, and cloud native eco-systems overall. As a Software Engineer II, you will create and implements code for both the Azure Kubernetes Fleet Manager and the open-sourced solutions, take the ownership as applicable. Writes and learns to create code that is correct, extensible, and maintainable. Considers diagnosability, reliability, and maintainability with few defects, and understands when the code is ready to be shared and delivered. Creates a clear and articulated plan for testing and assuring quality of solutions, and defines success for outcomes of tests (e.g., unit tests). Helps to drive efforts for augmenting test cases and ensures that the solution area has good test coverage.
You will also support efforts to apply debugging tools and examines logs, telemetry, and other methods to verify assumptions proactively before issues occur and reactively as issues occur for product features. Maintains operations of live service as issues arise on a rotational, on-call basis. Identifies solutions and mitigations issues when applicable impacting performance or functionality of Live Site services and escalates, as necessary.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Qualifications
Required Qualifications:
- Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, Go, Java, or C++
- OR equivalent experience.
- 2+ years of experience developing Kubernetes or CNCF projects
- 2+ years of experience running large scale online services (global)
Other Requirements:
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
Preferred Qualifications:
- Master's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, Go, Java, or C++
- OR Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, Go, Java, or C++
- OR equivalent experience.
- 1+ year(s) experience working within a globally distributed team.
#azurecorejobs
Responsibilities
- Works with appropriate stakeholders to determine user requirements for a set of features.
- Contributes to the identification of dependencies, and the development of design documents for a product area with little oversight.
- Creates and implements code for a product, service, or feature, reusing code as applicable.
- Contributes to efforts to break down larger work items into smaller work items and provides estimation.
- Acts as a Designated Responsible Individual (DRI) working on-call to monitor system/product feature/service for degradation, downtime, or interruptions and gains approval to restore system/product/service for simple problems.
- Remains current in skills by investing time and effort into staying abreast of current developments that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale.