Senior Service Engineer - CTJ - Poly
Microsoft
Senior Service Engineer - CTJ - Poly
Reston, Virginia, United States
Save
Overview
Microsoft has an exciting opportunity for a Senior Service Engineer to join the Azure Silver and Sovereign Team as part of the Cloud Transfer Service (CTS) team. The Azure Cloud Transfer Service enables secure access and transfer between enclaves and supports other transfer and access types enabling a wide set of capabilities within highly regulated industries. We welcome you to meet the team and learn about the complex challenges you can solve with us!
We are looking for engineers to join a fast-paced team and solve complex problems in the domain of mission-critical distributed systems spanning data transmission across clouds. Our team works across all facets of isolated system engineering but is deeply involved in the following areas: service automation and reliability improvements, systemic latency reduction, data validation and transformation, and throughput optimization. We need you to help us overcome these challenges. In this role, you will have the opportunity to build, deploy and support systems which enable a broad set of Azure services to be consumed by customers in highly secured and regulated industries. The systems you support will be required to meet the security policy and assurance requirements of both public and private sector customers.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Qualifications
Required / Minimum Qualifications
- Bachelor's Degree in Computer Science, Information Technology, Mechanical Engineering, Electrical Engineering, Aerospace Engineering, Data Science, Cybersecurity, or related field AND 3+ years technical experience in software engineering, network engineering, service engineering, systems engineering, or industrial controls OR equivalent experience.
Security Clearance Requirements: Candidates must be able to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:
- The successful candidate must have an active U.S. Government Top Secret Clearance with access to Sensitive Compartmented Information (SCI) based on a Single Scope Background Investigation (SSBI) with Polygraph. Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. Failure to maintain or obtain the appropriate U.S. Government clearance and/or customer screening requirements may result in employment action up to and including termination.
- Clearance Verification: This position requires successful verification of the stated security clearance to meet federal government customer requirements. You will be asked to provide clearance verification information prior to an offer of employment.
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
- Citizenship & Citizenship Verification: This position requires verification of U.S. citizenship due to citizenship-based legal restrictions. Specifically, this position supports United States federal, state, and/or local United States government agency customer and is subject to certain citizenship-based restrictions where required or permitted by applicable law. To meet this legal requirement, citizenship will be verified via a valid passport, or other approved documents, or verified US government Clearance
Preferred Qualifications:
- Bachelor's Degree in Computer Science, Information Technology, or related field AND 8+ years technical experience in software engineering, network engineering, service engineering, or systems engineering
- OR equivalent experience.
- 3+ years technical experience working with large-scale cloud or distributed systems
- Expertise in problem solving and analyzing distributed systems and critical production service environments
- Expertise in Ansible, Linux, specifically CentOS 7, Redhat, Mariner or similar in throughput management, troubleshooting and security hardening
Service Engineering IC4 - The typical base pay range for this role across the U.S. is USD $119,800 - $234,700 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $158,400 - $258,000 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay
Microsoft will accept applications for the role until October 31, 2025
#Silver
Responsibilities
- Develops end-to-end expertise in service and/or system design, interactions between technology layers and components, functions of infrastructure, and dependencies at scale. Takes ownership of service design by driving efforts within an organization to identify, define, recommend, and build optimal configurations of technology solutions with considerations for cost management. Independently adjusts configurations and defines infrastructures to improve the availability, reliability, efficiency, observability, and/or performance of supported products and services. Drives reviews with the engineering teams that develop and/or manage services, identifying opportunities for efficiencies in operations and sharing learnings and recommendations across engineering teams working on related services within their organization.
• Stays current in knowledge and expertise as technology landscape evolves, maintaining awareness of industry norms. Uses knowledge to drive the adoption of new solutions across engineering teams working with related products within an organization. Provides guidance to others through sharing, coaching, conferences, and other means to drive improvements across teams.
Operational Excellence
- Independently implements reliable, scalable, and high-performance solutions across teams. Contributes to design documents. Owns implementation and rollback plans. Maintains quality checklist and related documentation.
- Leverages end-to-end technical expertise and telemetry analysis to identify patterns and opportunities to implement configuration and data changes for related sets of platforms, systems, or products in production using code, tooling, and automation; identifies cases where teams lack the tools and/or capability to manage platforms, systems, or products using code and drives efforts within an organization to expand capabilities and/or tooling accordingly.
- Creates, monitors, and acts on telemetry data and influences telemetry analytics to better identify patterns that reveal errors and unexpected problems that are affecting the system’s availability, reliability, performance, and/or efficiency. Develops scripts and/or automation and leverages an understanding of solutions to define, develop, measure, track, change, and improve the quality of telemetry pipelines that support automated monitoring and incident response.
- Responds to incidents during regular on-call rotations, including complex issues with major customer or business impact, by identifying the level of impact, troubleshooting, contributing to difficult decisions based on business impact, deploying appropriate fixes to resolve root cause(s), and implementing automations for prevention of recurring issues through coordinating resources required for incident resolution, which may include product teams, owners, leadership, other engineering teams, and/or subject matter experts. Escalates resolution of highly complex, ambiguous, and impactful issues as needed. Contributes to postmortems and shares details related to incidents and their resolution through post-mortem reports and regular review meetings. Provides expert incident response assistance to other Service Engineers as needed and develops incident response and resolution guidance.
- Adheres to prescriptive guidance for security, privacy, and compliance standards in alignment with direction from the business and technical experts. Works with security, privacy, and compliance teams to identify and address issues relevant to their services. Identifies patterns of violations and implements automations for prevention. Aids other Service Engineers as needed.
Collaboration and Knowledge Sharing
- Collaborates within and across teams by proactively and systematically sharing information with an appropriate level of detail for their audience. Overcomes obstacles by resolving conflicts and issues across interdependent teams and engages with partners and stakeholders so issues can be resolved and mutual objectives are met. Develops, leverages, and drives sharing of information and knowledge base (e.g., customer, product, industry, troubleshooting guides) across teams.