Principal Software Engineer
Microsoft
Principal Software Engineer
Mountain View, California, United States
Save
Overview
The Artificial Intelligence (AI) Frameworks team at Microsoft develops the AI software used to train and deploy the world’s most advanced AI models. We collaborate with our hardware teams and partners to build the software stacks for Microsoft’s next-generation supercomputers and the Maia AI accelerators. We work closely with ML researchers and developers to optimize and scale out model training and inference. We work with OpenAI on the models hosted on the Azure OpenAI service.
The team operates at the intersection of AI algorithmic innovation, purpose-built AI hardware, systems, and software. We are a cross-discipline team of highly capable and motivated people with a collaborative and inclusive culture and with a shared mission of supporting and driving our AI future.
As a member of this team, you will have the opportunity to work on developing and evaluating core algorithmic and hardware technologies to enable large scale inferencing and training of the most advanced AI models on novel AI hardware.
This is a technical role: it requires hands-on software design and development skills. We’re looking for someone who has a demonstrated history of solving hard technical problems and is motivated to learn new areas and tackle the hardest problems in building efficient AI systems. An entrepreneurial approach and ability to take initiative and move fast are essential.
#AIFX
Qualifications
Required Qualifications:
- Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
- 4+ years’ practical experience related to machine learning, including modeling, application, or system/kernel optimization.
Preferred Qualifications:
- Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
- B.S. or advanced degree in computer engineering, computer science, or related fields, and 10+ years of software development experience
- Experience in training or serving Deep Neural Network models. Experience with Language Models and ML system optimization are a plus.
- A strong technical background and solid foundation in algorithms, software engineering principles, and computer systems.
- Great cross-team collaboration skills and the desire to collaborate in a team of researchers and developers.
- Strong communication skills (both written and oral)
Software Engineering IC5 - The typical base pay range for this role across the U.S. is USD $139,900 - $274,800 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $188,000 - $304,200 per year. Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay
• Single reqs: Microsoft will accept applications for the role until Month Day, Year.
• Multi reqs: Microsoft will accept applications and processes offers for these roles on an ongoing basis.
#CoreAI
#AIFX
Responsibilities
- Collaborate broadly with ML researchers, system engineers, and production engineers.
- Engage with key partners to understand and evaluate performance and quality for state-of-the-art LLMs at different scales.
- Build software tools to support validation and exploration of LLM optimization technologies.
- Perform software development in model scripting and/or kernel languages, such as Python, C/C++, CUDA.
- Identify requirements, scope solutions, estimate work, schedule deliverables.