My job alerts

Senior Machine Learning Engineer - Action Models and Generative AI - Microsoft Research AI Frontiers

Microsoft

Software Engineering, Data Science

USD 119,800-234,700 / year

Posted on Sep 16, 2025

Apply now

Senior Machine Learning Engineer – Action Models and Generative AI – Microsoft Research AI Frontiers

New York City, New York, United States

Save

Share job

Date posted

Sep 15, 2025

Job number

1877898

Work site

3 days / week in-office

Travel

None

Role type

Individual Contributor

Profession

Software Engineering

Discipline

Software Engineering

Employment type

Full-Time

Overview

Microsoft Research AI Frontiers lab is seeking applications for the position of Senior Machine Learning Engineer – Action Models and Generative AI to join their team in New York, NY.

The mission of the AI Frontiers lab is to expand the pareto frontier of Artificial Intelligence (AI) capabilities, efficiency, and safety through innovations in foundation models, generative AI, and learning agent platforms. Some of our projects include work on language models (e.g., Phi, Orca), action models (e.g., for racing, gaming, and office productivity), multi-agent systems (e.g. AutoGen), and evaluation (e.g., Eureka).

We are seeking a Senior Machine Learning Engineer to join our team and advance the state-of-the-art in foundation models and Generative AI for actions and behavior. Our domains of interest include: web/OS agents, gaming agents, and real-time control for racing. Our ongoing research areas include but are not limited to:

Novel architectures and representations for efficient and generalizable learning of action models
Multi-scale temporal reasoning and planning in transformer models
Pre-training methods for video generation and world modeling
Continual learning and adaptation at human speed
Specializing models to capture individual human behavior and style
Steering models towards specific behaviors via natural language instruction

As a Senior Machine Learning Engineer on our team, you will work hands-on on challenging and impactful projects. You will apply your engineering skills and expertise to a range of problems including rapid prototyping and testing of research hypotheses, contributing to, releasing, and maintaining high-quality open-source models and frameworks, and building and deploying robust practical solutions to real world problems. Additionally, you will collaborate closely with other researchers, engineers, and product teams across the company to amplify your impact and grow your career in a supportive and stimulating environment.

Microsoft Research offers a vibrant environment for cutting-edge, multidisciplinary research, including access to diverse, real-world problems and data, opportunities for experimentation and real-world impact, an open publication policy, and close links to top academic institutions around the world.

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

In alignment with our Microsoft values, we are committed to cultivating an inclusive work environment for all employees to positively impact our culture every day.

Qualifications

Required Qualifications

Bachelors in Computer Science or relevant field AND 4+ years technical engineering experience with coding in languages including, but not limited to, Python, C, C++, C#, or Java
- OR equivalent experience.
4+ years of industry or academic experience in applying, implementing, and/or developing algorithms for machine learning, artificial intelligence, or statistics.
2+ years of industry or academic experience with common ML engineering programming languages and platforms such as Python, Python numerical libraries, PyTorch, TensorFlow and/or Hugging Face.
2+ years of industry or academic experience with building, debugging and optimizing large, high performance ML training pipelines.

Preferred Qualifications

Masters in Computer Science or relevant field AND 6+ years technical engineering experience with coding in languages including, but not limited to, Python, C, C++, C#, or Java
- OR Doctorate (or currently pursuing) in Computer Science or relevant field AND 2+ years technical engineering experience as above
- OR equivalent experience.
Demonstrated software engineering excellence building and deploying prototypes, applications, or open-source (OSS) technologies. Providing a link to a GitHub profile (if available) and/or code samples, is highly encouraged.
Deep understanding of deep learning model architectures, especially transformers and state space models.
Hands-on experience building and working with large foundation models such as Large Language Models (LLMs), Large action models (LAMs), or multimodal models, including pre-training, fine-tuning, and inference.
Experience using and supporting ML infrastructure including job schedulers, big data storage, low latency interconnects, and large clusters of Graphics Processing Units (GPUs).
Experience with high-frequency simulation environments such as for gaming or racing.
Experience publishing academic papers at relevant conferences and journals (such as NeurIPS, ICML, ICLR, AAAI, ACL).
Ability to work independently and ramp-up quickly on complex, unfamiliar code.
Ability to collaborate, communicate effectively, and work as part of a multi-disciplinary team.
Keen interest in real-world applications and impact, and/or experience working in a commercial software product team.

Software Engineering IC4 - The typical base pay range for this role across the U.S. is USD $119,800 - $234,700 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $158,400 - $258,000 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay

Microsoft will accept applications for the role until September 29, 2025.

#Research

Responsibilities

Apply Machine Learning (ML) engineering skills to build, test, and deploy robust and scalable AI-based technologies, libraries, and applications.
Work closely with researchers and engineers to rapidly prototype and test new AI research ideas, driving a high-impact agenda and publishing results where appropriate.
Collaborate hands-on with other researchers, engineers, and internal and external product group partners on high-impact projects that deliver real-world impact to people and society.
Embody our culture and values.

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.

Industry leading healthcare

Educational resources

Discounts on products and services

Savings and investments

Maternity and paternity leave

Generous time away

Giving programs

Opportunities to network and connect

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.

Apply now

See more open positions at Microsoft

Connecting people I'd hire with companies I'd work at