Research Scientist II
Microsoft
Research Scientist II
Burlington, Massachusetts, United States
Save
Overview
Microsoft’s Health and Life Sciences team is dedicated to empowering healthcare organizations to achieve their goals and improve patient care. The HLS Platform team aims to create an efficient and connected healthcare ecosystem built on the Microsoft Cloud, empowering everyone across the healthcare journey to collaborate, communicate, and innovate together to provide better experiences for clinicians, staff, and patients.
Our team has an exciting opportunity for a research engineer working on voice-driven AI capabilities of Dragon Copilot. You will be working on taking our users on a journey that connects present-day speech recognition functionality in dictation and ambient conversation to the future of agentic speech-to-speech experiences. You will support the growth of Dragon Copilot to markets across the globe. In doing so, you apply your background in speech technology and software engineering to engage in all phases of the development lifecycle, taking new features from conception to production and playing an active role in delivery and live site maintenance. This role provides the opportunity to follow the latest developments in academia and the industry alike. You will identify innovative developments that optimize user experience and keep system complexity in check, enabling our scaling to millions of healthcare professionals and dozens of markets.
We are looking for a Research Scientist II to join our team.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Qualifications
Minimum Qualifications:
- Master's Degree in relevant field AND 1+ year(s) related-research experience
- OR equivalent experience.
- 2+ years experience developing and maintaining code running in production cloud systems.
- Close familiarity with speech technology and its applications.
Other Requirements:
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to, the following specialized security screenings:
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
Preferred Qualifications:
- Doctorate in relevant field
- OR equivalent experience.
- Experience participating in a top conference in relevant research domain.
- Experience developing research code for the cloud system.
Research Sciences IC3 - The typical base pay range for this role across the U.S. is USD $100,600 - $199,000 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $131,400 - $215,400 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: US corporate pay information | Microsoft Careers
Microsoft will accept applications for the role until October 23rd, 2025.
#Health&LifeScience #hlsdp #hls
Responsibilities
Responsibilities include:
- Develop the next-generation speech-to-text stack driving Dragon Copilot user experience across different modalities (dictation, conversation, NLI).
- Research into speech-to-speech experiences and build technologies that disrupt the current UX, leveraging state-of-the-art foundation models.
- Establish collaborative relationships with relevant product or business groups (engineering, product management) and provide expertise to them.
- Contribute to smooth operation of low-level technology services, exposing the technology you developed to other groups.
- Identify pathways towards modernization of existing systems, ensuring a seamless transition towards an agentic speech-to-speech experience.
- Develop best practices in scaling our systems to dozens of markets, playing an active role in building out the deployments.
- Embody our culture and values.