Microsoft Azure AI Inference platform is the next generation cloud business positioned to address the growing AI market. We are on the verge of an AI revolution and have a tremendous opportunity to empower our partners and customers to harness the full power of AI responsibly. We offer a fully managed AI Inference platform to accelerate the research, development, and operations of AI powered intelligent solutions at scale. This team owns the hosting, optimiz ation , and scaling the inference stack for all the Azure AI Foundary models including the latest and greatest from OpenAI, Grok, DeepSeek , and other OSS models .
Do you want to join a team entrusted with serving all internal and external ML workloads, solve real world inference problems for state-of-the-art l arge l anguage (LLM) and multi-modal Gen AI models from OpenAI and other model providers ? We are already serving billions of inferences per day on the most cutting-edge AI scenarios across the industry . You will be joining the CoreAI Inferencing team , influencing the overall product, driving new features and platform capabilities from preview to General Availability, and many exciting problems on the intersection of AI and C loud.
We’re looking for a Principal Software Engineer - Azure AI Inferencing to drive the design, optimization, and scaling of our inference systems. In this role, you’ll lead engineering efforts to ensure our largest models run with exceptional efficiency in high-throughput, low-latency environments. You will get to work on and influence multiple levels of the AI Inference data plane s tack.
We do not just value differences or different perspectives. We seek them out and invite them in so we can tap into the collective power of everyone in the company. As a result, our customers are better served .
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Required/Minimum Qualifications
Other Requirements:
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:
Preferred/Additional Qualifications
Software Engineering IC5 - The typical base pay range for this role across the U.S. is USD $139,900 - $274,800 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $188,000 - $304,200 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:Microsoft posts positions for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
#AIPlatform
#CoreAI
# azureai
# coreai
# genai
#aiinference
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect
...offerings. The exact amount of base salary may vary based on experience and skills brought to the role. What Youll Do As a... ...unload trucks and move boxes/material in a safe manner using a forklift, pallet jack, handcart, cherry pickers, walking riders, reach...
...About Us: Brighten Brewing is a community-focused craft brewery known for its innovative beer and welcoming atmosphere. Our kitchen serves up high-quality, beer-friendly food designed to complement our brews. We take pride in creating a fun, fast-paced environment where...
...As a Medical Interpreter II, you will: Serve as a medical interpreter for Limited English Proficiency (LEP) patients, family members and health care providers in the consecutive and simultaneous modes. Relay medical information between speakers of two different languages...
...Join the 26FIVE team as our newest Graphic Design Intern. We are a focused, passionate, driven, fun-loving, supportive, and successful NYC-based team looking for a Graphic Design Intern who shares our outlook and wants to work and learn in a growth-driven environment...
#128205; Location: Remote (Worldwide)#128188; Type: Volunteer Time Commitment: Flexible You Set the Schedule Share Your... ...from all fields business, IT, healthcare, marketing, finance, HR, and more. #128313; Individuals who are good listeners and...