We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results

Principal Software Engineer

Microsoft
United States, Nevada, Reno
6840 Sierra Center Parkway (Show on map)
Jan 08, 2025
OverviewThe Azure Compute team, builds a fault tolerant, distributed system on top of commodity datacenter hardware, to deliver infrastructure for hosting cloud applications in virtual machines (VMs). The team creates an illusion that resources are limitless, infinitely elastic, and always available.This role is in the Availability Platform team, within Azure Compute, that primarily focuses on making sure every Azure VM is available with an Service Level Agreement (SLA) of 99.99+%. Getting to that target and beyond requires out-of-the box thinking, backed by sound data-driven decisions. The team owns the services that measure the health of millions of Azure machine as well as the control plane services that make all repair decisions in Azure. The team also collaborates closely with data scientists to build predictive failure models to live migrate VMs off machines even before the failure occurs. VM availability is one of the top KPIs for Azure and Microsoft - come be part of the team driving the platform forward on this front.As a Principal Software Engineer, you will be joining a talented team that invests in our people and technology for the long term. We emphasize comprehensive designs, incremental development with high quality, shipping frequently, and adapting quickly to customer feedback. Join us in pushing the boundaries of scale, reliability, availability, and efficiency. If you want hands-on services architecture experience at large scale, this is the role for you.Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
ResponsibilitiesPartners with appropriate stakeholders spanning across teams and orgs to determine project requirementsLeads the design and architecture of change management features and services in Azure ComputeIdentifies dependencies and authors design documents for features and services.Leverages expertise with appropriate stakeholders to develop project plans, release plans, and work items.Develops high quality, extensible, maintenable code and coaches others to do the same.Supports livesite as Designated Responsible Individual (DRI), mentoring engineers across products/solutions, working on-call to monitor system/product/service for degradation, downtime, or interruptions.Proactively seeks new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale and shares knowledge with other engineers.
Applied = 0

(web-776696b8bf-d9dvp)