New
Senior Software Engineer - Azure Kubernetes Service (AKS)
![]() | |
![]() United States, Texas, Irving | |
![]() 7000 State Highway 161 (Show on map) | |
![]() | |
OverviewImagine being at the forefront of transformative cloud technology. The Azure Kubernetes Service (AKS) team is pioneering the management of Kubernetes clusters at hyperscale-building efficient, safe, and scalable tools to manage millions of servers that power AKS. As a Senior Software Engineer - Azure Kubernetes Service Infrastructure team, you'll dive deep into automated infrastructure management and server orchestration at a scale few companies ever reach. You'll be responsible for building and maintaining the compute infrastructure that powers AKS, enabling it to be the most performant and reliable managed Kubernetes service in the world.This is a unique opportunity to accelerate your career, develop deep cloud infrastructure expertise, and shape the future of hyperscale Kubernetes.Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Responsibilities* Create and maintain tools that manage hundreds of thousands of virtual machines powering Azure Kubernetes Service.* Expand AKS's global footprint by automating buildouts in new regions and sovereign clouds.* Coordinate region-agnostic buildout architecture, design, and execution across multiple AKS and Azure teams.* Automate the build and release stack to enable engineers to manage dozens of microservices safely, efficiently, and in compliance with standards.* Build tools, automation, and safety mechanisms to prevent infrastructure problems from becoming production incidents.* Act as a Designated Responsible Individual (DRI), participating in on-call rotations to monitor system health and restore services during incidents.* Balance pragmatism with vision and creativity; deliver continuous improvements to the team's process and codebase.* Collaborate across teams to deliver scalable, resilient, and secure infrastructure solutions. |