Principal Cloud Network Engineer
![]() | |
![]() United States, Texas, Irving | |
![]() 7000 State Highway 161 (Show on map) | |
![]() | |
OverviewThe Azure Dedicated team is a tightly integrated vertical team who have an exciting charter to deploy any workload anywhere in the world to Azure. Workloads span industries from Fintech and general purpose scenarios like Storage all the way to gigantic scale AI training workloads. We have a multi-disciplinary team working cohesively to innovate at bleeding edge and deliver for our customers, some of the most well known names in the world.As a Principal Cloud Network Engineer in the Azure Dedicated Team, you will design Network Fabrics for both traditional ethernet networks and back-end networks such as RDMA and Infiniband to allow our organization to build and integrate the next generation of AI workloads to Azure. You will seek to create highly reliable and resilient designs and seek to learn and grow at the forefront of one of the hottest spaces in the tech industry for the foreseeable future. You will interface with customers, industry vendors, 3rd party providers and fellow Microsoft colleagues daily producing world-class designs to manage Pbps level of traffic. This opportunity will allow you to build your skill set with real scale out networks, develop a bigger and wider network in the industry and satisfy some of the most demanding customer needs.Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
ResponsibilitiesYou drive deep and complex partnerships with 3rd parties to satisfy seamless integrations of various network stacks and topologies to interface cleanly with Azure's planetary-scale networks through well understood and well documented designs.You drive the adoption of practices across organizations to improve capabilities to identify risks and prevent classes of bugs. You define telemetry analytics and quality metrics to drive new data collection instrumentation that can improve the detection and troubleshooting of problems. You troubleshoot and repair complex multi-layer live site issues. You participate in on-call duties to manage and resolve incidents in production and advise engineers on best practices for repairing issues.You develop the new and emerging technology solutions within your area of expertise to proactively resolve fundamental flaws. You demonstrate deep knowledge of data (i.e., know what data is needed, find new and missing data, and coordinate the development of pipelines). You analyze traffic patterns across complete network infrastructures to identify needs to modify capacity. You drive innovation and cost management across organizations by critically evaluating existing practices. You engage as needed when our customers need it, as DRI or customer advocate for quality and service uptime at all times.You proactively seek mentorship and feedback from others, and provide feedback to others. |