Senior Software Engineer, AI Data Platform (CoreAI)
![]() | |
![]() United States, Washington, Redmond | |
![]() | |
OverviewJoin Microsoft's CoreAI team to build the AI Data Platform, the foundation for secure, scalable, reusable datasets that power model development. We seek Software Engineers passionate about large-scale data infrastructure, automation tools, and intelligence services to transform how Microsoft collects, generates, manages, and shares AI training data. The AI Data Platform team's mission is to build a central AI data platform that breaks down Microsoft's data silos and manages the full lifecycle of first-party, third-party, synthetic, and human-labeled data, accelerating AI model development with secure, reusable, and compliant datasets. The AI Data Platform team is responsible for large-scale data infrastructure, automation tools, and intelligence services to transform how Microsoft collects, generates, manages, and shares AI training data.
Responsibilities Design and build scalable data pipelines and services to automate the dataset lifecycle (ingestion, registration, validation, PII handling, discovery, sharing, lineage), including intelligent agent-driven automation for key stages. Develop secure and reliable infrastructurefor data access, entitlement management, and operational support across global time zones. Implement governance and compliance toolingto ensure data integrity, auditability, and adherence to regulatory standards. Create user-facing tools and APIsthat make datasets easily discoverable and reusable. Contribute to strategic extensionssuch as continuous feedback loops, human-in-the-loop workflows, and data intelligence services for internal and external stakeholders. Collaborate with cross-org partners(CoreAI, MAI, M365, GitHub, MSR, OCTO, and more) to align priorities and deliver company-wide impact. |