company logo

Senior Applied Scientist, AI Data Platform (CoreAI)

Microsoft.com

120k - 258k USD/year

Office

Redmond, Washington, United States

Full Time

Join Microsoft’s CoreAI team to build the AI Data Platform, the foundation for secure, scalable, reusable datasets that power model development

The AI Data Platform team's mission is to build a central AI data platform that breaks down Microsoft’s data silos and manages the full lifecycle of first-party, third-party, synthetic, and human-labeled data, accelerating AI model development with secure, reusable, and compliant datasets. 

The AI Data Platform team is responsible for large-scale data infrastructure, automation tools, and intelligence services to transform how Microsoft collects, generates, manages, and shares AI training data. 

We are seeking Applied Scientists to drive scientific innovation in data generation, validation, evaluation, and automation. You will set the vision for intelligent, ML-driven services that manage the end-to-end data lifecycle, and partner with leaders across Microsoft to ensure Microsoft’s data investments deliver maximum AI impact.

Responsibilities

Responsibilities

  • Advancing machine learning and data science to improve data quality, automate dataset generation, and design intelligent agent-driven services that manage the end-to-end data lifecycle. 
  • Develop ML-based pipelinesfor data generation, validation, augmentation, and discovery (e.g., synthetic data, human-in-the-loop workflows). 
  • Design and train intelligent agentsto automate key parts of the dataset lifecycle, including ingestion, validation, PII detection and handling, governance, discovery, and feedback loops. 
  • Build evaluation methodsto measure dataset quality, coverage, and usefulness for large-scale model training. 
  • Leverage AI/ML techniques(e.g., classification, clustering, anomaly detection, embeddings, LLM-based evaluation) to improve data discovery, curation, and governance. 
  • Collaborate with engineersto integrate scientific methods and models into scalable pipelines and platform services. 
  • Partner with AI product and research teams(CoreAI, MAI, M365, GitHub, MSR, and more) to align datasets with model training needs and identify new opportunities. 
  • Contribute thought leadershipby publishing or sharing insights internally and externally to shape Microsoft’s data-centric AI practices. 

Qualifications

Required Qualifications

  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • 2+ years of experience applying machine learning or data science in practical settings. 
  • Programming skills in Python and ML frameworks (e.g., PyTorch, TensorFlow, Scikit-learn). 
  • Experience with data analysis, dataset design, or evaluation methodologies. 
  • OR equivalent experience.

Other Requirements:

Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings:

  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years

Preferred Qualifications

  • Master’s degree or PhD in Computer Science, Machine Learning, Statistics, or related field, or equivalent experience. 
  • 4+ years of experience applying machine learning or data science in practical settings. 
  • Experience with LLM training pipelines, synthetic data generation, or data-centric AI approaches. 
  • Knowledge of PII detection, data privacy, fairness, or compliance in AI systems. 
  • Familiarity with distributed data systems (e.g., Spark, Databricks, Azure Data Lake). 
  • Strong collaboration skills with engineers, TPMs, and product partners across multiple orgs. 

Software Engineering IC4 - The typical base pay range for this role across the U.S. is USD $119,800 - $234,700 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $158,400 - $258,000 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay


Microsoft posts positions for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances.  We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.

#DataPlatform, #AIJobs, #MachineLearning, #DataScience #CoreAI 

Senior Applied Scientist, AI Data Platform (CoreAI)

Office

Redmond, Washington, United States

Full Time

120k - 258k USD/year

September 19, 2025

company logo

Microsoft

Microsoft