Posted on: May 4, 2026 | Job#: R211500-CARO444

Sr. ML Engineer – ML & Applied AI

Full time
4440 Rosewood Drive, Bldg 4, Pleasanton, CA, US 94588

Apply

We’ll send you to our application portal to get started.

About Gap Inc.

Our brands bridge the gaps we see in the world. Old Navy democratizes style to ensure everyone has access to quality fashion at every price point. Athleta unleashes the potential of every woman, regardless of body size, age or ethnicity. Banana Republic believes in sustainable luxury for all. And Gap inspires the world to bring individuality to modern, responsibly made essentials.     

This simple idea—that we all deserve to belong, and on our own terms—is core to who we are as a company and how we make decisions. Our team is made up of thousands of people across the globe who take risks, think big, and do good for our customers, communities, and the planet. Ready to  learn fast, create with audacity and lead boldly? Join our team.

About the Role

Gap Inc. is seeking a Senior Machine Learning Engineer with 10+ years of experience to design, build, and scale production-grade machine learning and AI systems that power data-driven decision making across the enterprise.

This role is focused on end-to-end ML system ownership, including data pipelines, feature engineering, model training, deployment, monitoring, and continuous optimization. You will lead the development of scalable ML platforms, drive best practices in MLOps, and enable reliable, high-performance model inference in both batch and real-time environments.

The ideal candidate combines strong software engineering expertise with deep ML knowledge and has experience building robust, scalable ML systems in production, including modern applications involving large language models (LLMs) and agent-based AI systems.

What You'll Do

  • Architect and build scalable, production-grade ML systems from experimentation to deployment and lifecycle management
  • Design and implement end-to-end ML pipelines, including data ingestion, feature engineering, training, validation, and inference
  • Develop and maintain high-performance model serving systems using APIs (e.g., FastAPI) for real-time and batch inference
  • Lead the design and implementation of feature stores and reusable feature pipelines across teams
  • Build and optimize distributed data processing workflows using Spark, Databricks, or similar platforms
  • Implement and enforce MLOps best practices, including CI/CD pipelines, automated retraining, model versioning, and experiment tracking
  • Design and manage model monitoring and observability frameworks to track performance, drift, latency, and system health
  • Drive strategies for model retraining, drift detection, and continuous improvement
  • Collaborate closely with data engineers, platform teams, and product stakeholders to integrate ML solutions into production systems
  • Contribute to the adoption of modern AI capabilities, including LLMs, vector databases, retrieval-augmented generation (RAG), and agentic workflows
  • Ensure high standards of code quality, testing, documentation, and reproducibility

Who You Are

  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field
  • 10+ years of experience in machine learning, software engineering, or related roles, with significant experience in production ML systems
  • Strong programming expertise in Python and solid software engineering fundamentals (data structures, system design, APIs)
  • Extensive experience with ML frameworks such as scikit-learn, XGBoost, PyTorch, or TensorFlow
  • Proven experience designing and deploying scalable ML pipelines and services in production
  • Hands-on experience with model serving frameworks and API development (e.g., FastAPI, Flask)
  • Strong experience with containerization (Docker) and orchestration platforms such as Kubernetes
  • Experience working with cloud platforms (GCP, AWS, or Azure) and building cloud-native ML solutions
  • Deep understanding of ML lifecycle management, including training, evaluation, deployment, monitoring, and retraining
  • Experience implementing CI/CD pipelines for ML workflows and managing version control systems (Git)
  • Strong experience with SQL and distributed data processing frameworks (e.g., Spark, PySpark)
  • Excellent problem-solving skills and ability to design scalable, maintainable systems

Gap Inc. is an equal-opportunity employer and is committed to providing a workplace free from harassment and discrimination. We are committed to recruiting, hiring, training and promoting qualified people of all backgrounds, and make all employment decisions without regard to any protected status. We have received numerous awards for our long-held commitment to equality and will continue to foster a diverse and inclusive environment of belonging. This year, we’ve been named as one of the Best Places to Work by the Human Rights Campaign for the seventeenth consecutive year and have been included in the 2021 Bloomberg Gender-Equality Index for the fourth year in a row.

Salary Range: $181,400 - $235,800 USD
Employee pay will vary based on factors such as qualifications, experience, skill level, competencies and work location. We will meet minimum wage or minimum of the pay range (whichever is higher) based on city, county and state requirements.

Apply

We’ll send you to our application portal to get started.

Browse all jobs

Recently Viewed