Machine Learning Modeling Pipelines in Production

Description

In the third course of Machine Learning Engineering for Production Specialization, you will build models for different serving environments; implement tools and techniques to effectively manage your modeling resources and best serve offline and online inference requests; and use analytics tools and performance metrics to address model fairness, explainability issues, and mitigate bottlenecks.

Understanding machine learning and deep learning concepts is essential, but if you’re looking to build an effective AI career, you need production engineering capabilities as well. Machine learning engineering for production combines the foundational concepts of machine learning with the functional expertise of modern software development and engineering roles to help you develop production-ready skills.
Week 1: Neural Architecture Search
Week 2: Model Resource Management Techniques
Week 3: High-Performance Modeling
Week 4: Model Analysis
Week 5: Interpretability

What you will learn

Week 1: Neural Architecture Search

Learn how to effectively search for the best model that will scale for various serving needs while constraining model complexity and hardware requirements.

Week 2: Model Resource Management Techniques

Learn how to optimize and manage the compute, storage, and I/O resources your model needs in production environments during its entire lifecycle.

Week 3: High-Performance Modeling

Implement distributed processing and parallelism techniques to make the most of your computational resources for training your models efficiently.

Week 4: Model Analysis

Use model performance analysis to debug and remediate your model and measure robustness, fairness, and stability.

What’s included