Productionizing Agentic Systems: MLOps, Monitoring & Cost Control Training Course

This course addresses the scaling, operationalization, and management of agentic AI systems within production environments, with a strong emphasis on reliability, observability, and cost efficiency.

Delivered as instructor-led live training (available online or onsite), this programme is designed for advanced-level professionals seeking to construct resilient, observable, and cost-optimized pipelines for large-scale agentic systems.

Upon completion of this training, participants will be equipped to:

Design scalable architectures suited for agentic AI workloads.
Implement observability and monitoring frameworks specifically tailored to agent behaviour and interactions.
Apply performance tuning and resource optimization techniques for long-running agent processes.
Manage costs and mitigate 'agent sprawl' through effective policy, orchestration, and automation.
Integrate MLOps best practices for the continuous deployment, versioning, and rollback of agentic services.

Course Format

Hands-on, engineering-focused sessions supported by live infrastructure examples.
Interactive discussions on architectural trade-offs and observability challenges.
Capstone exercise: deploying and monitoring a cost-controlled, production-grade agentic system.

Course Customization Options

To request customized training for this course, please contact us to make arrangements.

This course is available as onsite live training in Portugal or online live training.

Thank you for sending your enquiry! One of our team members will contact you shortly.

Thank you for sending your booking! One of our team members will contact you shortly.

Course Outline

Foundations of Agentic Systems in Production

Agentic architectures: loops, tools, memory, and orchestration layers.
The lifecycle of agents: development, deployment, and continuous operation.
Challenges associated with production-scale agent management.

Infrastructure and Deployment Models

Deploying agents in containerized and cloud environments.
Scaling patterns: horizontal versus vertical scaling, concurrency, and throttling.
Multi-agent orchestration and workload balancing.

Monitoring and Observability

Key metrics: latency, success rate, memory usage, and agent call depth.
Tracing agent activity and call graphs.
Instrumenting observability using Prometheus, OpenTelemetry, and Grafana.

Logging, Auditing, and Compliance

Centralized logging and structured event collection.
Compliance and auditability within agentic workflows.
Designing audit trails and replay mechanisms for debugging purposes.

Performance Tuning and Resource Optimization

Reducing inference overhead and optimizing agent orchestration cycles.
Model caching and lightweight embeddings for faster retrieval.
Load testing and stress scenarios for AI pipelines.

Cost Control and Governance

Understanding agent cost drivers: API calls, memory, compute, and external integrations.
Tracking agent-level costs and implementing chargeback models.
Automation policies to prevent agent sprawl and idle resource consumption.

CI/CD and Rollout Strategies for Agents

Integrating agent pipelines into CI/CD systems.
Testing, versioning, and rollback strategies for iterative agent updates.
Progressive rollouts and safe deployment mechanisms.

Failure Recovery and Reliability Engineering

Designing for fault tolerance and graceful degradation.
Retry, timeout, and circuit breaker patterns for agent reliability.
Incident response and post-mortem frameworks for AI operations.

Capstone Project

Building and deploying an agentic AI system with full monitoring and cost tracking.
Simulating load, measuring performance, and optimizing resource usage.
Presenting the final architecture and monitoring dashboard to peers.

Summary and Next Steps

Requirements

A robust understanding of MLOps and production machine learning systems.
Experience with containerized deployments (Docker/Kubernetes).
Familiarity with cloud cost optimization and observability tools.

Audience

MLOps engineers.
Site Reliability Engineers (SREs).
Engineering managers responsible for AI infrastructure.

21 Hours

Custom Corporate Training

Training solutions designed exclusively for businesses.

Customized Content: We adapt the syllabus and practical exercises to the real goals and needs of your project.
Flexible Schedule: Dates and times adapted to your team's agenda.
Format: Online (live), In-company (at your offices), or Hybrid.

Investment

Price per private group, online live training, starting from 3900 € + VAT*

(*The final price may vary depending on the technical specialization of the course, the level of customization, the method of delivery and the number of learners)

Need help picking the right course?
info@nobleprog.pt or +351 30 050 9666

Testimonials (3)

The trainer is patient and very helpful. He knows the topic well.

CLIFFORD TABARES - Universal Leaf Philippines, Inc.

Course - Agentic AI for Business Automation: Use Cases & Integration

Good mixvof knowledge and practice

Ion Mironescu - Facultatea S.A.I.A.P.M.

Course - Agentic AI for Enterprise Applications

The mix of theory and practice and of high level and low level perspectives

Productionizing Agentic Systems: MLOps, Monitoring & Cost Control Training Course

Course Outline

Requirements

Custom Corporate Training

Testimonials (3)

CLIFFORD TABARES - Universal Leaf Philippines, Inc.

Course - Agentic AI for Business Automation: Use Cases & Integration

Ion Mironescu - Facultatea S.A.I.A.P.M.

Course - Agentic AI for Enterprise Applications

Ion Mironescu - Facultatea S.A.I.A.P.M.

Course - Autonomous Decision-Making with Agentic AI

Provisional Upcoming Courses (Contact Us For More Information)

Productionizing Agentic Systems: MLOps, Monitoring & Cost Control

Productionizing Agentic Systems: MLOps, Monitoring & Cost Control

Productionizing Agentic Systems: MLOps, Monitoring & Cost Control

Productionizing Agentic Systems: MLOps, Monitoring & Cost Control

Productionizing Agentic Systems: MLOps, Monitoring & Cost Control

Productionizing Agentic Systems: MLOps, Monitoring & Cost Control

Related Categories

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites

Productionizing Agentic Systems: MLOps, Monitoring & Cost Control Training Course

Course Outline

Requirements

Custom Corporate Training

Testimonials (3)

CLIFFORD TABARES - Universal Leaf Philippines, Inc.

Course - Agentic AI for Business Automation: Use Cases & Integration

Ion Mironescu - Facultatea S.A.I.A.P.M.

Course - Agentic AI for Enterprise Applications

Ion Mironescu - Facultatea S.A.I.A.P.M.

Course - Autonomous Decision-Making with Agentic AI

Provisional Upcoming Courses (Contact Us For More Information)

Productionizing Agentic Systems: MLOps, Monitoring & Cost Control

Productionizing Agentic Systems: MLOps, Monitoring & Cost Control

Productionizing Agentic Systems: MLOps, Monitoring & Cost Control

Productionizing Agentic Systems: MLOps, Monitoring & Cost Control

Productionizing Agentic Systems: MLOps, Monitoring & Cost Control

Productionizing Agentic Systems: MLOps, Monitoring & Cost Control

Related Courses

Autonomous Decision-Making with Agentic AI

Understanding Agentic AI: Concepts and Capabilities

Agentic AI for Business Automation: Use Cases & Integration

Agentic AI for Enterprise Applications

Agentic AI and the Future of Work

Governance and Security Patterns for WrenAI in the Enterprise

Modernizing Legacy BI with WrenAI: Adoption, Migration, and Change Management

Quality and Observability for WrenAI: Evaluation, Prompt Tuning, and Monitoring

Course Format

Customisation Options

Building with the WrenAI API: Applications, Charts, and NL to SQL

WrenAI Cloud Essentials: From Data Sources to Dashboards

WrenAI for Financial Analytics: KPI Modeling and Regulatory-Aware Dashboards

WrenAI OSS Deep Dive: Semantic Modeling, Text to SQL, and Guardrails

WrenAI for Product Teams: Conversational Analytics and Self-Service BI

Deploying WrenAI for SaaS: Embedded GenBI in Customer-Facing Products

Operational Analytics with WrenAI Spreadsheets and Metrics Library

Related Categories

Agentic AI

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites