AI in the Cloud: Architecture, MLOps & Deployment

Design, secure, and scale end-to-end cloud AI applications

This two-day intermediate training course explores the architectures, services, and best practices for designing and operating AI solutions across major cloud platforms (Azure, AWS, GCP). Participants will learn to analyze use cases, implement data and inference pipelines, and configure governance, security, and monitoring. Guided hands-on labs enable the integration of managed AI services, the automation of deployment, and the industrialization of production environments.

Is it for you ?

Data/AI engineers, cloud developers, and technical architects in an enterprise context. Project teams looking to industrialize AI use cases on Azure/AWS/GCP within a governed IT environment. Expected experience: basic programming and cloud concepts; familiarity with ML/AI principles.

Prerequisites

Basic knowledge of Python or equivalent. Understanding of cloud architecture (networking, storage, compute). Familiarity with fundamental ML concepts (training sets, metrics, overfitting).

What You'll Walk Away With

✓ Analyze AI use cases and choose the appropriate cloud services (compute, data, managed AI)
✓ Implement an end-to-end inference pipeline (ingestion, preprocessing, model, exposure)
✓ Configure security, data governance, and costs for AI workloads
✓ Integrate managed AI services (vision, language, search) into an application
✓ Automate deployment and monitoring (CI/CD, MLOps, surveillance and logs)

Training content

1 Day 1 – Fundamentals of AI in the Cloud and End-to-End Architecture

Overview of cloud AI services: compute (CPU/GPU/Serverless), data (lakehouse), managed AI
Reference architecture for an inference pipeline: data flow, preprocessing, model, API, scaling
Technological choices and criteria: latency, costs, compliance, data residency
Security and governance: identities, keys/secrets, network segmentation, sensitive data management
Observability and costs: logs, metrics, traces, sizing and optimization
Operating model: roles, runbooks, SLA/SLOs, and responsibilities

2 Lab / Exercise:

Design and deploy a minimal inference pipeline in a specific cloud environment: creating necessary resources, deploying a pre-trained model via a managed service, exposing an API, and end-to-end testing. Deliverable: architecture diagram, endpoint URL, test sets, and measured latency/estimated cost.

3 Key points & takeaways:

Typical architecture of a cloud inference service and its components
Initial best practices for security, observability, and cost control

4 Day 2 – Integration of Managed AI Services and Industrialization (MLOps)

Integrating managed AI services (vision, NLP, vector search, embeddings) into an application
Architectural patterns for RAG and semantic search; managing prompts and contexts
Securing AI endpoints and protection against abuse (quotas, rate limiting, filtering)
CI/CD and MLOps automation: packaging, model registries, controlled deployments, rollback
Monitoring drift, data quality, alerting, and continuous improvement loops
Ongoing case study: service selection, cost/performance/compliance trade-offs, and operations plan

5 Lab / Exercise:

Integrate a managed AI service (e.g., NLP or vision) into an existing API and set up a simple CI/CD pipeline with basic monitoring (logs/metrics) and functional tests. Deliverable: code repository, operational CI/CD pipeline, monitoring dashboard, and runbook.

6 Key points & takeaways:

Secure and efficient integration of managed AI services
First milestones in industrialization with CI/CD and MLOps
Final assessment of skills: short case study presentation and skills checklist

📌 Practical information

Our training sessions are offered in Montreal or Quebec City, in person or in a virtual classroom. Dates and locations are specified when you select your session below. If you have any questions, check out our FAQ.