Ollama: Self-Hosted Large Language Models Replacing OpenAI and Claude APIs Training Course

Ollama is an open-source utility designed to run large language models locally on both consumer and enterprise-grade hardware. By condensing model quantization, GPU resource allocation, and API serving into a single command-line interface, it empowers organisations to host LLMs such as Llama, Mistral, and Qwen independently, without transmitting prompts or sensitive data to OpenAI, Anthropic, or Google.

This instructor-led, live training (available online or on-site) targets intermediate AI engineers and platform operators seeking to substitute cloud-based LLM APIs with self-hosted, sovereign language model inference using Ollama.

Upon completion of this training, participants will be able to:

Install Ollama on Linux, macOS, and Windows systems with GPU support.
Download, quantize, and serve models from the Ollama registry and HuggingFace.
Construct custom Modelfiles incorporating system prompts and parameter tuning.
Connect local LLMs to applications via the OpenAI-compatible API.
Enhance inference performance for CPU-only and multi-GPU configurations.

Course Format

Interactive lectures and discussions.
Numerous exercises and practical practice sessions.
Hands-on implementation within a live-lab environment.

Course Customization Options

For customized training requests, please contact us to arrange.

This course is available as onsite live training in Portugal or online live training.

Thank you for sending your enquiry! One of our team members will contact you shortly.

Thank you for sending your booking! One of our team members will contact you shortly.

Course Outline

AI Sovereignty and LLM Local Deployment

Risks associated with cloud LLMs: data retention, training on inputs, and foreign jurisdiction.
Ollama architecture: model server, registry, and OpenAI-compatible API.
Comparison with vLLM, llama.cpp, and Text Generation Inference.
Model licensing: terms for Llama, Mistral, Qwen, and Gemma.

Installation and Hardware Setup

Installing Ollama on Linux with CUDA and ROCm support.
CPU-only fallback and AVX/AVX2 optimization.
Docker deployment and persistent volume mapping.
Multi-GPU setup and VRAM allocation strategies.

Model Management

Downloading models from the Ollama registry: ollama pull llama3.
Importing GGUF models from HuggingFace and TheBloke.
Quantization levels: trade-offs between Q4_K_M, Q5_K_M, and Q8_0.
Model switching and limits on concurrent model loading.

Custom Modelfiles

Writing Modelfile syntax: FROM, PARAMETER, SYSTEM, TEMPLATE.
Tuning temperature, top_p, and repeat_penalty.
System prompt engineering for role-specific behaviour.
Creating and publishing custom models to the local registry.

API Integration

OpenAI-compatible /v1/chat/completions endpoint.
Streaming responses and JSON mode.
Integrating with LangChain, LlamaIndex, and custom applications.
Authentication and rate limiting using a reverse proxy.

Performance Optimization

Context window sizing and KV cache management.
Batch inference and parallel request handling.
CPU thread allocation and NUMA awareness.
Monitoring GPU utilization and memory pressure.

Security and Compliance

Network isolation for model serving endpoints.
Input filtering and output moderation pipelines.
Audit logging of prompts and completions.
Model provenance and hash verification.

Requirements

Intermediate knowledge of Linux and container administration.
High-level understanding of machine learning and transformer models.
Familiarity with REST APIs and JSON.

Audience

AI engineers and developers looking to replace cloud LLM APIs.
Organisations with data sensitivity constraints that prohibit cloud model usage.
Government and defence teams requiring air-gapped language models.

14 Hours

Custom Corporate Training

Training solutions designed exclusively for businesses.

Customized Content: We adapt the syllabus and practical exercises to the real goals and needs of your project.
Flexible Schedule: Dates and times adapted to your team's agenda.
Format: Online (live), In-company (at your offices), or Hybrid.

Investment

Price per private group, online live training, starting from 2600 € + VAT*

(*The final price may vary depending on the technical specialization of the course, the level of customization, the method of delivery and the number of learners)

Need help picking the right course?
info@nobleprog.pt or +351 30 050 9666

Ollama: Self-Hosted Large Language Models Replacing OpenAI and Claude APIs Training Course

Course Outline

Requirements

Custom Corporate Training

Provisional Upcoming Courses (Contact Us For More Information)

Ollama: Self-Hosted Large Language Models Replacing OpenAI and Claude APIs

Ollama: Self-Hosted Large Language Models Replacing OpenAI and Claude APIs

Ollama: Self-Hosted Large Language Models Replacing OpenAI and Claude APIs

Ollama: Self-Hosted Large Language Models Replacing OpenAI and Claude APIs

Related Categories

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites

Ollama: Self-Hosted Large Language Models Replacing OpenAI and Claude APIs Training Course

Course Outline

Requirements

Custom Corporate Training

Provisional Upcoming Courses (Contact Us For More Information)

Ollama: Self-Hosted Large Language Models Replacing OpenAI and Claude APIs

Ollama: Self-Hosted Large Language Models Replacing OpenAI and Claude APIs

Ollama: Self-Hosted Large Language Models Replacing OpenAI and Claude APIs

Ollama: Self-Hosted Large Language Models Replacing OpenAI and Claude APIs

Related Courses

Advanced Ollama Model Debugging & Evaluation

Building Private AI Workflows with Ollama

Deploying and Optimizing LLMs with Ollama

Fine-Tuning and Customizing AI Models on Ollama

Multimodal Applications with Ollama

Getting Started with Ollama: Running Local AI Models

Ollama & Data Privacy: Secure Deployment Patterns

Ollama Applications in Finance

Ollama Applications in Healthcare

Ollama for Responsible AI and Governance

Ollama Scaling & Infrastructure Optimization

Prompt Engineering Mastery with Ollama

Related Categories

Ollama

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites