Table of Contents
2.5.0

Home

  • Lightning in 15 minutes
  • Install
  • 2.0 Upgrade Guide

Level Up

  • Basic skills
  • Intermediate skills
  • Advanced skills
  • Expert skills

Core API

  • LightningModule
  • Trainer

Optional API

  • accelerators
  • callbacks
  • cli
  • core
  • loggers
  • profiler
  • trainer
  • strategies
  • tuner
  • utilities

More

  • Community
  • Glossary
  • How-to Guides
  • Overview
  • Team management
  • Production
  • Security
  • Open source
    • Overview
    • PyTorch Lightning
    • Fabric
    • Lit-GPT
    • Torchmetrics
    • Litdata
    • Lit LLaMA
    • Litserve
  • Examples
  • Glossary
  • FAQ
  • Docs >
  • Level 13: Run on a multi-node cluster
Shortcuts

Level 13: Run on a multi-node cluster¶

In this level you’ll learn to run on cloud or on-prem clusters.


Run single or multi-node on Lightning Studios

The easiest way to scale models in the cloud. No infrastructure setup required.

basic

Run on an on-prem cluster

Learn to train models on a general compute cluster.

intermediate

Run on a SLURM cluster

Run models on a SLURM-managed cluster

intermediate

Run with Torch Distributed

Run models on a cluster with torch distributed.

intermediate

  • Level 13: Run on a multi-node cluster

To analyze traffic and optimize your experience, we serve cookies on this site. By clicking or navigating, you agree to allow our usage of cookies. Read PyTorch Lightning's Privacy Policy.