Modal | AI Workshack

Modal is a serverless cloud platform designed for AI and ML workloads. It lets you run GPU inference, batch processing, and scheduled jobs in Python without managing any infrastructure — defining compute requirements as decorators on plain Python functions.

GPU inference with @app.function(gpu=...) — A10G, A100, H100 options
Modal.cls Model class for cached model weights across calls
Volumes for persistent model weight storage across cold starts
Parallel batch processing via .map() and .starmap()
Web endpoints, cron scheduling, and Secrets management

Visit official site →

Modal April 10, 2026 1 min

Batch AI Processing with Modal: Parallel Execution and Cost Control

How to process thousands of documents, images, or API calls in parallel without managing workers

Read guide →

Modal April 10, 2026 1 min

Modal for AI Engineers: Run GPU Inference Without Managing Infrastructure

How to deploy embedding models, LLMs, and batch jobs on serverless GPU with Modal

Read guide →

⚡ Modal

Batch AI Processing with Modal: Parallel Execution and Cost Control

Modal for AI Engineers: Run GPU Inference Without Managing Infrastructure

Stay sharp as AI tools evolve