FastAPI is a high-performance Python web framework for building APIs. It uses standard Python type hints to give you request validation, serialization, and automatic interactive API docs, and it runs on the ASGI standard, so your endpoints can handle requests asynchronously.
On Vercel, you can deploy a FastAPI app with zero configuration: your app becomes a Vercel Function running on Fluid compute, and you get response streaming, preview deployments, and observability without extra setup.
This guide walks you through deploying a FastAPI app to Vercel from a template, the Vercel CLI, or a Git repository, then configuring features such as streaming, middleware, cron jobs, the Python version, and observability.
Before you begin, make sure you have:
- A Vercel account
- Python 3.12 or later and a package manager (e.g., pip or uv)
- An existing FastAPI project, or a new one created from a FastAPI template
- A Git repository on GitHub, GitLab, or Bitbucket (if you want Git-based deployments)
- Vercel CLI installed (
npm i -g vercel)
When you deploy a FastAPI app, Vercel detects the framework from your dependencies and builds it for the Python runtime. Vercel looks for a FastAPI instance named app at a supported entrypoint and serves your whole app as a single Vercel Function, which runs on Fluid compute by default. Your app scales with traffic, and you pay only for the compute your function uses, not for idle time.
Because Vercel ships zero-configuration detection for FastAPI, you don't set a build command or output directory. Vercel reads your project, finds the file that exports your FastAPI app, and applies the correct build settings.
You can ship a FastAPI app to Vercel in three ways. Choose the one that fits where your code lives today.
The fastest way to ship a FastAPI app is to start from a template. Pick a starter, then deploy it. Vercel clones the template to your Git provider, creates a project, and deploys it with zero configuration.
Templates to start from include:
- FastAPI Boilerplate: A minimal FastAPI app that runs on the Python runtime and deploys with zero configuration. Start here if you're new to FastAPI on Vercel.
- AI SDK Python Streaming: Streams chat completions from a FastAPI function and renders them with AI SDK
useChathook in a Next.js frontend. - Next.js FastAPI Starter: An AI chatbot with a Next.js frontend and a FastAPI backend.
- OpenAI Agents SDK with FastAPI: A FastAPI app that runs the OpenAI Agents SDK with Vercel Sandbox, spinning up an isolated microVM per request to give the agent shell access.
- Full-stack FastAPI template with Next.js: A full-stack starter that pairs a FastAPI backend with a Next.js frontend, with cookie-based auth and a Postgres database.
To scaffold a new FastAPI project locally, use the Vercel CLI init command. It clones Vercel's FastAPI example into a folder named fastapi.
- Create the project:
- Set up a virtual environment and install dependencies:
- Develop locally at
http://localhost:3000. Use the Vercel CLI, so your app runs with the sameappinstance it uses in production: - Create a preview deployment. The first run creates a Vercel project link:
- Promote your deployment to production:
If you already have a FastAPI app, deploy it from Git or from the command line.
From Git: Push your project to GitHub, GitLab, or Bitbucket, then import it at vercel.com/new. Vercel detects FastAPI automatically and deploys it with zero configuration.
From the CLI: From your project's root directory, run vercel to create a preview deployment, then vercel --prod to go live. To pull project settings and environment variables for local development, run:
For Vercel to detect your app, export a FastAPI instance named app from one of the recognized entry files, such as app.py, index.py, server.py, main.py, wsgi.py, or asgi.py at your project root or under src/, app/, or api/:
To point Vercel to a FastAPI app in a custom module, set tool.vercel.entrypoint in pyproject.toml:
This tells Vercel to load the app variable from ./backend/server.py.
After your app is deployed, you can layer Vercel features onto it. Some work automatically, and others take a few lines of configuration in vercel.json.
Vercel serves your FastAPI app as a single Vercel Function. This function uses Fluid compute by default, which runs multiple requests concurrently within a single instance to reduce cold starts and the cost of I/O-bound work such as API calls and database queries. You don't configure anything to get this behavior.
Because FastAPI exports a single app, Vercel sends every incoming request to that app and lets FastAPI's router match the path. Your route handlers, dependencies, middleware, and error handling all run inside the function.
Vercel Functions on the Python runtime support streaming, so you can send data to the client as you produce it instead of waiting for the full response. Use FastAPI's StreamingResponse to stream text, server-sent events, or AI model output:
Streaming pairs well with Fluid compute: while your function waits between chunks, the same instance can serve other requests. To stream model output to a frontend, pair your FastAPI endpoint with AI SDK, as the AI SDK Python Streaming template shows.
FastAPI and Vercel each have a middleware layer, and they solve different problems. FastAPI middleware runs inside your function, after the request reaches it. Use it for app-level concerns such as logging, CORS, and authentication:
Vercel Routing Middleware runs at the edge, before the request reaches your FastAPI app, and works with any framework. Use it for rewrites, redirects, and header changes that should happen before any function runs. Add a middleware.ts file at your project root:
The two layers work together, with Routing Middleware shaping the request at the edge and FastAPI middleware handling it inside your app.
To serve static files such as images, fonts, or a favicon, place them in the public/** directory. Vercel serves them through its CDN using default headers, which you can override in vercel.json. FastAPI's own app.mount("/public", ...) is not needed on Vercel, so rely on the public directory instead.
You can still define routes that point at those files. For example, redirect /favicon.ico to an asset in public:
Vercel Cron Jobs trigger a route on a schedule by sending an HTTP GET request to it. Define a route in your FastAPI app for the task, then register the schedule in vercel.json.
Define the route:
Register the schedule:
Vercel runs cron jobs only on production deployments. To stop anyone else from calling the route, set a CRON_SECRET environment variable in your project settings. Vercel sends it as a Bearer token in the Authorization header on every cron invocation, and your handler compares it before running the task.
FastAPI runs on Vercel's Python runtime, which defaults to Python 3.12. The available versions are 3.12 (default), 3.13, and 3.14. To pin a version, add a .python-version file at your project root:
You can also set the version in pyproject.toml or Pipfile.lock. If you don't define a supported version, Vercel uses the default.
Use FastAPI lifespan events to run setup and teardown logic, such as opening and closing database connections. Vercel runs the startup logic when your function starts and the shutdown logic when it stops:
Cleanup during shutdown is limited to 500 ms after your function receives the SIGTERM signal, and logs printed during shutdown don't appear in the Vercel dashboard. Keep teardown work short, and move anything longer to a cron job or Vercel Workflow.
Vercel Observability tracks your deployed function automatically, with no setup. Open the Observability page in your project to see invocation counts, error rates, and duration for your FastAPI app, along with the requests your function makes to external APIs. On Observability Plus, you also get longer retention and a latency breakdown by path.
Vercel finds your FastAPI app by looking for an app instance at a fixed set of locations: app, index, server, main, wsgi, or asgi (with a .py extension) at your project root or under src/, app/, or api/. Put your app at one of these paths so Vercel detects and deploys it correctly, or set tool.vercel.entrypoint in pyproject.toml to point at a custom module:
Run vercel dev for local development instead of running Uvicorn directly. It serves your app the same way production does, using your app instance, so the behavior you test locally matches what you deploy. This also lets you exercise features such as cron routes before shipping.
Python functions are not tree-shaken, so Vercel bundles every file reachable at build time, up to a 500 MB limit. List only the packages you need at runtime in pyproject.toml or requirements.txt, and exclude tests, fixtures, and other development files with excludeFiles in vercel.json:
The pattern is a glob relative to your project root, so adjust it to match where your Python files live.
- Read the full FastAPI on Vercel documentation
- Browse backend templates you can deploy in one step
- Learn how the Python runtime runs your code
- Learn how Vercel Functions run your server code
- Understand pricing and scaling with Fluid compute
- Run code before requests with Routing Middleware
- Defer background work from your routes with Vercel Queues, or orchestrate multi-step tasks with Vercel Workflows