How to Implement Rate Limiting in Express for Node.js

Rate limiting is a fundamental mechanism for controlling the number of requests a client can make to a server in a given time frame. In a world where more than 30% of web traffic comes from malicious bots, that proactive strategy is critical to protect servers from abuse.

In this tutorial, you'll delve into the concept of rate limiting and understand why you need it in your Node.js backend. Then, you'll learn how to implement it in Express to block or slow down incoming excess requests.

It's time to become a Node.js rate limiting expert!

What Is Rate Limiting?

Rate limiting is a strategy for limiting network traffic by placing a cap on how often an actor can call the same API in a given time frame. That's essential for controlling the volume of requests a client can make to a server in a certain amount of time.

To better understand how this mechanism works, consider a scenario where you have a Node.js backend that exposes some public endpoints. Suppose a malicious user targets your server and writes a simple script to overload it with automated requests. The performance of your server will downgrade as a result, hindering the experience of all other users. In an extreme situation, your server may even go offline. By limiting the number of requests the same IP can make in a given time frame, you can avoid all that!

Specifically, there are two approaches to rate limiting:

Blocking incoming requests: When a client exceeds the defined limits, deny its additional requests.
Slowing down requests: Introduce a delay for requests beyond the limits, making the caller wait longer and longer for a response.

In this guide, you'll see how to implement both approaches!

Why You Need API Rate Limiting in Node.js

Implementing API rate limiting is crucial for maintaining stability, security, and fair usage of your Node.js application. In particular, controlling the rate at which requests are processed helps enforce usage limits, prevents server overloads, and safeguards against malicious attacks.

Here's a closer look at scenarios where limiting requests in Node.js is especially useful:

Anti-bot and anti-scraping measures: Limiting the frequency of requests from a single source helps mitigate bot activity. Automated scripts might overload your public endpoints with a high volume of traffic, but rate limiting the incoming requests is an effective defense against that malicious behavior.
DoS attack prevention: Rate limiting is a key strategy to protect your backend against DoS (Denial of Service) attacks. By restricting the rate at which requests are accepted, you can reduce the impact of large-scale, coordinated attacks that attempt to overwhelm your server infrastructure.
Implementing limitations for paid plans: For applications offering different subscription tiers, rate limiting enables you to enforce usage limits based on a user's plan. This ensures that users can access resources only as frequently as determined by their plan.

Prerequisites

To follow this tutorial, you need a Node.js 18+ application. For example, the following basic Express server will be enough:

JavaScript

const express = require("express");
 
const port = 3000;
 
// initialize an Express server
const app = express();
 
// define a sample endpoint
app.get("/hello-world", (req, res) => {
  res.send("Hello, World!");
});
 
// start the server
app.listen(port, () => {
  console.log(`Server listening at http://hostname:${port}`);
});

This exposes a single /hello-world endpoint that returns ”Hello, World!”

For a faster setup, clone the GitHub repository that supports this guide:

Shell

git clone https://github.com/Tonel/nodejs-rate-limiting-demo

You'll find the Express server above and further implementations in dedicated branches.

In the following two sections, you'll learn how to integrate rate limiting behavior into a Node.js application using the following libraries:

express-rate-limit: To block requests that exceed specified limits.
express-slow-down: To slow down similar requests coming from the same actor.

Let's dive in!

Blocking Requests With `express-rate-limit` in Node.js

Follow the steps below to block excess requests in your Node.js backend using express-rate-limit.

Getting Started

express-rate-limit is an npm library that provides a rate limiting middleware for Express, so it's easier to limit repeated requests to all APIs or only to specific endpoints. The middleware allows you to control how many requests the same user can make to the same endpoints before an application starts returning 429 Too Many Requests errors.

Add the express-rate-limit npm package to your project's dependencies with:

Shell

npm install express-rate-limit

Implement the Rate Limiting Blocking Logic in Node.js

First, import the rateLimit function exposed by express-rate-limit:

JavaScript

const { rateLimit } = require("express-rate-limit");
// for ESM users -> import { rateLimit } from "express-rate-limit";

Use it to define your rate limiting middleware as below:

JavaScript

const limiter = rateLimit({
  windowMs: 5 * 60 * 1000, // 5 minutes
  limit: 10, // each IP can make up to 10 requests per `windowsMs` (5 minutes)
  standardHeaders: true, // add the `RateLimit-*` headers to the response
  legacyHeaders: false, // remove the `X-RateLimit-*` headers from the response
});

Note that rateLimit accepts an options object and returns the rate limiting middleware. The options used in the example above are:

windowMs: The time frame where requests are checked for rate limiting. The default value is 60000 (1 minute).
limit: The maximum number of connections to allow during the windowMs time span. By default, it's 5.
standardHeaders: To enable support for the RateLimit headers recommended by the IETF. The default value is false.
legacyHeaders: To send the legacy rate limit X-RateLimit-* headers in the error responses. The value is true by default.

Other useful parameters are:

message: The response body to return when a request is rate limited. The default message is “Too many requests, please try again later.”
statusCode: The HTTP status code to set in the rate limiting error responses. The default value is 429.
skipFailedRequests: To avoid counting failed requests in limit. Set to false by default.
skipSuccessfulRequests: To avoid counting successful requests in limit. The value is false by default.
keyGenerator: The function that contains the logic used to uniquely identify users. By default, it checks the IP address of incoming requests.

Check out the Express documentation for a complete list of all options available.

You now know what express-rate-limit has to offer. What's left is to add its rate limiting capabilities to your application. Register the limiter middleware to all endpoints with:

JavaScript

app.use(limiter);

If you instead want to rate limit APIs under a specific path, use the limiter middleware as below:

JavaScript

app.use("/public", limiter);

To protect only a certain endpoint, pass limiter as a parameter in the endpoint definition:

JavaScript

app.get("/hello-world", limiter, (req, res) => {
  // ...
});

Great! You just implemented rate limiting in Node.js!

Testing Rate Limiting

Launch your Node.js application and verify that the rate limiting logic works as expected. Otherwise, check out the express-rate-limit branch from the repository supporting this article:

Shell

git checkout express-rate-limit

Enter the project folder, install the dependencies, and start the Express development server:

Shell

cd nodejs-rate-limiting-demo
npm install
npm run start

Your rate limiting Node.js application should be listening locally on port 3000. Open your favorite HTTP client and call the /hello-world endpoint:

The first ten times, the API will return the message “Hello, World!”

Inspect the response headers:

Note the RateLimit-* headers added to the responses by express-rate-limit. These are useful for the caller to understand how many more requests it can make before getting blocked.

On the eleventh API call within a 5-minute time frame, the request will fail, with the 429 HTTP error: “Too many requests, please try again later.”

Well done! The Express rate limiting functionality has been implemented correctly!

Slowing Down Requests With `express-slow-down`

This step-by-step section will teach you how to use express-slow-down to slow down requests and avoid overloading your Node.js server.

Getting Started

express-slow-down provides a rate limiting middleware built on top of express-rate-limit. Instead of blocking requests outright, express-slow-down slows down the processing of incoming requests that exceed defined limits.

Install it with:

Shell

npm install express-slow-down

Implement the Node.js Rate Limiting Slowing Down Logic

Import the rateLimit function from express-slow-down in your Express application:

JavaScript

const { rateLimit } = require("express-slow-down");
// for ESM users -> import { rateLimit } from "express-slow-down";

Use it to set up your slowing down rate limiting middleware:

JavaScript

const limiter = slowDown({
  windowMs: 15 * 60 * 1000, // 5 minutes
  delayAfter: 10, // allow 10 requests per `windowMs` (5 minutes) without slowing them down
  delayMs: (hits) => hits * 200, // add 200 ms of delay to every request after the 10th
  maxDelayMs: 5000, // max global delay of 5 seconds
});

The rateLimit function provided by express-slow-down works just like that of express-rate-limit. The main difference is that it supports the following additional options:

delayAfter: The maximum number of incoming requests allowed during windowMs before the middleware starts delaying their processing. The default value is 1.
delayMs: The requests-to-rate-limit delay. It can be the delay itself in milliseconds or a function that defines custom behavior. By default, it increases the delay by 1 second for every request over the limit.
maxDelayMs: The absolute maximum value that delayMs can reach. It defaults to Infinity.

Again, you can apply the limiter middleware to all endpoints or only to some APIs. To add it to all requests, use:

JavaScript

app.use(limiter);

Fantastic! You just implemented a rate limiter that slows down incoming requests in excess!

Testing `express-slow-down`

You now have to ensure that the Node.js server slows down incoming requests as desired. Start your local server by following the steps above, or check out the express-slow-down branch of the GitHub repository that backs this tutorial:

Shell

git checkout express-slow-down

Access the project directory, install the dependencies, and launch the development server:

Shell

cd express-slow-down
npm install
npm run start

The local Express application will listen on port 3000. Open your HTTP client and call the /hello-world endpoint:

For the first ten calls, the server will return the response “Hello, World!” immediately. Then, the responses will start to become slower and slower:

Take a look at the response time. It went from 33ms to 2.23 seconds. This makes sense considering the slowing down logic:

text

(hits * 200ms)

hits is 11, so 11*200ms + ~33ms, which brings us to ~2.2 seconds.

Et voilà! Now you understand rate limiting!

Wrapping Up: Protect Your Node.js Server From Malicious Behavior With Rate Limiting

In this blog post, we took a look at what rate limiting is and what benefits it introduces when applied to a Node.js application using Express.

You now know:

The definition of rate limiting
Why you should limit the number of requests users can make to your application
How to implement the two most popular approaches to rate limiting in Node.js: express-rate-limit and express-slow-down

Thanks for reading!

P.S. If you liked this post, subscribe to our JavaScript Sorcery list for a monthly deep dive into more magical JavaScript tips and tricks.

P.P.S. If you need an APM for your Node.js app, go and check out the AppSignal APM for Node.js.

Core features

Advanced tools

Supported Languages

Larger scale

Add-Ons

How to Implement Rate Limiting in Express for Node.js

What Is Rate Limiting?

Why You Need API Rate Limiting in Node.js

Prerequisites

Blocking Requests With `express-rate-limit` in Node.js

Getting Started

Implement the Rate Limiting Blocking Logic in Node.js

Testing Rate Limiting

Slowing Down Requests With `express-slow-down`

Getting Started

Implement the Node.js Rate Limiting Slowing Down Logic

Testing `express-slow-down`

Wrapping Up: Protect Your Node.js Server From Malicious Behavior With Rate Limiting

Wondering what you can do next?

Most popular Javascript articles

Top 5 HTTP Request Libraries for Node.js

When to Use Bun Instead of Node.js

Antonello Zanini

AppSignal monitors your apps

Core features

Advanced tools

Supported Languages

Larger scale

Add-Ons

What Is Rate Limiting?

Why You Need API Rate Limiting in Node.js

Prerequisites

Blocking Requests With express-rate-limit in Node.js

Getting Started

Implement the Rate Limiting Blocking Logic in Node.js

Testing Rate Limiting

Slowing Down Requests With express-slow-down

Getting Started

Implement the Node.js Rate Limiting Slowing Down Logic

Testing express-slow-down

Wrapping Up: Protect Your Node.js Server From Malicious Behavior With Rate Limiting

Wondering what you can do next?

Most popular Javascript articles

Top 5 HTTP Request Libraries for Node.js

When to Use Bun Instead of Node.js

Antonello Zanini

AppSignal monitors your apps

Blocking Requests With `express-rate-limit` in Node.js

Slowing Down Requests With `express-slow-down`

Testing `express-slow-down`