vercel-log-drain

A simple log-drain you can deploy to export log messages from Vercel to one or more sources!

Drivers

AWS CloudWatch

Available with the cloudwatch feature (enabled by default).

To use the CloudWatch driver, you'll need to either:

add an environment variable for VERCEL_LOG_DRAIN_CLOUDWATCH_ENABLED=true
add the --cloudwatch-enabled cli flag

The log drain will create new log groups and log streams if they are not present. Log groups follow this scheme: /vercel/{project_name}/{vercel_source}, project_name is self-explaining, and vercel_source is one of the following build, edge, external, lambda, and static The log stream is the vercel deployment ID.

Permissions

AWS permissions used:

logs:DescribeLogGroups
logs:DescribeLogStreams
logs:CreateLogGroup
logs:CreateLogStream
logs:PutLogEvents
logs:PutRetentionPolicy

Terraform examples

Below are aws_iam_role, aws_iam_role_policy and aws_iam_policy_document definitions which grant a minimal set of permissions required to push logs to CloudWatch:

Simple 🏠

resource "aws_iam_role" "vercel_log_drain" {
  name               = "vercel-log-drain"
  description        = "Role to be used by the vercel log drain deployment"
  assume_role_policy = data.aws_iam_policy_document.vercel_log_drain_assume.json
}
data "aws_iam_policy_document" "vercel_log_drain_assume" {
    # depends on how you intend to deploy/run the service
}
resource "aws_iam_role_policy" "vercel_log_drain_policy" {
  name   = "vercel-log-drain-policy"
  role   = aws_iam_role.vercel_log_drain.id
  policy = data.aws_iam_policy_document.vercel_log_drain_permissions.json
}
data "aws_iam_policy_document" "vercel_log_drain_permissions" {
  statement {
    actions = [
      "logs:DescribeLogGroups",
      "logs:DescribeLogGroups",
      "logs:DescribeLogStreams",
      "logs:CreateLogGroup",
      "logs:CreateLogStream",
      "logs:PutLogEvents",
      "logs:PutRetentionPolicy",
    ]
    resources = [
      "*"
    ]
  }
}

Advanced 🏘️

data "aws_caller_identity" "current" {}
variable "aws_region" {
  type        = string
  description = "AWS region to run in"
}
resource "aws_iam_role" "vercel_log_drain" {
  name               = "vercel-log-drain"
  description        = "Role to be used by the vercel log drain deployment"
  assume_role_policy = data.aws_iam_policy_document.vercel_log_drain_assume.json
}
data "aws_iam_policy_document" "vercel_log_drain_assume" {
    # An sts:AssumeRole policy for the service. This varies depending on how
    # you intend to deploy/run the service.
}
resource "aws_iam_role_policy" "vercel_log_drain_policy" {
  name   = "vercel-log-drain-policy"
  role   = aws_iam_role.vercel_log_drain.id
  policy = data.aws_iam_policy_document.vercel_log_drain_permissions.json
}
data "aws_iam_policy_document" "vercel_log_drain_permissions" {
  statement {
    actions = [
      "logs:CreateLogGroup",
      "logs:PutRetentionPolicy",
    ]
    resources = [
      provider::aws::arn_build(
        "aws",
        "logs",
        var.aws_region,
        data.aws_caller_identity.current.account_id,
        "log-group:/vercel/*"
      )
    ]
  }

  statement {
    actions = [
      "logs:CreateLogStream",
      "logs:DescribeLogStreams",
      "logs:PutLogEvents",
    ]
    resources = [
      provider::aws::arn_build(
        "aws",
        "logs",
        var.aws_region,
        data.aws_caller_identity.current.account_id,
        "log-group:/vercel/*:*"
      )
    ]
  }

  statement {
    actions = [
      "logs:DescribeLogGroups",
    ]
    resources = ["*"]
  }
}

Grafana Loki

Available with the loki feature (enabled by default).

To use the loki driver, you'll need to set up:

--loki-enabled (or the env var VERCEL_LOG_DRAIN_LOKI_ENABLED=true)
--loki-url (or the env var VERCEL_LOG_DRAIN_LOKI_URL)
(optional, if you have basic auth) --loki-basic-auth-user and --loki-basic-auth-pass (or the corresponding env vars VERCEL_LOG_DRAIN_LOKI_USER and VERCEL_LOG_DRAIN_LOKI_PASS)

Configuration

CLI Flag	Environment Variable	Default Value	Description
`-l, --log`	`VERCEL_LOG_DRAIN_LOG_LEVEL`	`INFO`	Log level
`-i, --ip`	`VERCEL_LOG_DRAIN_IP`	`"0.0.0.0"`	IP address to bind to
`-p, --port`	`VERCEL_LOG_DRAIN_PORT`	`8000`	Port number
`--vercel-verify`	`VERCEL_VERIFY`	-	Vercel verification token
`--vercel-secret`	`VERCEL_SECRET`	-	Vercel secret
`--enable-metrics`	`VERCEL_LOG_DRAIN_ENABLE_METRICS`	-	Enable prometheus metrics endpoint
`--metrics-prefix`	`VERCEL_LOG_DRAIN_METRICS_PREFIX`	"drain"	the shared prefix to use for all metrics
`--enable-cloudwatch`	`VERCEL_LOG_DRAIN_ENABLE_CLOUDWATCH`	-	Enable CloudWatch integration
`--enable-loki`	`VERCEL_LOG_DRAIN_ENABLE_LOKI`	-	Enable Loki integration
`--loki-url`	`VERCEL_LOG_DRAIN_LOKI_URL`	`""`	Loki URL
`--loki-basic-auth-user`	`VERCEL_LOG_DRAIN_LOKI_USER`	`""`	Loki basic auth username
`--loki-basic-auth-pass`	`VERCEL_LOG_DRAIN_LOKI_PASS`	`""`	Loki basic auth password

Setting up (in Vercel)

Vercel requires that you host the application over HTTP or HTTPS, and have it be accessible from the public internet.

vercel-log-drain itself only supports HTTP – so you should put an HTTPS load balancer in front of it.

To add new log drains in Vercel:

Go to the Vercel account dashboard.
Find the team to configure, and click ... → Manage
Select Log Drains

The parameters you'll need for vercel-log-drain are:

Delivery format: JSON
Custom secret: the Vercel secret you set with --vercel-secret or VERCEL_SECRET
Endpoint: https://${your_hostname}/vercel
(optional) Custom headers: add a random secret header value, and configure your load balancer to require that header (to help filter out bot noise)

Pass the value of the x-vercel-verify header (provided by Vercel) to vercel-log-drain with the --vercel-verify argument or VERCEL_VERIFY environment variable.

Note

Vercel does not sign the initial verification request, and expects the endpoint to return HTTP 200 OK and the x-vercel-verify to that request.

Configuring Vercel to send an extra custom header and requiring it in your HTTPS load balancer should allow it to drop the majority of bot traffic without it touching vercel-log-drain or exposing your account's x-vercel-verify token.

Operation

As written in my deployment this handled about ~8M requests per month, with an avg response time (LB -> target) of 1-1.5ms with an avg memory usage of ~5MB. A 3 node deployment (for redundency) with 100m CPU and 128MB memory reservations should be able to go quite far. The response times above are a bit unfair because the system is designed to always responsed to vercel as fast as possible, adding the messages to an internal queue which processes the messages async from the actual POST request which is was receieved from.

If you click Vercel's test log drain button when you are setting up your deployment you may see some messages fail to parse this is because a few of the test messages dont fully follow their documented structure (some fields are missing)

No effort has really been made yet to optimize the code, still it is performant enough to handle anything, but feel free to contribute optimizations or idiomatic code corrections, I wrote this in a vacuum.

JSON logging in vercel

If you have structured JSON logging ie the contents of messaage is a json string, the service attempts to parse it as json so a fully JSON message can be pass downstream, vs a string containing json.

Example: { "message": { "method": "GET" } } vs { "message": "{ \"method\": \"GET\" }" }

This helps with log queries in cloudwatch or if modified your downsteam system to search or filter on data not just provided by vercel but also your own JSON logging in the deployed application.

Cargo features

cargo will build vercel-log-drain with all features by default:

Feature	Description
`cloudwatch`	AWS CloudWatch driver
`loki`	Grafana Loki driver

If you want a smaller binary, you could disable all of them with --no-default-features, and then only re-enable the features you use.

For example, to build vercel-log-drain with only AWS CloudWatch support:

cargo build --release --no-default-features --features cloudwatch

This can also be used when building the Docker image:

docker build -t vercel-log-drain --build-arg 'BUILD_ARGS=--no-default-features --features cloudwatch' .

Testing

cargo build

# run the server
./target/debug/vercel-log-drain --enable-metrics --vercel-secret "deadbeefdeadbeefdeadbeefdeadbeefdeadbeefdeadbeefdeadbeefdeadbeef" --vercel-verify verify --log DEBUG

If you're using the nix environment, there are some helpful scripts for running and sending test payloads to the server!

# build
cargo build

# run the server
run  # you'll need to add env vars or your options here! example (i have an http sink server running on :8080 that is logging all requests incoming):
run --enable-loki --loki-url http://localhost:8080/ingest

# send test payloads
test_drain ./src/fixtures/sample_1.json

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.github/workflows		.github/workflows
src		src
.envrc		.envrc
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
Dockerfile.alpine		Dockerfile.alpine
LICENSE		LICENSE
README.md		README.md
default.nix		default.nix

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

vercel-log-drain

Drivers

AWS CloudWatch

Permissions

Terraform examples

Grafana Loki

Configuration

Setting up (in Vercel)

Operation

JSON logging in vercel

Cargo features

Testing

Related vercel documentation

About

Releases 9

Packages

Contributors 4

Languages

License

dacbd/vercel-log-drain

Folders and files

Latest commit

History

Repository files navigation

vercel-log-drain

Drivers

AWS CloudWatch

Permissions

Terraform examples

Grafana Loki

Configuration

Setting up (in Vercel)

Operation

JSON logging in vercel

Cargo features

Testing

Related vercel documentation

About

Resources

License

Stars

Watchers

Forks

Releases 9

Packages 0

Contributors 4

Languages

Packages