fix: add missing postgres indexes #3910

jessicamcinchak · 2024-11-04T18:19:57Z

Follows on from publishing debugging today: https://opensystemslab.slack.com/archives/C01E3AC0C03/p1730715895034839

Approached this from two directions:

Which indexes are we missing? Where is postgres is doing more sequence scans than index scans?

This blog has a useful sample query: https://erikrw.hashnode.dev/how-to-identify-missing-indexes-in-postgresql
From these results, I then cross-checked with how we commonly query those tables (eg where params)

Which existing indexes are unused? Can any be removed?

Ran select * from pg_stat_user_indexes where schemaname = 'public' order by idx_scan asc; for this one
We do have multiple cases of existing indexes that have never been hit, but they're all direct remnants of "Primary keys" or "Unique keys" created via Hasura and therefore don't make sense to drop

Wish that published_flows & flows would have come up higher on the "need to improve" checks, but the truth is they didn't !

jessicamcinchak · 2024-11-04T18:25:45Z

hasura.planx.uk/migrations/1730742914548_create_index_bops_applications_session_id/up.sql

+CREATE  INDEX "reconciliation_requests_session_id_idx" on
+  "public"."reconciliation_requests" using hash ("session_id");
+CREATE  INDEX "published_flows_created_at_idx" on
+  "public"."published_flows" using btree ("created_at");


A couple notes about this migration:

Why a mix of hash versus btree index types?

I'm using hash indices when it's a column that is exclusively queried via an equality check (eg session id) and btree indices when a) it's a column that is queried via equality or range checks or b) multiple columns are combined into the same index

Postgres docs on this are helpful https://www.postgresql.org/docs/current/indexes-types.html

I'm adding an index to published_flows.created_at because that is how all of our current queries are ordering, but I have not actually checked/compared yet if there's a difference to sort by published flow id (or jogged my memory about why we don't do this already). Even if we swap how we order in the future, an index on created_at should still be useful/hit here when we do date comparisons for reconciliation

It may also be worth adding a DESC in here as we're always searching in this order (docs).

Cool good shout - that's now updated 👍 Worth noting order isn't possible to specify in Hasura GUI for adding indexes, but worked no problem when added via "SQL" panel directly

jessicamcinchak · 2024-11-04T18:26:49Z

hasura.planx.uk/metadata/tables.yaml

@@ -157,7 +157,7 @@
      definition:
        enable_manual: false
        insert:
-          columns: "*"
+          columns: '*'


This file is only formatting changes 🌀

jessicamcinchak · 2024-11-04T18:27:41Z

Publishing timeout errors

github-actions · 2024-11-04T18:41:47Z

Removed vultr server and associated DNS entries

DafyddLlyr · 2024-11-04T20:23:08Z

hasura.planx.uk/migrations/1730742914548_create_index_bops_applications_session_id/up.sql

+CREATE  INDEX "reconciliation_requests_session_id_idx" on
+  "public"."reconciliation_requests" using hash ("session_id");
+CREATE  INDEX "published_flows_created_at_idx" on
+  "public"."published_flows" using btree ("created_at");


It may also be worth adding a DESC in here as we're always searching in this order (docs).

add missing indexes

1a6ed0f

jessicamcinchak commented Nov 4, 2024

View reviewed changes

jessicamcinchak marked this pull request as ready for review November 4, 2024 18:26

jessicamcinchak requested a review from a team November 4, 2024 18:28

DafyddLlyr approved these changes Nov 4, 2024

View reviewed changes

add DESC order to published flows date index

e40a3d1

jessicamcinchak merged commit aa80b32 into main Nov 5, 2024
12 checks passed

jessicamcinchak deleted the jess/pg-indexes-maintenance branch November 5, 2024 08:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: add missing postgres indexes #3910

fix: add missing postgres indexes #3910

jessicamcinchak commented Nov 4, 2024

jessicamcinchak Nov 4, 2024 •

edited

Loading

DafyddLlyr Nov 4, 2024

jessicamcinchak Nov 5, 2024

jessicamcinchak Nov 4, 2024

jessicamcinchak commented Nov 4, 2024

github-actions bot commented Nov 4, 2024 •

edited

Loading

DafyddLlyr Nov 4, 2024

fix: add missing postgres indexes #3910

fix: add missing postgres indexes #3910

Conversation

jessicamcinchak commented Nov 4, 2024

jessicamcinchak Nov 4, 2024 • edited Loading

Choose a reason for hiding this comment

DafyddLlyr Nov 4, 2024

Choose a reason for hiding this comment

jessicamcinchak Nov 5, 2024

Choose a reason for hiding this comment

jessicamcinchak Nov 4, 2024

Choose a reason for hiding this comment

jessicamcinchak commented Nov 4, 2024

github-actions bot commented Nov 4, 2024 • edited Loading

DafyddLlyr Nov 4, 2024

Choose a reason for hiding this comment

jessicamcinchak Nov 4, 2024 •

edited

Loading

github-actions bot commented Nov 4, 2024 •

edited

Loading