Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Experimental POC: Materialized Views #16051

Closed

Conversation

rohit-nayak-ps
Copy link
Contributor

@rohit-nayak-ps rohit-nayak-ps commented Jun 4, 2024

Description

This PR is an experiment in materializing joined tables to determine if it can be reliably built, estimate the complexity of implementing it with the current VReplication framework, and identify inherent limitations.

Current Implementation

User View

  • Utilize Materialize with a single table-setting where the query is a view. The view should include a directive like select /*vt+ view=orders_view */.
  • The target and source keyspace must be the same.
  • The filter generated by Materialize has the materialized view's table name as the match, and the filter is the view query.
  • The view must contain the primary key columns for each participating table, each part of an index. This ensures that updates and deletes to lookup tables, which result in bulk DMLs, are more efficient.
  • Initially, only integer primary keys are supported until we better understand the consequences of bulk DMLs and supporting views.

Todos

  • Remove hardcoded column name for base table!
  • Handle case where column names are repeated across joined tables
  • More unit tests in vplayer for join plans
  • More unit tests in vstreamer for join plans
  • Validation for parameters and view query: all PKs to be specified, required directive,
  • Remove excess logs
  • Allow target keyspace to be different: in that case we need to also replicate the lookup tables since future inserts will otherwise fail, as we exec the view query to get the new row whenever there is an insert into the base row.!

Related Issue(s)

RFC todo

Checklist

  • "Backport to:" labels have been added if this change should be back-ported to release branches
  • If this change is to be back-ported to previous releases, a justification is included in the PR description
  • Tests were added or are not required
  • Did the new or modified tests pass consistently locally and on CI?
  • Documentation was added or is not required

Deployment Notes

Copy link
Contributor

vitess-bot bot commented Jun 4, 2024

Review Checklist

Hello reviewers! 👋 Please follow this checklist when reviewing this Pull Request.

General

  • Ensure that the Pull Request has a descriptive title.
  • Ensure there is a link to an issue (except for internal cleanup and flaky test fixes), new features should have an RFC that documents use cases and test cases.

Tests

  • Bug fixes should have at least one unit or end-to-end test, enhancement and new features should have a sufficient number of tests.

Documentation

  • Apply the release notes (needs details) label if users need to know about this change.
  • New features should be documented.
  • There should be some code comments as to why things are implemented the way they are.
  • There should be a comment at the top of each new or modified test to explain what the test does.

New flags

  • Is this flag really necessary?
  • Flag names must be clear and intuitive, use dashes (-), and have a clear help text.

If a workflow is added or modified:

  • Each item in Jobs should be named in order to mark it as required.
  • If the workflow needs to be marked as required, the maintainer team must be notified.

Backward compatibility

  • Protobuf changes should be wire-compatible.
  • Changes to _vt tables and RPCs need to be backward compatible.
  • RPC changes should be compatible with vitess-operator
  • If a flag is removed, then it should also be removed from vitess-operator and arewefastyet, if used there.
  • vtctl command output order should be stable and awk-able.

@vitess-bot vitess-bot bot added NeedsBackportReason If backport labels have been applied to a PR, a justification is required NeedsDescriptionUpdate The description is not clear or comprehensive enough, and needs work NeedsIssue A linked issue is missing for this Pull Request NeedsWebsiteDocsUpdate What it says labels Jun 4, 2024
@github-actions github-actions bot added this to the v21.0.0 milestone Jun 4, 2024
@github-actions github-actions bot added the Stale Marks PRs as stale after a period of inactivity, which are then closed after a grace period. label Jul 5, 2024
@rohit-nayak-ps rohit-nayak-ps removed the Stale Marks PRs as stale after a period of inactivity, which are then closed after a grace period. label Jul 5, 2024
@github-actions github-actions bot added the Stale Marks PRs as stale after a period of inactivity, which are then closed after a grace period. label Aug 5, 2024
@github-actions github-actions bot closed this Aug 12, 2024
@vitessio vitessio deleted a comment from github-actions bot Aug 18, 2024
@vitessio vitessio deleted a comment from github-actions bot Aug 18, 2024
@vitessio vitessio deleted a comment from github-actions bot Aug 18, 2024
@github-actions github-actions bot removed the Stale Marks PRs as stale after a period of inactivity, which are then closed after a grace period. label Aug 19, 2024
@rohit-nayak-ps rohit-nayak-ps force-pushed the rohit/materialized-views branch from c9baeb6 to 98c83c7 Compare August 21, 2024 09:50
@rohit-nayak-ps rohit-nayak-ps removed NeedsDescriptionUpdate The description is not clear or comprehensive enough, and needs work NeedsBackportReason If backport labels have been applied to a PR, a justification is required labels Sep 2, 2024
Copy link
Contributor

github-actions bot commented Oct 3, 2024

This PR is being marked as stale because it has been open for 30 days with no activity. To rectify, you may do any of the following:

  • Push additional commits to the associated branch.
  • Remove the stale label.
  • Add a comment indicating why it is not stale.

If no action is taken within 7 days, this PR will be closed.

@github-actions github-actions bot added the Stale Marks PRs as stale after a period of inactivity, which are then closed after a grace period. label Oct 3, 2024
@vitess-bot vitess-bot modified the milestones: v21.0.0, v22.0.0 Oct 8, 2024
@github-actions github-actions bot removed the Stale Marks PRs as stale after a period of inactivity, which are then closed after a grace period. label Oct 9, 2024
Copy link
Contributor

github-actions bot commented Nov 9, 2024

This PR is being marked as stale because it has been open for 30 days with no activity. To rectify, you may do any of the following:

  • Push additional commits to the associated branch.
  • Remove the stale label.
  • Add a comment indicating why it is not stale.

If no action is taken within 7 days, this PR will be closed.

@github-actions github-actions bot added the Stale Marks PRs as stale after a period of inactivity, which are then closed after a grace period. label Nov 9, 2024
Copy link
Contributor

This PR was closed because it has been stale for 7 days with no activity.

@github-actions github-actions bot closed this Nov 16, 2024
@rohit-nayak-ps
Copy link
Contributor Author

One challenge we encountered while implementing this PR is the potential exponential increase in write QPS due to updates in the lookup tables. For instance, if a country name is modified, the change would require updating all rows referencing that country in the view table. In cases where millions of rows are affected, this would result in a large-scale table update, significantly increasing write latency and potentially causing application unavailability.

Additionally, such a large update would lead to a spike in replica lags, leaving the view table with stale data during the update period, as subsequent updates would be queued behind the large update.

Given these challenges, we’ve decided to temporarily put this implementation on hold until we identify a more efficient way to address the issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Component: VReplication Do Not Merge NeedsIssue A linked issue is missing for this Pull Request NeedsWebsiteDocsUpdate What it says Stale Marks PRs as stale after a period of inactivity, which are then closed after a grace period. Type: Feature
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants