Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update Mirror Gateway document #2207

Merged
Merged
Show file tree
Hide file tree
Changes from 9 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
78 changes: 1 addition & 77 deletions docs/api/mirror-gateway.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,13 +2,11 @@

## Overview

Mirror Service is responsible for providing the `Register` and `Advertise` interface for the Vald Mirror Gateway.
Mirror Service is responsible for providing the `Register` interface for the Vald Mirror Gateway.

```rpc
service Mirror {
rpc Register(payload.v1.Mirror.Targets) returns (payload.v1.Mirror.Targets) {}

rpc Advertise(payload.v1.Mirror.Targets) returns (payload.v1.Mirror.Targets) {}
}
```

Expand Down Expand Up @@ -85,77 +83,3 @@ Register RPC is the method to register other Vald Mirror Gateway targets.
| 3 | INVALID_ARGUMENT |
| 4 | DEADLINE_EXCEEDED |
| 13 | INTERNAL |

## Advertise RPC

Advertise RPC is the method to advertise Vald Mirror Gateway targets.

### Input

- the scheme of `payload.v1.Mirror.Targets`.

```rpc
message Mirror {
message Target {
string host = 1;
uint32 port = 2;
}

message Targets {
repeated Target targets = 1;
}
}
```

- Mirror.Targets

| field | type | label | required | desc. |
| :-----: | :------------ | :----------------------------- | :------: | :------------------------------- |
| targets | Mirror.Target | repeated(Array[Mirror.Target]) | \* | The multiple target information. |

- Mirror.Target

| field | type | label | required | desc. |
| :---: | :----- | :---- | :------: | :------------------- |
| host | string | | \* | The target hostname. |
| port | uint32 | | \* | The target port. |

### Output

- the scheme of `payload.v1.Mirror.Targets`.

```rpc
message Mirror {
message Target {
string host = 1;
uint32 port = 2;
}

message Targets {
repeated Target targets = 1;
}
}
```

- Mirror.Targets

| field | type | label | required | desc. |
| :-----: | :------------ | :----------------------------- | :------: | :------------------------------- |
| targets | Mirror.Target | repeated(Array[Mirror.Target]) | | The multiple target information. |

- Mirror.Target

| field | type | label | required | desc. |
| :---: | :----- | :---- | :------: | :------------------- |
| host | string | | \* | The target hostname. |
| port | uint32 | | \* | The target port. |

### Status Code

| code | desc. |
| :--: | :---------------- |
| 0 | OK |
| 1 | CANCELLED |
| 3 | INVALID_ARGUMENT |
| 4 | DEADLINE_EXCEEDED |
| 13 | INTERNAL |
52 changes: 28 additions & 24 deletions docs/overview/component/mirror-gateway.md
Original file line number Diff line number Diff line change
Expand Up @@ -44,35 +44,39 @@ Then, while forwarding the user request, the Vald Mirror Gateway bypasses the in

On the other hand, if the incoming user request is an [Object API](https://vald.vdaas.org/docs/api/object/) or [Search API](https://vald.vdaas.org/docs/api/search/), it is bypassed to only a Vald LB Gateway in its own cluster without forwarding it to other Vald Mirror Gateways.

### Automatic rollback on failure
### Continuous processing on failure

The request may fail at the forwarding destination or the bypass destination.

If some requests fail, the vector data will not be consistent across Vald clusters.
If some of the requests fails, the processing continues based on their status code.

To keep index state consistency, the Vald Mirror Gateway will send the rollback request for the failed request. After the rollback request succeeds, the index state will be the same as before requesting.

The following is the list of rollback types.
The following is an overview of the process for each request.

- Insert Request
- Rollback Condition/Trigger
- Status code other than `ALREADY_EXISTS` exists
- Rollback request to the successful request
- REMOVE request
- Remove Request
- Rollback Condition/Trigger
- Status code other than `NOT_FOUND` exists
- Rollback request to the successful request
- UPSERT Request with old vector

- If the target host returns a status of `ALREADY_EXISTS`, the Update request is sent to this host.
- If all target hosts return a status of `ALREADY_EXISTS`, the Mirror Gateway returns `ALREADY_EXISTS`.
- If all target hosts return a status of `OK` or `ALREADY_EXISTS`, the Mirror Gateway returns `OK`.

- Update Request
- Rollback Condition/Trigger
- Status code other than `ALREADY_EXISTS` exists
- Rollback request to the successful request
- REMOVE Request if there is no old vector data
- UPDATE Request if there is old vector data

- If the target host returns a status of `NOT_FOUND`, the Insert request is sent to this host.
- If all target hosts return a status of `ALREADY_EXISTS`, the Mirror Gateway returns `ALREADY_EXISTS`.
- If all target hosts return a status of `OK` or `ALREADY_EXISTS`, the Mirror Gateway returns `OK`.
- If all target hosts return a status of `OK` or `NOT_FOUND`, the Mirror Gateway returns `OK`.

- Upsert Request
- Rollback Condition/Trigger
- Status code other than `ALREADY_EXISTS` exists
- Rollback request to the successful request
- REMOVE Request if there is no old vector data
- UPDATE Request if there is old vector data

- If all target hosts return a status of `ALREADY_EXISTS`, the Mirror Gateway returns `ALREADY_EXISTS`.
- If all target hosts return a status of `OK` or `ALREADY_EXISTS`, the Mirror Gateway returns `OK`.

- Remove Request

- If all target hosts return a status of `NOT_FOUND`, the Mirror Gateway returns `NOT_FOUND`.
- If all target hosts return a status of `OK` or `NOT_FOUND`, the Mirror Gateway returns `OK`.

- RemoveByTimestamp Request

- Same as `Remove Request`.

For more information, please refer to [Mirror Gateway Troubleshooting](https://vald.vdaas.org/docs/troubleshooting/mirror-gateway/).
94 changes: 94 additions & 0 deletions docs/troubleshooting/mirror-gateway.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,94 @@
# Mirror Gateway Troubleshooting

This page introduces the popular troubleshooting for Mirror Gateway.

Additionally, if you encounter some errors when using API, the [API status code](../api/status.md) helps you, too.

## Insert Operation

Mirror Gateway sends an Update request to its host if some requests are `ALREADY_EXISTS`.

Therefore, in addition to the [Insert API status code](../api/insert.md#status-code), the [Update API status code](../api/update.md#status-code) may also be returned to the user.

Here are some common reasons of error.

| name | common reason |
| :---------------- | :-------------------------------------------------------------------------------------------------------------------------------------------------- |
| CANCELLED | Executed cancel() of rpc from client/server-side or network problems between client and server. |
| INVALID_ARGUMENT | The Dimension of the request vector is NOT the same as Vald Agent's config, the requested vector's ID is empty, or some request payload is invalid. |
| DEADLINE_EXCEEDED | The RPC timeout setting is too short on the client/server side. |
| ALREADY_EXISTS | Request ID is already inserted. This status code is returned when all hosts return `ALREADY_EXISTS`. |
| NOT_FOUND | Requested ID is NOT inserted. This is the status code of the Update request. |
| INTERNAL | Target Vald cluster or network route has some critical error. |

`0 (OK)` is also returned when all hosts return `OK` and `ALREADY_EXISTS`.

## Update Operation

Mirror Gateway sends an Update request to its host if some requests are `NOT_FOUND`.

Therefore, in addition to the [Update API status code](../api/update.md#status-code), the [Insert API status code](../api/insert.md#status-code) may also be returned to the user.

Here are some common reasons of error.

| name | common reason |
| :---------------- | :-------------------------------------------------------------------------------------------------------------------------------------------------- |
| CANCELLED | Executed cancel() of rpc from client/server-side or network problems between client and server. |
| INVALID_ARGUMENT | The Dimension of the request vector is NOT the same as Vald Agent's config, the requested vector's ID is empty, or some request payload is invalid. |
| DEADLINE_EXCEEDED | The RPC timeout setting is too short on the client/server side. |
| NOT_FOUND | Requested ID is NOT inserted. This status code is returned when all hosts return `NOT_FOUND`. |
| ALREADY_EXISTS | Request pair of ID and vector is already inserted. This status code is returned when all hosts return `ALREADY_EXISTS`. |
hlts2 marked this conversation as resolved.
Show resolved Hide resolved
| INTERNAL | Target Vald cluster or network route has some critical error. |

`0 (OK)` is also returned in the following cases.
hlts2 marked this conversation as resolved.
Show resolved Hide resolved

- All hosts return `OK` and `NOT_FOUND`.
- All hosts return `OK` and `ALREADY_EXISTS`.

## Upsert Operation

The request process may not be completed when the response code is NOT `0 (OK)`.

Here are some common reasons of error.

| name | common reason |
| :---------------- | :-------------------------------------------------------------------------------------------------------------------------------------------------- |
| CANCELLED | Executed cancel() of rpc from client/server-side or network problems between client and server. |
| INVALID_ARGUMENT | The Dimension of the request vector is NOT the same as Vald Agent's config, the requested vector's ID is empty, or some request payload is invalid. |
| DEADLINE_EXCEEDED | The RPC timeout setting is too short on the client/server side. |
| ALREADY_EXISTS | Requested pair of ID and vector is already inserted. This status code is returned when all hosts return `ALREADY_EXISTS`. |
| INTERNAL | Target Vald cluster or network route has some critical error. |

`0 (OK)` is also returned when all hosts return `OK` and `ALREADY_EXISTS`.

## Remove Operation

The request process may not be completed when the response code is NOT `0 (OK)`.

Here are some common reasons of error.

| name | common reason |
| :---------------- | :---------------------------------------------------------------------------------------------- |
| CANCELLED | Executed cancel() of rpc from client/server-side or network problems between client and server. |
| INVALID_ARGUMENT | The Requested vector's ID is empty, or some request payload is invalid. |
| DEADLINE_EXCEEDED | The RPC timeout setting is too short on the client/server side. |
| NOT_FOUND | Requested ID is NOT inserted. This status code is returned when all hosts return `NOT_FOUND`. |
| INTERNAL | Target Vald cluster or network route has some critical error. |

`0 (OK)` is also returned when all hosts return `OK` and `NOT_FOUND`.

## RemoveByTimestamp Operation

The request process may not be completed when the response code is NOT `0 (OK)`.

Here are some common reasons of error.

| name | common reason |
| :---------------- | :---------------------------------------------------------------------------------------------- |
| CANCELLED | Executed cancel() of rpc from client/server-side or network problems between client and server. |
| INVALID_ARGUMENT | The Requested vector's ID is empty, or some request payload is invalid. |
| DEADLINE_EXCEEDED | The RPC timeout setting is too short on the client/server side. |
| NOT_FOUND | Requested ID is NOT inserted. This status code is returned when all hosts return `NOT_FOUND`. |
| INTERNAL | Target Vald cluster or network route has some critical error. |

`0 (OK)` is also returned when all hosts return `OK` and `NOT_FOUND`.
4 changes: 2 additions & 2 deletions docs/user-guides/mirroring-configuration.md
Original file line number Diff line number Diff line change
Expand Up @@ -41,8 +41,8 @@ gateway:
...
# gRPC client configuration (overrides defaults.grpc.client)
client: {}
# The interval to advertise addresses of Mirror Gateway to other Mirror Gateway.
advertise_interval: "1s"
# The duration to register other Mirror Gateways.
register_duration: "1s"
hlts2 marked this conversation as resolved.
Show resolved Hide resolved
# The target namespace to discover ValdMirrorTarget (CR) resource.
# The default value is its own namespace.
namespace: "vald"
Expand Down