Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

V0.8.1 unhandled S3 SlowDown error #5448

Closed
tuziben opened this issue Sep 25, 2024 · 2 comments
Closed

V0.8.1 unhandled S3 SlowDown error #5448

tuziben opened this issue Sep 25, 2024 · 2 comments
Assignees
Labels
bug Something isn't working

Comments

@tuziben
Copy link
Collaborator

tuziben commented Sep 25, 2024

Describe the bug
A clear and concise description of what the bug is.

version: V0.8.1

As the doc from AWS said: your application can achieve at least 3,500 PUT/COPY/POST/DELETE or 5,500 GET/HEAD requests per second per partitioned Amazon S3 prefix.

https://docs.aws.amazon.com/AmazonS3/latest/userguide/optimizing-performance.html

error log from qw indexer

storage error(kind=Service, source=service error: unhandled error: unhandled error: Error { code: "SlowDown", message: "Please reduce your request rate.", s3_extended_request_id: "*******+QcTMf+==", aws_request_id: "***********" } 

(ServiceError(ServiceError { source: Unhandled(Unhandled { source: ErrorMetadata { code: Some("SlowDown"), message: Some("Please reduce your request rate."), extras: Some({"s3_extended_request_id": "*********************+QcTMf+==", "aws_request_id": "**************"}) }, meta: ErrorMetadata { code: Some("SlowDown"), 

message: Some("Please reduce your request rate."), extras: Some({"s3_extended_request_id": "**************+QcTMf+**************==", "aws_request_id": "**************"}) } }), 

raw: Response { inner: Response { status: 503, version: HTTP/1.1, headers: {"x-amz-request-id": "**************", "x-amz-id-2": "**************+QcTMf+**************==", "content-type": "application/xml", "transfer-encoding": "chunked", "date": "Tue, 24 Sep 2024 15:12:39 GMT", "server": "AmazonS3", "connection": "close"}, body: SdkBody { inner: Once(Some(b"<?xml version=\"1.0\" encoding=\"UTF-8\"?>\n<Error><Code>SlowDown</Code><Message>Please reduce your request rate.</Message><RequestId>**************</RequestId><HostId>**************+QcTMf+**************==</HostId></Error>")), retryable: true } }, 

properties: SharedPropertyBag(Mutex { data: PropertyBag { contents: ["aws_types::SigningService", "alloc::vec::Vec<http::version::Version>", "aws_smithy_http::operation::Metadata", "aws_smithy_http::connection::CaptureSmithyConnection", "aws_credential_types::credentials_impl::Credentials", "aws_http::user_agent::AwsUserAgent", "aws_sig_auth::signer::OperationSigningConfig", "aws_types::region::Region", "aws_smithy_types::endpoint::Endpoint", "aws_sig_auth::middleware::Signature", "aws_credential_types::cache::SharedCredentialsCache", "aws_sdk_s3::endpoint::Params", "aws_types::region::SigningRegion"] }, poisoned: false, .. }) } })))

After this error occurred, the Quickwit cluster became very unstable. Kafka consumption kept rebalancing continuously and impossible to perform the merge operation.

How to fix? According to AWS's recommendation, S3 prefixes need to be subdivided to improve performance.

@tuziben tuziben added the bug Something isn't working label Sep 25, 2024
@fulmicoton fulmicoton self-assigned this Sep 25, 2024
@fulmicoton
Copy link
Contributor

Closing as it is a duplicate of #4824

@trinity-1686a
Copy link
Contributor

should be a lot better since #5384

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants