[for analysis] Not working airmail #5217

fulmicoton · 2024-07-12T09:42:48Z

No description provided.

* also remove hits that are too many when removing skiped hits * add mock-test

…ith Glasskube (#5071) Signed-off-by: Idriss Neumann <[email protected]>

* Further optimization of validation. This uses serde_json_borrow to avoid most allocation, copying, and inserting in hashmap as we deserialize documents. Before: validation is taking 10.25% of the CPU After validation is taking 5.9% of the CPU. * CR comment. changed error message

includes cardinality aggregation and term aggregation perf improvement for large "size" parameters

The piece that estimates whether the next request is likely to fail is extremely simplistic for the moment. It simply counter the number of errors (not taking in account successes) that happened in a given time window. The reason is that for the moment, we want to use it for persist requests when the WAL is full. On airmail, the aggressive retry logic of the client was causing a massive grpc storm on the faulty indexer node, taking all of its CPU and preventing it from getting out of that state. In this case, the error estimation logic is very simple, a full WAL guarantees that no further persist request will be successful for a little while.

* docs: using-vector.md: Adjust Vector remap configuration to silence errors/warnings * docs: using-vector.md: Provide a link to the index configuration code so it doesn't go out of sync

…available (#5155)" (#5191) This reverts commit de2e150.

This reverts commit 9fddb68.

* optimize topn requests add logic to detect which splits will deliver the top n results for requests. This is only supported for match_all requests, with optional sort_by on timestamp sorting. start_timestamp, end_timestamp as well as a filter on the timestamp field is not supported currently but could be. * move to function, refactor

controller.

* Using the shard throughput information in the scheduling logic. * added cli flags

Reverts 3d6543b

…5198) Bumps [certifi](https://github.com/certifi/python-certifi) from 2024.2.2 to 2024.7.4. - [Commits](certifi/python-certifi@2024.02.02...2024.07.04) --- updated-dependencies: - dependency-name: certifi dependency-type: indirect ... Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>

throughput. Scaling up relies on the short term average in order to rapidly react to a change in throughput, while scaling down and the indexing scheduler relies on the long term average.

fulmicoton · 2024-07-12T09:43:31Z

quickwit/Cargo.lock

@@ -8112,7 +8134,7 @@ dependencies = [
 [[package]]
 name = "tantivy"
 version = "0.23.0"
-source = "git+https://github.com/quickwit-oss/tantivy/?rev=08b9fc0#08b9fc0b3114640ad06c2358c404c474a9eea3c1"
+source = "git+https://github.com/quickwit-oss/tantivy/?rev=13e9885#13e9885dfda8cebf4bfef72f53bf811da8549445"


tantivy difference?

fulmicoton · 2024-07-12T09:45:41Z

quickwit/Cargo.toml

@@ -324,7 +325,7 @@ quickwit-serve = { path = "quickwit-serve" }
 quickwit-storage = { path = "quickwit-storage" }
 quickwit-telemetry = { path = "quickwit-telemetry" }

-tantivy = { git = "https://github.com/quickwit-oss/tantivy/", rev = "08b9fc0", default-features = false, features = [
+tantivy = { git = "https://github.com/quickwit-oss/tantivy/", rev = "13e9885", default-features = false, features = [


tantivy version is changed

guilload and others added 24 commits June 28, 2024 10:18

Add Dockerfile for creating Ubuntu image

ca69835

remove hits beyond max requested hit (#5180)

49c566d

* also remove hits that are too many when removing skiped hits * add mock-test

feat(glasskube-doc): write the documentation about quickwit install w…

5eb270d

…ith Glasskube (#5071) Signed-off-by: Idriss Neumann <[email protected]>

Fix k8s deployment docs title.

694cd5c

Making shard throughput configurable (#5183)

be20923

Making all metastore tests serial (#5185)

84572ea

Retry on metastore timeout or unavailable error (#5182)

a209b6d

Fix: Remove / when calling search endpoint (#5187)

31ff364

Add missing gs:// protocol to storage config doc

c7abc50

Add Google Cloud Storage to list of supported storage providers

b4bf457

update tantivy (#5188)

b373552

includes cardinality aggregation and term aggregation perf improvement for large "size" parameters

docs: Vector fixes (#5094)

622a12f

* docs: using-vector.md: Adjust Vector remap configuration to silence errors/warnings * docs: using-vector.md: Provide a link to the index configuration code so it doesn't go out of sync

Revert "Consider ingesters returning ResourceExhausted temporarily un…

d1022d6

…available (#5155)" (#5191) This reverts commit de2e150.

Revert "Added rebuild plan rest debug handler. (#5150)" (#5192)

9e1476a

This reverts commit 9fddb68.

Fixing rest api tests (#5194)

cb63d32

Plugging the shard throughput limit configuration to the ingest (#5193)

b600b4a

controller.

Using the shard throughput information in the scheduling logic. (#5196)

43e5ced

* Using the shard throughput information in the scheduling logic. * added cli flags

Reverting the change in the shard opening rate limits. (#5197)

97889f6

Reverts 3d6543b

Emitting both a short term average and a long term average of shard

075be7a

throughput. Scaling up relies on the short term average in order to rapidly react to a change in throughput, while scaling down and the indexing scheduler relies on the long term average.

better support for hex in code tokenizer

b8e996f

fulmicoton commented Jul 12, 2024

View reviewed changes

fulmicoton closed this Jul 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[for analysis] Not working airmail #5217

[for analysis] Not working airmail #5217

fulmicoton commented Jul 12, 2024

fulmicoton Jul 12, 2024

fulmicoton Jul 12, 2024

[for analysis] Not working airmail #5217

[for analysis] Not working airmail #5217

Conversation

fulmicoton commented Jul 12, 2024

fulmicoton Jul 12, 2024

Choose a reason for hiding this comment

fulmicoton Jul 12, 2024

Choose a reason for hiding this comment