Vectorize text equality and LIKE #6189

akuzm · 2023-10-12T11:53:20Z

This PR adds vectorized computation of text equality for deterministic collations, and case-sensitive LIKE for UTF-8 database encoding.

Benchmarking shows up to 3x speedups: https://grafana.ops.savannah-dev.timescale.com/d/fasYic_4z/compare-akuzm?orgId=1&var-branch=All&var-run1=3171&var-run2=3178&var-threshold=0&var-use_historical_thresholds=true&var-threshold_expression=2.5%20%2A%20percentile_cont%280.90%29&var-exact_suite_version=false

Prerequisites:

Disable-check: force-changelog-file

codecov · 2023-10-24T08:45:08Z

Codecov Report

Attention: Patch coverage is 87.22222% with 23 lines in your changes are missing coverage. Please review.

Project coverage is 80.95%. Comparing base (59f50f2) to head (c355c25).
Report is 87 commits behind head on main.

Files	Patch %	Lines
tsl/src/nodes/decompress_chunk/pred_text.c	76.00%	0 Missing and 12 partials ⚠️
tsl/src/nodes/decompress_chunk/compressed_batch.c	91.80%	2 Missing and 3 partials ⚠️
tsl/src/import/ts_like_match.c	96.07%	0 Missing and 2 partials ⚠️
tsl/src/nodes/decompress_chunk/planner.c	0.00%	1 Missing and 1 partial ⚠️
tsl/src/nodes/decompress_chunk/pred_vector_array.c	66.66%	0 Missing and 1 partial ⚠️
tsl/src/nodes/decompress_chunk/vector_predicates.c	90.00%	0 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #6189      +/-   ##
==========================================
+ Coverage   80.06%   80.95%   +0.88%     
==========================================
  Files         190      193       +3     
  Lines       37181    36664     -517     
  Branches     9450     9583     +133     
==========================================
- Hits        29770    29680      -90     
- Misses       2997     3168     +171     
+ Partials     4414     3816     -598

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

svenklemm

LGTM

tsl/src/nodes/decompress_chunk/compressed_batch.c

tsl/src/import/ts_like_match.c

tsl/src/nodes/decompress_chunk/compressed_batch.c

jnidzwetzki · 2024-03-27T08:11:59Z

tsl/src/nodes/decompress_chunk/pred_vector_array.c

@@ -47,7 +47,7 @@ vector_array_predicate_impl(VectorPredicate *vector_const_predicate, bool is_or,
 	char typalign;
 	get_typlenbyvalalign(ARR_ELEMTYPE(arr), &typlen, &typbyval, &typalign);

-	const char *array_data = (const char *) ARR_DATA_PTR(arr);
+	const char *restrict array_data = (const char *) ARR_DATA_PTR(arr);


Could you add a comment on which compiler optimizations we are getting from the restrict type qualifier?

I just add restrict to every piece of data that is used in hot loops and lives on the heap. The general idea is to tell the compiler that they point to different chunks of memory, so that more efficient code can be generated. It's important for the data you write, otherwise technically everything has to be re-read for a new loop iteration, because it might have been modified by a write. I think it's not really important for read-only variables, but I find it simpler to stick it on everything to avoid the need of deeper analysis which can be error-prone. So this is the general idea, I think spelling this out in the code comments each time would be too verbose and not very useful.

I think it's not really important for read-only variables, but I find it simpler to stick it on everything to avoid the need of deeper analysis which can be error-prone.

Maybe I should use const for read-only objects and restrict for read/write.

jnidzwetzki · 2024-03-27T08:21:01Z

tsl/src/nodes/decompress_chunk/compressed_batch.c

 static ArrowArray *
-make_single_value_arrow(Oid pgtype, Datum datum, bool isnull)
+make_single_value_arrow_arithmetic(Oid arithmetic_type, Datum datum, bool isnull)


Could you add a comment about the high-level difference between make_single_value_arrow_arithmetic and make_single_value_arrow_text? It might help understand the code better without comparing the details for the functions.

They are different just because one creates arrow array for arithmetic types, and the other for text types, and the layouts of these arrays are different. I can write "this function creates arrow array for arithmetic types", but this is already clear from the name, so tell me if you have better ideas for a comment.

I guess if you add a comment and just mention that the array layouts are different would be enough for the reader to know at this point.

tsl/src/nodes/decompress_chunk/pred_text.c

jnidzwetzki · 2024-03-27T08:46:51Z

tsl/src/nodes/decompress_chunk/pred_text.c

+	{                                                                                              \
+		(p)++;                                                                                     \
+		(plen)--;                                                                                  \
+	} while ((plen) > 0 && (*(p) &0xC0) == 0x80)


Could you add a comment on why the *(p) &0xC0) == 0x80 part is needed and what the 0x80 constant represents?

This follows the PG code as well, I'll add more details to the comment above.

I think it's just skipping the UTF8-encoded characters, and this check is checking for the UTF8 prefix. I try to avoid adding comments to the copied PG code, because it makes comparing with the original more difficult.

That is true. I compared it with the PG code and my function looked a bit different. So, I was not sure if you modified it or if this is upstream code from another version. If we have a comment about the PG version of the function, this should be fine.

github-actions bot assigned akuzm Oct 12, 2023

akuzm force-pushed the bulk-text branch from 8f8d17d to 3aaf81b Compare October 17, 2023 11:01

akuzm added 27 commits December 12, 2023 18:39

reference

b3318b1

fixup

768daed

fix

7618723

Merge remote-tracking branch 'origin/main' into HEAD

71fc792

remove unused variable

2c43267

cleanup

2268052

Merge remote-tracking branch 'origin/main' into HEAD

1f7460a

format

1bbc455

tojson

41fb360

cleanups

e3e6a1b

fix?

20d8561

fix

44de281

fix

e091b35

this is so tiresome

2422e82

directory

d5df530

path

2d7a60c

switch

7b0d878

split out to files

f996f80

fix

df52a4d

headers

5d41308

headers?

af61eaa

headers...

5281c9c

ts format

2cafdff

cleanup

b1e5dea

cleanup

79db818

yaml

4bdd452

dash

610c6c2

review fixes

9cd735e

svenklemm approved these changes Mar 3, 2024

View reviewed changes

akuzm added 2 commits March 8, 2024 10:54

Merge remote-tracking branch 'origin/main' into HEAD

311fadd

move the recursion check later

e4d2e5d

akuzm mentioned this pull request Mar 12, 2024

Allow secondary indexes on compressed chunks #2418

Open

akuzm added 4 commits March 18, 2024 17:48

Merge remote-tracking branch 'origin/main' into HEAD

0e50ba5

benchmark vectorized text (2024-03-18 no. 1)

8f58261

Merge remote-tracking branch 'origin/main' into HEAD

15cfdaf

benchmark vectorized text (2024-03-25 no. 1)

ebde574