HSEARCH-3319 WIP: Type-safe field references v2 #4156

marko-bekhta · 2024-05-09T16:37:42Z

https://hibernate.atlassian.net/browse/HSEARCH-3319

marko-bekhta · 2024-05-15T11:13:15Z

...e/src/main/java/org/hibernate/search/engine/search/predicate/dsl/SearchPredicateFactory.java

+	@Incubating
+	default SearchPredicateFactory<SR> withRoot(ObjectFieldReference<? super SR> objectFieldReference) {
+		return withRoot( objectFieldReference.absolutePath() );
+	}


Even though I've added these withRoot(..) methods to factories, I am not sure that they make much sense with the approach references are taking...

Assuming we somehow address the paths, the factory would still point to the same <SR> root type, as the properties in the embedded objects of the "generated" metamodel will use them:

public static class EntityA_ { public static ValueFieldReference1<EntityA_, String, String, String> stringA; public static SomeNestedObject_ someNestedObject; public static RootReferenceScope<EntityA_, EntityA> scope; static { stringA = ValueFieldReference1.of( "stringA", EntityA_.class, String.class, String.class, String.class ); someNestedObject = new SomeNestedObject_(); scope = RootReferenceScopeImpl.of( EntityA_.class, EntityA.class ); } private static class SomeNestedObject_ implements ObjectFieldReference<EntityA_> { public final ValueFieldReference1<EntityA_, String, String, String> nestedStringA; public SomeNestedObject_() { this.nestedStringA = ValueFieldReference1.of( "someNestedObject.stringA", EntityA_.class, String.class, String.class, String.class ); } @Override public String absolutePath() { return "someNestedObject"; } @Override public Class<EntityA_> scopeRootType() { return EntityA_.class; } } }

If we go with the different <SR> for nestedStringA then we wouldn't be able to use it in a regular case e.g.:

... .where( f -> f.match().field( EntityA_.someNestedObject.nestedStringA ).matching( "a" ) )

Also, if we change the SR on withRoot call, the current idea for these embedded objects to create a class-per-field would not fly 😔.

If we ignore the <SR> type change, then any property can be passed to the factory methods, which makes the static metamodel pointless 😔 (in this case)

Ideally, I'd expect after using withRoot(), people using the factory would use references such as SomeNestedObject_.nestedStringA, with nestedStringA being a static field. Just like they would on root types.

So you'd have two classes per object field:

SomeNestedObject$Absolute (the name doesn't matter much, but must not conflict with other classes), which would be the type of EntityA_.someNestedObject, and which would expose non-static "sub" fields of type ValueFieldReference<EntityA_, ...>.

SomeNestedObject_ which would be the type users of withRoot() would rely on, and which would expose non-static "sub" fields of type ValueFieldReference<SomeNestedObject_, ...>.

I don't think you need an inheritance relationship between these two classes, but you may need to redefine ObjectFieldReference to have two generic type parameters: the first being its "holding" type, and the second being the SR of factories obtained by calling withRoot.

That being said... Since SomeNestedObject_ is completely specific to the nestedString field of EntityA, I'm wondering how useful withRoot() would be. The main point of withRoot is to obtain a factory that can be passed to some util method, which is then able to build a predicate. That method is supposed to be used with different object fields that are assumed to have a similar schema.
If that method works for a specific object field only (with a specific metamodel type, e.g. SomeNestedObject_), then... what's the point?

I suppose it all comes down to the "mapped superclass"/inheritance support I mentioned the other time. If we could make sure that object field metamodel uses inheritance, i.e. the metamodel type EntityB_.SomeNestedObject_ extends EntityA_.SomeNestedObject_ as soon as EntityB extends EntityA, then you'd at least be able to create a root from EntityB_.someNestedObject and then use EntityA_.SomeNestedObject_.nestedStringA on the resulting factory.

Union types at the object field level could be a thing too, but that seems... hard. We'd essentially need SomeNestedObject_ to extend all its union types. That means multiple inheritance, which means SomeNestedObject_ must be an interface. Possible, since interfaces can have static fields, but needs investigation.

Hmm ok I've come up with the next example. Assuming we have a model as

@Indexed public static class EntityA { @IndexedEmbedded(includePaths = { "a", "b" }) public EmbeddedThing embA; @IndexedEmbedded(includePaths = { "b", "c" }) public EmbeddedThing embB; } public static class EmbeddedThing { @FullTextField String a; @FullTextField String b; @FullTextField String c; }

and want to have an util to filter by smth.b using this withRoot approach...

we can generate the metamodel as:

public static class EntityA_ { public static a_ embA; public static b_ embB; // These two are for the absolute paths: public static class a_ extends ObjectFieldReference<EntityA_, EntityA_a_>{ public static ValueFieldReference<EntityA_, String, String, String> a; // path = embA.a public static ValueFieldReference<EntityA_, String, String, String> b; // path = embA.b } public static class b_ extends ObjectFieldReference<EntityA_, EntityA_b_>{ public static ValueFieldReference<EntityA_, String, String, String> b; // path = embB.b public static ValueFieldReference<EntityA_, String, String, String> c; // path = embB.c } } // These two are for "withRoot"/ relative paths public static class EntityA_a_ { public static ValueFieldReference<EntityA_a_, String, String, String> a; // path = a public static ValueFieldReference<EntityA_a_, String, String, String> b; // path = b } public static class EntityA_b_ { public static ValueFieldReference<EntityA_b_, String, String, String> b; // path = b public static ValueFieldReference<EntityA_b_, String, String, String> c; // path = c }

That won't help much for the with withRoot ... so we can try:

// ideally I guess we'd want to generate: public static class EntityA_a_b_common_ { public static ValueFieldReference<EntityA_a_b_common_, String, String, String> b; // path = b } public static class EntityA_a_ extends EntityA_a_b_common_ { public static ValueFieldReference<EntityA_a_, String, String, String> b; // path = b } public static class EntityA_b_ extends EntityA_a_b_common_ { public static ValueFieldReference<EntityA_b_, String, String, String> c; // path = c } // and then the util method could use EntityA_a_b_common_ with access to `b` // and we pass withRoot(EntityA_.embA) withRoot(EntityA_.embB) to the util method ...

that could probably work.. but if I understand it correctly... currently that withRoot is not tied to a particular embedded type, meaning we can have:

public static class EntityA { @IndexedEmbedded EmbeddedThing1 e1; @IndexedEmbedded EmbeddedThing1 e2; } public static class EmbeddedThing1 { @FullTextField String a; // some other fields } public static class EmbeddedThing2 { @FullTextField String a; // some other fields maybe different from EmbeddedThing1 } public static class EntityB { @IndexedEmbedded EmbeddedThing3 e3; } public static class EmbeddedThing3 { @FullTextField String a; // some other fields maybe different from EmbeddedThing1/EmbeddedThing2 } //util method private SearchPredicate matchFirstAndLastName(SearchPredicateFactory f, String avalue) { return f.match().field( "a" ).matching( avalue ).toPredicate(); }

it is ok now to pass any of

withRoot("e1"); withRoot("e2"); withRoot("e3");

but I'm not sure how we'd tie them all to something in the metamodel to be used in that util... 😕

but I'm not sure how we'd tie them all to something in the metamodel to be used in that util... 😕

Essentially, to go beyond simple entity inheritance, we'll need to introduce union types at the object field level.

That's what I was talking about here:

Union types at the object field level could be a thing too, but that seems... hard. We'd essentially need SomeNestedObject_ to extend all its union types. That means multiple inheritance, which means SomeNestedObject_ must be an interface. Possible, since interfaces can have static fields, but needs investigation.

ok next iteration 🙈 I've tried to look at the interfaces and whether we can somehow use them to our advantage. This time, the model is:

@MappedSuperclass public static class MappedSuperclassThing { @Id Long id; @FullTextField String a; } @Indexed @Entity public static class ContainingA extends MappedSuperclassThing { @IndexedEmbedded EmbeddedThing1 e1; @IndexedEmbedded EmbeddedThing1 e2; } public static class EmbeddedThing1 { @FullTextField String a; @FullTextField String b; // some other fields } public static class EmbeddedThing2 { @FullTextField String a; // some other fields maybe different from EmbeddedThing1 } @Entity @Indexed public static class ContainingB extends MappedSuperclassThing{ @IndexedEmbedded EmbeddedThing3 e3; } public static class EmbeddedThing3 { @GenericField Integer a; // some other fields maybe different from EmbeddedThing1/EmbeddedThing2 }

There are string/int fields a and some embeddables. Ultimately, we'd want to be able to have a util method as:

public static SearchPredicate utilMethodForPredicate(SearchPredicateFactory<? extends Property_String_a_> factory) { return factory.match().field( Property_String_a_.a ).matching( "a" ).toPredicate(); }

where we want to accept the factories for "scope roots" that have a string property a in them. Which can be based on ContainingA/ContainingB/EmbeddedThing1/EmbeddedThing2. For the util to work a should be a part of the Property_String_a_:

interface Property_String_a_ { ValueFieldReference1<Property_String_a_, String, String, String> a = ....; }

then we can have the metamodel as:

public static class MappedSuperclassThing_ implements Property_String_a_ { public static HibernateOrmRootReferenceScope<MappedSuperclassThing_, MappedSuperclassThing> scope; static { //a = ValueFieldReference1.of( "a", MappedSuperclassThing_.class, String.class, String.class, String.class ); scope = RootReferenceScopeImpl.of( MappedSuperclassThing_.class, MappedSuperclassThing.class ); } } public static class ContainingA_ extends MappedSuperclassThing_ { public static e1_.Absolute e1; public static e2_.Absolute e2; public static HibernateOrmRootReferenceScope<ContainingA_, ContainingA> scope; static { e1 = new e1_.Absolute(); e2 = new e2_.Absolute(); scope = RootReferenceScopeImpl.of( ContainingA_.class, ContainingA.class ); } public static class e1_ implements ObjectFieldReference<ContainingA_>, Property_String_a_, Property_String_b_ { @Override public String absolutePath() { return "e1"; } @Override public Class<ContainingA_> scopeRootType() { return ContainingA_.class; } static class Absolute extends e1_ { public static ValueFieldReference1<ContainingA_, String, String, String> a; public static ValueFieldReference1<ContainingA_, String, String, String> b; static { a = ValueFieldReference1.of( "e1.a", ContainingA_.class, String.class, String.class, String.class ); b = ValueFieldReference1.of( "e1.b", ContainingA_.class, String.class, String.class, String.class ); } } } public static class e2_ implements ObjectFieldReference<ContainingA_>, Property_String_a_, Property_String_b_ { @Override public String absolutePath() { return "e2"; } @Override public Class<ContainingA_> scopeRootType() { return ContainingA_.class; } static class Absolute extends e2_ { public static ValueFieldReference1<ContainingA_, String, String, String> a; public static ValueFieldReference1<ContainingA_, String, String, String> b; static { a = ValueFieldReference1.of( "e2.a", ContainingA_.class, String.class, String.class, String.class ); b = ValueFieldReference1.of( "e2.b", ContainingA_.class, String.class, String.class, String.class ); } } } }

Since we use ? super SR we can do

SearchScope<ContainingA_, ContainingA> scope = ContainingA_.scope.create( searchSession ); searchSession.search( scope ) .select( f -> f.field( ContainingA_.a ) ) .where( f -> utilMethodForPredicate( f ) ) .fetchHits( 20 )

as ContainingA_ extends Property_String_a_ it works both for projection and util.
Hence, I'm thinking - we can create an interface-per-filed, where we reuse the interfaces with the same name and capabilities(traits)...

marko-bekhta · 2024-05-15T12:30:41Z

mapper/orm/src/main/java/org/hibernate/search/mapper/orm/scope/SearchScope.java

@@ -38,7 +38,8 @@
 *
 * @param <E> A supertype of all types in this scope.
 */
-public interface SearchScope<E> {
+@SuppressWarnings("deprecation")
+public interface SearchScope<E> extends org.hibernate.search.engine.mapper.scope.SearchScope<E, org.hibernate.search.mapper.orm.common.EntityReference> {


Obviously, JQAssistant does not like these changes 😔 😃. However, I couldn't come up with an alternative to allow scope to be created from the "generated" static metamodel classes. Unless we generate a model specific to the mapper 😖 😕

@yrodiere if you can take a quick look at the approach in this PR whenever you have some free time 😃

Unless we generate a model specific to the mapper 😖 😕

That seems like a very valid solution to me?

Not for every Hibernate Search type in the metamodel, of course -- i.e. I wouldn't make ValueFieldReference specific to each mapper. But it seems perfectly fine to me to give EntityA_.scope a mapper-specific type.

But it seems perfectly fine to me to give EntityA_.scope a mapper-specific type.

mm ok, we'd need to tell the plugin generating these to know which mappers are expected and then have

EntityA_.ormScope EntityA_.standaloneScope

we probably can detect that from the dependencies (which mappers are present) and have an explicit property if needed I suppose 🤔

mm ok, we'd need to tell the plugin generating these to know which mappers are expected and then have

The plugin would already need to know, since it needs to start Hibernate Search in offline mode to create backends and their index descriptors.

(Because, yeah, annotation parsing is not enough: there are binders to take into account, and programmatic mapping :x)

sonarqubecloud · 2024-05-22T03:19:56Z

Quality Gate failed

Failed conditions
78.2% Coverage on New Code (required ≥ 80%)

See analysis details on SonarCloud

marko-bekhta

Hey @yrodiere 😃 the approach we discussed last Friday seems to work well so far. (see the new inline comments or just take a look at the FieldReferenceIT )

One thing I'm not sure how to deal with is named predicates. They are defined in binders, where there's no notion of search scope. Since binders would define the index structure, does it make sense to use the generated metamodel in them? as its generation depends on the binder ... 🤔 🙈

marko-bekhta · 2024-05-22T10:38:00Z

...tion/src/test/java/org/hibernate/search/documentation/search/predicate/FieldReferenceIT.java

+	// IMPL_NOTE: note cannot use the EntityClassName_ since ORM picks it up and tries to its thing...
+	// so we'd need to come up with a different naming strategy...
+	public static class ContainingA__


The entityName-underscore pattern for these "generated" classes won't work because of the ORM, even if that model is not generated... sooo should we prepend the underscore, use some prefix/suffix oooor do something else?

I'd suggest looking at what's been done for Jakarta Data, because we need to avoid conflict with that as well.

Maybe SearchContainingA? ContainingA_Search?

marko-bekhta · 2024-05-22T10:39:14Z

...tion/src/test/java/org/hibernate/search/documentation/search/predicate/FieldReferenceIT.java

+		// IMPL_NOTE: Maybe let's use the INDEX name?
+		// also I'm thinking we can make it configurable and let the user decide how to call this variable...
+		public static final ContainingA__ INDEX = new ContainingA__();


I thought about the naming for this constant, and INDEX seemed like an ok alternative to INSTANCE, and as the comment suggests, we can make this configurable and let the user pick the name.

But then for union scopes, you'd have... INDEXES?

Wouldn't SCOPE be a better alternative? Then you'd have ContainingA__ implement SearchScopeReferenceWhatever.

Or SCOPE_REF, or a static scopeRef(), or...

marko-bekhta · 2024-05-22T10:46:16Z

...tion/src/test/java/org/hibernate/search/documentation/search/predicate/FieldReferenceIT.java

+	public interface ContainingA_e1_e2_e3_Intersection<SR> {
+
+		TraitsIntersection<SR, String> a();
+
+		interface TraitsIntersection<SR, T>
+				extends MatchPredicateFieldReference<SR, T>,
+				ExistsPredicateFieldReference<SR>,
+				QueryStringPredicateFieldReference<SR, T> {
+
+			@Override
+			default ValueConvert valueConvert() {
+				return ValueConvert.YES;
+			}
+		}
+	}
+
+	// the same field with the same traits:
+	public interface ContainingA_e1_e2_Intersection<SR> {
+
+		TraitsIntersection<SR, String, String> a();
+
+		interface TraitsIntersection<SR, T, P>
+				extends FieldProjectionFieldReference<SR, P>,
+				MatchPredicateFieldReference<SR, T>,
+				ExistsPredicateFieldReference<SR>,
+				QueryStringPredicateFieldReference<SR, T> {
+
+			@Override
+			default ValueConvert valueConvert() {
+				return ValueConvert.YES;
+			}
+		}
+	}


These are examples of intersection interfaces. I'm thinking that it makes sense to keep the trait intersection interface as an inner class, then if there are multiple intersection interfaces per one scope (ContainingA_) there's no ambiguity.

Are you sure there's a need to have special handling for the TraitsInteresection interface?

I mean, we already need to generate interfaces/classes for combination of traits, so we already assume something will be generating a FieldProjectionAndMatchPredicateAndExistsPredicateAndQueryStringPredicateFieldReference; can't we use that here, too?

Unrelated note: naming these "trait combination" interfaces will be a challenge in itself.

Are you sure there's a need to have special handling for the TraitsInteresection interface?

thinking more about it, yeah we can probably reuse the interfaces... I was thinking about a scenario:

@KeywordField(projectable = Projectable.YES, sortable = Sortable.NO) private String title; @KeywordField(projectable = Projectable.NO, sortable = Sortable.YES) private String title;

where the intersection would be just predicates and we wouldn't have it since it is not needed for an actual field, but we could just keep creating these combinations and add them to a lookup for reuse, so +1 😃

Unrelated note: naming these "trait combination" interfaces will be a challenge in itself.

yeah... At some point I was thinking to have a sorted list of trait interfaces and then "map" them to some numbered keys, e.g. T1, T2... Tn, and the name of a combination will be based on these keys, e.g. T1T5T24FieldReferece

marko-bekhta · 2024-05-23T14:44:26Z

So, in the new commit, I've tried adding a Maven plugin that would generate things. The approach I've taken here is to have model+ search config classes in their own jar that is passed as a dependency to the Maven plugin. The plugin then starts the Search. I've used some of our test helpers to start things since, well, it's just to test how it'll work.

It generates something like:

package org.hibernate.search.metamodel.generator.model;

class MyEntity__ {

	public String text;	
}

package org.hibernate.search.metamodel.generator.model;

class MyProgrammaticEntity__ {

	public String number;
	public String text;	
}

And since it happens on the generate sources phase, these are getting compiled automatically by Maven itself.

As for the plugin configuration:

<configuration>
    <annotatedTypes>
        <type>org.hibernate.search.metamodel.generator.model.MyEntity</type>
        <type>org.hibernate.search.metamodel.generator.model.MyProgrammaticEntity</type>
    </annotatedTypes>
    <properties>
        <hibernate.search.mapping.configurer>org.hibernate.search.metamodel.generator.model.MyConfigurer</hibernate.search.mapping.configurer>
    </properties>
</configuration>

There's a way to pass any properties that should be "forwarded" to the Search through properties. In this case, I've tried passing a configurer to apply some programmatic mapping 🙈.

now.... next thing I'd want to try is to run the plugin in the same module that the model is. From what I've seen so far, it is easy to get the source root, and the dependencies (things that were missing in the AP approach).

hibernate-github-bot · 2024-12-12T21:26:43Z

Thanks for your pull request!

This pull request does not follow the contribution rules. Could you have a look?

❌ The PR title or body should list the keys of all JIRA issues mentioned in the commits
↳ Issue keys mentioned in commits but missing from the PR title or body: [HSEARCH-5300]
❌ The pull request description must contain the license agreement text.
↳ The description of this pull request must contain the following license agreement text:

----------------------
By submitting this pull request, I confirm that my contribution is made under the terms of the [Apache 2.0 license](https://www.apache.org/licenses/LICENSE-2.0.txt)
and can be relicensed under the terms of the [LGPL v2.1 license](https://www.gnu.org/licenses/old-licenses/lgpl-2.1.txt) in the future at the maintainers' discretion.
For more information on licensing, please check [here](https://github.com/hibernate/hibernate-search/blob/main/CONTRIBUTING.md#legal).

----------------------

› This message was automatically generated.

sonarqubecloud · 2025-02-12T16:09:28Z

Quality Gate failed

Failed conditions
72.8% Coverage on New Code (required ≥ 80%)

See analysis details on SonarQube Cloud

… metamodel

marko-bekhta

Hey @yrodiere,
I've added the AP experiments to this branch as in the end we'd want to use the interfaces introduced here to generate that model.... as if the PR wasn't big enough already 😄, but the experiments are all in a single commit, and I've added comments to the places I think are of interest. There's a lot of boring code similar to the one we have in the annotation processors, as these just work with a different model.

the next steps, as I see it, are:

keep adding the missing bits (e.g. type level binders etc.). As I encounter some new code example from the docs, or just when I remember some missing part, I add it to the test and then see what has to be added to the processor to get it working.
write a simple class from the model.
take care of "various backends". I think that we'd just add all backends (lucene,elasticsearch) and have a config property for the annotation processor to let the user specify the backend to use (or even both).
deal with the ORM mapper. I don't think that it would be feasible to make ORM somehow generate the model from javax.lang.model.element types. Hence, my thinking here is to add a specific processor that reads the ORM's @Id and treats them as @DocumentId; ignore the access type for now 🙈

marko-bekhta · 2025-02-20T11:04:19Z

...org/hibernate/search/metamodel/processor/impl/IndexedEntityMetamodelAnnotationProcessor.java

+				.property(
+						StandalonePojoMapperSettings.MAPPING_CONFIGURER,
+						BeanReference.ofInstance( (StandalonePojoMappingConfigurer) configurationContext -> {
+							// disable any discovery by annotations -- that won't work as we do not implement annotations in the pojo model
+							configurationContext.annotationMapping()
+									.discoverAnnotatedTypesFromRootMappingAnnotations( false )
+									.discoverAnnotationsFromReferencedTypes( false )
+									.discoverJandexIndexesFromAddedTypes( false )
+									.buildMissingDiscoveredJandexIndexes( false );
+
+							ProgrammaticMappingConfigurationContext programmaticMapping =
+									configurationContext.programmaticMapping();
+
+							ProcessorAnnotationProcessorContext ctx = new ProcessorAnnotationProcessorContext(
+									context.elementUtils(),
+									context.typeUtils(), context.messager(), programmaticMapping
+							);
+
+							for ( Element el : indexedEntities ) {
+								if ( el.getKind().isClass() || el.getKind().isInterface() ) {
+									TypeElement indexedEntityType = (TypeElement) el;
+
+									String typeName = indexedEntityType.getQualifiedName().toString();
+
+									introspectorContext.typeElementsByName().clear();
+									introspectorContext.typeElementsByName().put( typeName, indexedEntityType );
+
+									TypeMappingStep typeMappingContext = programmaticMapping.type( typeName );
+									typeMappingContext.indexed().enabled( true );
+									typeMappingContext.searchEntity().name( typeName );
+
+									processTypeAndProperties( indexedEntityType, typeMappingContext, ctx );
+								}
+								else {
+									// TODO: generate message bundle with JBoss Logging ?
+									context.messager()
+											.printMessage(
+													Diagnostic.Kind.ERROR,
+													"Unexpected location of the " + indexedAnnotation.getQualifiedName()
+															+ ". Expected to be placed on an indexed type.",
+													el
+											);
+								}
+							}
+
+						} )
+				).build()
+				.boot() ) {


The main idea behind this approach is that we are starting a standalone mapper with all possible "annotation discovery" disabled since an annotation processor won't play nice with here 😃.
Then there is a mapping configurer that iterates through the @Indexed entities and uses programmatic mapping to build the model.

marko-bekhta · 2025-02-20T11:06:35Z

...org/hibernate/search/metamodel/processor/impl/IndexedEntityMetamodelAnnotationProcessor.java

+			for ( SearchIndexedEntity<?> entity : searchMapping.allIndexedEntities() ) {
+				context.messager().printMessage( Diagnostic.Kind.NOTE, entity.name() );
+				entity.indexManager().descriptor().staticFields()
+						.forEach( f -> context.messager().printMessage( Diagnostic.Kind.NOTE, f.toString() ) );
+			}


for now I just print the fields out so it's easy to see what is produced, but I've already started looking into how to generate classes. From what I see, Hibernate ORM/Tools are writing the classes with custom code, I also found this https://github.com/palantir/javapoet and was considering whether it would make sense to pull a dependency, or just do yet another custom class writer 😄

marko-bekhta · 2025-02-20T11:08:09Z

...essor/src/main/java/org/hibernate/search/metamodel/processor/impl/ProcessorElementUtils.java

+	private static Stream<? extends AnnotationMirror> flattened(Types types, AnnotationMirror annotationMirror) {
+		for ( Map.Entry<? extends ExecutableElement, ? extends AnnotationValue> entry : annotationMirror.getElementValues()
+				.entrySet() ) {
+			if ( entry.getKey().getSimpleName().contentEquals( "value" ) ) {
+				if ( entry.getKey().getReturnType() instanceof ArrayType arrayType ) {
+					Optional<? extends AnnotationMirror> repeatable = types.asElement( arrayType.getComponentType() )
+							.getAnnotationMirrors()
+							.stream()
+							.filter( a -> ( (TypeElement) a.getAnnotationType().asElement() ).getQualifiedName()
+									.contentEquals( "java.lang.annotation.Repeatable" ) )
+							.findAny();
+					if ( repeatable.isPresent() ) {
+						TypeMirror returnType = (TypeMirror) repeatable.get().getElementValues().entrySet().iterator().next()
+								.getValue().getValue();
+						TypeMirror annotationType = annotationMirror.getAnnotationType();
+						if ( types.isSameType( returnType, annotationType ) ) {
+							return ( (List<? extends AnnotationMirror>) entry.getValue().getValue() ).stream();
+						}
+					}
+				}
+
+			}
+		}
+		return Stream.of( annotationMirror );
+	}


had to "reinvent" the wheel 😄 as these repeatable annotations are wrapped in the list one, even if the user didn't do that explicitly 🫨 (just adapted the code used in the mapper to expend these repeatable annotations)

marko-bekhta · 2025-02-20T11:18:23Z

...g/hibernate/search/metamodel/processor/mapping/ProcessorPojoModelsBootstrapIntrospector.java

+	public <T> PojoRawTypeModel<T> typeModel(Class<T> clazz) {
+		return delegate.typeModel( clazz );
+	}
+
+	@Override
+	public PojoRawTypeModel<?> typeModel(String name) {
+		TypeElement typeElement = context.typeElementsByName().get( name );
+		if ( typeElement == null ) {
+			try {
+				typeElement = context.processorContext().elementUtils().getTypeElement( name );
+			}
+			catch (Exception e) {
+				throw new IllegalArgumentException( name + " not found", e );
+			}
+		}
+		return typeModel( typeElement );
+	}
+
+	public PojoRawTypeModel<?> typeModel(TypeElement typeElement) {
+		Name qualifiedName = typeElement.getQualifiedName();
+		return elementTypeModelCache.computeIfAbsent( qualifiedName,
+				k -> {
+					Optional<Class<?>> loadableType = BuiltInBridgeResolverTypes.loadableType( typeElement.asType() );
+					if ( loadableType.isPresent() ) {
+						return typeModel( loadableType.get() );
+					}
+					else {
+						return new ProcessorPojoRawTypeModel<>( typeElement, context.processorContext(), this );
+					}
+				} );
+	}


you'll see a few options to get the types by this introspector...
to leverage all built-in bridges etc, we have to pass them the Java types they use in a regular run, rather than typelements. There's a list of "loadable" types (I'm planning to add a test to make sure that it is kept in sync with what is actually present). When we are going through the model and encounter a property of this loadable type, we will end up loading a class and creating a regular type model.
for other cases, when we are working with the type elements, there's ProcessorPojoRawTypeModel and ProcessorPojoPropertyModel. These types implement most of the API, except the annotations, as those aren't available and we are working with the annotation mirrors...

marko-bekhta · 2025-02-20T11:23:22Z

...te/search/metamodel/processor/annotation/processing/impl/ProcessorKeywordFieldProcessor.java

+public class ProcessorKeywordFieldProcessor extends AbstractProcessorNonFullTextFieldAnnotationProcessor {
+	@Override
+	PropertyMappingNonFullTextFieldOptionsStep<?> initSortableFieldMappingContext(PropertyMappingStep mappingContext,
+			AnnotationMirror annotation, String fieldName) {
+		PropertyMappingKeywordFieldOptionsStep fieldContext = mappingContext.keywordField( fieldName );
+
+		String normalizer = getAnnotationValueAsString( annotation, "normalizer", "" );
+		if ( !normalizer.isEmpty() ) {
+			fieldContext.normalizer( normalizer );
+		}
+
+		Norms norms = getNorms( annotation );
+		if ( !Norms.DEFAULT.equals( norms ) ) {
+			fieldContext.norms( norms );
+		}
+
+		return fieldContext;
+	}
+
+	@Override
+	protected Optional<IndexFieldTypeFinalStep<?>> configureField(PropertyBindingContext bindingContext,
+			AnnotationMirror annotation, ProcessorAnnotationProcessorContext context, Element element, TypeMirror fieldType) {
+		StringIndexFieldTypeOptionsStep<?> optionsStep = bindingContext.typeFactory().asString()
+				.projectable( getProjectable( annotation ) )
+				.sortable( getSortable( annotation ) )
+				.searchable( getSearchable( annotation ) )
+				.aggregable( getAggregable( annotation ) )
+				.norms( getNorms( annotation ) )
+				.indexNullAs( getIndexNullAs( annotation ) );
+
+		String normalizer = getAnnotationValueAsString( annotation, "normalizer", "" );
+		if ( !normalizer.isEmpty() ) {
+			optionsStep.normalizer( normalizer );
+		}
+		return Optional.of( optionsStep );
+	}
+
+	protected Norms getNorms(AnnotationMirror annotation) {
+		return Norms.valueOf( getAnnotationValueAsString( annotation, "norms", "DEFAULT" ) );
+	}
+}


In general all these processors are the same as the ones in the mapper, but they are based on the javax.lang.model.* types. The other difference is that they have two process methods, one for the regular use of the search annotations and the other one for "injectable fields" in the binders.

marko-bekhta · 2025-02-20T11:25:37Z

...search/metamodel/processor/annotation/processing/impl/ProcessorPropertyBindingProcessor.java

+		if ( propertyBinder != null ) {
+			AnnotationValue type = getAnnotationValue( propertyBinder, "type" );
+			if ( type != null ) {
+				TypeMirror binder = (TypeMirror) type.getValue();
+
+				mapping.binder( new ProcessorPropertyBinder( context, binder ) );
+			}
+		}


at first I thought we could just continue using the programmatic mapping to build the model from the "injected" fields, but that didn't work. So instead, I've added this binder ^

marko-bekhta · 2025-02-20T11:28:20Z

...search/metamodel/processor/annotation/processing/impl/ProcessorPropertyBindingProcessor.java

+			bindingContext.dependencies().useRootOnly();
+
+			for ( Element member : context.elements()
+					.getAllMembers( (TypeElement) context.types().asElement( binder ) ) ) {
+				if ( member.getKind() == ElementKind.FIELD ) {
+					// we only care about fields in the "injectable binders" (at least for now):
+					for ( AnnotationMirror annotationMirror : member.getAnnotationMirrors() ) {
+						ProcessorPropertyMappingAnnotationProcessor.processor( annotationMirror )
+								.ifPresent( p -> p.process(
+										bindingContext,
+										annotationMirror,
+										context,
+										member
+								) );
+					}
+				}
+			}
+
+			bindingContext.bridge( DoNothingPropertyBridge.INSTANCE );


the binder takes the binder type from the annotation attribute and then goes through the fields from that type, and that's where the "second" processing method for the annotations is used since, at this point, we are working with the index elements and not the programmatic api, hence the "original" processing method won't apply here...

marko-bekhta · 2025-02-20T11:29:45Z

...ate/search/metamodel/processor/annotation/processing/impl/ProcessorVectorFieldProcessor.java

+		VectorFieldTypeOptionsStep<?, ?> optionsStep = bindingContext.typeFactory().asVector( vectorType )
+				// it doesn't really matter since we won't be actually using this value anywhere,
+				// but since it's "optional" let's just keep it simple for now here:
+				.dimension( 10 );


this is just an interesting thing to point out, as we don't really care about some things when building this model, we can skip reading them and just hardcode some values instead.

marko-bekhta mentioned this pull request May 14, 2024

HSEARCH-3319 WIP: DRAFT: IDEA: TEST: Type-safe field references #4113

Draft

marko-bekhta force-pushed the feat/HSEARCH-3319-static-metamodel-field-references-v2 branch from de353a5 to aaf4fff Compare May 15, 2024 09:35

marko-bekhta commented May 15, 2024

View reviewed changes

marko-bekhta force-pushed the feat/HSEARCH-3319-static-metamodel-field-references-v2 branch 3 times, most recently from 8555146 to 296c173 Compare May 20, 2024 14:48

marko-bekhta commented May 22, 2024

View reviewed changes

marko-bekhta force-pushed the feat/HSEARCH-3319-static-metamodel-field-references-v2 branch from 0c8366e to 30f9d6a Compare May 28, 2024 16:15

marko-bekhta added the On ice while we think about it label Jun 25, 2024

marko-bekhta force-pushed the feat/HSEARCH-3319-static-metamodel-field-references-v2 branch 2 times, most recently from 5789bc7 to 159347c Compare August 21, 2024 13:19

marko-bekhta force-pushed the feat/HSEARCH-3319-static-metamodel-field-references-v2 branch 2 times, most recently from baae623 to b8ad37a Compare December 5, 2024 17:42

marko-bekhta force-pushed the feat/HSEARCH-3319-static-metamodel-field-references-v2 branch from b8ad37a to b5a16ea Compare December 12, 2024 21:26

marko-bekhta force-pushed the feat/HSEARCH-3319-static-metamodel-field-references-v2 branch from b5a16ea to cbfcec9 Compare December 13, 2024 12:38

marko-bekhta added 10 commits February 18, 2025 18:22

HSEARCH-3319 Add base scope/scope provider to the engine

cda77a3

HSEARCH-3319 Create reference traits interfaces

31ef727

HSEARCH-3319 Add context type

11cd0b5

HSEARCH-3319 Add references to the DSL

5d36a01

HSEARCH-3319 Simple test

635491d

HSEARCH-3319 Address super-type references

fd11236

HSEARCH-3319 Use mapper specific root-scope-creators

023f20a

HSEARCH-3319 Test: Model properties with interfaces

910f3b3

HSEARCH-3319 Remove now unnecessary withRoot(Reference)

4c03b1b

HSEARCH-3319 Test

ac4ec04

marko-bekhta added 2 commits February 18, 2025 18:22

HSEARCH-3319 Use single impl for field sort options

6f1f39b

HSEARCH-3319 Address scope changes in the Lucene - next backend

07592a3

marko-bekhta force-pushed the feat/HSEARCH-3319-static-metamodel-field-references-v2 branch from cbfcec9 to f04af13 Compare February 20, 2025 10:34

HSEARCH-5300 Experiment with the annotation processor to generate the…

e3a9d5a

… metamodel

marko-bekhta force-pushed the feat/HSEARCH-3319-static-metamodel-field-references-v2 branch from f04af13 to e3a9d5a Compare February 20, 2025 10:54

marko-bekhta commented Feb 20, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HSEARCH-3319 WIP: Type-safe field references v2 #4156

HSEARCH-3319 WIP: Type-safe field references v2 #4156

marko-bekhta commented May 9, 2024

marko-bekhta May 15, 2024

yrodiere May 15, 2024

marko-bekhta May 15, 2024

yrodiere May 15, 2024

marko-bekhta May 16, 2024

marko-bekhta May 15, 2024

yrodiere May 15, 2024

marko-bekhta May 15, 2024

yrodiere May 15, 2024

yrodiere May 15, 2024 •

edited

Loading

sonarqubecloud bot commented May 22, 2024

marko-bekhta left a comment

marko-bekhta May 22, 2024

yrodiere May 23, 2024 •

edited

Loading

marko-bekhta May 22, 2024

yrodiere May 23, 2024

yrodiere May 23, 2024 •

edited

Loading

marko-bekhta May 22, 2024

yrodiere May 23, 2024

marko-bekhta May 23, 2024

marko-bekhta commented May 23, 2024

hibernate-github-bot bot commented Dec 12, 2024 •

edited

Loading

sonarqubecloud bot commented Feb 12, 2025

marko-bekhta left a comment

marko-bekhta Feb 20, 2025

marko-bekhta Feb 20, 2025

marko-bekhta Feb 20, 2025

marko-bekhta Feb 20, 2025

marko-bekhta Feb 20, 2025

marko-bekhta Feb 20, 2025

marko-bekhta Feb 20, 2025

marko-bekhta Feb 20, 2025

HSEARCH-3319 WIP: Type-safe field references v2 #4156

Are you sure you want to change the base?

HSEARCH-3319 WIP: Type-safe field references v2 #4156

Conversation

marko-bekhta commented May 9, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yrodiere May 15, 2024 • edited Loading

Choose a reason for hiding this comment

sonarqubecloud bot commented May 22, 2024

Quality Gate failed

marko-bekhta left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yrodiere May 23, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yrodiere May 23, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

marko-bekhta commented May 23, 2024

hibernate-github-bot bot commented Dec 12, 2024 • edited Loading

sonarqubecloud bot commented Feb 12, 2025

Quality Gate failed

marko-bekhta left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yrodiere May 15, 2024 •

edited

Loading

yrodiere May 23, 2024 •

edited

Loading

yrodiere May 23, 2024 •

edited

Loading

hibernate-github-bot bot commented Dec 12, 2024 •

edited

Loading