You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The current version does not allow for custom tokenizers to be passed to the SCA class. As semantic_components.representation.CTFIDFRepresenter it should take tokenizer as an argument (and ignore the language argument which is used to infer the tokenizer right now).
The tokenizer that can be passed should operate like semantic_components.representation.GenericTokenizer and minimally implement a tokenize and __call__ method.
The text was updated successfully, but these errors were encountered:
decomposition.ClusterDecomposer.get_component_repr
method to avoid errors when no components are found.
Adressing issue #2: We've added functionalityto the
`SCA` initialization method to allow for custom tokenizers.
We've alos added respective test cases and notes in the README.
You can now pass a tokenizer argument to the initialization method of SCA. Let me know if there are any problems with it. It seems to be fine for the test case I created.
The current version does not allow for custom tokenizers to be passed to the
SCA
class. Assemantic_components.representation.CTFIDFRepresenter
it should taketokenizer
as an argument (and ignore thelanguage
argument which is used to infer the tokenizer right now).The tokenizer that can be passed should operate like
semantic_components.representation.GenericTokenizer
and minimally implement atokenize
and__call__
method.The text was updated successfully, but these errors were encountered: