Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add capacity to link integers with category numbers for categorical targets #39

Closed
nickreich opened this issue Apr 6, 2023 · 3 comments

Comments

@nickreich
Copy link
Contributor

nickreich commented Apr 6, 2023

In a categorical target definition, could we use what is currently stored in
model_tasks > output_type > categorical > type_id
to encode a mapping between integers and categories, so that we could have sample-based representations of categories? E.g. a table like

output_type | type id | value 
----------- | ------- | ------
"sample"    | 1       | 4
"sample"    | 2       | 3
"sample"    | 3       | 7
"sample"    | 4       | 4

where type_id corresponds to the index of the sample and value corresponds to the number corresponding to the category. Noting that categorical targets are weird because the categories can show up as the type_id for pmf output_type but also as the value for the sample output_type.

Possibly related to the question about whether output_type should be a property of a target specifically or not.

@nickreich
Copy link
Contributor Author

@elray1 notes that this then makes it so you can't see what the file is explicitly, you need additional information to make the file readable.

@annakrystalli
Copy link
Member

I'm guessing we will not be pursuing this idea anymore?

@nickreich
Copy link
Contributor Author

Yeah, I think we should close for now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Development

No branches or pull requests

2 participants