Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Test a "floor number" feature based on unit numbers #90

Open
jeancochrane opened this issue Jan 22, 2025 · 0 comments
Open

Test a "floor number" feature based on unit numbers #90

jeancochrane opened this issue Jan 22, 2025 · 0 comments
Labels
new data/feature Create or edit a column/feature or collect new data

Comments

@jeancochrane
Copy link
Contributor

jeancochrane commented Jan 22, 2025

Roughly 50% of condo PINs have unit identifiers that start with a number:

select numeric_unitno, count(*) as count
from (
    select
        case
            when
                unitno is not null
                and regexp_like(unitno, '^[0-9].*')
            then 'yes'
            else 'no'
        end as numeric_unitno
    from iasworld.pardat
    where cur = 'Y'
        and deactivat is null
        and class = '299'
) as unit_counts
group by numeric_unitno
order by numeric_unitno desc
#	numeric_unitno	count
1	yes	        6937526
2	no	        3501778

It's likely that for many buildings, particularly high rises, these numeric unit identifiers may be structured and contain information about the floor that the unit is on. We should try making a "floor number" feature based on unit identifiers in cases where the identifier appears to be numeric and follows the common pattern of a floor prefix with a unit suffix.

@jeancochrane jeancochrane added the new data/feature Create or edit a column/feature or collect new data label Jan 22, 2025
@jeancochrane jeancochrane changed the title Try constructing a floor feature based on unit numbers Test a "floor number" feature based on unit numbers Jan 22, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
new data/feature Create or edit a column/feature or collect new data
Projects
None yet
Development

When branches are created from issues, their pull requests are automatically linked.

1 participant