-
-
Notifications
You must be signed in to change notification settings - Fork 158
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add capability to handle tmd input #2740
Conversation
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## master #2740 +/- ##
=======================================
Coverage 99.42% 99.42%
=======================================
Files 13 13
Lines 2594 2603 +9
=======================================
+ Hits 2579 2588 +9
Misses 15 15
Flags with carried forward coverage won't be shown. Click here to find out more.
|
- "pandas>=1.2.0" | ||
- "bokeh>=1.4.0, <3.0.0" | ||
- "python>=3.9, <3.12" | ||
- "numpy>=1.20, <2.0" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is it important to provide an upper bound on the versions for certain packages, like numpy
and pandas
? Historically, Tax-Calculator has tried to remain compatible with the most recent versions of the dependencies.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There are several deprecation warnings in the code (mostly related to pandas) that say the current code will cause errrors when pandas 3.0 is used. I check the pandas web site and they said 3.0 would be released any day now.
So, the limits are temporary. They will be removed when the deprecated code gets fixed.
@martinholmer I've reviewed this PR and had one small question left in my review. But there is a larger issue that this PR touches on that might be worth discussion before merging. There has been work in the past (see PR #2538) to disentangle the tax calculator logic from the data used for specific calculations. Things like the |
@jdebacker said in PR #2740:
There has been no work on PR #2538 in over three years. Seems like there is little or no momentum in the "direction that the project was hoping to go". |
Thanks of the PR @martinholmer. Merging. |
Add code to handle new private
tmd.csv
in a manner like the privatepuf.csv
is handled.The private
tmd.csv
file contains 2021 data and has its own publictmd_weights.csv.gz
file containing weights for 2021 through 2074. Also, the new publicgrowfactors.csv
file has factors extending through 2074.These three files are created in the tax-microdata repository using the following scripts:
create_taxcalc_input_variables.py
produces the privatetmd.csv
filecreate_taxcalc_sampling_weights.py
produces the publictmd_weights.csv.gz
filecreate_taxcalc_growth_factors.py
produces the publicgrowfactors.csv
fileThe code in this pull request still specifies the
Policy.LAST_BUDGET_YEAR
to be 2034; a subsequent pull request will increase it to 2074 (which has already been tested).