-
Notifications
You must be signed in to change notification settings - Fork 351
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: Support weight-stripped engine and REFIT_IDENTICAL flag #3167
Open
zewenli98
wants to merge
36
commits into
main
Choose a base branch
from
weight_stripped_engine
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Changes from 29 commits
Commits
Show all changes
36 commits
Select commit
Hold shift + click to select a range
40349a8
support weight-stripped engine and REFIT_IDENTICAL flag
zewenli98 5d7c677
refactor with new design
zewenli98 82b7ddc
lint
zewenli98 9f6a771
samll fix
zewenli98 7ea3c0f
remove make_refittable
zewenli98 bf7553b
fast refit -> slow refit
zewenli98 46e9bc8
fix np.bool_, group_norm
zewenli98 d783fdd
add immutable_weights
zewenli98 160588e
skip engine caching for non-refittable engines, slow refit -> fast refit
zewenli98 493f981
refactored, there are 3 types of engines
zewenli98 f204104
fix and add tests
zewenli98 4663c83
fix issues #3206 #3217
zewenli98 c57ab06
small fix
zewenli98 402c9b0
resolve comments
zewenli98 d8e59da
WIP: cache weight-stripped engine
zewenli98 e8811fd
Merge branch 'main' into weight_stripped_engine
zewenli98 f2e3f00
redesigned hash func and add constant mapping to fast refit
zewenli98 31af308
refactor and add tests
zewenli98 1ae33f4
Merge branch 'main' into weight_stripped_engine
zewenli98 90bf679
update
zewenli98 a8a34f6
increase ENGINE_CACHE_SIZE
zewenli98 285bc90
skip some tests
zewenli98 2d152cf
fix tests
zewenli98 d461608
try fixing cumsum
zewenli98 d57b885
Merge branch 'main' into weight_stripped_engine
zewenli98 23d68d5
fix windows cross compile, TODO: whether windows support stripping en…
zewenli98 a928f67
CI debug test 1
zewenli98 02625ca
CI debug test 2
zewenli98 c462e40
CI debug test 3
zewenli98 9ba33b5
Merge branch 'main' into weight_stripped_engine
3d68039
reduce -n to 4 for converter tests on CI
zewenli98 2e7ef3b
reduce -n to 4 for converter tests on CI
zewenli98 9ff165c
simplify test_different_args_dont_share_cached_engine
zewenli98 8ca8e2d
reduce -n to 2
zewenli98 f9f2a70
reduce -n to 1
zewenli98 c69c61a
revert -n back to 4 and chunk converter
zewenli98 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
do we need both np.bool_ and np.bool ?