Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add shuffle kwarg to GroupBy.map #9706

Draft
wants to merge 47 commits into
base: main
Choose a base branch
from

Commits on Aug 7, 2024

  1. Add GroupBy.shuffle()

    dcherian committed Aug 7, 2024
    Configuration menu
    Copy the full SHA
    3bc51bd View commit details
    Browse the repository at this point in the history
  2. Cleanup

    dcherian committed Aug 7, 2024
    Configuration menu
    Copy the full SHA
    60d7619 View commit details
    Browse the repository at this point in the history
  3. Cleanup

    dcherian committed Aug 7, 2024
    Configuration menu
    Copy the full SHA
    d1429cd View commit details
    Browse the repository at this point in the history
  4. fix

    dcherian committed Aug 7, 2024
    Configuration menu
    Copy the full SHA
    31fc00e View commit details
    Browse the repository at this point in the history

Commits on Aug 13, 2024

  1. Configuration menu
    Copy the full SHA
    4583853 View commit details
    Browse the repository at this point in the history
  2. Fix nD by

    dcherian committed Aug 13, 2024
    Configuration menu
    Copy the full SHA
    abd9dd2 View commit details
    Browse the repository at this point in the history

Commits on Aug 14, 2024

  1. Configuration menu
    Copy the full SHA
    6b820aa View commit details
    Browse the repository at this point in the history
  2. Skip if no dask

    dcherian committed Aug 14, 2024
    Configuration menu
    Copy the full SHA
    0d70656 View commit details
    Browse the repository at this point in the history
  3. fix tests

    dcherian committed Aug 14, 2024
    Configuration menu
    Copy the full SHA
    fafb937 View commit details
    Browse the repository at this point in the history
  4. Merge branch 'main' into groupby-shuffle

    * main:
      Revise (pydata#9366)
      Fix rechunking to a frequency with empty bins. (pydata#9364)
      whats-new entry for dropping python 3.9 (pydata#9359)
      drop support for `python=3.9` (pydata#8937)
      Revise (pydata#9357)
      try to fix scheduled hypothesis test (pydata#9358)
    dcherian committed Aug 14, 2024
    Configuration menu
    Copy the full SHA
    939db9a View commit details
    Browse the repository at this point in the history
  5. Add chunks to signature

    dcherian committed Aug 14, 2024
    Configuration menu
    Copy the full SHA
    a08450e View commit details
    Browse the repository at this point in the history
  6. FIx self

    dcherian committed Aug 14, 2024
    Configuration menu
    Copy the full SHA
    d0cd218 View commit details
    Browse the repository at this point in the history
  7. Another Self fix

    dcherian committed Aug 14, 2024
    Configuration menu
    Copy the full SHA
    4edc976 View commit details
    Browse the repository at this point in the history
  8. Forward chunks too

    dcherian committed Aug 14, 2024
    Configuration menu
    Copy the full SHA
    0b42be4 View commit details
    Browse the repository at this point in the history
  9. [revert]

    dcherian committed Aug 14, 2024
    Configuration menu
    Copy the full SHA
    c52734d View commit details
    Browse the repository at this point in the history
  10. undo flox limit

    dcherian committed Aug 14, 2024
    Configuration menu
    Copy the full SHA
    8180625 View commit details
    Browse the repository at this point in the history
  11. [revert]

    dcherian committed Aug 14, 2024
    Configuration menu
    Copy the full SHA
    7897c91 View commit details
    Browse the repository at this point in the history
  12. fix types

    dcherian committed Aug 14, 2024
    Configuration menu
    Copy the full SHA
    7773548 View commit details
    Browse the repository at this point in the history

Commits on Aug 15, 2024

  1. Configuration menu
    Copy the full SHA
    51a7723 View commit details
    Browse the repository at this point in the history
  2. Add doctest

    dcherian committed Aug 15, 2024
    Configuration menu
    Copy the full SHA
    cc95513 View commit details
    Browse the repository at this point in the history
  3. Refactor

    dcherian committed Aug 15, 2024
    Configuration menu
    Copy the full SHA
    18f4a40 View commit details
    Browse the repository at this point in the history
  4. tweak docstrings

    dcherian committed Aug 15, 2024
    Configuration menu
    Copy the full SHA
    f489bcf View commit details
    Browse the repository at this point in the history
  5. fix typing

    dcherian committed Aug 15, 2024
    Configuration menu
    Copy the full SHA
    ead1bb4 View commit details
    Browse the repository at this point in the history
  6. Fix

    dcherian committed Aug 15, 2024
    Configuration menu
    Copy the full SHA
    75115d0 View commit details
    Browse the repository at this point in the history
  7. fix docstring

    dcherian committed Aug 15, 2024
    Configuration menu
    Copy the full SHA
    390863a View commit details
    Browse the repository at this point in the history

Commits on Aug 17, 2024

  1. Configuration menu
    Copy the full SHA
    a408cb0 View commit details
    Browse the repository at this point in the history
  2. Merge branch 'main' into groupby-shuffle

    * main:
      Improve error message for missing coordinate index (pydata#9370)
      Add flaky to TestNetCDF4ViaDaskData (pydata#9373)
      Make chunk manager an option in `set_options` (pydata#9362)
      Revise (pydata#9371)
      Remove duplicate word from docs (pydata#9367)
      Adding open_groups to BackendEntryPointEngine, NetCDF4BackendEntrypoint, and H5netcdfBackendEntrypoint (pydata#9243)
    dcherian committed Aug 17, 2024
    Configuration menu
    Copy the full SHA
    7038f37 View commit details
    Browse the repository at this point in the history
  3. Fix typing

    dcherian committed Aug 17, 2024
    Configuration menu
    Copy the full SHA
    05a0fb4 View commit details
    Browse the repository at this point in the history
  4. Fix types

    dcherian committed Aug 17, 2024
    Configuration menu
    Copy the full SHA
    b8e7f62 View commit details
    Browse the repository at this point in the history

Commits on Aug 22, 2024

  1. Merge branch 'main' into groupby-shuffle

    * main:
      Adds copy parameter to __array__ for numpy 2.0 (pydata#9393)
      `numpy 2` compatibility in the `pydap` backend (pydata#9391)
      pyarrow dependency added to doc environment (pydata#9394)
      Extend padding functionalities (pydata#9353)
      refactor GroupBy internals (pydata#9389)
      Combine `UnsignedIntegerCoder` and `CFMaskCoder` (pydata#9274)
      passing missing parameters to ZarrStore.open_store when opening a datatree (pydata#9377)
      Fix tests on big-endian systems (pydata#9380)
      Improve error message on `ds['x', 'y']` (pydata#9375)
    dcherian committed Aug 22, 2024
    Configuration menu
    Copy the full SHA
    6d9ed1c View commit details
    Browse the repository at this point in the history

Commits on Aug 30, 2024

  1. Merge branch 'main' into groupby-shuffle

    * main:
      Accessibility: Add keyboard handling for XArray HTML view (pydata#9412)
      [pre-commit.ci] pre-commit autoupdate (pydata#9316)
      [skip-ci] Speed up docs build by limiting toctrees (pydata#9395)
      fix the failing `pre-commit.ci` runs (pydata#9411)
      Update benchmarks.yml (pydata#9406)
      GroupBy(multiple groupers) (pydata#9372)
      Encode/decode property tests use variables() (pydata#9401)
    dcherian committed Aug 30, 2024
    Configuration menu
    Copy the full SHA
    20a8cd9 View commit details
    Browse the repository at this point in the history
  2. remove shuffle_by for now.

    dcherian committed Aug 30, 2024
    Configuration menu
    Copy the full SHA
    7a99c8f View commit details
    Browse the repository at this point in the history
  3. Add tests

    dcherian committed Aug 30, 2024
    Configuration menu
    Copy the full SHA
    5e2fdfb View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    a22c7ed View commit details
    Browse the repository at this point in the history

Commits on Sep 11, 2024

  1. Revert "remove shuffle_by for now."

    This reverts commit 7a99c8f.
    dcherian committed Sep 11, 2024
    Configuration menu
    Copy the full SHA
    2d48690 View commit details
    Browse the repository at this point in the history

Commits on Sep 12, 2024

  1. Merge branch 'main' into groupby-shuffle

    * main: (29 commits)
      Release notes for v2024.09.0 (pydata#9480)
      Fix `DataTree.coords.__setitem__` by adding `DataTreeCoordinates` class (pydata#9451)
      Rename DataTree's "ds" and "data" to "dataset" (pydata#9476)
      Update DataTree repr to indicate inheritance (pydata#9470)
      Bump pypa/gh-action-pypi-publish in the actions group (pydata#9460)
      Repo checker (pydata#9450)
      Add days_in_year and decimal_year to dt accessor (pydata#9105)
      remove parent argument from DataTree.__init__ (pydata#9465)
      Fix inheritance in DataTree.copy() (pydata#9457)
      Implement `DataTree.__delitem__` (pydata#9453)
      Add ASV for datatree.from_dict (pydata#9459)
      Make the first argument in DataTree.from_dict positional only (pydata#9446)
      Fix typos across the code, doc and comments (pydata#9443)
      DataTree should not be "Generic" (pydata#9445)
      Disallow passing a DataArray as data into the DataTree constructor (pydata#9444)
      Support additional dtypes in `resample` (pydata#9413)
      Shallow copy parent and children in DataTree constructor (pydata#9297)
      Bump minimum versions for dependencies (pydata#9434)
      Always include at least one category in random test data (pydata#9436)
      Avoid deep-copy when constructing groupby codes (pydata#9429)
      ...
    dcherian committed Sep 12, 2024
    Configuration menu
    Copy the full SHA
    0679d2b View commit details
    Browse the repository at this point in the history

Commits on Sep 17, 2024

  1. Merge branch 'main' into groupby-shuffle

    * main:
      Opt out of floor division for float dtype time encoding (pydata#9497)
      fixed formatting for whats-new (pydata#9493)
      Forbid modifying names of DataTree objects with parents (pydata#9494)
      DAS-2155 - Merge datatree documentation into main docs. (pydata#9033)
      Make illegal path-like variable names when constructing a DataTree from a Dataset (pydata#9378)
      Ensure TreeNode doesn't copy in-place (pydata#9482)
      `open_groups` for zarr backends (pydata#9469)
      Update pyproject.toml (pydata#9484)
      New whatsnew section (pydata#9483)
    dcherian committed Sep 17, 2024
    Configuration menu
    Copy the full SHA
    63b3e77 View commit details
    Browse the repository at this point in the history
  2. bad merge

    dcherian committed Sep 17, 2024
    Configuration menu
    Copy the full SHA
    7dc5dd1 View commit details
    Browse the repository at this point in the history

Commits on Sep 18, 2024

  1. Merge branch 'main' into groupby-shuffle

    * main:
      Turn off survey banner (pydata#9512)
      Stateful test: silence DeprecationWarning from drop_dims (pydata#9508)
    dcherian committed Sep 18, 2024
    Configuration menu
    Copy the full SHA
    bad0744 View commit details
    Browse the repository at this point in the history
  2. Add a test

    dcherian committed Sep 18, 2024
    Configuration menu
    Copy the full SHA
    91e4bd8 View commit details
    Browse the repository at this point in the history

Commits on Nov 1, 2024

  1. Merge branch 'main' into groupby-shuffle

    * main: (85 commits)
      Refactor out utility functions from to_zarr (pydata#9695)
      Use the same function to floatize coords in polyfit and polyval (pydata#9691)
      Add `DataTree.persist` (pydata#9682)
      Typing annotations for arithmetic overrides (e.g., DataArray + Dataset) (pydata#9688)
      Raise `ValueError` for unmatching chunks length in `DataArray.chunk()` (pydata#9689)
      Fix inadvertent deep-copying of child data in DataTree (pydata#9684)
      new blank whatsnew (pydata#9679)
      v2024.10.0 release summary (pydata#9678)
      drop the length from `numpy`'s fixed-width string dtypes (pydata#9586)
      fixing behaviour for group parameter in `open_datatree` (pydata#9666)
      Use zarr v3 dimension_names (pydata#9669)
      fix(zarr): use inplace array.resize for zarr 2 and 3 (pydata#9673)
      implement `dask` methods on `DataTree` (pydata#9670)
      support `chunks` in `open_groups` and `open_datatree` (pydata#9660)
      Compatibility for zarr-python 3.x (pydata#9552)
      Update to_dataframe doc to match current behavior (pydata#9662)
      Reduce graph size through writing indexes directly into graph for ``map_blocks`` (pydata#9658)
      Add close() method to DataTree and use it to clean-up open files in tests (pydata#9651)
      Change URL for pydap test (pydata#9655)
      Fix multiple grouping with missing groups (pydata#9650)
      ...
    dcherian committed Nov 1, 2024
    Configuration menu
    Copy the full SHA
    0542944 View commit details
    Browse the repository at this point in the history
  2. Add docs

    dcherian committed Nov 1, 2024
    Configuration menu
    Copy the full SHA
    1e4f805 View commit details
    Browse the repository at this point in the history
  3. bugfix

    dcherian committed Nov 1, 2024
    Configuration menu
    Copy the full SHA
    ad502aa View commit details
    Browse the repository at this point in the history

Commits on Nov 3, 2024

  1. Configuration menu
    Copy the full SHA
    4b0c143 View commit details
    Browse the repository at this point in the history
  2. Merge branch 'main' into groupby-shuffle

    * main:
      update mypy to 1.13 (pydata#9687)
    dcherian committed Nov 3, 2024
    Configuration menu
    Copy the full SHA
    2b2c4ab View commit details
    Browse the repository at this point in the history
  3. fix types

    dcherian committed Nov 3, 2024
    Configuration menu
    Copy the full SHA
    f624c8f View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    fa6311a View commit details
    Browse the repository at this point in the history