Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

V2 branch #547

Merged
merged 36 commits into from
Nov 16, 2024
Merged

V2 branch #547

merged 36 commits into from
Nov 16, 2024

Conversation

jonmmease
Copy link
Collaborator

merge v2 branch in main for 2.0.0-rc1 release

jonmmease and others added 30 commits September 28, 2024 17:29
* Remove vegafusion-jupyter package

* port altair mock tests to use JupyterChart and

* relock, update pixi version in GA

* update ci

* install chrome, install test deps

* Don't run selenium tests on osx arm

* debugging

* disable selenium tests for windows

* remove unneeded windows deps

* comment other jobs

* add ipykernel

* fix selection

* restore all tests

* restore all tests

* comment out test with slider position issue
* Remove altair functionality from vegafusion package

* remove altair dep

* Install altair when testing
* combine vegafusion and vegafusion-python-embed

* fix exclude

* action fixes

* exclude based on crate name not directory name

* Fix pip install paths

* don't try to import embed
* drop Java library

* update workspace
…Python logic (#513)

* Initial update to DataFusion 42 and Arrow 53.1

* Remove protobuf python code

* Remove python protobuf instructions

* fix jsonwriter tests

* fix select tests

* more sort test fixes

* more test fixes

* test / clippy fixes

* more test/lint fixes

* clippy/fmt fixes

* Remove order col after evaluation

* clippy fix

* fixes

* update baselines

* fix sql tests

* update baselines

* sort

* fmt

* use published arrow crates
* Add VegaFusionWidget based on AnyWidget

* remove comment

* add vegafusion_widget test

* Rename test file

* fix ignore path for rename
* ruff format

* Initial lint fix

* Initial ruff lint passing

* Add mypy type checking

* Add ci check

* Add py.typed

* ignore mypy errors in jupyter widget

* fix imports

* bump min version to Python 3.9

* numpy < 2
* Add get_column_usage utility function

* Add Python function for column usage
* Add initial Arrow PyCapsule support

* Bump python in actions to 3.11

* fmt

* update hang test to use polars with pycapsule path

* Add system python of 3.11

* Add more efficient hashing and rechunk for DataFusion

* fmt

* use narwhals to remove unused columns.

* toward removing arrow/pyarrow flag

* Remove arrow-rs pyarrow flag, use pyo3-arrow

* Rename pyarrow feature flag to `py`

* toward using narwhals to process transformed data

* Remove python datasource

* Handle dict and non-narwhals pycapsule types

* update default extraction to arro3

* fix type checking

* build py before type checking

* skip empty fields in window transform

* Try normalize category order

* lower min Python back to 3.9

* clear wheel build dir first

* try rename artifacts

* cache?
* remove dep on psutil, pandas, pyarrow

* Add test that polars usage doesn't import pandas/pyarrow

* Add pandas/pyarrow to test deps
* Move ChartState to vegafusion-core, add VegaFusionRuntimeTrait

* format

* fmt

* type fixes
* Reapply "Refactor to move ChartState to vegafusion core (#519)" (#520)

This reverts commit 713b159.

* Use ChartState in vegafusion-wasm

* format
* Add grpc VegaFusionRuntimeTraim implementation, use from Python

* include inline datasets in query_request

* fix tests

* fix tests

* Add unimplemented stubs in wasm implementation

* fmt

* Move methods up to VegaFusionRuntimeTrait

* fmt/fix

* handle inline datasets in vegafusion-wasm

* warning / format

* fix requested indices

* fix python lint

* fix expected error message

* relock, mypy fixes

* bring back ignores for CI?
* Add release-opt profile, remove unused flags/deps

* add profile for minimizing size

* Fix tests
* Remove Python sql connection / sql dataset

* Remove rust python connection logic

* Fix python tests

* fmt
…Fs (#525)

* wip refactor to use DataFusion's DataFrame

* Add aggregate support

* Port additional transforms

* Port additional transforms

* Port window transform

* Port fold transform

* Port impute transform

* Port pivot transform

* port timeunit

* get schema from first batch

* Don't require metadata match

* start porting stack

* finish stack transform port

* Use object-store with DataFusion to load from http

* wip time functions

* parse %Y-%m-%d in UTC like the browser

* Update timeunit transform to use datafusion operations

* remove unused UDFs

* json fallback to reqwest

* Fix timezone parsing

* Fix selection_test

* all custom spec tests passing

* get image_comparison tests passing

* Get all vegafusion-runtime tests passing

* fix

* fix

* remove more udfs

* remove vegafusion-datafusion-udfs, vegafusion-dataframe, and vegafusion-sql crates

* fix tests

* clippy fix

* format

* warnings / format

* python test updates

* Update to datafusion main

* fmt

* re-enable format millis test, fix substr args

* Support Utf8View in json writer

* fix remaining python tests

* fmt

* clippy fix

* fmt

* work around wasm-pack error

* add call to update-pkg.js

* remove some stale comments
…526)

* Use vega-embed in vegafusion-wasm, drop vegafusion-embed package

* expose view, remove unneeded methods
* Support compiling full runtime to wasm

* fix

* fmt

* fix imports

* update profile

* profile updates

* update release-dev
* use Narwhals stable api, use expressions

* just use .dt.convert_time_zone
* Add "get column usage" section to docs

* fmt
* transformed data cleanup and docs

* remove redundant grpc runtime implementations

* fmt

* fix tests

* clippy fixes

* python lint

* lint/types

* pandas first

* fix test
* Document pre_transform_spec

* Document pre_transform_extract

* Support maxbins property as expression

* Add chart state documentation

* fmt

* fix example

* fix test
* Add logical plan and substrait plan enums items for VegaFusionDataset

* flag proto/substrait

* Add simple inline_dataset examples

* Add logical plan dataset example

* Add inline datasets doc page

* add grpc page, better dark-mode logo.

* Add about and community docs
* use pydata-sphinx theme, add old blog posts

* switch to pydata sphinx theme

* remove unused file
* Add JavaScript embed docs

* doc review

* fmt
* bump version to 2.0.0-a0

* Add empty embed dep group for backward compat

* update README

* use release-opt profile for building wheels

* remove action pinning, update some versions

* update to upload artifact v4

* normalize alpha version to not have hyphen in Python

* fix deps

* put wasm package in a directory so macos doesn't unzip it.

* update action versions

* update versions

* release-opt

* fix dep

* to macos 13

* only upload test images on failures

* upload wasm package as directory
* update RELEASE, add sync_docs script

* pixi task
…ames (#544)

* Use select in narwhals to support duckdb/ibis

* drop patch

* add clear_cache runtime method

* fix
* move prost gen to default out dir location

* clippy fix

* fmt

* remove --no-verify

* fix warning
* standardize on "comm_plan"

* Add jupyter widget features section

* image

* Add 2.0 release post

* bump version to 2.0.0-rc1
@jonmmease jonmmease merged commit b148af5 into main Nov 16, 2024
16 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants