Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create 2024-08-12-hudi-acid-transactions.md #11760

Closed
wants to merge 5,246 commits into from
Closed

Create 2024-08-12-hudi-acid-transactions.md #11760

wants to merge 5,246 commits into from

Conversation

alberttwong
Copy link
Contributor

@alberttwong alberttwong commented Aug 12, 2024

Change Logs

adding article on hudi acid transactions

Impact

no impact

Risk level (write none, low medium or high below)

none

Documentation Update

none

Contributor's checklist

  • Read through contributor's guide
  • Change Logs and Impact were stated clearly
  • Adequate tests were added if applicable
  • CI passed

jonvex and others added 30 commits May 10, 2024 14:19
…and use BaseFileUtils (#11185)

Co-authored-by: Jonathan Vexler <=>
Co-authored-by: Y Ethan Guo <[email protected]>
…dieRecords and JsonKafkaSource mapPartitions (#10872)

Co-authored-by: Y Ethan Guo <[email protected]>
Co-authored-by: Jonathan Vexler <=>
Co-authored-by: Y Ethan Guo <[email protected]>
Co-authored-by: Jonathan Vexler <=>
Co-authored-by: Y Ethan Guo <[email protected]>
Co-authored-by: Jonathan Vexler <=>
* [HUDI-7749] Bump Spark version 3.3.1 to 3.3.4

* cdcFileReader should return batches for CDC reads only when batch read is supported for the schema
…ead of just base files while initializing (#11153)

Co-authored-by: Y Ethan Guo <[email protected]>
…10763)

Co-authored-by: Y Ethan Guo <[email protected]>
Co-authored-by: Jonathan Vexler <=>
…tDeltaCommitsSinceLatestCompaction (#10915)

* Fixing schedule compaction bug

* Addressing comments

* Fixing CDC tests
…ite keys (#11077)

Introduce a new abstract class `BaseHoodieMergedLogRecordScanner` which 
subclasses from `AbstractHoodieLogRecordReader`. The new abstract class 
holds the `records` map as `ExternalSpillableMap<K, HoodieRecord>` and exposes 
`public abstract Map<K, HoodieRecord> getRecords()` API. The existing 
`HoodieMergedLogRecordScanner` now derives from the new abstract class 
(instead of `AbstractHoodieLogRecordReader`) and uses String keys.
wombatu-kun and others added 21 commits July 29, 2024 17:30
1. Support MOR table type for HoodieJavaWriteClientExample
2. Typo Fix

Signed-off-by: ther1sing3un <[email protected]>
…get clustering instant contains insert overwrite (#11691)
…gs (#11638)

Store partition type as well in `hoodie.table.partition.fields`. 
Currently this config stores the commas separated partition fields 
without the partition type. Partition type here corresponds to the 
partition type supported in Custom key generators like `simple` 
and `timestamp`.

Co-authored-by: Sagar Sumit <[email protected]>
…partitioning options are not available (#11679)

Co-authored-by: Y Ethan Guo <[email protected]>
)

Updated AwsGlueCatalaogSyncClient createOrReplace method to drop and 
create the table, instead of updating the table. Updated the check to compare 
the base of the hudi table and metastore table location to ignore file schemas.

---------

Co-authored-by: Vamsi <[email protected]>
Co-authored-by: Sagar Sumit <[email protected]>
Co-authored-by: Jonathan Vexler <=>
* [HUDI-8012] Update checkstyle.xml based on the new release

* Upgrade checkstyle and plugin

* Fix checkstyle errors

* Suppress checkstyle error for sql
@github-actions github-actions bot added docs size:S PR with lines of changes in (10, 100] size:XL PR with lines of changes > 1000 and removed size:S PR with lines of changes in (10, 100] labels Aug 12, 2024
@alberttwong alberttwong deleted the patch-1 branch August 12, 2024 22:56
@alberttwong alberttwong restored the patch-1 branch August 12, 2024 23:10
@alberttwong alberttwong deleted the patch-1 branch August 12, 2024 23:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
docs size:XL PR with lines of changes > 1000
Projects
None yet
Development

Successfully merging this pull request may close these issues.