Skip to content

Releases: linkedin/iceberg

v1.0.0.1

17 Dec 01:58
1a5b94e
Compare
Choose a tag to compare

Changelog generated by Shipkit Changelog Gradle Plugin

1.0.0.1

  • 2022-12-17 - 1 commit(s) by Limian (Raymond) Zhang
  • Rebase LI-Iceberg changes on top of Apache Iceberg 1.0.0 release (#131)
  • Bug fix: MergeHiveSchemaWithAvro should retain avro properties for li… (#125)
  • Do not push down filter to ORC for union type schema (#123)
  • Add logic to derive partition column id from partition.column.ids pro… (#122)
  • Hive: Return null for currentSnapshot() (#121)
  • Fix MergeHiveSchemaWithAvro to make it copy full Avro schema attributes (#120)
  • Set ORC columns and fix case-sensitivity issue with schema check (#119)
  • ORC: Handle query where select and filter only uses default value col… (#118)
  • Check for schema corruption and fix it on commit (#117)
  • Support single type union for ORC-vectorization reader (#114)
  • Enhance the UT for testing required fields with default values (#113)
  • Fix and refactor schema parser (#112)
  • Add scm and developer info (#111)
  • [Hivelink] Refactor support hive non string partition cols to rid of … (#110)
  • Retain the type of field while copying the default values. (#109)
  • Handle single type union for deep nested schema (#107)
  • Disable default value preserving (#106)
  • Handle non-nullable union of single type for ORC spark non-vectorized reader (#104)
  • Remove activeSparkSession (#103)
  • Move 'Hive Metadata Scan: Support case insensitive name mapping' (PR 52) to hivelink-core (#102)
  • Release automation overhaul: Sonatype Nexus, Shipkit and GH Actions (#101)
  • Handle null default in nested type default value situations (#100)
  • Move custom hive catalog to hivelink-core (#99)
  • Handle non-nullable union of single type for Avro (#98)
  • Improve the logging when the deserailzed index is invalid to read the symbol from enum (#96)
  • [LI][Spark][Avro] read avro union using decoder instead of directly returning v… (#94)
  • [LI][Avro] Do not reorder elements inside a Avro union schema (#93)
  • [LI] Align default value validation align with avro semantics in terms of nullable (nested) fields (#92)
  • Support timestamp as partition type (#91)
  • reverting commits 2c59857 and f362aed (#88)
  • Separate classes under hive legacy package to new hivelink module (#87)
  • Recorder hive table properties to align the avro.schema.literal place… (#86)
  • ORC: Change union read schema from hive to trino (#85)
  • Avro: Change union read schema from hive to trino (#84)
  • Fix spark avro reader reading union schema data (#83)
  • Fix ORC schema visitors to support reading ORC files with deeply nest… (#81)
  • Support avro.schema.literal/hive union types in Hive legacy table to … (#80)
  • Support hive non string partition cols (#78)
  • Support default value read for ORC format in spark (#76)
  • [#2039] Support default value semantic for AVRO (#75)
  • Support reading ORC complex union types (#74)
  • Support reading Avro complex union types (#73)
  • Change error handling to not delete data from HDFS when hive metaStore has been updated (#68)
  • Fix HasDuplicateLowercaseColumnNames's visit method to use a new visi… (#67)
  • Stop using serdeToFileFormat to unblock formats other than Avro or Orc (#64)
  • Fix support for Hive timestamp type (#61)
  • fix hive primitive typeinfo to avro logical type conversion. (#58)
  • Alternative implementation of #55 Hive Metadata Scan: Merge Hive and Avro schemas if they are inconsistent (#57)
  • ORC: Support NameMapping with row-level filtering (#53)
  • Hive Metadata Scan: Support case insensitive name mapping (#52)
  • Fix pushdown of non-partition predicates within NOT (#51)
  • Hive Table Operations: Do not throw an exception on dangling partitions; log warning message (#50)
  • Hive Metadata Scan: Return empty statistics (#49)
  • Rebase: Hive: Fix schema not forwarded to SerDe on MR jobs (#45) (#47)
  • Hive: Fix schema not forwarded to SerDe on MR jobs (#45)
  • Hive: Fix uppercase bug and determine catalog from table properties (#38)
  • Hive Metadata Scan: Do not use table snapshot summary for estimating statistics (#37)
  • Hive Metadata Scan: Notify ScanEvent listeners on planning (#35)
  • Make LegacyHiveTable extensible. (#26)
  • Support for Hive tables without avro.schema.literal (#25)
  • Support for non string partition columns (#24)
  • Support reading tables with only Hive metadata (#23)
  • Simplify CustomHiveCatalog (#22)
  • Support reading ORC files without Iceberg IDs (#16)
  • Add a iceberg-runtime shaded module (#12)
  • Add custom hive catalog to not override existing Hive metadata (#10)

v0.11.1.19

14 Dec 22:49
976277d
Compare
Choose a tag to compare

Changelog generated by Shipkit Changelog Gradle Plugin

0.11.1.19

  • 2022-12-14 - 1 commit(s) by Limian (Raymond) Zhang
  • [LI] Add feature for Spark ORC reader to ignore field ids in files by using a new table property (#134)
  • Add logic to derive partition column id from partition.column.ids pro… (#122)

v0.14.1.4

03 Nov 00:59
a248fe4
Compare
Choose a tag to compare

Changelog generated by Shipkit Changelog Gradle Plugin

0.14.1.4

  • 2022-11-03 - 1 commit(s) by Limian (Raymond) Zhang
  • Fix netty dependency and row level fitler issue in spark2 (#130)

v0.14.1.3

31 Oct 19:09
3740f36
Compare
Choose a tag to compare

Changelog generated by Shipkit Changelog Gradle Plugin

0.14.1.3

  • 2022-10-31 - 1 commit(s) by Limian (Raymond) Zhang
  • Add a depend-all module for easy ELR integration (#129)

v0.14.1.2

26 Oct 04:06
6d9f637
Compare
Choose a tag to compare

Changelog generated by Shipkit Changelog Gradle Plugin

0.14.1.2

  • 2022-10-26 - 1 commit(s) by Limian (Raymond) Zhang
  • Remove netty-common module from the spark-runtime fat jar (#127)

v0.11.1.18

26 Oct 03:43
00b370e
Compare
Choose a tag to compare

Changelog generated by Shipkit Changelog Gradle Plugin

0.11.1.18

  • 2022-10-26 - 1 commit(s) by Limian (Raymond) Zhang
  • Let shipkit automatically generate change log for github release (#128)

v0.14.1.1

21 Oct 23:28
211a8c9
Compare
Choose a tag to compare

Changelog generated by Shipkit Changelog Gradle Plugin

0.14.1.1

  • 2022-10-21 - 1 commit(s) by Limian (Raymond) Zhang
  • Rebase LI-Iceberg changes on top of Apache Iceberg 0.14.1 release (#126)
  • Bug fix: MergeHiveSchemaWithAvro should retain avro properties for li… (#125)
  • Do not push down filter to ORC for union type schema (#123)
  • Add logic to derive partition column id from partition.column.ids pro… (#122)
  • Hive: Return null for currentSnapshot() (#121)
  • Fix MergeHiveSchemaWithAvro to make it copy full Avro schema attributes (#120)
  • Set ORC columns and fix case-sensitivity issue with schema check (#119)
  • ORC: Handle query where select and filter only uses default value col… (#118)
  • Check for schema corruption and fix it on commit (#117)
  • Support single type union for ORC-vectorization reader (#114)
  • Enhance the UT for testing required fields with default values (#113)
  • Fix and refactor schema parser (#112)
  • Add scm and developer info (#111)
  • [Hivelink] Refactor support hive non string partition cols to rid of … (#110)
  • Retain the type of field while copying the default values. (#109)
  • Handle single type union for deep nested schema (#107)
  • Disable default value preserving (#106)
  • Handle non-nullable union of single type for ORC spark non-vectorized reader (#104)
  • Remove activeSparkSession (#103)
  • Move 'Hive Metadata Scan: Support case insensitive name mapping' (PR 52) to hivelink-core (#102)
  • Release automation overhaul: Sonatype Nexus, Shipkit and GH Actions (#101)
  • Handle null default in nested type default value situations (#100)
  • Move custom hive catalog to hivelink-core (#99)
  • Handle non-nullable union of single type for Avro (#98)
  • Improve the logging when the deserailzed index is invalid to read the symbol from enum (#96)
  • [LI][Spark][Avro] read avro union using decoder instead of directly returning v… (#94)
  • [LI][Avro] Do not reorder elements inside a Avro union schema (#93)
  • [LI] Align default value validation align with avro semantics in terms of nullable (nested) fields (#92)
  • Support timestamp as partition type (#91)
  • reverting commits 2c59857 and f362aed (#88)
  • Separate classes under hive legacy package to new hivelink module (#87)
  • Recorder hive table properties to align the avro.schema.literal place… (#86)
  • ORC: Change union read schema from hive to trino (#85)
  • Avro: Change union read schema from hive to trino (#84)
  • Fix spark avro reader reading union schema data (#83)
  • Fix ORC schema visitors to support reading ORC files with deeply nest… (#81)
  • Support avro.schema.literal/hive union types in Hive legacy table to … (#80)
  • Support hive non string partition cols (#78)
  • Support default value read for ORC format in spark (#76)
  • [#2039] Support default value semantic for AVRO (#75)
  • Support reading ORC complex union types (#74)
  • Support reading Avro complex union types (#73)
  • Change error handling to not delete data from HDFS when hive metaStore has been updated (#68)
  • Fix HasDuplicateLowercaseColumnNames's visit method to use a new visi… (#67)
  • Stop using serdeToFileFormat to unblock formats other than Avro or Orc (#64)
  • Fix support for Hive timestamp type (#61)
  • fix hive primitive typeinfo to avro logical type conversion. (#58)
  • Alternative implementation of #55 Hive Metadata Scan: Merge Hive and Avro schemas if they are inconsistent (#57)
  • ORC: Support NameMapping with row-level filtering (#53)
  • Hive Metadata Scan: Support case insensitive name mapping (#52)
  • Fix pushdown of non-partition predicates within NOT (#51)
  • Hive Table Operations: Do not throw an exception on dangling partitions; log warning message (#50)
  • Hive Metadata Scan: Return empty statistics (#49)
  • Rebase: Hive: Fix schema not forwarded to SerDe on MR jobs (#45) (#47)
  • Hive: Fix schema not forwarded to SerDe on MR jobs (#45)
  • Hive: Fix uppercase bug and determine catalog from table properties (#38)
  • Hive Metadata Scan: Do not use table snapshot summary for estimating statistics (#37)
  • Hive Metadata Scan: Notify ScanEvent listeners on planning (#35)
  • Make LegacyHiveTable extensible. (#26)
  • Support for Hive tables without avro.schema.literal (#25)
  • Support for non string partition columns (#24)
  • Support reading tables with only Hive metadata (#23)
  • Simplify CustomHiveCatalog (#22)
  • Support reading ORC files without Iceberg IDs (#16)
  • Add a iceberg-runtime shaded module (#12)
  • Add custom hive catalog to not override existing Hive metadata (#10)

v0.11.1.17

22 Sep 23:27
fed806b
Compare
Choose a tag to compare

Changelog generated by Shipkit Changelog Gradle Plugin

0.11.1.17

  • 2022-09-22 - 0 commit(s) by
  • No notable improvements. No pull requests (issues) were referenced from commits.

v0.11.1.16

26 Aug 21:34
5f214b1
Compare
Choose a tag to compare

Changelog generated by Shipkit Changelog Gradle Plugin

0.11.1.16

  • 2022-08-26 - 0 commit(s) by
  • No notable improvements. No pull requests (issues) were referenced from commits.

v0.11.1.15

24 Aug 19:33
f2e5bd2
Compare
Choose a tag to compare

Changelog generated by Shipkit Changelog Gradle Plugin

0.11.1.15

  • 2022-08-24 - 0 commit(s) by
  • No notable improvements. No pull requests (issues) were referenced from commits.