Replies: 12 comments 3 replies
-
Apache Paimon is an effort undergoing incubation at The Apache Software Foundation (ASF)。 |
Beta Was this translation helpful? Give feedback.
-
I'm just curious what you're expecting here. I saw your issue #189 but I'm still a little confused. are you asking if hydra can consume data from Paimon, or store in its format, query it, or ... ? I'm happy to advocate for something cool, I just need to understand what it is :) |
Beta Was this translation helpful? Give feedback.
-
In China, a large number of users use Paimon to perform real-time computing of massive data lakes, and if Hydra can provide an interface or driver to access Pamion, then it can find another broad scenario for Hydra. Well-known real-time data warehouse software such as ClickHouse, Doris, and Starrocks have supported reading and computation of Paimon. |
Beta Was this translation helpful? Give feedback.
-
Due to the popularity of Flink, many users in China use Pamion Catlog to build data lakes, and then columnar vector computing software (such as ClickHouse and Hydra) consume (real-time computing) data lake data, and store the real-time computing results in their own (Clickhouse) databases, so as to achieve the integration of data lakes. In this process, there is only one set of data in the data lake, which reduces storage costs and data handling. |
Beta Was this translation helpful? Give feedback.
-
so this would be an external table source? please forgive me, I'm very ignorant of Paimon, is this similar to parquet, where it could be considered a data source, or another database to query from? if it is similar to parquet (a data source), then it should be possible to support, but it would be helpful if there were already a FDW for it. |
Beta Was this translation helpful? Give feedback.
-
It is indeed a data source. It is also a data lake, and in China, the best practice is to store all the original business data of the enterprise together, called a data lake, and then all the analysis software can perform real-time or near-real-time calculations on the data of this lake according to business needs. |
Beta Was this translation helpful? Give feedback.
-
Does it work with Postgres? |
Beta Was this translation helpful? Give feedback.
-
I'm just trying to clarify your request: is the request that we can read a Paimon data source directly from a Paimon server (or file format, like parquet), import the data from JDBC into a hydra table, or export from hydra into something that Paimon can deal with? and as @wuputah asked, does it already work directly with postgres? if so, after a code review it might be something we could easily support. |
Beta Was this translation helpful? Give feedback.
-
or are you asking if we can export like the diagram you posted, from hydra to Paimon? |
Beta Was this translation helpful? Give feedback.
-
Hydra is a columnar store for Postgres so if you can connect to Postgres (e.g. with JDBC) then it will work. |
Beta Was this translation helpful? Give feedback.
-
Apache Paimon:Streaming data lake platform with high-speed data ingestion, changelog tracking and efficient real-time analytics.
Beta Was this translation helpful? Give feedback.
All reactions