How can I create trino / hive tables on top of existing HDFS parquet #78
Unanswered
terranceuney
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Context
I have successfully tried the demo to configure (hdfs, hive, trino, superset, spark) using your stackable. Below are details:
hdfscluster
: up & running with kerberos authenticationhivecluster
(metastore): up & running, connected to hdfs with kerberos finetrino
,superset
, also up & running OKWhen I define trino-catalog with iceberg connector. I am able to insert data into HDFS (using spark sql) I also able to query via superset & trino using your demo, ALL WORKING FINE
Problems
We have a lots of hdfs data (in parquet format). Before using your stackable, we're able to create superset dashboard by define (
hive-metastore
,trino
,hive-server2
) stack that:hive-metastore
(with postgres): provide metastore for both trino & hive-server2hive-server2
: we use this to create EXTERNAL TABLE with data on HDFSTrino
/superset
: query hdfs data using tables defained by hive-server2If using your stackable, seem we cannot migrate our existing hdfs parquet hive table because:
hive-metastore
, nothive-server2
itself as native query engineiceberge trino-catalog
connector but seem it does not allow to create table with existing hdfs parquethive trino-catalog
connector but when querying onsuperset
(through trino) I got empty dataBelow are my questions:
Is there anyway to create trino tables using
iceberg
ORhive
connector (both usinghive metastore
) on EXISTING hdfs parquet?Is there any way to integrate
hive native
withspark
/trino
so that I can create hive table using native hive query?I saw current
spark-k8s-operator (24.11.1)
only supporthive 2.3.9
jars BUT current hive version is3.1.2
. If using spark + hive native, how can we upgrade those jars to3.1.2
?Looking forward to your help on this, thanks a lot.
If possible, I can share you my
trino.yaml
,hdfs.yaml
,hive.yaml
&superset.yaml
filesBeta Was this translation helpful? Give feedback.
All reactions