This is the Unity Catalog server implementation that can be run in the cloud or started locally.
To run against the latest main branch, start by cloning the open source Unity Catalog GitHub repository:
git clone [email protected]:unitycatalog/unitycatalog.git
To run Unity Catalog, you need Java 17 installed on your machine. You can always run the java --version
command
to verify that you have the right version of Java installed such as the following example output.
% java --version
openjdk 17.0.12 2024-07-16
OpenJDK Runtime Environment Homebrew (build 17.0.12+0)
OpenJDK 64-Bit Server VM Homebrew (build 17.0.12+0, mixed mode, sharing)
Change into the unitycatalog
directory and run bin/start-uc-server
to instantiate the server. Here is what you
should see:
###################################################################
# _ _ _ _ _____ _ _ #
# | | | | (_) | / ____| | | | | #
# | | | |_ __ _| |_ _ _ | | __ _| |_ __ _| | ___ __ _ #
# | | | | '_ \| | __| | | | | | / _` | __/ _` | |/ _ \ / _` | #
# | |__| | | | | | |_| |_| | | |___| (_| | || (_| | | (_) | (_| | #
# \____/|_| |_|_|\__|\__, | \_____\__,_|\__\__,_|_|\___/ \__, | #
# __/ | __/ | #
# |___/ v0.2.0 |___/ #
###################################################################
The server can be started by issuing the below command from the project root directory:
bin/start-uc-server
!!! note "Running Unity Catalog Server on a specific port"
To run the server on a specific port, use the -p
or --port
option followed by the port number:
```sh title="Use -p or --port to specify your port"
bin/start-uc-server -p <port_number>
bin/start-uc-server -port <port_number>
```
If no port is specified, the server defaults to port **8080**.
The server config file is at the location etc/conf/server.properties
(relative to the project root).
server.env
: The environment in which the server is running. This can be set todev
ortest
. When set totest
the server will instantiate an empty in-memory h2 database for storing metadata. If set todev
, the server will use the fileetc/db/h2db.mv.db
as the metadata store. Any changes made to the metadata will be persisted in this file.
For enabling server to vend AWS temporary credentials to access S3 buckets (for accessing External tables/volumes), the following parameters need to be set:
s3.bucketPath.i
: The S3 path of the bucket where the data is stored. Should be in the formats3://<bucket-name>
.s3.accessKey.i
: The AWS access key, an identifier of temp credentials.s3.secretKey.i
: The AWS secret key used to sign API requests to AWS.s3.sessionToken.i
: THE AWS session token, used to verify that the request is coming from a trusted source.
You can configure multiple buckets by incrementing the index i in the above parameters. The starting index should be 0.
All the above parameters are required for each index. For vending temporary credentials, the server matches the bucket path in the table/volume storage_location with the bucket path in the configuration and returns the corresponding access key, secret key, and session token.
Any params that are not required can be left empty.
The server logs are located at etc/logs/server.log
. The log level and log rolling policy can be set in log4j2 config
file: etc/conf/server.log4j2.properties
.
Please refer to Authentication and Authorization section for more information.