From b059b88082253aea6e2bb06b7f91418269b33eeb Mon Sep 17 00:00:00 2001 From: Phil Rzewski Date: Thu, 19 Oct 2023 16:40:29 -0700 Subject: [PATCH] Add docs related to zed/4758 changes --- cmd/zed/use/command.go | 6 +++--- docs/commands/zed.md | 40 ++++++++++++++++++++++++++++++---------- 2 files changed, 33 insertions(+), 13 deletions(-) diff --git a/cmd/zed/use/command.go b/cmd/zed/use/command.go index 6923aeecb7..424c498d78 100644 --- a/cmd/zed/use/command.go +++ b/cmd/zed/use/command.go @@ -14,7 +14,7 @@ import ( var Cmd = &charm.Spec{ Name: "use", Usage: "use [pool][@branch]", - Short: "use a branch", + Short: "use a branch, or print the branch and lake currently in use", Long: ` The use command prints or sets the working pool and branch. Setting these values allows commands like load, rebase, merge, etc. to function without @@ -22,7 +22,7 @@ having to specify the working branch. The branch specifier may also be a commit ID, in which case you enter a headless state and commands like load that require a branch will report an error. -The use command is like "git checkuout" but there is no local copy of +The use command is like "git checkout" but there is no local copy of the lake data. Rather, the local HEAD state influences commands as they access the lake. @@ -52,7 +52,7 @@ file ~/.zed_head. This file simply contains a pointer to the HEAD branch and thus provides the default for the -use option. This way, multiple working directories can contain different HEAD pointers (along with your local files) and you can easily switch between windows without having to continually -re-specify a new HEAD. Unlike Git, all the commited pool data remains +re-specify a new HEAD. Unlike Git, all the committed pool data remains in the lake and is not copied to this local directory. `, New: New, diff --git a/docs/commands/zed.md b/docs/commands/zed.md index c9f420f9ae..e73a5c43e0 100644 --- a/docs/commands/zed.md +++ b/docs/commands/zed.md @@ -74,7 +74,7 @@ components come together. While the CLI-first approach provides these benefits, all of the functionality is also exposed through [an API](../lake/api.md) to a Zed service. Many use cases involve an application like -[Zui](https://github.com/brimdata/zui) or a +[Zui](https://zui.brimdata.io/) or a programming environment like Python/Pandas interacting with the service API in place of direct use with the `zed` command. @@ -130,18 +130,17 @@ model onto a file system so that Zed lakes can also be deployed on standard file The `zed` command provides a single command-line interface to Zed lakes, but different personalities are taken on by `zed` depending on the particular -sub-command executed and the disposition of its `-lake` option -(which defaults to the value of `ZED_LAKE` environment variable or, -if `ZED_LAKE` is not set, to the client personality `https://localhost:9867`). +sub-command executed and where the [lake is located](#locating-the-lake). To this end, `zed` can take on one of three personalities: + * _Direct Access_ - When the lake is a storage path (`file` or `s3` URI), then the `zed` commands (except for `serve`) all operate directly on the lake located at that path. * _Client Personality_ - When the lake is an HTTP or HTTPS URL, then the lake is presumed to be a Zed lake service endpoint and the client commands are directed to the service managing the lake. -* _Server Personality_ - When the `zed serve` command is executed, then +* _Server Personality_ - When the [`zed serve`](#serve) command is executed, then the personality is always the server personality and the lake must be a storage path. This command initiates a continuous server process that serves client requests for the lake at the configured storage path. @@ -167,6 +166,25 @@ all adhere to the consistency semantics of the Zed lake. > a server instance of `zed` on the same file system and data consistency will > be maintained. +### Locating the Lake + +At times you may want the Zed CLI tools to access the same lake storage +used by other tools such as [Zui](https://zui.brimdata.io/). To help +enable this by default while allowing for separate lake storage when desired, +`zed` checks each of the following in order to attempt to locate an existing +lake. + +1. The contents of the `-lake` option (if specified) +2. The contents of the `ZED_LAKE` environment variable (if defined) +3. A `zed` subdirectory below a path in the + [`XDG_DATA_HOME`](https://specifications.freedesktop.org/basedir-spec/basedir-spec-latest.html) + environment variable (if defined) +4. A Zed lake service running locally at `http://localhost:9867` (if a socket + is listening at that port) +5. A default file system location based on detected OS platform: + - `%LOCALAPPDATA%\zed` on Windows + - `$HOME/.local/share/zed` on Linux/macOS + ### Data Pools A lake is made up of _data pools_, which are like "collections" in NoSQL @@ -415,8 +433,8 @@ without confirmation. zed init [path] ``` A new lake is initialized with the `init` command. The `path` argument -is a [storage path](#storage-layer) and is optional. If not present, -the path is taken from the `ZED_LAKE` environment variable, which must be defined. +is a [storage path](#storage-layer) and is optional. If not present, a path +is determined by attempting to automatically [locate the lake](#locating-the-lake). If the lake already exists, `init` reports an error and does nothing. @@ -712,7 +730,7 @@ pool ``, which may be referenced by its ID or its previous name. zed serve [options] ``` The `serve` command implements Zed's server personality to service requests -from instances of Zed's client personality. +from instances of Zed's client [personality](#zed-command-personalities). It listens for Zed lake API requests on the interface and port specified by the `-l` option, executes the requests, and returns results. @@ -727,8 +745,10 @@ is recommended. zed use [] ``` The `use` command sets the working branch to the indicated commitish. -When run without a commitish argument, it displays the current commitish -in use. +When run without a commitish argument, it displays + +1. The current commitish in use, and, +2. The current [lake location](#locating-the-lake). For example, ```