Property | Description |
---|---|
storage | The maximum storage space the table is allowed to use, before replication.
For example, in the above table, the storage is 140G and replication is
3. Therefore, the maximum storage the table is allowed to use is 140*3=420G.
The space used by the table is calculated by adding up the sizes of all
segments from every server hosting this table. Once this limit is reached,
offline segment push throws a 403 exception with message, Quota check failed for segment: segment_0 of table: pinotTable . |
maxQueriesPerSecond |
The maximum queries per second allowed to execute on this table. If query
volume exceeds this, a will be sent, and a BrokerMetric |
Property | Description |
---|---|
segmentPrunerTypes |
The list of segment pruners to be enabled. The segment pruner prunes the selected segments based on the query. Supported
values currently are
|
instanceSelectorType | The instance selector selects server instances to serve the query based
on selected segments. Supported values are
balanced - balances the number of segments served by each
selected instance. Default.
replicaGroup - instance selector for replica group routing
strategy.
For more details on how to configure this check out Querying All Servers |
Property | Description |
---|---|
schemaName | Name of the schema associated with the table |
timeColumnName | The name of the time column for this table. This must match with the time
column name in the schema. This is mandatory for tables with push type APPEND ,
optional for REFRESH. timeColumnName along with timeColumnType
is used to manage segment retention and time boundary for offline vs realtime. |
replication | Number of replicas |
retentionTimeUnit | Unit for the retention. e.g. HOURS, DAYS. This in combination with retentionTimeValue
decides the duration for which to retain the segments e.g. 365 DAYS in
the example means that segments containing data older than 365 days will
be deleted periodically. This is done by the RetentionManager Controller
periodic task. By default, no retention is set. |
retentionTimeValue | A numeric value for the retention. This in combination with retentionTimeUnit decides the duration for which to retain the segments |
segmentPushType
(Deprecated starting 0.7.0 or commit 9eaea9. Use IngestionConfig -> BatchIngestionConfig -> segmentPushType ) |
This can be either
APPEND - new data segments pushed periodically, to append
to the existing data eg. daily or hourly
REFRESH - the entire data is replaced every time during a
data push. Refresh tables have no retention. |
segmentPushFrequency
(Deprecated starting 0.7.0 or commit 9eaea9. Use IngestionConfig -> BatchIngestionConfig -> segmentPushFrequency ) |
The cadence at which segments are pushed eg. HOURLY , DAILY
|
Property | |
---|---|
name | name of the column |
encodingType | Should be one of RAW or DICTIONARY
|
indexType | index to create on this column. currently only TEXT is supported. |
properties |
JSON of key-value pairs containing additional properties associated with the index. The following properties are supported currently -
|
Property | Description |
---|---|
streamType | only kafka is supported at the moment |
stream.[streamType].consumer.type | should be one of lowLevel or highLevel . See Stream ingestion for
more details |
stream.[streamType].topic.name | topic or equivalent datasource from which to consume data |
stream[streamType].consumer.prop.auto.offset.reset | offset to start consuming data from. Should be one of smallest , largest or
a timestamp in millis |
(0.6.0 onwards) realtime.segment.flush.threshold.rows (0.5.0 and prior) (deprecated) |
Maximum number of rows to consume before persisting the consuming segment. Default is 5000000 |
realtime.segment.flush.threshold.time | Maximum elapsed time after which a consuming segment should be persisted.
The value can be set as a human readable string, such as 1d , 4h30m
Default is 6 hours. |
(0.6.0 onwards) realtime.segment.flush.threshold.segment.size (0.5.0 and prior) (deprecated)
|
Desired size of the completed segments. This value can be set as a human
readable string such as 150M , or 1.1G , etc. This
value is used when realtime.segment.flush.threshold.size is
set to 0. Default is 200M i.e. 200 MegaBytes |
realtime.segment.flush.autotune.initialRows |
Initial number of rows for learning. This value is used only if Default is |