Skip to main content

Other Aggregate Settings

You can view and modify the following model-level aggregate settings.

Aggregate partition for BigQuery

  • aggregate.partition.bigquery.range.end: The upper bound for integer value partitioning on BigQuery. The default value is 10000.
  • aggregate.partition.bigquery.range.interval: The interval for integer value partitioning on BigQuery. The default value is 10.
  • aggregate.partition.bigquery.range.start: The lower bound for integer value partitioning on BigQuery. The default value is 0.

Aggregate batch

  • aggregates.batch.buildFromExisting.enabled: Build aggs from aggs for aggregate batch rebuilds. The default value is true.

  • aggregates.batch.buildFromExisting.threshold: Use the optimized algorithm only when the number of aggregates in a batch are less than this threshold. The default value is 100.

  • aggregates.batch.cube.attributes.threshold:Use the optimized algorithm only when the number of attributes in a model are less than this threshold. The default value is 250.

  • aggregates.batch.cubeDataRequests.parallelism: Number of model data requests that can be made in parallel. The default value is 2.

  • aggregates.batch.retry.maxAttemptsPerAggregate: The maximum number of reattempts to build a single aggregate during a single batch build. This number cannot exceed the value of aggregates.batch.retry.maxattemptsperbatch. The default value is 3.

  • aggregates.batch.retry.maxAttemptsPerBatch: The maximum number of reattempts to build aggregates during a single batch build. The default value is 5.

  • aggregates.batch.reuseOrderingGraph.enabled: Use the existing ordering graph if available for rebuilding aggregates. The default value is true.

  • aggregates.batch.max.failures: Maximum number of failures for a batch build, before the whole batch fails. The default value is 0.

    Setting a value higher than 0 enables failure tolerance for aggregate batch builds. For example, assume you have set this number to 5, and 3 instances in a batch build fail. The rest of the instances would be built successfully. The 3 failed aggregate instances would be moved to a list. At the end of aggregate batch, the system would come back to the list of failed aggregates and would retry to build them, as configured in the retry settings (see above).

Large aggregate tables

  • aggregates.largeTableOptimization.distributionKeyColumn.minimumCardinality: The minimum cardinality required for the highest cardinality dimensional attribute to be used as a table distribution or clustering key. The default value is 30.
  • aggregates.largeTableOptimization.enabled: Enables consideration of optimizations for large aggregate tables, such as column-based clustering or distribution. The default value is False.
  • aggregates.largeTableOptimization.minimumEstimatedRows: The minimum estimated row count required for the aggregate system to consider applying optimizations, such as clustering or distribution. The default value is 100000.

Aggregate creation schedules

  • aggregates.new.build.scheduled: When true, new aggregate instance builds are postponed until the next scheduled batch build for the model. If false (default), new aggregate instances may be queued for building at any time.
  • aggregates.uda.build.scheduled: When false (default), new user-defined aggregate build instances are queued for building following the project publish event. If true, UDA builds are postponed until the next scheduled batch build for the model.
  • aggregates.predictionDefined.build.scheduled: When false (default), new prediction-defined aggregate build instances are queued for building following the project publish event. If true, PDA builds are postponed until the next scheduled batch build for the model.

Other aggregate settings

  • aggregates.dataWarehouseCacheTableRequests.enabled: Whether to cache data warehouse table requests. The default value is True.
  • aggregates.dataWarehouseCacheTableRequests.maximumRowCount: The maximum number of rows for a data warehouse table request to be cacheable, if caching is enabled. The default value is 50000.
  • aggregates.dimensional.allowJoinsToSecondaries.enabled: Whether or not to allow joins to secondary attributes to be filtered out for completeness testing. The default value is True.
  • aggregates.dimensional.build: Set to True (default) to allow the engine to create aggregates that contain dimensional attributes only. Such aggregates can be useful in Tableau for queries against fact tables that contain degenerate dimensions.
  • aggregates.slowBuild.cutoff: The duration cutoff for a completed aggregate build query to emit a SlowAggEvent. The default value is 4 seconds.
  • aggregates.smallTableReplication.enabled: Whether to consider applying data replication to system-defined aggregate tables. The default value is False.
  • aggregates.smallTableReplication.factBasedAggs.enabled: Whether to allow aggregate table replication on fact-based aggregates. The default value is False.
  • aggregates.smallTableReplication.maximumEstimatedRows: The maximum estimated number of rows that an aggregate table can have to be considered for replication. The default value is 1000.
  • aggregates.systemGenerated.activeInstance.extraAllowance: The maximum number of additional system-defined aggregates that are temporarily permitted to be active when a model has reached its retention limit. The default value is 2.
  • aggregates.systemGenerated.activeInstance.retentionLimit: The maximum number of active system-defined aggregates to retain per model. This is also known as the Retention Limit. The default value is 20.
  • aggregates.systemGenerated.withDistinctCountMeasure.retentionPercentage: The maximum number of active aggregates with distinct count metrics per model is calculated as this percentage of the value of aggregates.systemGenerated.activeInstance.retentionLimit for each model. This pool of system-defined aggregate tables is sized and counted separately from the regular system-defined aggregate table pool. The default value is 40.
  • aggregates.withDistinctCounts.widening.enabled: Allows widening of distinct count aggregates with other metrics, including distinct counts. This setting requires aggregates.create.widening.enabled to be set to True, for details see Settings that are Related to Narrowing and Widening in Global Settings for System-Defined Aggregates Only.

More information