docs: Recommend Overlord-based auto-compaction and mark useIncrementalCache production ready by cecemei · Pull Request #19252 · apache/druid

cecemei · 2026-04-01T19:56:48Z

Description

Rename CoordinatorRunStats to DruidRunStats
Some changes in automatic compaction documentation and default configuration settings.

Release note

Automatic Compaction

Overlord-based compaction supervisors are now the recommended and default approach for automatic compaction. This method provides better reactivity, MSQ task engine support, and easier management through the supervisor framework. Coordinator-based auto-compaction remains available as an alternative.

Incremental cache

Incremental segment metadata cache (useIncrementalCache) is no longer experimental and defaults to ifSynced.

This PR has:

capistrant · 2026-04-02T14:26:13Z

The default for useSupervisors should be true in the cluster compaction config if we are recommending it going forward. that way all new deploys will get the recommended config

cecemei · 2026-04-07T01:10:35Z

useSupervisors

updated default to true, PTAL!

clintropolis · 2026-04-08T18:12:42Z

docs/configuration/index.md

-|`druid.manager.rules.pollDuration`|The duration between polls the Coordinator does for updates to the set of active rules. Generally defines the amount of lag time it can take for the Coordinator to notice rules.|`PT1M`|
-|`druid.manager.rules.defaultRule`|The default rule for the cluster|`_default`|
-|`druid.manager.rules.alertThreshold`|The duration after a failed poll upon which an alert should be emitted.|`PT10M`|
+|Property|Description| Default    |


what's up with all the unrelated formatting changes?

clintropolis · 2026-04-08T19:02:04Z

server/src/main/java/org/apache/druid/server/coordinator/stats/DruidRunStats.java

 */
 @ThreadSafe
-public class CoordinatorRunStats
+public class DruidRunStats


I can't help but wonder if there is a better name for this, maybe DutyRunStats or something that means 'thing to collect stuff to emit later from regularly occurring internal chores'? I guess 'duty' isn't quite right because that is basically only used to refer to periodic coordinator tasks, not supervisor stuff. The javadoc still only mentions coordinator run/duties, which should be fixed.

Also, it is kind of weird having the name DruidRunStats but the thing it has is still called CoordinatorStat, it seems like that naming should be changed to reflect the change here.

Stepping back, what exactly is the motivation for renaming, i guess that compaction uses and runs as a supervisor now so it isn't really specific to the coordinator? While this is still used quite heavily by all of the things the coordinator does, it seems reasonable to give it a more generic name of some sort, I was just wondering if this one is a bit too generic, but maybe is fine too as long as the javadoc clarifies its purpose?

Some addtional thoughts: I believe some of us would like to eventually merge the coordinator and overlord into a single service. Since they both now basically need a heavy segment timeline and so have similar footprint requirements, there aren't a lot of compelling reasons to keep them separate anymore. In my mind 'coordinator' would be the remaining service, with all of the overlords functionality merged into it (though this hasn't really been discussed so maybe other people have other opinions), so if that were true then this would basically become something only used by the coordinator again heh. There is a lot of work to do for something like this, so it is not really a short term goal afaik and needs more official discussion at some point, just adding it here for additional stuff to think about.

clintropolis · 2026-04-08T19:27:43Z

docs/data-management/automatic-compaction.md

+* Can use either the native compaction engine or the [MSQ task engine](#use-msq-for-auto-compaction)
+* More reactive and submits tasks as soon as a compaction slot is available
+* Tracked compaction task status to avoid re-compacting an interval repeatedly
+* Uses new Indexing State Fingerprinting mechanisms to store less data per segment in metadata storage


i know this isn't new, but by default we still store compaction state afaict (ClusterCompactionConfig.storeCompactionStatePerSegment defaults to true), so this should be reworded to be like 'can be configured to store only fingerprints' or whatever

clintropolis · 2026-04-08T21:10:44Z

docs/data-management/cascading-reindexing.md

- **MSQ compaction engine**: Set `engine` to `msq` in the compaction dynamic config or in the supervisor spec.
- **Incremental segment metadata caching**: Set `druid.manager.segments.useIncrementalCache` to `always` or `ifSynced` in your Overlord and Coordinator runtime properties. See [Segment metadata caching](../configuration/index.md#metadata-retrieval).
- **At least two compaction task slots**: The MSQ task engine requires at least two tasks (one controller, one worker).
-


i think we need to leave part of this, no? the default engine is still native, so people still need to set engine to msq, and you need 2 compaction slots since its using msq engine

cecemei added 2 commits April 1, 2026 12:24

auto

0ef3d9d

DruidRunStats

d375136

github-actions bot added Area - Documentation Kubernetes Area - Ingestion labels Apr 1, 2026

cecemei changed the title ~~Auto~~ docs: Recommend Overlord-based auto-compaction and mark useIncrementalCache production ready Apr 1, 2026

cecemei added 2 commits April 1, 2026 14:11

doc

4b1b728

check

7323706

cecemei added the Release Notes label Apr 1, 2026

cecemei marked this pull request as ready for review April 1, 2026 23:52

Merge branch 'master' into auto

f7ea749

cecemei added 5 commits April 2, 2026 16:23

default

eefeeaf

legacy

d4e45c0

Merge remote-tracking branch 'origin/master' into auto

e072603

default-native

fb8e563

format

fef9255

Merge remote-tracking branch 'origin/master' into auto

cdd3dac

cecemei added this to the 37.0.0 milestone Apr 7, 2026

clintropolis reviewed Apr 8, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: Recommend Overlord-based auto-compaction and mark useIncrementalCache production ready#19252

docs: Recommend Overlord-based auto-compaction and mark useIncrementalCache production ready#19252
cecemei wants to merge 11 commits intoapache:masterfrom
cecemei:auto

cecemei commented Apr 1, 2026 •

edited

Loading

Uh oh!

capistrant commented Apr 2, 2026

Uh oh!

cecemei commented Apr 7, 2026

Uh oh!

clintropolis Apr 8, 2026

Uh oh!

clintropolis Apr 8, 2026

Uh oh!

clintropolis Apr 8, 2026

Uh oh!

clintropolis Apr 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

cecemei commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Release note

Automatic Compaction

Incremental cache

Uh oh!

capistrant commented Apr 2, 2026

Uh oh!

cecemei commented Apr 7, 2026

Uh oh!

clintropolis Apr 8, 2026

Choose a reason for hiding this comment

Uh oh!

clintropolis Apr 8, 2026

Choose a reason for hiding this comment

Uh oh!

clintropolis Apr 8, 2026

Choose a reason for hiding this comment

Uh oh!

clintropolis Apr 8, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cecemei commented Apr 1, 2026 •

edited

Loading