Adaptive Binning values.
Note that adaptive binning will take a set of events, create a histogram on every parameter being clustered for those events, and then examine if any of those histograms could be divided usefully. The following parameters control how this histogram examination is performed to divide events further and further into smaller and smaller bins.
Minimum Separation Channel:
![]()
Clustering will not divide any population below this fraction of the total scale of a parameter. Thus, a value of "0.25" means don't divide any subset within the first decade (on a 4-decade log scale). Setting this value too high will not allow the platform to resolve populations near the low end of the scale. Too low, and it may create "biologically-irrelevant" populations.
Range of values: 0 - 1
Default value: 0.03 (roughly, 1/8th of a decade in a 4-decade scale).
Other useful values: 0.06, 0.125, 0.25
Histogram Resolution:
![]()
The number of channels used for histogramming. Channels are equally spaced (linear parameter), or log-spaced (log parameters), exactly as for creating displays. Setting this value too high could cause subsets to be "orphaned" during the "shaving" of empty space off of histograms during adaptive binning... this could lead to clusters that cannot be joined. In addition, the clustering algorithm will take longer with larger values. Setting this value too low will lead to an inability to distinguish clusters.
Range of values: None (However, less than 16 is probably useless, as is more than 1024)
Default value: 256.
Other useful values: 128, 64.
Minimum Shave Fraction:
![]()
During adaptive binning, one major criterion for dividing a distribution into bins is whether or not there is "empty" space at one end of the histogram. Such empty space will be "shaved off" into a separate bin from the data. Since bins that are not physically adjacent can never be re-joined into a cluster, shaving too aggressively can lead to the inability to rejoin event-containing bins into real clusters. For this clustering algorithm, the shaving step is one of the most critical.
This value specifies how much of a distribution in a given parameter must be devoid of events before the platform considers "shaving" it off. Thus, a value of "0.1" means that 10% of the distribution's width (either upper or lower end) must be completely devoid of events before shaving is considered. A value that is too small will lead to too many orphaned clusters. A value that is too large will not allow sufficient resolution to segregate otherwise close clusters.
Range of values: 0 - 1
Default value: 0.1.
Minimum Shave Channels:
![]()
See above discussion on "Minimum Shave Fraction." In addition to that criterion, the platform will never shave off any empty space if it contains less than a fixed number of channels, which is this value times the histogram resolution. In other words, this value defines the smallest empty space that can be created during binning.
Range of values: 0 - 1
Default value: 0.063 (approximately 1/4th decade on a log scale)
Other useful values: Smaller, probably as low as 0.01 might be useful.
Maximum Value Height Ratio:
![]()
During adaptive binning, the other criterion for dividing a distribution that is always checked is whether or not there is a bimodal distribution. This value determines what kind of distribution is considered bimodal. If a valley exists between two peaks, then the distribution is bimodal. A valley must be no higher than this value times the lower of the two peaks. (In other words, a value of 0.5 means that the valley must be no higher than half of the height of the lower peak). If there is no point in the distribution where this exists, then the distribution is not considered bimodal. A value that is too small will prevent the platform from finding useful peaks in a distribution. A value that is too high will divide events seemingly randomly.
Range of values: 0 - 1
Default value: 0.5
Useful value: 0.75
Separation: Valley Weighting
![]()
Separation: Even Division Weighting
![]()
If, for any given set of events, there is more than one parameter that is bimodal, then these two values determine which one better separates the events. Since adaptive binning is iterative, it probably doesn't matter which parameter is first divided; the other will be divided shortly. Nonetheless, these two parameters provide the relative weighting to compare two such divisions. A larger value on valley weighting means that emphasis is place more on how deep a valley is between the two peaks. A larger value on the "Ev Division" means that emphasis is placed on divisions that more evenly divide the events by number.
Range of values: 0 - infinity
Default values: 1 and 10
Useful values: 10 and 1

