binny.api.nz_tomography module#

Tools for building tomographic redshift bins.

This module provides a simple interface for constructing tomographic redshift bins using either photometric or spectroscopic redshifts. Bins can be built from survey presets, configuration mappings, or directly from user-supplied redshift distributions.

Once bins are built, the results are kept internally so that related quantities such as population statistics or bin-to-bin comparisons can be computed without repeating the setup.

class binny.api.nz_tomography.NZTomography#

Bases: object

Builds and caches tomographic n(z) bins with a small public API.

An instance caches the most recent parent distribution (z, nz), the tomography settings used to build the bins, the constructed bin curves, and optional metadata. This lets follow-up calls (stats, comparisons, combo filtering) reuse the same build without re-parsing inputs.

Initialize an empty cache.

between_sample_stats(other: NZTomography, *, overlap: Mapping[str, Any] | None = None, pairs: Mapping[str, Any] | None = None, interval_mass: Mapping[str, Any] | None = None, pearson: Mapping[str, Any] | None = None) → dict[str, Any]#

Compute a bundle of between-sample comparison and diagnostic matrices.

This returns quantitative comparisons between bins from two different tomographic samples, such as lens and source bins used in galaxy-galaxy lensing. The returned quantities can be used to assess overlap, similarity, ordering, and interval-based contamination across sample families.

Parameters:

other – Second tomography instance to compare against. Both instances must be built on the same redshift grid.
overlap – Keyword arguments for between-sample overlap metrics, or None to skip.
pairs – Keyword arguments for between-sample pair summaries, or None to skip.
interval_mass – Keyword arguments for interval-mass computation, or None to skip. Must include "target_edges".
pearson – Keyword arguments for between-sample Pearson correlation computation, or None to skip.

Returns:

A dictionary containing a subset of keys "overlap", "correlations", "interval_mass", and "pearson".

Raises:

ValueError – If bins are not cached, if the two instances do not share the same redshift grid, or if interval-mass computation is requested without providing target_edges.

bin_combo_filter(spec: Mapping[str, Any], other: NZTomography | None = None) → list[tuple[int, ...]]#

Select bin-index combinations using a combo-filter specification.

This is a high-level entry point for “give me the bin pairs/tuples that satisfy these criteria.” It constructs a BinComboFilter on the cached bins (optionally using a second tomography instance for cross-sample comparisons), applies the provided selection spec, and returns the resulting index tuples.

Parameters:

spec – Selection specification for topology + ordered filters.
other – Optional second tomography instance for cross-sample selections.

Returns:

A list of index tuples representing the selected bin combinations.

Raises:

ValueError – If bins are not cached, or if cross-sample filtering is requested but the two instances do not share the same redshift grid.
KeyError – If the selection spec references an unknown topology, filter, or metric kernel.
TypeError – If the selection spec has an unexpected structure.

property bin_keys: list[int]#

Bin indices for the most recent build, in mapping insertion order.

Returns:: List of integer bin indices in the same order they appear in bins.
Raises:: ValueError – If bins have not been built yet (call build_bins()).

property bins: Mapping[int, ndarray]#

Mapping of bin index to bin curve from the most recent build.

Returns:: Mapping from integer bin index to the corresponding bin n(z) array.
Raises:: ValueError – If bins have not been built yet (call build_bins()).

Builds tomographic bins from a config, mapping, or explicit arrays.

Given either a survey configuration or direct (z, nz) arrays, this method constructs the corresponding tomographic bins and returns the resulting bin distributions along with the relevant inputs.

Exactly one of the following input modes must be used: - config_file=... (optionally with key=...), - cfg=... (an in-memory config mapping), - (z=..., nz=..., tomo_spec=...) (explicit arrays plus a spec mapping).

Parameters:

config_file – Path to a YAML config file. If provided, the tomography entry is selected from the file (optionally using key).
cfg – In-memory config mapping using the same schema as the YAML files.
z – True-redshift grid. Used only in the explicit arrays mode.
nz – Parent distribution evaluated on z. Used only in the explicit arrays mode.
tomo_spec – Tomography specification mapping for the explicit arrays mode.
key – Optional entry key when selecting an item from a config file.
role – Optional selector for a tomography entry (e.g., “lens” or “source”).
year – Optional selector for a tomography entry (e.g., “1”, “10”).
kind – Optional tomography kind override (“photoz” or “specz”).
scenario – Optional selector for a tomography entry, useful for survey variants such as “hls_optimistic”, “hls_conservative”, or “wide”.
sample – Optional selector for a tomography entry, useful for sample labels such as “lrg” or “elg”.
overrides – Optional mapping of values merged into the resolved tomography spec. Nested mappings are merged shallowly at the first level.
include_survey_metadata – Whether to include survey-level metadata in the output when building from a config file or mapping.
include_tomo_metadata – Whether to request tomography metadata from the builder.
include_photoz_diagnostics – Whether to include photo-z assignment probabilities and observed-bin diagnostic metadata in the output.

Returns:

A TomographyBins object containing the true-redshift grid, parent distribution, resolved tomography spec, constructed bins, and optional tomography and survey metadata.

Raises:

ValueError – If inputs are inconsistent, no unique entry can be selected, the tomography spec is invalid, or the requested kind is unsupported.

build_survey_bins(survey: str, *, role: str | None = None, year: Any | None = None, scenario: Any | None = None, sample: Any | None = None, z: Any | None = None, overrides: Mapping[str, Any] | None = None, include_survey_metadata: bool = False, include_tomo_metadata: bool = False, include_photoz_diagnostics: bool = False, config_file: str | Path | None = None) → TomographyBins#

Builds tomographic bins from a built-in survey preset.

This method loads the configuration for a given survey preset, builds the corresponding tomographic bins, and returns the resulting bin distributions.

Parameters:

survey – Survey preset name (case-insensitive), resolved to a shipped YAML file.
role – Optional selector for a tomography entry (e.g., “lens” or “source”).
year – Optional selector for a tomography entry (e.g., “1”, “10”).
z – Optional override for the config-defined redshift grid.
overrides – Optional mapping merged into the resolved tomography spec.
include_survey_metadata – Whether to include survey-level metadata in the output.
include_tomo_metadata – Whether to include tomography metadata in the output.
include_photoz_diagnostics – Whether to include photo-z assignment probabilities and observed-bin diagnostic metadata in the output.
config_file – Optional explicit path to a survey-spec YAML file. If provided, it is used instead of resolving a shipped preset.

Returns:

A TomographyBins object, which provides convenience methods such as shape_stats() and population_stats().

Raises:

FileNotFoundError – If the requested preset name does not resolve to a shipped file.
ValueError – If the selected entry is invalid or required metadata is unavailable.

static calibrate_psf_depth_from_mock(z_true: ndarray, mag: ndarray, r_gal: ndarray, *, maglims: ndarray, r_psf_values: ndarray, area_deg2: float, z_edges: ndarray, r_min: float = 0.3, selection_kind: Literal['hard', 'sigmoid'] = 'sigmoid', width: float = 0.05, normalize_nz: bool = True) → dict[str, Any]#

Run an end-to-end calibration of PSF-dependent source selection.

This routine estimates redshift distributions and effective source densities from a mock catalog after applying survey depth and PSF-dependent shear-selection cuts.

Parameters:

z_true – True redshifts of galaxies in the mock catalog.
mag – Apparent magnitudes of the same galaxies.
r_gal – Galaxy sizes of the same galaxies.
maglims – Limiting magnitudes to evaluate.
r_psf_values – PSF sizes to evaluate.
area_deg2 – Survey or mock area in square degrees.
z_edges – Redshift-bin edges used to histogram selected samples.
r_min – Minimum resolution factor required for shear selection.
selection_kind – Selection model used for PSF-dependent weights.
width – Width of the sigmoid transition for smooth selection.
normalize_nz – Whether to normalize each redshift distribution.

Returns:

Dictionary containing the PSF-depth calibration grid, redshift distributions, and effective source densities.

static calibrate_smail_from_mock(z_true: ndarray, mag: ndarray, *, maglims: ndarray, area_deg2: float, weights: ndarray | None = None, infer_alpha_beta_from: Literal['deep_cut', 'all_selected_at_maglim'] = 'deep_cut', alpha_beta_maglim: float | None = None, z_max: float | None = None) → dict[str, Any]#

Run an end-to-end calibration of survey depth scaling relations.

This routine performs a complete calibration of the relations linking survey limiting magnitude to both the redshift distribution and the galaxy number density of a sample.

The procedure estimates the shape parameters of the Smail redshift distribution from a representative galaxy sample, calibrates how the redshift scale parameter varies with magnitude limit, and measures the corresponding galaxy number density.

The resulting calibration can be used to construct realistic analytic redshift distributions and galaxy densities for survey forecasts.

Parameters:

z_true – True redshifts of galaxies in the mock catalog.
mag – Apparent magnitudes of the same galaxies.
maglims – Limiting magnitudes defining magnitude-limited samples.
area_deg2 – Survey area of the mock catalog in square degrees.
weights – Optional non-negative weights for fitting the Smail redshift distribution.
infer_alpha_beta_from – Strategy used to determine the shape parameters of the Smail distribution.
alpha_beta_maglim – Magnitude limit defining the deep sample used to infer the Smail shape parameters.
z_max – Maximum redshift included when fitting redshift distributions.

Returns:

Dictionary containing the calibrated Smail parameters, the fitted z0–magnitude relation, and the galaxy density calibration.

clear() → None#: Clears the cached parent distribution, spec, bins, and metadata.

static combine_parent_nz(samples: Sequence[Mapping[str, Any]], *, interpolate: bool = False, z_target: ndarray | None = None) → tuple[ndarray, ndarray]#: Combine parent redshift distributions into one parent sample.

static combine_tomography_bins(samples: Sequence[TomographyBins], *, interpolate: bool = False, z_target: ndarray | None = None) → TomographyBins#: Combine matching tomographic samples into one TomographyBins object.

cross_bin_stats(*, overlap: Mapping[str, Any] | None = None, pairs: Mapping[str, Any] | None = None, leakage: Mapping[str, Any] | None = None, pearson: Mapping[str, Any] | None = None) → dict[str, Any]#

Compute a bundle of cross-bin similarity and diagnostic matrices.

This returns quantitative comparisons between tomographic bins, such as how much two bins overlap in redshift, how strongly their shapes co-vary, and (optionally) how much probability mass leaks across a set of bin edges.

Parameters:

overlap – Keyword arguments for overlap metrics, or None to skip.
pairs – Keyword arguments for pairwise overlap summaries, or None to skip.
leakage – Keyword arguments for leakage matrix computation. Must include "bin_edges" (the bin edges corresponding to the bins), or None to skip.
pearson – Keyword arguments for Pearson correlation computation, or None to skip.

Returns:

A dictionary containing a subset of keys "overlap", "correlations", "leakage", and "pearson".

Raises:

ValueError – If bins are not cached, or if leakage is requested without providing bin_edges.

static fit_smail_from_mock(z_true: ndarray, *, weights: ndarray | None = None, z_max: float | None = None, min_n: int = 200) → dict[str, Any]#: Infer Smail parameters from a mock redshift sample.

static list_nz_models() → list[str]#

Lists available registered n(z) model names.

Returns the set of model identifiers currently registered with the n(z) registry. This is useful for discovery and for validating configuration inputs.

Returns:: A sorted list of registered n(z) model names.

static list_surveys() → list[str]#

List available built-in survey preset names.

Returns:: A sorted list of available survey preset names, such as "lsst", "desi", or "roman".

static load_survey_config(survey: str) → dict[str, Any]#

Load a built-in survey configuration by survey name.

Parameters:: survey – Survey preset name, such as "lsst", "desi", or "roman".
Returns:: Parsed survey configuration dictionary.

property nz: ndarray#

Parent n(z) from the most recent build.

Returns:: The cached parent distribution evaluated on z.
Raises:: ValueError – If no build has been performed yet.

static nz_model(name: str, z: Any, /, **params: Any) → ndarray#

Computes a registered parent n(z) model on a redshift grid.

This is a convenience wrapper around the n(z) registry. It computes the named model on the provided redshift grid and returns the resulting array.

Parameters:

name – Name of a registered n(z) model.
z – Redshift grid (array-like) to compute the model on.
**params – Model-specific parameters passed to the registered implementation.

Returns:

Model values on z.

Raises:

ValueError – If the model name is unknown or the provided parameters are invalid for the requested model.

population_stats(**kwargs: Any) → dict[str, Any]#

Compute population statistics for the cached bins.

Population statistics describe how galaxies are distributed across bins, including quantities related to bin occupancy and normalization relative to the parent redshift distribution.

Parameters:: **kwargs – Additional keyword arguments forwarded to the underlying stats function.
Returns:: A dictionary of population statistics computed from the cached bins and cached tomography metadata.
Raises:: ValueError – If bins have not been built, or if tomography metadata is not available (rebuild with include_tomo_metadata=True).

static sample_bin_labels(samples: Mapping[str, TomographyBins]) → list[tuple[str, int]]#: Return flattened (sample_name, bin_index) labels.

static sample_bins(samples: Mapping[str, TomographyBins]) → dict[tuple[str, int], ndarray]#: Return flattened bins keyed by (sample_name, bin_index).

static sample_combinations(*collections: Mapping[str, TomographyBins]) → list[tuple[tuple[str, int], ...]]#: Return all bin combinations across any number of sample collections.

shape_stats(**kwargs: Any) → dict[str, Any]#

Computes shape statistics for the cached bins.

Shape statistics summarize the structure of each bin’s redshift distribution (e.g. bin moments, centers, etc.) using the parent redshift grid and the most recently built bins.

Parameters:: **kwargs – Additional keyword arguments forwarded to the underlying stats function.
Returns:: A dictionary of shape statistics computed from the cached bins.
Raises:: ValueError – If bins have not been built yet.

static show_survey_config(survey: str, *, print_output: bool = True) → str#

Show a built-in survey configuration as YAML text.

Parameters:

survey – Survey preset name, such as "lsst", "desi", or "roman".
print_output – Whether to print the YAML text before returning it.

Returns:

The selected survey configuration formatted as YAML.

property z: ndarray#

True-redshift grid from the most recent build.

Returns:: The cached redshift grid used to build the parent n(z) and bins.
Raises:: ValueError – If no build has been performed yet.

binny.api.nz_tomography.available_metric_kernels() → list[str]#: List registered metric-kernel names for combo-filter specs.

binny.api.nz_tomography.list_survey_configs() → list[str]#: List built-in survey configuration names.

binny.api.nz_tomography.load_survey_config(survey: str) → dict[str, Any]#

Load a built-in survey configuration by survey name.

Parameters:: survey – Survey preset name, such as "lsst", "desi", or "roman".
Returns:: Parsed survey configuration dictionary.

binny.api.nz_tomography.register_metric_kernel(name: str, func: Callable[[...], float]) → None#

The name is referenced in selection specs via:

{"name": "metric", "metric": "<name>", "threshold": ..., "compare": ...}

Parameters:

name – Kernel identifier used in selection specs.
func – Callable that receives one curve per slot and returns a float.

Raises:

ValueError – If name is already registered.

binny.api.nz_tomography.show_survey_config(survey: str, *, print_output: bool = True) → str#

Show a built-in survey configuration as YAML text.

Parameters:

survey – Survey preset name, such as "lsst", "desi", or "roman".
print_output – Whether to print the YAML text before returning it.

Returns:

The selected survey configuration formatted as YAML.