langspace.metrics package

Submodules

langspace.metrics.disentanglement module

class langspace.metrics.disentanglement.DisentanglementMetric(value, names=None, *, module=None, qualname=None, type=None, start=1, boundary=None)[source]

Bases: Enum

Metrics used to evaluate disentangled representations in latent variable models.

These metrics provide multiple perspectives on the quality of learned representations, by assessing how exclusively and robustly individual latent dimensions capture specific generative factors.

Z_DIFF: The “z-diff” metric assesses how sensitive each latent dimension is to changes in just one generative factor. It calculates the differences in latent activations when varying a single factor, thereby measuring the responsiveness of each dimension. Reference: Higgins, I., Matthey, L., Pal, A., et al. “beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework.” ICLR, 2017.

Z_MIN_VAR: The “z-min-var” metric quantifies the stability of latent dimensions by measuring the minimum variance observed across dimensions when a particular generative factor is fixed. Lower variance in the corresponding latent code indicates a consistent encoding of that factor. Reference: Kim, H., & Mnih, A. “Disentangling by Factorising.” In Proceedings of the 35th International Conference on Machine Learning, volume 80 of Proceedings of Machine Learning Research, pages 2649–2658. PMLR.

MIG: The Mutual Information Gap (MIG) evaluates disentanglement by computing the difference in mutual information between the top two latent variables linked to a specific ground-truth factor. A larger gap implies that the factor is predominantly captured by a single latent dimension. Reference: Chen, R. T. Q., Li, X., Grosse, R., & Duvenaud, D. “Isolating Sources of Disentanglement in Variational Autoencoders.” NeurIPS, 2018.

DISENTANGLEMENT: This overall metric reflects the degree to which the learned representation is disentangled. It typically aggregates per-dimension or per-factor scores to provide a single measure of representation quality. Reference: Eastwood, C., & Williams, C. K. I. “A Framework for the Quantitative Evaluation of Disentangled Representations.” ICLR, 2018.

COMPLETENESS: Completeness measures the concentration of information about each generative factor within a limited set of latent variables. High completeness indicates that each factor is encoded without excessive redundancy across different dimensions. Reference: Eastwood, C., & Williams, C. K. I. “A Framework for the Quantitative Evaluation of Disentangled Representations.” ICLR, 2018.

INFORMATIVENESS: This metric gauges how well the latent representation retains useful information about the underlying generative factors. It is often estimated via the performance of predictors that recover these factors from the latent codes, thus reflecting the overall predictive power of the representation. Reference: Eastwood, C., & Williams, C. K. I. “A Framework for the Quantitative Evaluation of Disentangled Representations.” ICLR, 2018.

COMPLETENESS = 'Completeness'

DISENTANGLEMENT = 'Disentanglement'

INFORMATIVENESS = 'Informativeness'

MIG = 'MIG'

Z_DIFF = 'z-diff'

Z_MIN_VAR = 'z-min-var'

langspace.metrics.interpolation module

class langspace.metrics.interpolation.InterpolationMetric(value, names=None, *, module=None, qualname=None, type=None, start=1, boundary=None)[source]

Bases: Enum

Metrics used to evaluate interpolation in generative models and latent space representations.

These metrics assess the quality and continuity of transitions between points in the latent space to ensure that interpolated samples are both realistic and smoothly connected.

QUALITY: This metric evaluates the overall perceptual fidelity and realism of interpolated outputs. It considers aspects such as artifact-free generation, text clarity, and consistency with the learned data distribution. High quality interpolations are expected to appear indistinguishable from real data. Research in generative adversarial networks and variational autoencoders has repeatedly emphasized the importance of qualitative assessments in generative processes.

SMOOTHNESS: This metric assesses the continuity of transitions along the latent space trajectory. A smooth interpolation implies gradual and coherent changes between successive points, ensuring that there are no abrupt jumps or artifacts. Smoothness is a key indicator of a well-behaved latent space, where semantic features change consistently and predictably. Studies evaluating latent space geometry have highlighted that smooth transitions are indicative of robust and disentangled representations.

QUALITY = 'Quality'

SMOOTHNESS = 'Smoothness'

langspace.metrics package

Submodules

langspace.metrics.disentanglement module

langspace.metrics.interpolation module

Module contents