LayerFlow: Layer-wise Exploration of LLM Embeddings using Uncertainty-aware Interlinked Projections
By: Rita Sevastjanova , Robin Gerling , Thilo Spinner and more
Potential Business Impact:
Shows how computer words change, avoiding mistakes.
Large language models (LLMs) represent words through contextual word embeddings encoding different language properties like semantics and syntax. Understanding these properties is crucial, especially for researchers investigating language model capabilities, employing embeddings for tasks related to text similarity, or evaluating the reasons behind token importance as measured through attribution methods. Applications for embedding exploration frequently involve dimensionality reduction techniques, which reduce high-dimensional vectors to two dimensions used as coordinates in a scatterplot. This data transformation step introduces uncertainty that can be propagated to the visual representation and influence users' interpretation of the data. To communicate such uncertainties, we present LayerFlow - a visual analytics workspace that displays embeddings in an interlinked projection design and communicates the transformation, representation, and interpretation uncertainty. In particular, to hint at potential data distortions and uncertainties, the workspace includes several visual components, such as convex hulls showing 2D and HD clusters, data point pairwise distances, cluster summaries, and projection quality metrics. We show the usability of the presented workspace through replication and expert case studies that highlight the need to communicate uncertainty through multiple visual components and different data perspectives.
Similar Papers
Layer by Layer: Uncovering Hidden Representations in Language Models
Machine Learning (CS)
Computers understand things better using middle parts.
Visualising Information Flow in Word Embeddings with Diffusion Tensor Imaging
Computation and Language
Shows how computers understand sentences, not just words.
On the Effect of Uncertainty on Layer-wise Inference Dynamics
Computation and Language
Helps AI know when it's unsure.