Add lm head vocab histogram animation by klei22 · Pull Request #814 · ReaLLMASIC/ReaLLM-Forge

klei22 · 2026-05-13T07:04:31Z

This pull request introduces a new feature for visualizing the evolution of lm_head vocabulary vector magnitudes during model training, making it easier to analyze and debug model behavior. It adds both TensorBoard histogram logging and an interactive HTML export, along with the necessary configuration options and demo scripts. The changes are grouped as follows:

New Visualization Features:

Added a mechanism to log the per-token lm_head vector magnitudes as a histogram in TensorBoard during training, controlled by new command-line arguments. This enables users to track how the output layer's token representations change over time. [1] [2] [3] [4]
Implemented an export of an interactive HTML report that visualizes the final snapshot (and optionally the evolution) of the lm_head vocab vector magnitudes, with sortable bars and hover labels for token ids and text. [1] [2]

Configuration and Usability:

Added new command-line arguments to control the logging and export of the lm_head vocab histogram, including logging interval and HTML output path.
Introduced a demo script (demos/lm_head_vocab_histogram_demo.sh) and updated the documentation to guide users in running and visualizing the new feature. [1] [2]

Internal Implementation:

Extended the training logic to capture and store snapshots of lm_head vector magnitudes and associated token labels, supporting both TensorBoard and HTML visualization. [1] [2] [3] [4]

These changes make it much easier to monitor and analyze how the model's output token representations evolve during training, both interactively and post-hoc.

Copilot

Pull request overview

This PR adds a new training-time visualization for tracking lm_head per-token vector magnitude evolution: TensorBoard histogram logging during training plus an interactive HTML export for post-hoc inspection.

Changes:

Add TensorBoard histogram logging of lm_head vocab-vector L2 magnitudes at a configurable interval.
Capture lm_head magnitude “snapshots” during training and export them as an interactive Plotly-based HTML report.
Add CLI flags and a demo script + docs to make the feature easy to run.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 7 comments.

File	Description
`train.py`	Implements histogram logging, snapshot capture, and HTML export hooks in the training loop.
`train_args.py`	Adds new CLI flags to enable/configure histogram logging and HTML export.
`demos/README.md`	Documents the new demo and how to view the TensorBoard/HTML outputs.
`demos/lm_head_vocab_histogram_demo.sh`	Provides a runnable example training command enabling the new logging/export.

Comments suppressed due to low confidence (1)

train.py:1519

Same issue as above: using .get(...) on self.model.transformer (an nn.ModuleDict) is likely to fail at runtime in multicontext mode. Use membership check + [] indexing (or another ModuleDict-safe accessor) instead.

        lm_head = getattr(self.model, "lm_head", None)
        if self.args.training_mode == "multicontext" and hasattr(self.model, "transformer"):
            try:
                dataset_idx = self.args.multicontext_datasets.index(target_dataset)
            except ValueError:
                dataset_idx = 0
            lm_head = self.model.transformer.get(f"lm_head_{dataset_idx}", lm_head)
        if lm_head is None or not hasattr(lm_head, "weight"):

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+            or self.iter_num % self.args.log_lm_head_vocab_hist_interval != 0
+        ):
+            return
+


+                dataset_idx = self.args.multicontext_datasets.index(target_dataset)
+            except ValueError:
+                dataset_idx = 0
+            lm_head = self.model.transformer.get(f"lm_head_{dataset_idx}", lm_head)


+<script>
+const snapshots = {payload};


+        out_path = self.args.lm_head_vocab_hist_html_path or os.path.join(
+            self.args.out_dir, "lm_head_vocab_histogram.html"
+        )
+        os.makedirs(os.path.dirname(out_path), exist_ok=True)


+        with torch.no_grad():
+            magnitudes = lm_head.weight.detach().norm(dim=1).float().cpu().tolist()
+        vocab_data = []
+        for i, m in enumerate(magnitudes):
+            token_raw, token_display = self._get_vocab_label_parts(i)
+            vocab_data.append({"id": i, "magnitude": float(m), "token_raw": token_raw, "token_display": token_display})
+        self.lm_head_hist_snapshots.append({
+            "iter_num": int(self.iter_num),
+            "tokens_trained": float(tokens_trained),
+            "dataset": target_dataset,
+            "vocab": vocab_data,
+        })


                # End of training actions
                if self.iter_num > self.args.max_iters:
+                    self._export_lm_head_vocab_histogram_html()
                    print(self.best_val_loss, self.best_iter, self.best_tokens)


+    logging_group.add_argument('--log_lm_head_vocab_hist', default=False, action=argparse.BooleanOptionalAction, help='Log TensorBoard histogram of per-token lm_head vector magnitudes over training')
+    logging_group.add_argument('--log_lm_head_vocab_hist_interval', default=100, type=int, help='Training-step interval for logging lm_head vocab magnitude histogram')
+    logging_group.add_argument('--export_lm_head_vocab_hist_html', default=False, action=argparse.BooleanOptionalAction, help='Export an interactive HTML report of final lm_head vocab-vector magnitudes')


klei22 added 5 commits May 11, 2026 13:21

Add TensorBoard lm_head vocab magnitude histogram logging

b5b65e6

Add demo script for lm_head vocab histogram logging

53c6d02

Export sortable lm_head vocab magnitude HTML report

40f5d75

Fix lm_head HTML sorting control and token-symbol x-axis

576ccda

Add time-snapshotted lm_head HTML playback across evals

fd696c8

klei22 requested review from Copilot and gkielian May 13, 2026 07:04

Copilot AI reviewed May 13, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add lm head vocab histogram animation#814

Add lm head vocab histogram animation#814
klei22 wants to merge 5 commits into
ReaLLMASIC:masterfrom
klei22:add-lm-head-vocab-histogram-animation

klei22 commented May 13, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		<script>
		const snapshots = {payload};

Conversation

klei22 commented May 13, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants