feat(tui): show output token throughput by Randy-sin · Pull Request #36 · MoonshotAI/kimi-code

Randy-sin · 2026-05-25T16:05:01Z

Related Issue

Resolve #35

Problem

See linked issue. Kimi Code did not surface the model output token throughput that users can see in kimi-cli.

What changed

Track the elapsed time for each model step and compute output tokens per second from reported usage.
Show the most recent output speed in the footer next to the context usage line.
Keep transient footer hints higher priority so exit/cancel hints can still replace the speed line temporarily.
Add tests for throughput calculation and footer rendering.

Checklist

I have read the CONTRIBUTING document.
I have linked a related issue, or explained the problem above.
I have added tests that prove my feature works.
Ran gen-changesets skill, or this PR needs no changeset.
Ran gen-docs skill, or this PR needs no doc update.

changeset-bot · 2026-05-25T16:05:06Z

🦋 Changeset detected

Latest commit: 4f65ae0

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 1 package

Name	Type
@moonshot-ai/kimi-code	Minor

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 33194ea4df

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-05-25T16:10:39Z

  // Starts or updates a rendered tool call from a tool-call start event.
  private handleToolCall(event: ToolCallStartedEvent): void {
    this.flushStreamingUiUpdatesNow();
+    this.state.currentStepModelFinishedAtMs ??= Date.now();


Capture model-finish time before tool preflight latency

Seting currentStepModelFinishedAtMs on tool.call.started makes the throughput calculation include non-model delays for tool steps. In this codebase, tool.call.started is emitted only after tool preflight/prepare work (including hooks and potential approval waits) completes, so updateOutputTokenThroughput can divide by a much larger elapsed time and report misleadingly low tok/s whenever a tool call is delayed before dispatch. This regresses the accuracy of the new footer speed metric specifically for tool-calling turns.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-05-25T16:10:39Z

+    const tokensPerSecond = outputTokensPerSecond(
+      usage,
+      this.state.currentStepStartedAtMs,
+      endedAtMs,


Exclude retry backoff time from throughput denominator

updateOutputTokenThroughput always divides by elapsed wall-clock time since turn.step.started, but turn.step.completed.usage reflects only the successful LLM attempt. When a step retries (network/provider retryable errors), this elapsed window includes retry sleeps and failed attempts, which can drastically underreport tok/s even though model generation speed was normal. The metric therefore becomes misleading in any retried step.

Useful? React with 👍 / 👎.

Randy-sin · 2026-05-26T02:32:59Z

Addressed in 4f65ae0 by deriving throughput from the observed model streaming window instead of step wall-clock time or tool.call.started.

The footer now records the first/last assistant, thinking, or tool-call argument delta for the current step, then computes output tok/s from that window once usage arrives. That excludes retry backoff, tool preflight/approval, and tool execution time. If no model deltas were observed, it skips the speed display rather than showing a misleading value.

Added regression coverage for delayed tool-step completion and the no-delta fallback.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 4f65ae0fc6

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-05-26T02:37:30Z

+    const startedAtMs = this.state.currentStepFirstDeltaAtMs;
+    const endedAtMs = this.state.currentStepLastDeltaAtMs;
+    if (startedAtMs === undefined || endedAtMs === undefined) return;


Use step-end time when only one delta is observed

This throughput path uses currentStepLastDeltaAtMs as the end timestamp, so when a step emits exactly one model delta (a common pattern in this repo’s agent-core event snapshots), startedAtMs and endedAtMs are identical and outputTokensPerSecond returns null. In that case the new footer speed metric silently disappears even though turn.step.completed.usage.output is positive, so short or single-chunk responses never show a speed value.

Useful? React with 👍 / 👎.

liruifengv · 2026-05-26T04:23:32Z

Thank you for your interest in contributing to Kimi Code.

For new features, please create an issue or discuss it in an existing issue. We are not currently accepting pull requests for new features.

feat(tui): show output token throughput

33194ea

chatgpt-codex-connector Bot reviewed May 25, 2026

View reviewed changes

fix(tui): measure throughput from model deltas

4f65ae0

chatgpt-codex-connector Bot reviewed May 26, 2026

View reviewed changes

liruifengv closed this May 26, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(tui): show output token throughput#36

feat(tui): show output token throughput#36
Randy-sin wants to merge 2 commits into
MoonshotAI:mainfrom
Randy-sin:feat/footer-token-throughput

Randy-sin commented May 25, 2026

Uh oh!

changeset-bot Bot commented May 25, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot May 25, 2026

Uh oh!

chatgpt-codex-connector Bot May 25, 2026

Uh oh!

Randy-sin commented May 26, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot May 26, 2026

Uh oh!

liruifengv commented May 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Randy-sin commented May 25, 2026

Related Issue

Problem

What changed

Checklist

Uh oh!

changeset-bot Bot commented May 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦋 Changeset detected

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot May 25, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 25, 2026

Choose a reason for hiding this comment

Uh oh!

Randy-sin commented May 26, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot May 26, 2026

Choose a reason for hiding this comment

Uh oh!

liruifengv commented May 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

changeset-bot Bot commented May 25, 2026 •

edited

Loading