feat(deepinfra): update model YAMLs [bot] by harshiv-26 · Pull Request #484 · truefoundry/models

harshiv-26 · 2026-03-30T07:07:40Z

Auto-generated by poc-agent for provider deepinfra.

Note

Medium Risk
Primarily configuration-only YAML updates, but changes to max_*_tokens, context_window, and declared features (e.g., json_output) can alter runtime behavior and token budgeting for callers.

Overview
Updates DeepInfra provider model YAMLs to reflect current capabilities and metadata: adds status: active broadly, introduces/expands modalities sections, and annotates more models with features like json_output, structured_output, and function_calling.

Adjusts several token limits/pricing knobs (e.g., adds missing max_output_tokens, reduces max_tokens for some Qwen models, increases context_window for Kimi models, adds tiered cache-read pricing for Qwen3-Max) and marks meta-llama/Llama-Guard-3-8B as deprecated via isDeprecated + deprecationDate.

^{Written by Cursor Bugbot for commit e23d6c7. This will update automatically on new commits. Configure here.}

cursor

Cursor Bugbot has reviewed your changes and found 2 potential issues.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

cursor · 2026-03-30T07:14:56Z

providers/deepinfra/deepseek-ai/Janus-Pro-1B.yaml

+        - text
+        - image
+    output:
+        - text


Limits section accidentally removed from Janus-Pro-1B

High Severity

The entire limits section (including max_input_tokens: 2600) was removed from the Janus-Pro-1B config. The sibling model Janus-Pro-7B.yaml retains its limits with max_input_tokens: 2600. Removing input token constraints means clients won't know the model's actual input limit, potentially leading to failed API calls when exceeding the undocumented cap.

cursor · 2026-03-30T07:14:56Z

providers/deepinfra/anthropic/claude-4-sonnet.yaml

    - https://platform.claude.com/docs/en/docs/about-claude/pricing
+status: active
+supportedModes:
+    - chat


Redundant supportedModes field added to deepinfra model

Low Severity

A supportedModes field with - chat was added to this deepinfra model config, but this field is not used in any other deepinfra model. It only appears in azure-open-ai provider configs. The existing mode: chat field on line 24 already conveys the same information, making supportedModes redundant and inconsistent with the rest of the deepinfra provider.

feat(deepinfra): update model YAMLs [bot]

e23d6c7

cursor bot reviewed Mar 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(deepinfra): update model YAMLs [bot]#484

feat(deepinfra): update model YAMLs [bot]#484
harshiv-26 wants to merge 1 commit intomainfrom
bot/update-deepinfra-20260330-070739

harshiv-26 commented Mar 30, 2026 •

edited by cursor bot

Loading

Uh oh!

cursor bot left a comment

Uh oh!

cursor bot Mar 30, 2026

Uh oh!

cursor bot Mar 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

harshiv-26 commented Mar 30, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor bot Mar 30, 2026

Choose a reason for hiding this comment

Limits section accidentally removed from Janus-Pro-1B

Uh oh!

cursor bot Mar 30, 2026

Choose a reason for hiding this comment

Redundant supportedModes field added to deepinfra model

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

harshiv-26 commented Mar 30, 2026 •

edited by cursor bot

Loading

Redundant `supportedModes` field added to deepinfra model