feat: Add late interaction model training support for retrieval by rnyak · Pull Request #2283 · NVIDIA-NeMo/Automodel

rnyak · 2026-05-20T17:18:29Z

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Updated train_bi_encoder.py to support local ColBERT-style pooling by adding colbert_scores_and_labels(), which computes MaxSim scores with query and passage attention-mask handling. The train and validation paths now route ColBERT models through this scoring function instead of standard pooled embedding contrastive scoring.

Distributed in-batch negatives remain unsupported for ColBERT for now and still raise explicitly.

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?

copy-pr-bot · 2026-05-20T17:18:33Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

rnyak · 2026-05-20T23:56:19Z

/claude review

Signed-off-by: Ronay Ak <ronaya@nvidia.com>

add late interaction for retrieval

4d5e76e

rnyak requested review from a team, HuiyingLi, adil-a, akoumpa and hemildesai as code owners May 20, 2026 17:18

rnyak self-assigned this May 20, 2026

rnyak added the enhancement New feature or request label May 20, 2026

rnyak marked this pull request as draft May 20, 2026 17:18

rnyak temporarily deployed to public May 20, 2026 17:40 — with GitHub Actions Inactive

rnyak temporarily deployed to public May 20, 2026 17:46 — with GitHub Actions Inactive

remove redundant query masking for colbert

4b31317

Signed-off-by: Ronay Ak <ronaya@nvidia.com>

rnyak temporarily deployed to public May 22, 2026 00:15 — with GitHub Actions Inactive

rnyak temporarily deployed to public May 22, 2026 00:16 — with GitHub Actions Inactive

rnyak temporarily deployed to public May 22, 2026 00:17 — with GitHub Actions Inactive

rnyak temporarily deployed to public May 22, 2026 00:23 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add late interaction model training support for retrieval#2283

feat: Add late interaction model training support for retrieval#2283
rnyak wants to merge 2 commits into
mainfrom
rny/late_interaction_retrieval

rnyak commented May 20, 2026

Uh oh!

copy-pr-bot Bot commented May 20, 2026

Uh oh!

rnyak commented May 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

rnyak commented May 20, 2026

What does this PR do ?

Before your PR is "Ready for review"

Uh oh!

copy-pr-bot Bot commented May 20, 2026

Uh oh!

rnyak commented May 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant