Skip to content

Adding support to training Ministral 3 as embedding model for visual document retrieval#2648

Draft
gabrielspmoreira wants to merge 11 commits into
mainfrom
ministral3_vdr
Draft

Adding support to training Ministral 3 as embedding model for visual document retrieval#2648
gabrielspmoreira wants to merge 11 commits into
mainfrom
ministral3_vdr

Conversation

@gabrielspmoreira

Copy link
Copy Markdown

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Changelog

  • Add specific line by line info of high level changes in this PR.

Before your PR is "Ready for review"

Pre checks:

  • Make sure you read and followed Contributor guidelines
  • Did you write any new necessary tests?
  • Did you add or update any necessary documentation?

If you haven't finished some of the above items you can still open "Draft" PR.

Additional Information

  • Related to # (issue)

gabrielspmoreira and others added 11 commits June 15, 2026 17:18
… using vision model processor as a collator. Also adding Nemotron VL 1B model for retrieval

Signed-off-by: Gabriel Moreira <gmoreira@nvidia.com>
Signed-off-by: Gabriel Moreira <gmoreira@nvidia.com>
Signed-off-by: Gabriel Moreira <gmoreira@nvidia.com>
Signed-off-by: Gabriel Moreira <gmoreira@nvidia.com>
…ion encoder with a dummy image to ensure multi-GPU synchronization

Signed-off-by: Gabriel Moreira <gmoreira@nvidia.com>
Signed-off-by: Gabriel Moreira <gmoreira@nvidia.com>
Signed-off-by: Gabriel Moreira <gmoreira@nvidia.com>
Signed-off-by: Gabriel Moreira <gmoreira@nvidia.com>
* fix(retrieval): gate dummy vision forward

Signed-off-by: Yuhe Zhang <yuhez@nvidia.com>

* perf(vlm): optimize Nemotron VL image token insertion

Signed-off-by: Yuhe Zhang <yuhez@nvidia.com>

---------

Signed-off-by: Yuhe Zhang <yuhez@nvidia.com>
…ing. Adding example YAMl for training Nemotron VL 1B
@gabrielspmoreira gabrielspmoreira self-assigned this Jun 19, 2026
@gabrielspmoreira gabrielspmoreira requested review from a team as code owners June 19, 2026 20:55
@copy-pr-bot

copy-pr-bot Bot commented Jun 19, 2026

Copy link
Copy Markdown

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@gabrielspmoreira gabrielspmoreira marked this pull request as draft June 19, 2026 20:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants