Enable LFM2.5 export and runner support for MLX by seyeong-han · Pull Request #19195 · pytorch/executorch

seyeong-han · 2026-04-28T19:43:55Z

Summary

Add LFM2.5 350M registration, Hugging Face repo mapping, and ExecuTorch model config.
Add an MLX 4-bit export config for LFM2.5 with KV cache metadata.
Add make lfm_2_5-mlx to build the shared Llama C++ runner with MLX enabled, plus README instructions for export and C++/pybindings runs.
Link to uploaded Hugging Face Hub artifacts: https://huggingface.co/younghan-meta/LFM2.5-ExecuTorch-MLX

Test plan

pytest -q examples/models/lfm2/test_lfm2_5_mlx.py
make -n lfm_2_5-mlx
make lfm_2_5-mlx
Exported and runtime-loaded:
- lfm2_5_350m_mlx_4w.pte
- lfm2_5_1_2b_mlx_4w.pte
Ran pybindings smoke generation for both 350M and 1.2B.
Ran C++ llama_main smoke generation with lfm2_5_350m_mlx_4w.pte.
Benchmarked C++ MLX runner:
- 350M median decode: 330.43 tok/s
- 1.2B median decode: 147.93 tok/s

pytorch-bot · 2026-04-28T19:43:58Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19195

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Rolling out OSDC (ARC) runners on pull & trunk workflows in PyTorch main

✅ No Failures

As of commit 46dbe46 with merge base 7e43308 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

seyeong-han · 2026-04-28T21:09:08Z

@pytorchbot label "release notes: examples"

Add LFM2.5 350M registration, MLX export config, focused regression coverage, and a make target for building the shared Llama C++ runner with MLX. Made-with: Cursor

Point the LFM2 README at the uploaded Hugging Face artifacts so users can run the MLX examples without re-exporting locally. Made-with: Cursor

seyeong-han requested review from kirklandsign, larryliu0820, lucylq and mergennachin as code owners April 28, 2026 19:43

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 28, 2026

pytorch-bot Bot added the release notes: examples Changes to any of our example LLMs integrations, such as Llama3 and Llava label Apr 28, 2026

seyeong-han added 2 commits April 28, 2026 14:11

Enable LFM2.5 MLX export and runner build

7b00db3

Add LFM2.5 350M registration, MLX export config, focused regression coverage, and a make target for building the shared Llama C++ runner with MLX. Made-with: Cursor

Document LFM2.5 MLX Hub artifacts

46dbe46

Point the LFM2 README at the uploaded Hugging Face artifacts so users can run the MLX examples without re-exporting locally. Made-with: Cursor

seyeong-han force-pushed the lfm2_5_mlx branch from fd04017 to 46dbe46 Compare April 28, 2026 21:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable LFM2.5 export and runner support for MLX#19195

Enable LFM2.5 export and runner support for MLX#19195
seyeong-han wants to merge 2 commits intopytorch:mainfrom
seyeong-han:lfm2_5_mlx

seyeong-han commented Apr 28, 2026 •

edited

Loading

Uh oh!

pytorch-bot Bot commented Apr 28, 2026 •

edited

Loading

Uh oh!

seyeong-han commented Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

seyeong-han commented Apr 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

pytorch-bot Bot commented Apr 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19195

❗ 1 Active SEVs

✅ No Failures

Uh oh!

seyeong-han commented Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

seyeong-han commented Apr 28, 2026 •

edited

Loading

pytorch-bot Bot commented Apr 28, 2026 •

edited

Loading