Skip to content

Pull requests: ml-explore/mlx-lm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Adding --kv_bits as parameter.
#1309 opened May 24, 2026 by Wolfbane1 Loading…
feat(models): warn when MTP weights are discarded at load
#1306 opened May 24, 2026 by kru2710shna Loading…
4 tasks done
Fix _rstrip_until ValueError when until list is empty
#1305 opened May 23, 2026 by hadoobi Loading…
Log detected tool parser on server model load
#1295 opened May 21, 2026 by robertlangdonn Loading…
Add Cohere2 MoE (Command A+) model support
#1294 opened May 21, 2026 by eauchs Loading…
2 of 5 tasks
Fix KeyError: 'name' in qwen3_coder tool parser
#1289 opened May 19, 2026 by DShickle Loading…
Fix tokenizer test failure
#1287 opened May 19, 2026 by zcbenz Collaborator Loading…
[mlx_lm] Expose 'strict' parameter in load() function
#1284 opened May 18, 2026 by zyguy Loading…
Add per-request prompt cache files to server
#1283 opened May 18, 2026 by Quiet-Node-io Loading…
Add timings to server responses
#1279 opened May 16, 2026 by spicyneuron Contributor Loading…
Restrict think-state scan to assistant prefill tail
#1277 opened May 15, 2026 by eilidhmae Loading…
Add Gemma 4 assistant (MTP drafter) model class
#1276 opened May 14, 2026 by broomva Loading…
feat: add --idle-timeout to unload model after inactivity
#1274 opened May 14, 2026 by nish2292 Loading…
8 tasks done
Add logits processor arguments to mlx_lm.generate
#1273 opened May 13, 2026 by realyxl Loading…
Support max_kv_size configuration in HTTP server
#1272 opened May 13, 2026 by r-bahuguna Loading…
Add Olmo3 tool parser
#1271 opened May 11, 2026 by anthonyhchan Loading…
2 tasks done
ProTip! Mix and match filters to narrow down what you’re looking for.