-
Notifications
You must be signed in to change notification settings - Fork 706
Pull requests: ml-explore/mlx-lm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat(models): warn when MTP weights are discarded at load
#1306
opened May 24, 2026 by
kru2710shna
Loading…
4 tasks done
Fix server XTC: accept int params and flatten special_tokens list
#1301
opened May 22, 2026 by
realyxl
Loading…
Fix enable_thinking TypeError in deepseek_v32 chat template
#1300
opened May 22, 2026 by
ivaniguarans
Loading…
generate: cache active samplers/processors in GenerationBatch hot path
#1299
opened May 22, 2026 by
erayack
Loading…
fix: add sanitize method to Granite model for tied embeddings
#1298
opened May 22, 2026 by
SahilChachra
Loading…
Add Prompt Lookup Decoding (ngram-simple) and Rolling-Hash Speculative Memory (ngram-mod)
#1297
opened May 22, 2026 by
mayank2130
Loading…
Add Cohere2 MoE (Command A+) model support
#1294
opened May 21, 2026 by
eauchs
Loading…
2 of 5 tasks
BatchGenerator: opt-in prefer_prefill_when_pending scheduler
#1288
opened May 19, 2026 by
benjamin-levin
Loading…
Fix nemotron_h MoEGate breaking load with per-path quantization
#1282
opened May 18, 2026 by
YBJ0000
Loading…
fix: make generation_stream per-thread to fix server crash on worker threads
#1275
opened May 14, 2026 by
nish2292
Loading…
4 tasks done
feat: add --idle-timeout to unload model after inactivity
#1274
opened May 14, 2026 by
nish2292
Loading…
8 tasks done
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.