ml-explore / mlx-lm Public

Notifications You must be signed in to change notification settings
Fork 706
Star 5.4k

Code
Issues 146
Pull requests 168
Discussions
Actions
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Security and quality
Insights

Pull requests: ml-explore/mlx-lm

Labels 9 Milestones 0

New pull request New

168 Open 646 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Adding --kv_bits as parameter.

#1309 opened May 24, 2026 by Wolfbane1

Loading…

feat(models): warn when MTP weights are discarded at load

#1306 opened May 24, 2026 by kru2710shna

Loading…

4 tasks done

Fix _rstrip_until ValueError when until list is empty

#1305 opened May 23, 2026 by hadoobi

Loading…

Add Gemma4 shared-KV sanitize regression

#1302 opened May 22, 2026 by rafaelescrich • Draft

Fix server XTC: accept int params and flatten special_tokens list

#1301 opened May 22, 2026 by realyxl

Loading…

Fix enable_thinking TypeError in deepseek_v32 chat template

#1300 opened May 22, 2026 by ivaniguarans

Loading…

generate: cache active samplers/processors in GenerationBatch hot path

#1299 opened May 22, 2026 by erayack

Loading…

fix: add sanitize method to Granite model for tied embeddings

#1298 opened May 22, 2026 by SahilChachra

Loading…

Add Prompt Lookup Decoding (ngram-simple) and Rolling-Hash Speculative Memory (ngram-mod)

#1297 opened May 22, 2026 by mayank2130

Loading…

Log detected tool parser on server model load

#1295 opened May 21, 2026 by robertlangdonn

Loading…

Add Cohere2 MoE (Command A+) model support

#1294 opened May 21, 2026 by eauchs

Loading…

2 of 5 tasks

Fix KeyError: 'name' in qwen3_coder tool parser

#1289 opened May 19, 2026 by DShickle

Loading…

BatchGenerator: opt-in prefer_prefill_when_pending scheduler

#1288 opened May 19, 2026 by benjamin-levin

Loading…

Fix tokenizer test failure

#1287 opened May 19, 2026 by zcbenz Collaborator

Loading…

[mlx_lm] Expose 'strict' parameter in load() function

#1284 opened May 18, 2026 by zyguy

Loading…

Add per-request prompt cache files to server

#1283 opened May 18, 2026 by Quiet-Node-io

Loading…

Fix nemotron_h MoEGate breaking load with per-path quantization

#1282 opened May 18, 2026 by YBJ0000

Loading…

Add timings to server responses

#1279 opened May 16, 2026 by spicyneuron Contributor

Loading…

Restrict think-state scan to assistant prefill tail

#1277 opened May 15, 2026 by eilidhmae

Loading…

Add Gemma 4 assistant (MTP drafter) model class

#1276 opened May 14, 2026 by broomva

Loading…

fix: make generation_stream per-thread to fix server crash on worker threads

#1275 opened May 14, 2026 by nish2292

Loading…

4 tasks done

feat: add --idle-timeout to unload model after inactivity

#1274 opened May 14, 2026 by nish2292

Loading…

8 tasks done

Add logits processor arguments to mlx_lm.generate

#1273 opened May 13, 2026 by realyxl

Loading…

Support max_kv_size configuration in HTTP server

#1272 opened May 13, 2026 by r-bahuguna

Loading…

Add Olmo3 tool parser

#1271 opened May 11, 2026 by anthonyhchan

Loading…

2 tasks done

Previous 1 2 3 4 5 6 7 Next

Previous Next

ProTip! Mix and match filters to narrow down what you’re looking for.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!