Blaizzy / mlx-vlm Public

Notifications You must be signed in to change notification settings
Fork 471
Star 4.3k

Code
Issues 80
Pull requests 60
Discussions
Actions
Projects
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security and quality
Insights

Pull requests: Blaizzy/mlx-vlm

Labels 11 Milestones 0

New pull request New

60 Open 481 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Expose presence_penalty, frequency_penalty, and per-penalty context_size on the server API

#1023 opened Apr 14, 2026 by esaruoho

Loading…

refactor: improve model loading and resource handling in utils.py

#1019 opened Apr 13, 2026 by SyedaAnshrahGillani

Loading…

Add Youtu-VL

#1018 opened Apr 13, 2026 by MollySophia

Loading…

Feature: Add native Gemma 4 video support

#1017 opened Apr 12, 2026 by hybridherbst

Loading…

fix: propagate the verbose to the Prefill tqdm

#1015 opened Apr 12, 2026 by PeterStaar-IBM

Loading…

server: indicate finish reason properly when model made a tool call.

#1014 opened Apr 12, 2026 by viktike Contributor

Loading…

Resolve no images crash for qwen3_vl and qwen3_vl_moe generate call

#1013 opened Apr 11, 2026 by urimem

Loading…

perf: close 5.5% decode gap vs mlx_lm.server on streaming chat endpoint

#1012 opened Apr 11, 2026 by chilang

Loading…

fix: use OpenAI chat-completion field names in /chat/completions usage

#1009 opened Apr 10, 2026 by chilang

Loading…

fix: replace NaN from all-masked SDPA padding rows in Gemma 4 vision

#1006 opened Apr 10, 2026 by fabiopili

Loading…

4 tasks done

feat: OpenAI Responses API with structured tool calling and multi-turn support

#996 opened Apr 9, 2026 by eloe

Loading…

feat: prompt prefix caching with TTL eviction and TurboQuant support

#995 opened Apr 9, 2026 by eloe

Loading…

feat: add logprobs support to /chat/completions

#994 opened Apr 9, 2026 by eloe

Loading…

feat: add JSON mode via response_format parameter

#993 opened Apr 9, 2026 by eloe

Loading…

feat: enforce tool_choice parameter in chat/completions

#992 opened Apr 9, 2026 by eloe

Loading…

feat: add stop sequences support for both endpoints

#991 opened Apr 9, 2026 by eloe

Loading…

fix: return finish_reason=tool_calls when tool calls detected

#990 opened Apr 9, 2026 by eloe

Loading…

feat: concurrency guard for Metal GPU serialization

#989 opened Apr 9, 2026 by eloe

Loading…

Fix sft_trainer crash when adapter_file is None (#908)

#987 opened Apr 9, 2026 by H-A-Khan

Loading…

Fix LoRA scaling: divide alpha by rank (#845)

#986 opened Apr 9, 2026 by H-A-Khan

Loading…

Add TriAttention KV cache compression

#985 opened Apr 9, 2026 by Blaizzy Owner

Loading…

3 of 4 tasks

Fix for Gemma4 audio task

#980 opened Apr 8, 2026 by lifeiteng

Loading…

fix(trainer): pass images to prepare_inputs for Gemma, Qwen, and SmolVLM

#979 opened Apr 8, 2026 by ukint-vs

Loading…

fix(trainer): flatten input_ids before measuring length in batch padding

#977 opened Apr 8, 2026 by ukint-vs

Loading…

3 tasks done

Strip tool-call markup from streamed delta.content

#974 opened Apr 7, 2026 by michaelstingl Contributor • Draft

Previous 1 2 3 Next

Previous Next

ProTip! Updated in the last three days: updated:>2026-04-11.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!