InternLM / lmdeploy Public

Notifications You must be signed in to change notification settings
Fork 666
Star 7.7k

Code
Issues 512
Pull requests 58
Discussions
Actions
Projects
Security 1
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: InternLM/lmdeploy

Labels 34 Milestones 0

New pull request New

58 Open 2,069 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Optimize Qwen3.5 improvement

#4434 opened Mar 19, 2026 by lzhangzz

Loading…

Split/tool call args json for qwen3coder tool calls (Qwen3.5)

#4433 opened Mar 19, 2026 by lapy

Loading…

feat: fully implement compressed-tensors gs32 support in TurboMind enhancement

New feature or request

#4429 opened Mar 19, 2026 by lapy

Loading…

update h config and add glm4.7 mtp test

#4424 opened Mar 18, 2026 by littlegy

Loading…

delete ray remote function return value improvement

#4422 opened Mar 18, 2026 by grimoire

Loading…

lmdeploy support kernel block size

#4421 opened Mar 17, 2026 by Tsundoku958

Loading…

[Feature] Support n parameter in /v1/chat/completions and /v1/completions

#4419 opened Mar 17, 2026 by ziyangliu-666

Loading…

support cache_seqlen on recurrent-gdr and causal-conv1d-update

#4417 opened Mar 17, 2026 by grimoire

Loading…

Assign sequential api_server ports when proxy_url is unset

#4416 opened Mar 16, 2026 by lvhan028

Loading…

[WIP] Support qwen3-omni

#4411 opened Mar 13, 2026 by CUHKSZzxy • Draft

2 of 4 tasks

fix metrics Bug:P1

#4410 opened Mar 13, 2026 by CUHKSZzxy

Loading…

[ci] add nightly docker build workflow

#4406 opened Mar 12, 2026 by zhulinJulia24

Loading…

Add model deployment best practice section in user guide

#4399 opened Mar 9, 2026 by lvhan028 • Draft

[Fix][Feat] Fix worker sorting with external pg bundles & Support persistent buffer for update_params

#4397 opened Mar 6, 2026 by CyCle1024

Loading…

[Ascend] support qwen3.5 27B

#4395 opened Mar 4, 2026 by wanfengcxz • Draft

Builtin mrope improvement

#4393 opened Mar 4, 2026 by grimoire

Loading…

Use pyupgrade and ruff to modernize LMDeploy Python Code

#4392 opened Mar 3, 2026 by windreamer

Loading…

add tool and reasoning test

#4388 opened Mar 2, 2026 by littlegy

Loading…

Fix Structured Output for GPT-OSS Models

#4386 opened Mar 2, 2026 by windreamer

Loading…

Improve proxy server improvement

#4354 opened Feb 12, 2026 by lvhan028

Loading…

Support MiniMax-M2 in TurboMind engine enhancement

New feature or request

#4343 opened Feb 10, 2026 by zh-nj

Loading…

[WIP]Support torch compile

#4336 opened Feb 8, 2026 by grimoire • Draft

add preliminary support for EP(single-node) of turbomind backend enhancement

New feature or request

#4332 opened Feb 6, 2026 by irexyc

Loading…

change ascend paged attention from BSH format to TND format for better performace

#4295 opened Jan 27, 2026 by jinminxi104 • Draft

return BadRequest for all invlid inputs Bug:P2

#4291 opened Jan 26, 2026 by lvhan028

Loading…

Previous 1 2 3 Next

Previous Next

ProTip! Add no:assignee to see everything that’s not assigned.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!