Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

vLLM

vLLM is a high-throughput and memory-efficient inference and serving engine for large language models (LLMs).
Collective - Host: opensource - https://opencollective.com/vllm - Code: https://github.com/vllm-project/vllm

[V1] Add `uncache_blocks`

github.com/vllm-project/vllm - comaniac opened this pull request 9 days ago
Fixing the LoRA CI test.

github.com/vllm-project/vllm - Alexei-V-Ivanov-AMD opened this pull request 10 days ago
[Misc]: RoPE vs Sliding Windows

github.com/vllm-project/vllm - ccruttjr opened this issue 10 days ago
[Core] Fix an isort error from pre-commit

github.com/vllm-project/vllm - russellb opened this pull request 10 days ago
[Docs] Document vulnerability disclosure process

github.com/vllm-project/vllm - russellb opened this pull request 10 days ago
[Feature]: Use `uv` in pre-commit

github.com/vllm-project/vllm - NickLucche opened this issue 10 days ago
[Bug]: Speculative decoding does not work

github.com/vllm-project/vllm - JohnConnor123 opened this issue 10 days ago
[Doc] Add docs for prompt replacement

github.com/vllm-project/vllm - DarkLight1337 opened this pull request 10 days ago
[Core] tokens in queue metric

github.com/vllm-project/vllm - annapendleton opened this pull request 10 days ago
[Core] Support `reset_prefix_cache`

github.com/vllm-project/vllm - comaniac opened this pull request 10 days ago
[Docs] Update FP8 KV Cache documentation

github.com/vllm-project/vllm - mgoin opened this pull request 11 days ago
[Model] Add Qwen2 PRM model support

github.com/vllm-project/vllm - Isotr0py opened this pull request 12 days ago
[Bugfix] Fix incorrect types in LayerwiseProfileResults

github.com/vllm-project/vllm - terrytangyuan opened this pull request 12 days ago
[V1][Spec Decode] Ngram Spec Decode

github.com/vllm-project/vllm - LiuXiaoxuanPKU opened this pull request 12 days ago
[torch.compile] fix sym_tensor_indices

github.com/vllm-project/vllm - youkaichao opened this pull request 13 days ago
[misc] add cuda runtime version to usage data

github.com/vllm-project/vllm - youkaichao opened this pull request 13 days ago
[Misc] Add Gemma2 GGUF support

github.com/vllm-project/vllm - Isotr0py opened this pull request 14 days ago
[Kernel] add triton fused moe kernel for gptq/awq

github.com/vllm-project/vllm - jinzhen-lin opened this pull request 14 days ago
[Misc] Add BNB support to GLM4-V model

github.com/vllm-project/vllm - Isotr0py opened this pull request 14 days ago
[torch.compile] store inductor compiled Python file

github.com/vllm-project/vllm - youkaichao opened this pull request 14 days ago
[Feature]: Multi-Token Prediction (MTP)

github.com/vllm-project/vllm - casper-hansen opened this issue 14 days ago
[Docs] Fix broken link in SECURITY.md

github.com/vllm-project/vllm - russellb opened this pull request 15 days ago
[Bug]: Unable to serve Qwen2-audio in V1

github.com/vllm-project/vllm - superfan89 opened this issue 15 days ago
[misc] fix cross-node TP

github.com/vllm-project/vllm - youkaichao opened this pull request 15 days ago
[New Model]: NV-Embed-v2

github.com/vllm-project/vllm - Hypothesis-Z opened this issue 15 days ago
[WIP] Multimodal model support for V1 TPU

github.com/vllm-project/vllm - mgoin opened this pull request 15 days ago
[V1] Add V1 support of Qwen2-VL

github.com/vllm-project/vllm - ywang96 opened this pull request 16 days ago
[core] further polish memory profiling

github.com/vllm-project/vllm - youkaichao opened this pull request 16 days ago
[Misc] Update to Transformers 4.48

github.com/vllm-project/vllm - tlrmchlsmth opened this pull request 16 days ago
[BUILD] Add VLLM_BUILD_EXT to control custom op build

github.com/vllm-project/vllm - MengqingCao opened this pull request 16 days ago
[V1] Collect env var for usage stats

github.com/vllm-project/vllm - simon-mo opened this pull request 16 days ago
[New Model]: internlm3-8b-instruct

github.com/vllm-project/vllm - engchina opened this issue 16 days ago
Add: Support for Sparse24Bitmask Compressed Models

github.com/vllm-project/vllm - rahul-tuli opened this pull request 17 days ago
[Bug]: whisper example issue?

github.com/vllm-project/vllm - silvacarl2 opened this issue 17 days ago
[Kernel] Flash Attention 3 Support

github.com/vllm-project/vllm - LucasWilkinson opened this pull request 17 days ago
[Bugfix] Fix _get_lora_device for HQQ marlin

github.com/vllm-project/vllm - varun-sundar-rabindranath opened this pull request 17 days ago
Various cosmetic/comment fixes

github.com/vllm-project/vllm - mgoin opened this pull request 17 days ago
[delete]

github.com/vllm-project/vllm - Aktsvigun opened this pull request 17 days ago
[V1][WIP] Add KV cache group dimension to block table

github.com/vllm-project/vllm - heheda12345 opened this pull request 17 days ago
[Usage]: Token Embeddings from LLMs/VLMs

github.com/vllm-project/vllm - conceptofmind opened this issue 17 days ago