Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

vLLM

vLLM is a high-throughput and memory-efficient inference and serving engine for large language models (LLMs).
Collective - Host: opensource - https://opencollective.com/vllm - Code: https://github.com/vllm-project/vllm

[Bug]: Mismatch of tqdm when n > 1

github.com/vllm-project/vllm - MiDonkey opened this issue 15 days ago
[Model] Composite weight loading for multimodal Qwen2

github.com/vllm-project/vllm - DarkLight1337 opened this pull request 15 days ago
Build tpu image in release pipeline

github.com/vllm-project/vllm - richardsliu opened this pull request 16 days ago
Update deploying_with_k8s.rst

github.com/vllm-project/vllm - AlexHe99 opened this pull request 16 days ago
Add Bamba Model

github.com/vllm-project/vllm - fabianlim opened this pull request 17 days ago
[Misc]: FP8/INT8 for AQLM ?

github.com/vllm-project/vllm - Duncan1115 opened this issue 17 days ago
[Doc]: How to make Multi-Node Inference

github.com/vllm-project/vllm - pygongnlp opened this issue 17 days ago
[Core] Support offloading KV cache to CPU

github.com/vllm-project/vllm - ApostaC opened this pull request 18 days ago
[Bugfix] Only require XGrammar on x86

github.com/vllm-project/vllm - mgoin opened this pull request 18 days ago
[CI] Turn on basic correctness tests for V1

github.com/vllm-project/vllm - tlrmchlsmth opened this pull request 18 days ago
[MISC][XPU] quick fix for XPU CI

github.com/vllm-project/vllm - yma11 opened this pull request 18 days ago
Add jamba classfication

github.com/vllm-project/vllm - yecohn opened this pull request 18 days ago
Update sampling_params.py

github.com/vllm-project/vllm - o2363286 opened this pull request 18 days ago
Regional compilation support

github.com/vllm-project/vllm - Kacper-Pietkun opened this pull request 18 days ago
[Feature]: add DoRA support

github.com/vllm-project/vllm - cmhungsteve opened this issue 18 days ago
Tmp whl

github.com/vllm-project/vllm - robertgshaw2-neuralmagic opened this pull request 19 days ago
[core][distributed] add pynccl broadcast

github.com/vllm-project/vllm - youkaichao opened this pull request 19 days ago
Lora scheduler

github.com/vllm-project/vllm - Scott-Hickmann opened this pull request 19 days ago
[Doc] add KubeAI to serving integrations

github.com/vllm-project/vllm - samos123 opened this pull request 19 days ago
[WIP] Xgrammar init in engine

github.com/vllm-project/vllm - mgoin opened this pull request 19 days ago
[Doc] Create a new "Usage" section

github.com/vllm-project/vllm - DarkLight1337 opened this pull request 19 days ago
[Bug]: mistral tool choice error

github.com/vllm-project/vllm - warlockedward opened this issue 19 days ago
[Misc] Split up pooling tasks

github.com/vllm-project/vllm - DarkLight1337 opened this pull request 19 days ago
[Misc] Remove deprecated names

github.com/vllm-project/vllm - DarkLight1337 opened this pull request 20 days ago
[Model] Add support for embedding model GritLM

github.com/vllm-project/vllm - pooyadavoodi opened this pull request 20 days ago
[misc] remove xverse modeling file

github.com/vllm-project/vllm - youkaichao opened this pull request 20 days ago
[Bug]: Engine process (pid 76) died

github.com/vllm-project/vllm - 0xymoro opened this issue 20 days ago
[Kernel] Use `out` in flash_attn_varlen_func

github.com/vllm-project/vllm - WoosukKwon opened this pull request 20 days ago
[Bug]: vllm stream generate error

github.com/vllm-project/vllm - Wbxxx opened this issue 20 days ago
[Misc] Rename embedding classes to pooling

github.com/vllm-project/vllm - DarkLight1337 opened this pull request 21 days ago
[LoRA] Change lora_tokenizers capacity

github.com/vllm-project/vllm - xyang16 opened this pull request 21 days ago
[Model] Add BNB support to Llava and Pixtral-HF

github.com/vllm-project/vllm - Isotr0py opened this pull request 21 days ago
Fix openvino on GPU

github.com/vllm-project/vllm - janimo opened this pull request 21 days ago
[Usage]: Question on max_model_len

github.com/vllm-project/vllm - mces89 opened this issue 21 days ago
[Bugfix] Fix OpenVino/Neuron `driver_worker` init

github.com/vllm-project/vllm - NickLucche opened this pull request 22 days ago
[Bugfix] Fix Idefics3 bug

github.com/vllm-project/vllm - jeejeelee opened this pull request 22 days ago
Prepare sin/cos buffers for rope outside model forward

github.com/vllm-project/vllm - tzielinski-habana opened this pull request 22 days ago
[Model]: add some tests for aria model

github.com/vllm-project/vllm - xffxff opened this pull request 22 days ago
[Model] Replace embedding models with pooling adapter

github.com/vllm-project/vllm - DarkLight1337 opened this pull request 22 days ago
[Platform] Move `async output` check to platform

github.com/vllm-project/vllm - wangxiyuan opened this pull request 22 days ago
Drop ROCm load format check

github.com/vllm-project/vllm - wangxiyuan opened this pull request 22 days ago
[feature]:upstream quark format to vllm

github.com/vllm-project/vllm - kewang-xlnx opened this pull request 22 days ago
[Bug]: idefics3 doesn't stream

github.com/vllm-project/vllm - sjuxax opened this issue 22 days ago