Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

vLLM

vLLM is a high-throughput and memory-efficient inference and serving engine for large language models (LLMs).
Collective - Host: opensource - https://opencollective.com/vllm - Code: https://github.com/vllm-project/vllm

[V1] Add all_token_ids attribute to Request

github.com/vllm-project/vllm - WoosukKwon opened this pull request 3 months ago
Rename vllm.logging to vllm.logging_utils

github.com/vllm-project/vllm - flozi00 opened this pull request 3 months ago
[Kernel]Enable HPU for Speculative Decoding

github.com/vllm-project/vllm - xuechendi opened this pull request 3 months ago
[Mistral] FP8 format

github.com/vllm-project/vllm - patrickvonplaten opened this pull request 3 months ago
[WIP] Prefix Cache Aware Scheduling [1/n]

github.com/vllm-project/vllm - rickyyx opened this pull request 3 months ago
[V1][Bugfix] Propagate V1 LLMEngine properly

github.com/vllm-project/vllm - comaniac opened this pull request 3 months ago
[Core] Add padding-aware scheduling for 2D prefills

github.com/vllm-project/vllm - kzawora-intel opened this pull request 3 months ago
[CI/Build] Always run mypy

github.com/vllm-project/vllm - russellb opened this pull request 3 months ago
Fix quantization config of vl model

github.com/vllm-project/vllm - jinzhen-lin opened this pull request 3 months ago
[Doc]: follow the doc but got error

github.com/vllm-project/vllm - husheng-liu opened this issue 3 months ago
Add hf_transfer to testing image

github.com/vllm-project/vllm - mgoin opened this pull request 3 months ago
[Kernel]Generalize Speculative decode from Cuda

github.com/vllm-project/vllm - xuechendi opened this pull request 3 months ago
Splitting attention kernel file

github.com/vllm-project/vllm - maleksan85 opened this pull request 3 months ago
[Misc] Improve Web UI

github.com/vllm-project/vllm - rafvasq opened this pull request 3 months ago
[CI/Build] Automate PR body text cleanup

github.com/vllm-project/vllm - russellb opened this pull request 3 months ago
[Core] Add dynamic chunk size calculation

github.com/vllm-project/vllm - prashantgupta24 opened this pull request 3 months ago
[Build] Fix for the Wswitch-bool clang warning

github.com/vllm-project/vllm - gshtras opened this pull request 3 months ago
[Doc] Updated TPU install instructions

github.com/vllm-project/vllm - mikegre-google opened this pull request 3 months ago
[Kernel] Refactor Cutlass c3x

github.com/vllm-project/vllm - varun-sundar-rabindranath opened this pull request 3 months ago
[Benchmark] guided decoding

github.com/vllm-project/vllm - aarnphm opened this pull request 3 months ago
[0/N] Rename `MultiModalInputs` to `MultiModalKwargs`

github.com/vllm-project/vllm - DarkLight1337 opened this pull request 3 months ago
Online video support for VLMs

github.com/vllm-project/vllm - litianjian opened this pull request 3 months ago
Adding cascade inference to vLLM

github.com/vllm-project/vllm - raywanb opened this pull request 3 months ago
[WIP] Ray Backend V1

github.com/vllm-project/vllm - rkooo567 opened this pull request 3 months ago
[Bugfix] Upgrade to pytorch 2.5.1

github.com/vllm-project/vllm - bnellnm opened this pull request 3 months ago
[Misc] Modify BNB parameter name

github.com/vllm-project/vllm - jeejeelee opened this pull request 3 months ago
[Misc]Reduce BNB static variable

github.com/vllm-project/vllm - jeejeelee opened this pull request 3 months ago
[Bugfix] Fix E2EL mean and median stats

github.com/vllm-project/vllm - daitran2k1 opened this pull request 3 months ago
[5/N] pass the whole config to model

github.com/vllm-project/vllm - youkaichao opened this pull request 3 months ago
[4/N] make quant config first-class citizen

github.com/vllm-project/vllm - youkaichao opened this pull request 3 months ago
[Bugfix][OpenVINO] Fix circular reference #9939

github.com/vllm-project/vllm - MengqingCao opened this pull request 3 months ago
[Bugfix] Fix `MQLLMEngine` hanging

github.com/vllm-project/vllm - robertgshaw2-neuralmagic opened this pull request 3 months ago
[V1] Prefix caching (take 2)

github.com/vllm-project/vllm - comaniac opened this pull request 3 months ago
[CI] Basic Integration Test For TPU

github.com/vllm-project/vllm - robertgshaw2-neuralmagic opened this pull request 3 months ago
[help wanted]: fix broken xverse model

github.com/vllm-project/vllm - youkaichao opened this issue 3 months ago
[Hardware][CPU] Add ARM CPU backend

github.com/vllm-project/vllm - ShawnD200 opened this pull request 3 months ago
[V1] Support per-request seed

github.com/vllm-project/vllm - njhill opened this pull request 3 months ago
[Doc] Add documentation for Structured Outputs

github.com/vllm-project/vllm - ismael-dm opened this pull request 3 months ago
Bump the patch-update group with 3 updates

github.com/vllm-project/vllm - dependabot[bot] opened this pull request 3 months ago
[Core]Add New Run:ai Streamer Load format.

github.com/vllm-project/vllm - pandyamarut opened this pull request 3 months ago
[Bug]: Phi-3 cannot be used with bitsandbytes

github.com/vllm-project/vllm - yananchen1989 opened this issue 3 months ago
[CI] Prune down LM Eval test time

github.com/vllm-project/vllm - mgoin opened this pull request 3 months ago
Doc: Improve benchmark documentation

github.com/vllm-project/vllm - rafvasq opened this pull request 3 months ago
[RFC] Propose a vulnerability management team

github.com/vllm-project/vllm - russellb opened this pull request 3 months ago