Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

vLLM

vLLM is a high-throughput and memory-efficient inference and serving engine for large language models (LLMs).
Collective - Host: opensource - https://opencollective.com/vllm - Code: https://github.com/vllm-project/vllm

Add bitsandbytes fp4 support

github.com/vllm-project/vllm - thesues opened this pull request 2 months ago
[Kernel] Flashinfer correctness fix for v0.1.3

github.com/vllm-project/vllm - LiuXiaoxuanPKU opened this pull request 2 months ago
[Docs] Update readme

github.com/vllm-project/vllm - simon-mo opened this pull request 2 months ago
[Bugfix][Kernel] Increased atol to fix failing tests

github.com/vllm-project/vllm - ProExpertProg opened this pull request 2 months ago
[Usage]: how to save sharded state?

github.com/vllm-project/vllm - aldwnesx opened this issue 2 months ago
[Bug]: vllm hangs after model download / load

github.com/vllm-project/vllm - ArtificialEU opened this issue 2 months ago
[Core] Use Appropriate `torch.dtype` for FP8 KV Cache

github.com/vllm-project/vllm - jon-chuang opened this pull request 2 months ago
[CI/Build] Dockerfile.cpu improvements

github.com/vllm-project/vllm - dtrifiro opened this pull request 2 months ago
AsyncLLMEngine and LLMEngine

github.com/vllm-project/vllm - ngz-sun opened this issue 2 months ago
[Bugfix] Fix LoRA with PP

github.com/vllm-project/vllm - andoorve opened this pull request 2 months ago
[Feature]:

github.com/vllm-project/vllm - Jack-mi opened this issue 2 months ago
[Usage]: add mulitple lora in docker

github.com/vllm-project/vllm - chintanshrinath opened this issue 2 months ago
[Kernel] Fix Flashinfer Correctness

github.com/vllm-project/vllm - LiuXiaoxuanPKU opened this pull request 2 months ago
[ Bugfix ] Fix Prometheus Metrics With `zeromq` Frontend

github.com/vllm-project/vllm - robertgshaw2-neuralmagic opened this pull request 2 months ago
[Misc] `compressed-tensors` code reuse

github.com/vllm-project/vllm - kylesayrs opened this pull request 2 months ago
[Model] Rename MiniCPMVQwen2 to MiniCPMV2.6

github.com/vllm-project/vllm - jeejeelee opened this pull request 2 months ago
[Kernel] AQ AZP 3/4: Asymmetric quantization kernels

github.com/vllm-project/vllm - ProExpertProg opened this pull request 2 months ago
[Bugfix] Fix new Llama3.1 GGUF model loading

github.com/vllm-project/vllm - Isotr0py opened this pull request 2 months ago
[Core] Support serving encoder/decoder models

github.com/vllm-project/vllm - DarkLight1337 opened this pull request 2 months ago
[OpenVINO] migrate to latest dependencies versions

github.com/vllm-project/vllm - ilya-lavrenov opened this pull request 2 months ago
[mypy] Enable following imports for entrypoints

github.com/vllm-project/vllm - DarkLight1337 opened this pull request 2 months ago
[RFC]: Initial support for RBLN NPU

github.com/vllm-project/vllm - rebel-jonghewk opened this issue 2 months ago
[Bug]: Miscalculation for ITL

github.com/vllm-project/vllm - AslanEZ opened this issue 2 months ago
[CI/Build][ROCm] Enabling tensorizer tests for ROCm

github.com/vllm-project/vllm - alexeykondrat opened this pull request 2 months ago
[wip][misc] register custom op for flash attention

github.com/vllm-project/vllm - youkaichao opened this pull request 2 months ago
[mypy] Enable mypy type checking for `vllm/core`

github.com/vllm-project/vllm - jberkhahn opened this pull request 2 months ago
[Model][LoRA]LoRA support added for MiniCPMV2.5

github.com/vllm-project/vllm - jeejeelee opened this pull request 2 months ago
Updating LM Format Enforcer version to v0.10.6

github.com/vllm-project/vllm - noamgat opened this pull request 2 months ago
[VLM][Model] TP support for ViTs

github.com/vllm-project/vllm - ChristopherCho opened this pull request 3 months ago
Suri vllm cpchung

github.com/vllm-project/vllm - chakpongchung opened this pull request 3 months ago
[Models] Add remaining model PP support

github.com/vllm-project/vllm - andoorve opened this pull request 3 months ago
[Bug]: vLLM latest version on Inf2 fails

github.com/vllm-project/vllm - ratnopamc opened this issue 3 months ago
[Performance] Optimize e2e overheads: Reduce python allocations

github.com/vllm-project/vllm - alexm-neuralmagic opened this pull request 3 months ago
[Bugfix][Frontend] Enable tools

github.com/vllm-project/vllm - tomeras91 opened this pull request 3 months ago
[Frontend] Support embeddings in the run_batch API

github.com/vllm-project/vllm - pooyadavoodi opened this pull request 3 months ago
[RFC]: vLLM plugin system

github.com/vllm-project/vllm - youkaichao opened this issue 3 months ago
add plugin

github.com/vllm-project/vllm - youkaichao opened this pull request 3 months ago
[RFC]: Model architecture plugins

github.com/vllm-project/vllm - NadavShmayo opened this issue 3 months ago