Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

vLLM

vLLM is a high-throughput and memory-efficient inference and serving engine for large language models (LLMs).
Collective - Host: opensource - https://opencollective.com/vllm - Code: https://github.com/vllm-project/vllm

Rahul quant merged

github.com/vllm-project/vllm - robertgshaw2-neuralmagic opened this pull request 3 months ago
[Perf] Reduce peak memory usage of llama

github.com/vllm-project/vllm - andoorve opened this pull request 3 months ago
[bugfix] Fix static asymmetric quantization case

github.com/vllm-project/vllm - ProExpertProg opened this pull request 3 months ago
[Tool parsing] Improve / correct mistral tool parsing

github.com/vllm-project/vllm - patrickvonplaten opened this pull request 3 months ago
Nir b2b latest

github.com/vllm-project/vllm - nirda7 opened this pull request 3 months ago
[Docs] Publish meetup slides

github.com/vllm-project/vllm - WoosukKwon opened this pull request 3 months ago
[Feature] enable host memory for kv cache

github.com/vllm-project/vllm - YZP17121579 opened this pull request 3 months ago
Rs 24 sparse

github.com/vllm-project/vllm - robertgshaw2-neuralmagic opened this pull request 3 months ago
[Bugfix] Fix unable to load some models

github.com/vllm-project/vllm - DarkLight1337 opened this pull request 3 months ago
[Model] Support telechat2

github.com/vllm-project/vllm - shunxing12345 opened this pull request 3 months ago
[TPU] Implement prefix caching for TPUs

github.com/vllm-project/vllm - WoosukKwon opened this pull request 3 months ago
[Model] Add Support for Multimodal Granite Models

github.com/vllm-project/vllm - alex-jw-brooks opened this pull request 3 months ago
[Feature]: 2D TP & EP

github.com/vllm-project/vllm - WenhaoHe02 opened this issue 3 months ago
[Misc] Update benchmark to support image_url file or http

github.com/vllm-project/vllm - kakao-steve-ai opened this pull request 3 months ago
[CI/Build] Make shellcheck happy

github.com/vllm-project/vllm - DarkLight1337 opened this pull request 3 months ago
Bump to compressed-tensors v0.8.0

github.com/vllm-project/vllm - dsikka opened this pull request 3 months ago
Bump to `compressed-tensors` v0.8.0

github.com/vllm-project/vllm - dsikka opened this pull request 3 months ago
[core][distributed] use tcp store directly

github.com/vllm-project/vllm - youkaichao opened this pull request 3 months ago
[help wanted]: add QwenModel to ci tests

github.com/vllm-project/vllm - youkaichao opened this issue 3 months ago
[V1] Fix CI tests on V1 engine

github.com/vllm-project/vllm - WoosukKwon opened this pull request 3 months ago
Revert "[ci][build] limit cmake version"

github.com/vllm-project/vllm - youkaichao opened this pull request 3 months ago
[doc] improve debugging doc

github.com/vllm-project/vllm - youkaichao opened this pull request 3 months ago
[V1] Enable Inductor when using piecewise CUDA graphs

github.com/vllm-project/vllm - WoosukKwon opened this pull request 3 months ago
[doc] fix location of runllm widget

github.com/vllm-project/vllm - youkaichao opened this pull request 3 months ago
[TPU] Use numpy to compute slot mapping

github.com/vllm-project/vllm - WoosukKwon opened this pull request 3 months ago
[Doc] Fix typo in arg_utils.py

github.com/vllm-project/vllm - xyang16 opened this pull request 3 months ago
[Bug]: qwen cannot be quantized in vllm

github.com/vllm-project/vllm - yananchen1989 opened this issue 3 months ago
[Bugfix] Fix QwenModel argument

github.com/vllm-project/vllm - DamonFool opened this pull request 3 months ago
[Feature]: 2:4 sparsity + w4a16 support

github.com/vllm-project/vllm - arunpatala opened this issue 3 months ago
[Usage]:Qwen2-VL not support Lora

github.com/vllm-project/vllm - menglrskr opened this issue 3 months ago
[Misc]Fix Idefics3Model argument

github.com/vllm-project/vllm - jeejeelee opened this pull request 3 months ago
[misc] Layerwise profile updates

github.com/vllm-project/vllm - varun-sundar-rabindranath opened this pull request 3 months ago
[V1] TPU Prototype

github.com/vllm-project/vllm - robertgshaw2-neuralmagic opened this pull request 3 months ago
[Hardware] [HPU]add `mark_step` for hpu

github.com/vllm-project/vllm - jikunshang opened this pull request 3 months ago
[Bugfix] Fix for Spec model TP + Chunked Prefill

github.com/vllm-project/vllm - andoorve opened this pull request 3 months ago
[V1] Enable custom ops with piecewise CUDA graphs

github.com/vllm-project/vllm - WoosukKwon opened this pull request 3 months ago
[6/N] pass whole config to inner model

github.com/vllm-project/vllm - youkaichao opened this pull request 3 months ago
[Docs] Misc updates to TPU installation instructions

github.com/vllm-project/vllm - mikegre-google opened this pull request 3 months ago
[Doc] Move PR template content to docs

github.com/vllm-project/vllm - russellb opened this pull request 3 months ago
Fix missing data type in flashinfer prefill

github.com/vllm-project/vllm - reyoung opened this pull request 3 months ago
[Bug]: Outlines w/ Mistral

github.com/vllm-project/vllm - matbee-eth opened this issue 3 months ago