Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

vLLM

vLLM is a high-throughput and memory-efficient inference and serving engine for large language models (LLMs).
Collective - Host: opensource - https://opencollective.com/vllm - Code: https://github.com/vllm-project/vllm

[Frontend] Kill the server on engine death

github.com/vllm-project/vllm - joerunde opened this pull request 3 months ago
[ Kernel ] FP8 Dynamic Per Token Quant - Add scale_ub

github.com/vllm-project/vllm - varun-sundar-rabindranath opened this pull request 3 months ago
[Bug]: Intel GPU Test failing in CI

github.com/vllm-project/vllm - tdoublep opened this issue 3 months ago
Fp8 dyn per tok

github.com/vllm-project/vllm - robertgshaw2-neuralmagic opened this pull request 3 months ago
[Feature]: LLM2Vec (Fine-Tuned Embeddings) Support

github.com/vllm-project/vllm - DorotheaMueller opened this issue 3 months ago
[Bugfix][Frontend] remove duplicate init logger

github.com/vllm-project/vllm - dtrifiro opened this pull request 3 months ago
[Docs] Update docs for wheel location

github.com/vllm-project/vllm - simon-mo opened this pull request 3 months ago
Fp8 support for mi300x

github.com/vllm-project/vllm - ferrybaltimore opened this issue 3 months ago
[Bug]: CUDA Error when print

github.com/vllm-project/vllm - NaNillll opened this issue 3 months ago
[Model]: Llava-Next-Video support

github.com/vllm-project/vllm - TKONIY opened this issue 3 months ago
Upgrade to numpy >= 2.0.0

github.com/vllm-project/vllm - fgebhart opened this issue 3 months ago
add tqdm when loading checkpoint shards

github.com/vllm-project/vllm - zhaotyer opened this pull request 3 months ago
[Misc] Enhance prefix-caching benchmark tool

github.com/vllm-project/vllm - Jeffwan opened this pull request 3 months ago
[Bug]: Distributed Inference and Serving

github.com/vllm-project/vllm - warlockedward opened this issue 3 months ago
[New Model]: Mistral-Nemo

github.com/vllm-project/vllm - Hambaobao opened this issue 3 months ago
[ Misc ] `fbgemm` checkpoints

github.com/vllm-project/vllm - robertgshaw2-neuralmagic opened this pull request 3 months ago
[Core] Allow specifying custom Executor

github.com/vllm-project/vllm - Yard1 opened this pull request 3 months ago
[ci][test] add correctness test for cpu offloading

github.com/vllm-project/vllm - youkaichao opened this pull request 3 months ago
[Model] Support Mistral-Nemo

github.com/vllm-project/vllm - mgoin opened this pull request 3 months ago
[ Kernel ] Enable Dynamic Per Token `fp8`

github.com/vllm-project/vllm - robertgshaw2-neuralmagic opened this pull request 3 months ago
[CI/Build] bump ruff version, fix linting issues

github.com/vllm-project/vllm - dtrifiro opened this pull request 3 months ago
[CI/Build] replace yapf with ruff

github.com/vllm-project/vllm - dtrifiro opened this pull request 3 months ago
[Misc] Small perf improvements

github.com/vllm-project/vllm - Yard1 opened this pull request 3 months ago
[Model] Pipeline Parallel Support for DeepSeek v2

github.com/vllm-project/vllm - tjohnson31415 opened this pull request 3 months ago
FP8 Dynamic-Per-Token Quant

github.com/vllm-project/vllm - varun-sundar-rabindranath opened this pull request 3 months ago
[TPU] Refactor TPU worker & model runner

github.com/vllm-project/vllm - WoosukKwon opened this pull request 3 months ago
[Misc] Use `torch.Tensor` for type annotation

github.com/vllm-project/vllm - WoosukKwon opened this pull request 3 months ago
[TPU] Remove multi-modal args in TPU backend

github.com/vllm-project/vllm - WoosukKwon opened this pull request 3 months ago
[New Model]: Support for Telechat

github.com/vllm-project/vllm - hzhaoy opened this issue 3 months ago
[Model] Add Support for GPTQ Fused MOE

github.com/vllm-project/vllm - izhuhaoran opened this pull request 3 months ago
deploying embedding model in same way as LLM

github.com/vllm-project/vllm - riyajatar37003 opened this issue 3 months ago
[core][model] yet another cpu offload implementation

github.com/vllm-project/vllm - youkaichao opened this pull request 3 months ago
[Bugfix] Fix for multinode crash on 4 PP

github.com/vllm-project/vllm - andoorve opened this pull request 3 months ago
[Bug]: The metrics have not improved.

github.com/vllm-project/vllm - zjjznw123 opened this issue 3 months ago
Sequence parallel

github.com/vllm-project/vllm - wbdr opened this pull request 3 months ago
[Not for review]test gemma lora

github.com/vllm-project/vllm - jeejeelee opened this pull request 3 months ago
[misc][distributed] add seed to dummy weights

github.com/vllm-project/vllm - youkaichao opened this pull request 3 months ago
[CI/Build] Update flashinfer to v0.0.9 (#6489)

github.com/vllm-project/vllm - 170928 opened this pull request 3 months ago
[misc][distributed] improve tests

github.com/vllm-project/vllm - youkaichao opened this pull request 3 months ago
[ Kernel ] Fp8 Channelwise Weight Support

github.com/vllm-project/vllm - robertgshaw2-neuralmagic opened this pull request 3 months ago
[Model] Support Mamba

github.com/vllm-project/vllm - tlrmchlsmth opened this pull request 3 months ago
[Not for review] Spmd tp rebase

github.com/vllm-project/vllm - ruisearch42 opened this pull request 3 months ago
[ROCm] Cleanup Dockerfile and remove outdated patch

github.com/vllm-project/vllm - hongxiayang opened this pull request 3 months ago
[New Model]: Codestral Mamba

github.com/vllm-project/vllm - K-Mistele opened this issue 3 months ago
[Bug]: Gemma 27B crashes on GCP A100

github.com/vllm-project/vllm - noamgat opened this issue 3 months ago
[Misc][Speculative decoding] Typos and typing fixes

github.com/vllm-project/vllm - ShangmingCai opened this pull request 3 months ago
unable to run vllm model deployment

github.com/vllm-project/vllm - riyajatar37003 opened this issue 3 months ago
[Bugfix][Frontend] Fix missing `/metrics` endpoint

github.com/vllm-project/vllm - DarkLight1337 opened this pull request 3 months ago