Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

vLLM

vLLM is a high-throughput and memory-efficient inference and serving engine for large language models (LLMs).
Collective - Host: opensource - https://opencollective.com/vllm - Code: https://github.com/vllm-project/vllm

[Model] Clean up MiniCPMV

github.com/vllm-project/vllm - DarkLight1337 opened this pull request 2 months ago
[Misc] typo find in sampling_metadata.py

github.com/vllm-project/vllm - noooop opened this pull request 2 months ago
[New Model]: Qwen/QwQ-32B-Preview

github.com/vllm-project/vllm - SionicAI-Engineering opened this issue 2 months ago
[doc]Update config docstring

github.com/vllm-project/vllm - wangxiyuan opened this pull request 2 months ago
[WIP][V1] Ray executor

github.com/vllm-project/vllm - rkooo567 opened this pull request 2 months ago
[Bugfix] Fix BNB loader target_modules

github.com/vllm-project/vllm - jeejeelee opened this pull request 2 months ago
[CI]add genai-perf benchmark in nightly benchmark

github.com/vllm-project/vllm - jikunshang opened this pull request 2 months ago
[Model] Implement merged input processor for LLaVA model

github.com/vllm-project/vllm - DarkLight1337 opened this pull request 2 months ago
[RFC]: Make any vLLM model a pooling model

github.com/vllm-project/vllm - DarkLight1337 opened this issue 2 months ago
[Doc] Add github links for source code references

github.com/vllm-project/vllm - russellb opened this pull request 2 months ago
[Bug]: Llama 3.2 90b crash

github.com/vllm-project/vllm - yessenzhar opened this issue 2 months ago
[core] improve cpu offloading implementation

github.com/vllm-project/vllm - youkaichao opened this pull request 3 months ago
[Bug]: Authorization ignored when root_path is set

github.com/vllm-project/vllm - chaunceyjiang opened this pull request 3 months ago
[fix] Correct num_accepted_tokens counting

github.com/vllm-project/vllm - KexinFeng opened this pull request 3 months ago
[doc] update the code to add models

github.com/vllm-project/vllm - youkaichao opened this pull request 3 months ago
[Misc]Further reduce BNB static variable

github.com/vllm-project/vllm - jeejeelee opened this pull request 3 months ago
[Interleaved ATTN] Support for Mistral-8B

github.com/vllm-project/vllm - patrickvonplaten opened this pull request 3 months ago
[Bug]: Duplicate request_id breaks the engine

github.com/vllm-project/vllm - tjohnson31415 opened this issue 3 months ago
[Core] Update to outlines >= 0.1.8

github.com/vllm-project/vllm - russellb opened this pull request 3 months ago
[Usage]: Fail to load config.json

github.com/vllm-project/vllm - dequeueing opened this issue 3 months ago
Add Sageattention backend

github.com/vllm-project/vllm - flozi00 opened this pull request 3 months ago
[8/N] enable cli flag without a space

github.com/vllm-project/vllm - youkaichao opened this pull request 3 months ago
[Bug]: Gemma2 becomes a fool.

github.com/vllm-project/vllm - Foreist opened this issue 3 months ago
[Kernel] Register punica ops directly

github.com/vllm-project/vllm - jeejeelee opened this pull request 3 months ago
[Model]: Add support for Aria model

github.com/vllm-project/vllm - xffxff opened this pull request 3 months ago
[Doc] fix a small typo in docstring of llama_tool_parser

github.com/vllm-project/vllm - FerdinandZhong opened this pull request 3 months ago
[Feature]: Multimodel prefix-caching features

github.com/vllm-project/vllm - justzhanghong opened this issue 3 months ago
[Usage]:

github.com/vllm-project/vllm - Lukas-123 opened this issue 3 months ago
[Platforms] Add `device_type` in `Platform`

github.com/vllm-project/vllm - MengqingCao opened this pull request 3 months ago
Need to update the jax and jaxlib version

github.com/vllm-project/vllm - vanbasten23 opened this pull request 3 months ago
Turn on V1 for H200 build

github.com/vllm-project/vllm - simon-mo opened this pull request 3 months ago
[Model] Add OLMo November 2024 model

github.com/vllm-project/vllm - 2015aroras opened this pull request 3 months ago
Support softcap in ROCm Flash Attention

github.com/vllm-project/vllm - hliuca opened this pull request 3 months ago
[CI/Build] Dockerfile build for ARM64 / GH200

github.com/vllm-project/vllm - drikster80 opened this pull request 3 months ago
[Bugfix] GPU memory profiling should be per LLM instance

github.com/vllm-project/vllm - tjohnson31415 opened this pull request 3 months ago
[Frontend] Add Command-R and Llama-3 chat template

github.com/vllm-project/vllm - ccs96307 opened this pull request 3 months ago