Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

vLLM

vLLM is a high-throughput and memory-efficient inference and serving engine for large language models (LLMs).
Collective - Host: opensource - https://opencollective.com/vllm - Code: https://github.com/vllm-project/vllm

[Model] Port over CLIPVisionModel for VLMs

github.com/vllm-project/vllm - ywang96 opened this pull request 4 months ago
[Hardware][Intel] Add AWQ support for CPU backend

github.com/vllm-project/vllm - zhouyuan opened this pull request 4 months ago
[Kernel] Add punica dimensions for Granite 13b

github.com/vllm-project/vllm - joerunde opened this pull request 4 months ago
[Usage]: how to use enable-chunked-prefill?

github.com/vllm-project/vllm - chenchunhui97 opened this issue 4 months ago
[ci] Diff check step

github.com/vllm-project/vllm - khluu opened this pull request 4 months ago
[CI/Build] Disable LLaVA-NeXT CPU test

github.com/vllm-project/vllm - DarkLight1337 opened this pull request 4 months ago
[Core][Distributed] improve p2p cache generation

github.com/vllm-project/vllm - youkaichao opened this pull request 4 months ago
[CI/Build] [1/3] Reorganize entrypoints tests

github.com/vllm-project/vllm - DarkLight1337 opened this pull request 4 months ago
[Core] Remove duplicate processing in async engine

github.com/vllm-project/vllm - DarkLight1337 opened this pull request 4 months ago
[Misc] Fix arg names

github.com/vllm-project/vllm - AllenDou opened this pull request 4 months ago
bump version to v0.5.0.post1

github.com/vllm-project/vllm - simon-mo opened this pull request 4 months ago
Limit visible devices for 2gpu tests

github.com/vllm-project/vllm - khluu opened this pull request 4 months ago
[Misc] Log cudagraph memory usage

github.com/vllm-project/vllm - ymwangg opened this pull request 4 months ago
[Kernel] Update Cutlass int8 kernel configs for SM90

github.com/vllm-project/vllm - varun-sundar-rabindranath opened this pull request 4 months ago
[misc] fix format.sh

github.com/vllm-project/vllm - youkaichao opened this pull request 4 months ago
[CI/Build] Disable test_fp8.py

github.com/vllm-project/vllm - tlrmchlsmth opened this pull request 4 months ago
[Bugfix]typofix

github.com/vllm-project/vllm - AllenDou opened this pull request 4 months ago
[Kernel] Disable CUTLASS kernels for fp8

github.com/vllm-project/vllm - tlrmchlsmth opened this pull request 4 months ago
support load qwen2-72b-instruct lora

github.com/vllm-project/vllm - NiuBlibing opened this pull request 4 months ago
[Bug]: ray not work when tp>=2

github.com/vllm-project/vllm - Jimmy-Lu opened this issue 4 months ago
[Hardware][Intel] fp8 kv cache support for CPU

github.com/vllm-project/vllm - jikunshang opened this pull request 4 months ago
[Bug]: NCCL hangs and causes timeout

github.com/vllm-project/vllm - wjj19950828 opened this issue 4 months ago
[Misc] add code to get git hash info for vllm

github.com/vllm-project/vllm - dhuangnm opened this pull request 4 months ago
[CI/Build] Enable CPU test for VLMs

github.com/vllm-project/vllm - DarkLight1337 opened this pull request 4 months ago
Add `cuda_device_count_stateless`

github.com/vllm-project/vllm - Yard1 opened this pull request 4 months ago
[Doc] Update documentation on Tensorizer

github.com/vllm-project/vllm - sangstar opened this pull request 4 months ago
[ci] Upload wheels

github.com/vllm-project/vllm - khluu opened this pull request 4 months ago
[misc] add hint for AttributeError

github.com/vllm-project/vllm - youkaichao opened this pull request 4 months ago
[Bug]: Torch2.3 run fail

github.com/vllm-project/vllm - lucasjinreal opened this issue 4 months ago
[Feature]: PagedAttention multiple of 8

github.com/vllm-project/vllm - barschiiii opened this issue 4 months ago
[Bug]: vllm v0.5.0 internal assert failed

github.com/vllm-project/vllm - changshivek opened this issue 4 months ago
[Model] Bert Embedding Model

github.com/vllm-project/vllm - laishzh opened this pull request 4 months ago
multilora_inference调用qwen2-1.5b报错

github.com/vllm-project/vllm - zigangzhao-ai opened this issue 4 months ago
[Bugfix] TYPE_CHECKING for MultiModalData

github.com/vllm-project/vllm - kimdwkimdw opened this pull request 4 months ago
[Bug]: v0.4.3 AsyncEngineDeadError

github.com/vllm-project/vllm - changshivek opened this issue 4 months ago
[Bugfix] Avoid to warmup when world size is 1

github.com/vllm-project/vllm - kerthcet opened this pull request 4 months ago
[Kernel] Add punica dimension for Qwen2 LoRA

github.com/vllm-project/vllm - jinzhen-lin opened this pull request 4 months ago