Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

vLLM

vLLM is a high-throughput and memory-efficient inference and serving engine for large language models (LLMs).
Collective - Host: opensource - https://opencollective.com/vllm - Code: https://github.com/vllm-project/vllm

[Doc]: Marlin does not support weight_bits = uint4b8

github.com/vllm-project/vllm - xiaotukuaipao12318 opened this issue about 2 months ago
Release v0.6.0

github.com/vllm-project/vllm - simon-mo opened this issue about 2 months ago
[Misc] add iteration_tokens metric

github.com/vllm-project/vllm - LucasWilkinson opened this pull request about 2 months ago
[New Model]: Qwen2-VL

github.com/vllm-project/vllm - krevas opened this issue about 2 months ago
[Misc] GPTQ Activation Ordering

github.com/vllm-project/vllm - kylesayrs opened this pull request about 2 months ago
[CI/Build] Use python 3.12 in cuda image

github.com/vllm-project/vllm - joerunde opened this pull request about 2 months ago
Adding Cascade Infer to FlashInfer

github.com/vllm-project/vllm - raywanb opened this pull request about 2 months ago
[MISC] Consolidate FP8 kv-cache tests

github.com/vllm-project/vllm - comaniac opened this pull request about 2 months ago
[Core][WIP] MPLLMEngine with async streaming (depends on 8090)

github.com/vllm-project/vllm - alexm-neuralmagic opened this pull request about 2 months ago
➕ add peft to common requirements

github.com/vllm-project/vllm - prashantgupta24 opened this pull request about 2 months ago
[Bugfix] Fix weight loading for the unfused pathway

github.com/vllm-project/vllm - dsikka opened this pull request about 2 months ago
[Bugfix] Fix bug in detokenizer.py

github.com/vllm-project/vllm - cafeii opened this pull request about 2 months ago
[Bugfix] Fix bug in detokenizer.py

github.com/vllm-project/vllm - cafeii opened this pull request about 2 months ago
Iboiko/flatpa blocksnumber

github.com/vllm-project/vllm - iboiko-habana opened this pull request about 2 months ago
[Bugfix] remove post_layernorm in siglip

github.com/vllm-project/vllm - wnma3mz opened this pull request about 2 months ago
chore: Update check-wheel-size.py to read MAX_SIZE_MB from env

github.com/vllm-project/vllm - haitwang-cloud opened this pull request about 2 months ago
[Frontend] Multimodal support in offline chat

github.com/vllm-project/vllm - DarkLight1337 opened this pull request about 2 months ago
[Core][Bugfix][Perf] Refactor Server to Avoid `AsyncLLMEngine`

github.com/vllm-project/vllm - robertgshaw2-neuralmagic opened this pull request about 2 months ago
[Bug]: vLLM 0.5.5 and FlashInfer0.1.6

github.com/vllm-project/vllm - wlwqq opened this issue about 2 months ago
[Bug]: when tensor-parallel-size>1,Stuck

github.com/vllm-project/vllm - wiluen opened this issue about 2 months ago
[Usage]: How to stop vllm serving properly?

github.com/vllm-project/vllm - phisinger opened this issue about 2 months ago
[Model] LoRA with lm_head fully trained

github.com/vllm-project/vllm - sergeykochetkov opened this pull request about 2 months ago
[Usage]: VLLM start

github.com/vllm-project/vllm - ChinChyi opened this issue about 2 months ago
[Bug]: RuntimeError: CUDA error: invalid argument

github.com/vllm-project/vllm - fengyang95 opened this issue about 2 months ago
[New Model]: quantized Qwen2 MoE models

github.com/vllm-project/vllm - BrenchCC opened this issue about 2 months ago
[Misc] Clean up RoPE forward_native

github.com/vllm-project/vllm - WoosukKwon opened this pull request about 2 months ago
[Feature]: Faster guided decoding for pre-defined output

github.com/vllm-project/vllm - captify-sivakhno opened this issue about 2 months ago
Add smoothquant support

github.com/vllm-project/vllm - ehartford opened this issue about 2 months ago
[New Model]: FM9GForCausalLM

github.com/vllm-project/vllm - Aiwenqiuyu opened this issue about 2 months ago
[Feature]: Beam Search with Temperature > 0

github.com/vllm-project/vllm - ekurtulus opened this issue about 2 months ago
[Misc] Optional installation of audio related packages

github.com/vllm-project/vllm - ywang96 opened this pull request about 2 months ago
[Frontend] Add progress reporting to run_batch.py

github.com/vllm-project/vllm - alugowski opened this pull request about 2 months ago
[BugFix][Core] Multistep Fix Crash on Request Cancellation

github.com/vllm-project/vllm - robertgshaw2-neuralmagic opened this pull request about 2 months ago
[Bug]: vllm0.4.3 guided decoding

github.com/vllm-project/vllm - TangJiakai opened this issue about 2 months ago
[Core][Bugfix] Accept GGUF model without .gguf extension

github.com/vllm-project/vllm - Isotr0py opened this pull request about 2 months ago
[Bugfix] Fix internlm2 tensor parallel inference

github.com/vllm-project/vllm - Isotr0py opened this pull request about 2 months ago
[Hardware][Ascend] Add Ascend NPU backend

github.com/vllm-project/vllm - wangshuai09 opened this pull request about 2 months ago
[Bugfix] Fix import error in Phi-3.5-MoE

github.com/vllm-project/vllm - DarkLight1337 opened this pull request about 2 months ago
[Bug]: flakey test found in #7874

github.com/vllm-project/vllm - noooop opened this issue about 2 months ago
[Core] Optimize Async + Multi-step

github.com/vllm-project/vllm - alexm-neuralmagic opened this pull request about 2 months ago
[not-for-review] test PR

github.com/vllm-project/vllm - khluu opened this pull request about 2 months ago
[WIP, Kernel] (3/N) Machete W4A8

github.com/vllm-project/vllm - LucasWilkinson opened this pull request about 2 months ago
[CI/Build] Use uv in the Dockerfile

github.com/vllm-project/vllm - mgoin opened this pull request about 2 months ago
[CI/Build][Kernel] Update CUTLASS to 3.5.1 tag

github.com/vllm-project/vllm - tlrmchlsmth opened this pull request about 2 months ago
[cleanup] remove engine-use-ray

github.com/vllm-project/vllm - simon-mo opened this pull request about 2 months ago
[Performance]: Sampler is too slow?

github.com/vllm-project/vllm - niuzheng168 opened this issue about 2 months ago
RayServe TPU example

github.com/vllm-project/vllm - richardsliu opened this pull request about 2 months ago
[Bugfix] Fix ModelScope models in v0.5.5

github.com/vllm-project/vllm - NickLucche opened this pull request about 2 months ago
[Feature]: Contribute T5 model to vLLM

github.com/vllm-project/vllm - shivance opened this issue about 2 months ago
[TPU][Bugfix] Fix tpu type api

github.com/vllm-project/vllm - WoosukKwon opened this pull request about 2 months ago
[Bugfix] Fix import error in Exaone model

github.com/vllm-project/vllm - DarkLight1337 opened this pull request about 2 months ago
[Kernel] Enable 8-bit weights in Fused Marlin MoE

github.com/vllm-project/vllm - ElizaWszola opened this pull request about 2 months ago