Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

vLLM

vLLM is a high-throughput and memory-efficient inference and serving engine for large language models (LLMs).
Collective - Host: opensource - https://opencollective.com/vllm - Code: https://github.com/vllm-project/vllm

[Kernel] Enhance MoE benchmarking & tuning script

github.com/vllm-project/vllm - WoosukKwon opened this pull request 5 months ago
Virtual Office Hours: Jun 5 and Jun 20

github.com/vllm-project/vllm - robertgshaw2-neuralmagic opened this issue 5 months ago
[Bugfix] Fix dummy weight for fp8

github.com/vllm-project/vllm - mzusman opened this pull request 5 months ago
[Bug]: Phi3 lora module not loading

github.com/vllm-project/vllm - arunpatala opened this issue 5 months ago
[CI/Build] Make marlin kernel build conditional.

github.com/vllm-project/vllm - esmeetu opened this pull request 5 months ago
Update test_ignore_eos

github.com/vllm-project/vllm - simon-mo opened this pull request 5 months ago
feat: Add batch API

github.com/vllm-project/vllm - shehraj123 opened this pull request 5 months ago
v0.4.3 Release Tracker

github.com/vllm-project/vllm - simon-mo opened this issue 5 months ago
[Bugfix] Relax tiktoken to >= 0.6.0

github.com/vllm-project/vllm - mgoin opened this pull request 5 months ago
[Core] Sharded State Loader download from HF

github.com/vllm-project/vllm - aurickq opened this pull request 5 months ago
[Model] Add Phi-2 LoRA support

github.com/vllm-project/vllm - Isotr0py opened this pull request 5 months ago
[Bugfix] Fix with verifying model max len

github.com/vllm-project/vllm - dimaioksha opened this pull request 5 months ago
[Build/CI] Extending AMD Tests

github.com/vllm-project/vllm - Alexei-V-Ivanov-AMD opened this pull request 5 months ago
[Draft][CI/Build] Optimize models tests

github.com/vllm-project/vllm - DarkLight1337 opened this pull request 5 months ago
[Misc] remove old comments

github.com/vllm-project/vllm - youkaichao opened this pull request 5 months ago
[Feature]: add local_files_only parameter

github.com/vllm-project/vllm - yananchen1989 opened this issue 5 months ago
[Doc] Highlight the fourth meetup in the README

github.com/vllm-project/vllm - zhuohan123 opened this pull request 5 months ago
Add a new kernel for fusing the dequantization in fused-moe gemm

github.com/vllm-project/vllm - RezaYazdaniAminabadi opened this pull request 5 months ago
[Build/CI] Enabling AMD Entrypoints Test

github.com/vllm-project/vllm - Alexei-V-Ivanov-AMD opened this pull request 5 months ago
[Bug]: llava inference result is wrong !

github.com/vllm-project/vllm - xiaoyudxy opened this issue 5 months ago
Support to serve vLLM on Kubernetes with LWS

github.com/vllm-project/vllm - kerthcet opened this pull request 5 months ago
[Bugfix] Avoid circular import in model loader

github.com/vllm-project/vllm - hiyouga opened this pull request 5 months ago
[Feature]: rope_scaling for qwen2

github.com/vllm-project/vllm - HappyLynn opened this issue 5 months ago
[Bug]: Llama 3 - Out of memory - RTX 4060 TI

github.com/vllm-project/vllm - savi8sant8s opened this issue 5 months ago
temporarily prioritize xformer for lora test

github.com/vllm-project/vllm - rkooo567 opened this pull request 5 months ago
[Core][Distributed] remove graph mode function

github.com/vllm-project/vllm - youkaichao opened this pull request 5 months ago
Add 4th meetup announcement to readme

github.com/vllm-project/vllm - simon-mo opened this pull request 5 months ago
Add marlin unit tests and marlin benchmark script

github.com/vllm-project/vllm - alexm-nm opened this pull request 5 months ago
[Bugfix][Model] Add base class for vision-language models

github.com/vllm-project/vllm - DarkLight1337 opened this pull request 5 months ago
[Bugfix][Doc] Fix CI failure in docs

github.com/vllm-project/vllm - DarkLight1337 opened this pull request 5 months ago
[Performance]: Deepseek-v2 support

github.com/vllm-project/vllm - ZixinxinWang opened this issue 5 months ago
[Doc] Add page for `PoolingParams`

github.com/vllm-project/vllm - DarkLight1337 opened this pull request 5 months ago
[Doc] Shorten README by removing supported model list

github.com/vllm-project/vllm - zhuohan123 opened this pull request 5 months ago
[Frontend] Support OpenAI batch file format

github.com/vllm-project/vllm - wuisawesome opened this pull request 5 months ago
[CI/Build] PEP 517/518 improvements

github.com/vllm-project/vllm - dtrifiro opened this pull request 5 months ago
Add GPTQ Marlin 2:4 sparse structured support

github.com/vllm-project/vllm - alexm-neuralmagic opened this pull request 5 months ago
[Kernel] add bfloat16 support for gptq marlin kernel

github.com/vllm-project/vllm - jinzhen-lin opened this pull request 5 months ago
[Lora] Support long context lora

github.com/vllm-project/vllm - rkooo567 opened this pull request 5 months ago