Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

vLLM

vLLM is a high-throughput and memory-efficient inference and serving engine for large language models (LLMs).
Collective - Host: opensource - https://opencollective.com/vllm - Code: https://github.com/vllm-project/vllm

[misc] Optimize speculative decoding

github.com/vllm-project/vllm - jacob-crux opened this pull request about 2 months ago
[CI/Build] Added OpenVINO backend tests run

github.com/vllm-project/vllm - ilya-lavrenov opened this pull request about 2 months ago
[Usage]: set num_crops in LVLM

github.com/vllm-project/vllm - Liyan06 opened this issue about 2 months ago
Inclusion of InternVLChatModel In PP_SUPPORTED_MODELS(Pipeline Parallelism)

github.com/vllm-project/vllm - Manikandan-Thangaraj-ZS0321 opened this pull request about 2 months ago
[Bug]: minicpmv2_6 OOM

github.com/vllm-project/vllm - Howe-Young opened this issue about 2 months ago
[ci][test] fix RemoteOpenAIServer

github.com/vllm-project/vllm - youkaichao opened this pull request about 2 months ago
[Bugfix][CI/Build] Fix model name being overwritten

github.com/vllm-project/vllm - DarkLight1337 opened this pull request about 2 months ago
[Misc] Remove snapshot_download usage in InternVL2 test

github.com/vllm-project/vllm - Isotr0py opened this pull request about 2 months ago
[ci][test] exclude model download time in server start time

github.com/vllm-project/vllm - youkaichao opened this pull request about 2 months ago
[BUG]: Support AI21-Jamba-1.5-Large (and mini)

github.com/vllm-project/vllm - pseudotensor opened this issue about 2 months ago
[misc][core] lazy import outlines

github.com/vllm-project/vllm - youkaichao opened this pull request about 2 months ago
[Bug]: Nemotron 340B does not generated EOS token

github.com/vllm-project/vllm - natolambert opened this issue about 2 months ago
[Bug]: tool_calls parsing error with CPU

github.com/vllm-project/vllm - xuechendi opened this issue about 2 months ago
[WIP][Model] Add support for multiple audio chunks/audio URLs

github.com/vllm-project/vllm - petersalas opened this pull request about 2 months ago
[Bugfix][Intel] Fix XPU Dockerfile Build

github.com/vllm-project/vllm - tylertitsworth opened this pull request about 2 months ago
Bump version to v0.5.5

github.com/vllm-project/vllm - simon-mo opened this pull request about 2 months ago
[CI/Build] Reorganize models tests

github.com/vllm-project/vllm - DarkLight1337 opened this pull request about 2 months ago
[MODEL] add Exaone model support

github.com/vllm-project/vllm - nayohan opened this pull request about 2 months ago
[MODEL] add Exaone model support

github.com/vllm-project/vllm - nayohan opened this pull request about 2 months ago
[Core] Chunked Prefill support for Multi Step Scheduling

github.com/vllm-project/vllm - varun-sundar-rabindranath opened this pull request about 2 months ago
[Bug]: Docker build for ROCm fails for latest release and main branch

github.com/vllm-project/vllm - Spurthi-Bhat-ScalersAI opened this issue about 2 months ago
[Hardware][Intel GPU] Add intel GPU pipeline parallel support.

github.com/vllm-project/vllm - jikunshang opened this pull request about 2 months ago
[github][misc] promote asking llm first

github.com/vllm-project/vllm - youkaichao opened this pull request about 2 months ago
[misc] Add Torch profiler support for CPU-only devices

github.com/vllm-project/vllm - DamonFool opened this pull request about 2 months ago
[Misc] Update `qqq` to use vLLMParameters

github.com/vllm-project/vllm - dsikka opened this pull request about 2 months ago
[Bug]: falcon-40B model support

github.com/vllm-project/vllm - jikunshang opened this issue about 2 months ago
[Misc] Update `marlin` to use vLLMParameters

github.com/vllm-project/vllm - dsikka opened this pull request about 2 months ago
[core][torch.compile] not compile for profiling

github.com/vllm-project/vllm - youkaichao opened this pull request about 2 months ago
[Bug]: Critical distributed executor bug

github.com/vllm-project/vllm - clintg6 opened this issue about 2 months ago
[Core] Add multi-step support to LLMEngine

github.com/vllm-project/vllm - alexm-neuralmagic opened this pull request about 2 months ago
[Bug]: install vllm ocurr the building error

github.com/vllm-project/vllm - Mysnake opened this issue about 2 months ago
[Bug]: llama3-405b-fp8 NCCL communication

github.com/vllm-project/vllm - wangwensuo opened this issue about 2 months ago
[Misc] Update `gptq_marlin_24` to use vLLMParameters

github.com/vllm-project/vllm - dsikka opened this pull request about 2 months ago
Add more percentiles and latencies

github.com/vllm-project/vllm - wschin opened this pull request 2 months ago
[Kernel] Add torch custom op for all_reduce

github.com/vllm-project/vllm - SageMoore opened this pull request 2 months ago
[BugFix] Fix server crash on empty prompt

github.com/vllm-project/vllm - maxdebayser opened this pull request 2 months ago
Combine async postprocessor and multi-step - first WIP version

github.com/vllm-project/vllm - alexm-neuralmagic opened this pull request 2 months ago
[Usage]: About bitsandbytes

github.com/vllm-project/vllm - emreekmekcioglu1 opened this issue 2 months ago
[Frontend]-config-cli-args

github.com/vllm-project/vllm - KaunilD opened this pull request 2 months ago
[Bugfix] chat method add_generation_prompt param

github.com/vllm-project/vllm - brian14708 opened this pull request 2 months ago
WIP

github.com/vllm-project/vllm - patrickvonplaten opened this pull request 2 months ago
[Bugfix] Pass PYTHONPATH from setup.py to CMake

github.com/vllm-project/vllm - sasha0552 opened this pull request 2 months ago
[Model] Adding support for MSFT Phi-3.5-MoE

github.com/vllm-project/vllm - wenxcs opened this pull request 2 months ago
[New Model]: MiniCPM-V-2_6-int4

github.com/vllm-project/vllm - tangent2018 opened this issue 2 months ago
[Model] 1.58bits BitNet Model Support

github.com/vllm-project/vllm - LeiWang1999 opened this pull request 2 months ago
[Bugfix] Mirror jinja2 in pyproject.toml

github.com/vllm-project/vllm - sasha0552 opened this pull request 2 months ago