Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

vLLM

vLLM is a high-throughput and memory-efficient inference and serving engine for large language models (LLMs).
Collective - Host: opensource - https://opencollective.com/vllm - Code: https://github.com/vllm-project/vllm

[Doc] Fix typo in AMD installation guide

github.com/vllm-project/vllm - Imss27 opened this pull request 27 days ago
[Core] Rename input data types

github.com/vllm-project/vllm - DarkLight1337 opened this pull request 27 days ago
[Doc]: Is Qwen2-VL-72B supported?

github.com/vllm-project/vllm - pseudotensor opened this issue 28 days ago
[Hardware][AWS] update neuron to 2.20

github.com/vllm-project/vllm - omrishiv opened this pull request 28 days ago
[Hardware][AMD] ROCm6.2 upgrade

github.com/vllm-project/vllm - hongxiayang opened this pull request 28 days ago
[Doc] neuron documentation update

github.com/vllm-project/vllm - omrishiv opened this pull request 28 days ago
[Kernel] Split Marlin MoE kernels into multiple files

github.com/vllm-project/vllm - ElizaWszola opened this pull request 28 days ago
[Model] Expose Phi3v num_crops as a mm_processor_kwarg

github.com/vllm-project/vllm - alex-jw-brooks opened this pull request 28 days ago
[Bugfix] Refactor composite weight loading logic

github.com/vllm-project/vllm - Isotr0py opened this pull request 28 days ago
[Bug]: RuntimeError in gptq_marlin_24_gemm

github.com/vllm-project/vllm - leoyuppieqnew opened this issue 28 days ago
[Misc] add non cuda hf benchmark_througput

github.com/vllm-project/vllm - park12sj opened this pull request 28 days ago
[Misc] Show AMD GPU topology in `collect_env.py`

github.com/vllm-project/vllm - DarkLight1337 opened this pull request 28 days ago
[Core] CUDA Graphs for Multi-Step + Chunked-Prefill

github.com/vllm-project/vllm - varun-sundar-rabindranath opened this pull request 29 days ago
Create SECURITY.md

github.com/vllm-project/vllm - simon-mo opened this pull request 29 days ago
[Bugfix][Core] Fix tekken edge case for mistral tokenizer

github.com/vllm-project/vllm - patrickvonplaten opened this pull request 29 days ago
[Feature]: Output logps of given output

github.com/vllm-project/vllm - lycheeyolo opened this issue 29 days ago
[Doc] Add documentation for GGUF quantization

github.com/vllm-project/vllm - Isotr0py opened this pull request 29 days ago
[Bugfix] Use heartbeats instead of health checks

github.com/vllm-project/vllm - joerunde opened this pull request 29 days ago
[Feature]: DRY Sampling

github.com/vllm-project/vllm - Shreyansh1311 opened this issue 30 days ago
[Core] Allow IPv6 in VLLM_HOST_IP with zmq

github.com/vllm-project/vllm - russellb opened this pull request 30 days ago
[Usage]:

github.com/vllm-project/vllm - lauhaide opened this issue 30 days ago
[Feature]: Offline quantization for Pixtral-12B

github.com/vllm-project/vllm - KohakuBlueleaf opened this issue 30 days ago
[Misc]: Create ProfileConfig for Profiling

github.com/vllm-project/vllm - sylviayangyy opened this issue about 1 month ago
[Bug]: Profiling RuntimeError when `with_stack=True`

github.com/vllm-project/vllm - sylviayangyy opened this issue about 1 month ago
[MISC] add support custom_op check

github.com/vllm-project/vllm - jikunshang opened this pull request about 1 month ago
[Misc] Add argument to disable FastAPI docs

github.com/vllm-project/vllm - Jeffwan opened this pull request about 1 month ago
[doc] improve installation doc

github.com/vllm-project/vllm - youkaichao opened this pull request about 1 month ago
Enabling Agent Splitting.

github.com/vllm-project/vllm - Alexei-V-Ivanov-AMD opened this pull request about 1 month ago
[Bugfix] Validate SamplingParam n is an int

github.com/vllm-project/vllm - saumya-saran opened this pull request about 1 month ago
[Feature]: Quantisation Support with CPU Backend

github.com/vllm-project/vllm - Christofon opened this issue about 1 month ago
Dry sample

github.com/vllm-project/vllm - alxiang opened this pull request about 1 month ago
[Bugfix] Fix TP > 1 for new granite

github.com/vllm-project/vllm - joerunde opened this pull request about 1 month ago
[Core] zmq: bind only to localhost for local-only usage

github.com/vllm-project/vllm - russellb opened this pull request about 1 month ago
[CI/Build] fix Dockerfile.cpu on podman

github.com/vllm-project/vllm - dtrifiro opened this pull request about 1 month ago
[Installation]: Assets v0.6 for cuda 12+

github.com/vllm-project/vllm - GennVa opened this issue about 1 month ago
[CI/Build] Avoid CUDA initialization

github.com/vllm-project/vllm - DarkLight1337 opened this pull request about 1 month ago
ppc64le: Dockerfile and CI fix

github.com/vllm-project/vllm - sumitd2 opened this pull request about 1 month ago
[torch.compile] register allreduce operations as custom ops

github.com/vllm-project/vllm - youkaichao opened this pull request about 1 month ago
[Frontend] Improve Nullable kv Arg Parsing

github.com/vllm-project/vllm - alex-jw-brooks opened this pull request about 1 month ago
[refactor] remove triton based sampler

github.com/vllm-project/vllm - simon-mo opened this pull request about 1 month ago
[Feature]: APC introspection interface

github.com/vllm-project/vllm - lun-4 opened this issue about 1 month ago
Adding metrics to external cache services

github.com/vllm-project/vllm - happyandslow opened this pull request about 1 month ago
[CI/Build] Excluding kernels/test_gguf.py from ROCm

github.com/vllm-project/vllm - alexeykondrat opened this pull request about 1 month ago