Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

vLLM

vLLM is a high-throughput and memory-efficient inference and serving engine for large language models (LLMs).
Collective - Host: opensource - https://opencollective.com/vllm - Code: https://github.com/vllm-project/vllm

[Model] Snowflake arctic model implementation

github.com/vllm-project/vllm - sfc-gh-hazhang opened this pull request 5 months ago
Support Deepseek-V2

github.com/vllm-project/vllm - zwd003 opened this pull request 5 months ago
[Scheduler] Warning upon preemption and Swapping

github.com/vllm-project/vllm - rkooo567 opened this pull request 5 months ago
[CORE] Adding support for insertion of soft-tuned prompts

github.com/vllm-project/vllm - SwapnilDreams100 opened this pull request 5 months ago
fix MiniCPM tie_word_embeddings

github.com/vllm-project/vllm - Receiling opened this pull request 5 months ago
[Frontend] Dynamic RoPE scaling

github.com/vllm-project/vllm - sasha0552 opened this pull request 5 months ago
[CI] Add llama 3 model test

github.com/vllm-project/vllm - rkooo567 opened this pull request 5 months ago
[Model] Add support for IBM Granite Code models

github.com/vllm-project/vllm - yikangshen opened this pull request 5 months ago
[CI] Add retry for agent lost

github.com/vllm-project/vllm - cadedaniel opened this pull request 6 months ago
Update lm-format-enforcer to 0.10.1

github.com/vllm-project/vllm - noamgat opened this pull request 6 months ago
[Feature]: MLA Support

github.com/vllm-project/vllm - chengtbf opened this issue 6 months ago
[Misc]: int4 support on CPU backend

github.com/vllm-project/vllm - leiwen83 opened this issue 6 months ago
[Bugfix] Fix `asyncio.Task` not being subscriptable

github.com/vllm-project/vllm - DarkLight1337 opened this pull request 6 months ago
[Usage]: doubt on computational complexity

github.com/vllm-project/vllm - Juelianqvq opened this issue 6 months ago
Main backup 2024 05 05

github.com/vllm-project/vllm - robertgshaw2-neuralmagic opened this pull request 6 months ago
Upstream sync 2024 05 05

github.com/vllm-project/vllm - robertgshaw2-neuralmagic opened this pull request 6 months ago
Revert to previous main

github.com/vllm-project/vllm - robertgshaw2-neuralmagic opened this pull request 6 months ago
chunked-prefill-doc-syntax

github.com/vllm-project/vllm - simon-mo opened this pull request 6 months ago
[CI/Build] from scratch build for dockerfile

github.com/vllm-project/vllm - youkaichao opened this pull request 6 months ago
bump version to v0.4.2

github.com/vllm-project/vllm - simon-mo opened this pull request 6 months ago
[Bug]: 400 Bad Request

github.com/vllm-project/vllm - gaye746560359 opened this issue 6 months ago
[CI] Make mistral tests pass

github.com/vllm-project/vllm - rkooo567 opened this pull request 6 months ago
[Core] Optimize sampler get_logprobs

github.com/vllm-project/vllm - rkooo567 opened this pull request 6 months ago
[BugFix] Fix fp8 quantizer

github.com/vllm-project/vllm - Kev1ntan opened this pull request 6 months ago
add spec infer related into prometheus metrics.

github.com/vllm-project/vllm - leiwen83 opened this pull request 6 months ago
[Doc] Chunked Prefill Documentation

github.com/vllm-project/vllm - rkooo567 opened this pull request 6 months ago
[Misc]: openai compatible server

github.com/vllm-project/vllm - aqx95 opened this issue 6 months ago
[Core] Log more GPU memory reservation info

github.com/vllm-project/vllm - rkooo567 opened this pull request 6 months ago
[New Model]: Cogagent

github.com/vllm-project/vllm - leoozy opened this issue 6 months ago
[Misc] add installation time env vars

github.com/vllm-project/vllm - youkaichao opened this pull request 6 months ago
[Doc] add env vars to the doc

github.com/vllm-project/vllm - youkaichao opened this pull request 6 months ago
[Misc] remove chunk detected debug logs

github.com/vllm-project/vllm - DefTruth opened this pull request 6 months ago
[Kernel] Make static FP8 scaling more robust

github.com/vllm-project/vllm - pcmoritz opened this pull request 6 months ago
[RFC]: Automate Speculative Decoding

github.com/vllm-project/vllm - LiuXiaoxuanPKU opened this issue 6 months ago
Update requirements-dev.txt

github.com/vllm-project/vllm - yecohn opened this pull request 6 months ago
[CI/Build] Unpin outlines

github.com/vllm-project/vllm - br3no opened this pull request 6 months ago
[Core] Ignore infeasible swap requests.

github.com/vllm-project/vllm - rkooo567 opened this pull request 6 months ago
[mypy][7/N] Cover all directories

github.com/vllm-project/vllm - rkooo567 opened this pull request 6 months ago
[Bug fix][Core] fixup ngram not setup correctly

github.com/vllm-project/vllm - leiwen83 opened this pull request 6 months ago
[WIP] Enhance MoE Triton kernel & tuning

github.com/vllm-project/vllm - WoosukKwon opened this pull request 6 months ago
[New Model]: OpenELM-3B

github.com/vllm-project/vllm - Isotr0py opened this issue 6 months ago
[Misc] centralize all usage of environment variables

github.com/vllm-project/vllm - youkaichao opened this pull request 6 months ago
[Core] Sliding window for block manager v2

github.com/vllm-project/vllm - mmoskal opened this pull request 6 months ago
[Misc][Refactor] Introduce ExecuteModelData

github.com/vllm-project/vllm - comaniac opened this pull request 6 months ago
[Core] Add MultiprocessingGPUExecutor

github.com/vllm-project/vllm - njhill opened this pull request 6 months ago
Virtual Office Hours: May 15 2pm ET

github.com/vllm-project/vllm - robertgshaw2-neuralmagic opened this issue 6 months ago