Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

vLLM

vLLM is a high-throughput and memory-efficient inference and serving engine for large language models (LLMs).
Collective - Host: opensource - https://opencollective.com/vllm - Code: https://github.com/vllm-project/vllm

[Core] Support sparse KV cache framework

github.com/vllm-project/vllm - chizhang118 opened this pull request 4 months ago
[RFC]: Support sparse KV cache framework

github.com/vllm-project/vllm - chizhang118 opened this issue 4 months ago
compressed-tensors accuracy testing

github.com/vllm-project/vllm - dsikka opened this pull request 4 months ago
[ci][test] fix ca test in main

github.com/vllm-project/vllm - youkaichao opened this pull request 4 months ago
[BugFix] [Kernel] Add Cutlass2x fallback kernels

github.com/vllm-project/vllm - varun-sundar-rabindranath opened this pull request 4 months ago
[Bug]: KeyError: '/psm_ed65b7e3'

github.com/vllm-project/vllm - randydl opened this issue 4 months ago
[Bugfix] fix the bug for lora request

github.com/vllm-project/vllm - InkdyeHuang opened this pull request 4 months ago
[Bug]: VLLM usage on AWS Inferentia instances

github.com/vllm-project/vllm - ashutoshsaboo opened this issue 4 months ago
[Bug]: which torchvision version required

github.com/vllm-project/vllm - tusharraskar opened this issue 4 months ago
[Draft] Tensor parallel for CPU

github.com/vllm-project/vllm - bigPYJ1151 opened this pull request 4 months ago
[LoRA] Adds support for bias in LoRA

github.com/vllm-project/vllm - followumesh opened this pull request 4 months ago
[RFC]: Add runtime weight update API

github.com/vllm-project/vllm - lyuqin-scale opened this issue 4 months ago
[New Model]: Support Nemotron-4-340B

github.com/vllm-project/vllm - dskhudia opened this issue 4 months ago
[New Model]: Chameleon support

github.com/vllm-project/vllm - nopperl opened this issue 4 months ago
[Distributed] Add send and recv helpers

github.com/vllm-project/vllm - andoorve opened this pull request 4 months ago
[Kernel][CPU] Add Quick `gelu` to CPU

github.com/vllm-project/vllm - ywang96 opened this pull request 4 months ago
max_tokens must be at least 1, got -160

github.com/vllm-project/vllm - njhouse365 opened this issue 4 months ago
[Misc] optimize sampler with top_p=1 and top_k>0

github.com/vllm-project/vllm - gx16377 opened this pull request 4 months ago
[Usage]: TimeoutError()

github.com/vllm-project/vllm - ZZhangxian opened this issue 4 months ago
[Bug]:Qwen2-57B-A14B 两卡 推理报错

github.com/vllm-project/vllm - CXLiang123 opened this issue 4 months ago
[Bug]: Illegal memory access

github.com/vllm-project/vllm - w013nad opened this issue 4 months ago
[Installation]: pip install -e failed

github.com/vllm-project/vllm - chunniunai220ml opened this issue 4 months ago
[WIP][Misc] Create setup_files dir for cleanup

github.com/vllm-project/vllm - WoosukKwon opened this pull request 4 months ago
[BugFix] exclude version 1.15.0 for modelscope

github.com/vllm-project/vllm - zhyncs opened this pull request 4 months ago
Support CPU inference with VSX PowerPC ISA

github.com/vllm-project/vllm - ChipKerchner opened this pull request 4 months ago
[build][misc] remove nvidia runtime docker base image

github.com/vllm-project/vllm - youkaichao opened this pull request 4 months ago
test a100

github.com/vllm-project/vllm - khluu opened this pull request 4 months ago
[Misc]Add param max-model-len in benchmark_latency.py

github.com/vllm-project/vllm - DearPlanet opened this pull request 4 months ago
[Bugfix] Fix Phi-3 Long RoPE scaling implementation

github.com/vllm-project/vllm - ShukantPal opened this pull request 4 months ago
[Misc] Remove import from transformers logging

github.com/vllm-project/vllm - CatherineSue opened this pull request 4 months ago
[ci] Deprecate original CI template

github.com/vllm-project/vllm - khluu opened this pull request 4 months ago
[CI/Build][Misc] Update Pytest Marker for VLMs

github.com/vllm-project/vllm - ywang96 opened this pull request 4 months ago
[misc][typo] fix typo

github.com/vllm-project/vllm - youkaichao opened this pull request 4 months ago
[misc][distributed] use localhost for single-node

github.com/vllm-project/vllm - youkaichao opened this pull request 4 months ago
[Misc] Fix typo

github.com/vllm-project/vllm - DarkLight1337 opened this pull request 4 months ago
[Doc] Update docker references

github.com/vllm-project/vllm - rafvasq opened this pull request 4 months ago
[Model] Add support for Qwen2 for embeddings

github.com/vllm-project/vllm - mgoin opened this pull request 4 months ago
[Feature]: Initial LLM token

github.com/vllm-project/vllm - CHesketh76 opened this issue 4 months ago
feat: adds user information to the input of the scheduler

github.com/vllm-project/vllm - FerranAgulloLopez opened this pull request 4 months ago
[Feature]: Access to user information in scheduler

github.com/vllm-project/vllm - FerranAgulloLopez opened this issue 4 months ago
[Feature]: support Qwen2 embedding

github.com/vllm-project/vllm - DavidPeleg6 opened this issue 4 months ago
[Core] Add use_dummy_dirver to parallel config

github.com/vllm-project/vllm - DriverSong opened this pull request 4 months ago
[Model] Rename Phi3 rope scaling type

github.com/vllm-project/vllm - garg-amit opened this pull request 4 months ago