Ecosyste.ms: OpenCollective
An open API service for software projects hosted on Open Collective.
SGLang
SGLang is a fast serving framework for large language models and vision language models.
Collective -
Host: opensource -
https://opencollective.com/sglang
- Code: https://github.com/sgl-project/sglang
Add requests with curl
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 2 months ago
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 2 months ago
add native api docs
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 2 months ago
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 2 months ago
Update docs and workflow
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
Native api documents
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 2 months ago
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 2 months ago
Fix links in the docs
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
Add a FAQ documentation
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
Add Tensor Parallel to torch_native_llama
github.com/sgl-project/sglang - kwen2501 opened this pull request 2 months ago
github.com/sgl-project/sglang - kwen2501 opened this pull request 2 months ago
Improve docs and fix the broken links
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
Benchmark torchao and torch.compile (need torch 2.5)
github.com/sgl-project/sglang - jerryzh168 opened this issue 2 months ago
github.com/sgl-project/sglang - jerryzh168 opened this issue 2 months ago
Fix incorrect context length for llama3.2-11b
github.com/sgl-project/sglang - rchen19 opened this pull request 2 months ago
github.com/sgl-project/sglang - rchen19 opened this pull request 2 months ago
[Bug] Offline engine performance is not better than local server when running batch
github.com/sgl-project/sglang - jischein opened this issue 2 months ago
github.com/sgl-project/sglang - jischein opened this issue 2 months ago
[3rdparty, document] Updated Documentation that covers performance tuning techniques for AMD Instinct GPUs.
github.com/sgl-project/sglang - yichiche opened this pull request 2 months ago
github.com/sgl-project/sglang - yichiche opened this pull request 2 months ago
Question: Does sglang support prefix cache for multimodal models?
github.com/sgl-project/sglang - htrekker opened this issue 2 months ago
github.com/sgl-project/sglang - htrekker opened this issue 2 months ago
Unable to Load Gemma2 Model with SGLANG
github.com/sgl-project/sglang - hahmad2008 opened this issue 2 months ago
github.com/sgl-project/sglang - hahmad2008 opened this issue 2 months ago
minor: update nightly eval
github.com/sgl-project/sglang - zhyncs opened this pull request 2 months ago
github.com/sgl-project/sglang - zhyncs opened this pull request 2 months ago
Add vlm document
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 2 months ago
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 2 months ago
[Feature] Create a benchmark script for offline inference
github.com/sgl-project/sglang - ByronHsu opened this issue 2 months ago
github.com/sgl-project/sglang - ByronHsu opened this issue 2 months ago
Add vlm tutorial
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 2 months ago
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 2 months ago
[Bug] Exception output when Cuda Graph is enabled for Qwen2.5-Coder
github.com/sgl-project/sglang - TechxGenus opened this issue 2 months ago
github.com/sgl-project/sglang - TechxGenus opened this issue 2 months ago
Update vocab embedding deps and add TP switch
github.com/sgl-project/sglang - ispobock opened this pull request 2 months ago
github.com/sgl-project/sglang - ispobock opened this pull request 2 months ago
[Build, ROCm] Dockerfile.rocm for Instinct GPUs, with package updates
github.com/sgl-project/sglang - HaiShaw opened this pull request 2 months ago
github.com/sgl-project/sglang - HaiShaw opened this pull request 2 months ago
Fix retraction + overlap
github.com/sgl-project/sglang - hnyls2002 opened this pull request 2 months ago
github.com/sgl-project/sglang - hnyls2002 opened this pull request 2 months ago
change file tree
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 2 months ago
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 2 months ago
Fix memory leak for chunked prefill 2
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
TP8 scheduling overhead is very high for small model, Llama 3 8B on AMD
github.com/sgl-project/sglang - hliuca opened this issue 2 months ago
github.com/sgl-project/sglang - hliuca opened this issue 2 months ago
Update vocab embedding deps and add TP switch
github.com/sgl-project/sglang - ispobock opened this pull request 2 months ago
github.com/sgl-project/sglang - ispobock opened this pull request 2 months ago
delete unused character
github.com/sgl-project/sglang - geeker-smallwhite opened this pull request 2 months ago
github.com/sgl-project/sglang - geeker-smallwhite opened this pull request 2 months ago
delete unused characters
github.com/sgl-project/sglang - geeker-smallwhite opened this pull request 2 months ago
github.com/sgl-project/sglang - geeker-smallwhite opened this pull request 2 months ago
support prometheus metrics
github.com/sgl-project/sglang - Lzhang-hub opened this pull request 2 months ago
github.com/sgl-project/sglang - Lzhang-hub opened this pull request 2 months ago
Fix warnings in doc build
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
Simplify documentation
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
Fix mixed chunked prefill
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
chore: update torch v2.5.1
github.com/sgl-project/sglang - zhyncs opened this pull request 2 months ago
github.com/sgl-project/sglang - zhyncs opened this pull request 2 months ago
Make decode log interval configurable
github.com/sgl-project/sglang - ByronHsu opened this pull request 2 months ago
github.com/sgl-project/sglang - ByronHsu opened this pull request 2 months ago
Refactor tokenizer manager
github.com/sgl-project/sglang - ByronHsu opened this pull request 2 months ago
github.com/sgl-project/sglang - ByronHsu opened this pull request 2 months ago
[Performance, Triton Kernel Args] _decode_grouped_softmax_reducev_fwd…
github.com/sgl-project/sglang - HaiShaw opened this pull request 2 months ago
github.com/sgl-project/sglang - HaiShaw opened this pull request 2 months ago
Fix suggest edit
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 2 months ago
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 2 months ago
[Bug] sglang template import issue
github.com/sgl-project/sglang - multimodalpragmatic opened this issue 2 months ago
github.com/sgl-project/sglang - multimodalpragmatic opened this issue 2 months ago
[Production] Drain requests before exit when receive SIGTERM
github.com/sgl-project/sglang - Ying1123 opened this pull request 2 months ago
github.com/sgl-project/sglang - Ying1123 opened this pull request 2 months ago
Fix memroy leak caused by chunked prefill
github.com/sgl-project/sglang - hnyls2002 opened this pull request 2 months ago
github.com/sgl-project/sglang - hnyls2002 opened this pull request 2 months ago
[Performance, Hardware] MoE weights padding to AMD MI300x GPUs
github.com/sgl-project/sglang - HaiShaw opened this pull request 2 months ago
github.com/sgl-project/sglang - HaiShaw opened this pull request 2 months ago
[Bug] stop_str of qwen2-vl template should be a tuple not a str
github.com/sgl-project/sglang - wellhowtosay opened this issue 2 months ago
github.com/sgl-project/sglang - wellhowtosay opened this issue 2 months ago
fix get_memory_pool_size deadlock for DP
github.com/sgl-project/sglang - ByronHsu opened this pull request 2 months ago
github.com/sgl-project/sglang - ByronHsu opened this pull request 2 months ago
Remove delay after cancelled streaming requests are aborted
github.com/sgl-project/sglang - matthew-hippocratic opened this pull request 2 months ago
github.com/sgl-project/sglang - matthew-hippocratic opened this pull request 2 months ago
Questions Regarding sglang vs vllm and Memory Management
github.com/sgl-project/sglang - hahmad2008 opened this issue 2 months ago
github.com/sgl-project/sglang - hahmad2008 opened this issue 2 months ago
Imporve openai api documents
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 2 months ago
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 2 months ago
[Feature] Support QLoRA weights
github.com/sgl-project/sglang - zzh-www opened this issue 2 months ago
github.com/sgl-project/sglang - zzh-www opened this issue 2 months ago
Fix update_weights deadlock for DP
github.com/sgl-project/sglang - ByronHsu opened this pull request 2 months ago
github.com/sgl-project/sglang - ByronHsu opened this pull request 2 months ago
Support setting `use_thread` in the `run_program` for easier debugging.
github.com/sgl-project/sglang - liuyanyi opened this pull request 2 months ago
github.com/sgl-project/sglang - liuyanyi opened this pull request 2 months ago
[3rdparty, document] Add 3rdparty/amd, with profiling and tuning instructions to be added
github.com/sgl-project/sglang - HaiShaw opened this pull request 2 months ago
github.com/sgl-project/sglang - HaiShaw opened this pull request 2 months ago
Fix docs deploy ci
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 2 months ago
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 2 months ago
support token ids in `engine.generate`
github.com/sgl-project/sglang - ByronHsu opened this pull request 2 months ago
github.com/sgl-project/sglang - ByronHsu opened this pull request 2 months ago
Fix Triton decode kernel & ut
github.com/sgl-project/sglang - ispobock opened this pull request 2 months ago
github.com/sgl-project/sglang - ispobock opened this pull request 2 months ago
Granite and GraniteMoE models.
github.com/sgl-project/sglang - janimo opened this pull request 2 months ago
github.com/sgl-project/sglang - janimo opened this pull request 2 months ago
Add a watch dog thread
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
Offline serving final
github.com/sgl-project/sglang - hnyls2002 opened this pull request 2 months ago
github.com/sgl-project/sglang - hnyls2002 opened this pull request 2 months ago
[Bug] Parameter Update API update_weights Fails in DP=2, TP=1 Configuration
github.com/sgl-project/sglang - rbao2018 opened this issue 2 months ago
github.com/sgl-project/sglang - rbao2018 opened this issue 2 months ago
Update hyperparameter_tuning.md
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
profile of M sizes for Torch native and TE (ignore)
github.com/sgl-project/sglang - Zhuohao-Li opened this pull request 2 months ago
github.com/sgl-project/sglang - Zhuohao-Li opened this pull request 2 months ago
Improve the user control of new_token_ratio
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
Add openAI compatible API
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 2 months ago
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 2 months ago
Provide an argument to set the maximum batch size for cuda graph
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
Simplify our docs with complicated functions into utils
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 2 months ago
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 2 months ago
detach two CI for documentation
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 2 months ago
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 2 months ago
Update ci workflows
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
fix int conversion for `SGLANG_CPU_COUNT`
github.com/sgl-project/sglang - ByronHsu opened this pull request 2 months ago
github.com/sgl-project/sglang - ByronHsu opened this pull request 2 months ago
Allow consecutive ports when launching multiple sglang servers.
github.com/sgl-project/sglang - hnyls2002 opened this pull request 2 months ago
github.com/sgl-project/sglang - hnyls2002 opened this pull request 2 months ago
Set `ZMQ` buffer size heuristic
github.com/sgl-project/sglang - hnyls2002 opened this pull request 2 months ago
github.com/sgl-project/sglang - hnyls2002 opened this pull request 2 months ago
Fix possible ZMQ hanging
github.com/sgl-project/sglang - hnyls2002 opened this pull request 2 months ago
github.com/sgl-project/sglang - hnyls2002 opened this pull request 2 months ago
move max_position_embeddings to the last
github.com/sgl-project/sglang - hliuca opened this pull request 2 months ago
github.com/sgl-project/sglang - hliuca opened this pull request 2 months ago
[Fix] Fix --skip-tokenizer-init
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
Revert "Fix memory leak when doing chunked prefill"
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
Release v0.3.4.post2
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
Fix logprob in the overlapped mode
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
[Fix] Fix the log parsing in chunked prefill uni tests
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
Fix log parsing in the chunked prefill unit tests
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
github.com/sgl-project/sglang - merrymercy opened this pull request 2 months ago
[Bug] Got error with awq_marlin quantization args.
github.com/sgl-project/sglang - liangzelang opened this issue 2 months ago
github.com/sgl-project/sglang - liangzelang opened this issue 2 months ago
[Bug] param of max_workers is int type while a string type value os.environ.get("SGLANG_CPU_COUNT") provided
github.com/sgl-project/sglang - wellhowtosay opened this issue 3 months ago
github.com/sgl-project/sglang - wellhowtosay opened this issue 3 months ago
[router] rust-based router
github.com/sgl-project/sglang - ByronHsu opened this pull request 3 months ago
github.com/sgl-project/sglang - ByronHsu opened this pull request 3 months ago
Fix seq_lens_sum for cuda graph runner in padded cases
github.com/sgl-project/sglang - merrymercy opened this pull request 3 months ago
github.com/sgl-project/sglang - merrymercy opened this pull request 3 months ago
[Bug] cutlass group_gemm.initialize failed
github.com/sgl-project/sglang - senlice opened this issue 3 months ago
github.com/sgl-project/sglang - senlice opened this issue 3 months ago
Fix memory leak when doing chunked prefill
github.com/sgl-project/sglang - hnyls2002 opened this pull request 3 months ago
github.com/sgl-project/sglang - hnyls2002 opened this pull request 3 months ago
add support for ipynb
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 3 months ago
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 3 months ago
Enhance the test case for chunked prefill and check memory leak
github.com/sgl-project/sglang - merrymercy opened this pull request 3 months ago
github.com/sgl-project/sglang - merrymercy opened this pull request 3 months ago
Create deploy-docs.yml
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 3 months ago
github.com/sgl-project/sglang - zhaochenyang20 opened this pull request 3 months ago
Re-introduce `get_cuda_graph_seq_len_fill_value`
github.com/sgl-project/sglang - merrymercy opened this pull request 3 months ago
github.com/sgl-project/sglang - merrymercy opened this pull request 3 months ago
[Fix] Fix cuda graph padding for triton attention backend
github.com/sgl-project/sglang - merrymercy opened this pull request 3 months ago
github.com/sgl-project/sglang - merrymercy opened this pull request 3 months ago