Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

github.com/FedML-AI/FedML

FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on any GPU cloud or on-premise cluster. Built on this library, TensorOpera AI (https://TensorOpera.ai) is your generative AI platform at scale.
https://github.com/FedML-AI/FedML

[CoreEngine] set the gpu id from the mlops scheduler.

683c7c709c91905fbf80a2f0993b43a2672b7586 authored about 1 year ago
Merge pull request #1801 from FedML-AI/alexleung/dev_branch

[DevOps] update devops files.

e79d6f8ea7e22ca92037f28eac99bb9420dabec0 authored about 1 year ago
[DevOps] update devops files.

22e069d53341c515994d9c798d182be32c80ba0a authored about 1 year ago
Merge pull request #1800 from FedML-AI/alexleung/dev_branch

[CoreEngine] check if the endpoint is ready, debug the federate job o…

87731451ea39848878f16c48ba4ad5a7ca0c47f6 authored about 1 year ago
Merge branch 'dev/v0.7.0' into alexleung/dev_branch

3e0ff523f0ce8c8413519d4ef8f5d22b9c2eb2d2 authored about 1 year ago
[CoreEngine] check if the endpoint is ready, debug the federate job on launch.

6cc57da3be733dd2044f11ecfc9c5527ee4e4e6c authored about 1 year ago
[train.llm] bugfix tokenizer creation

a22182adba8d6dc0bf3290bc974c96179e13d085 authored about 1 year ago
Merge pull request #1799 from FedML-AI/raphael/fix-multiple-worker

[Deploy] Fix the CPU usage indication

ae1bf62cd6a1e391f0370bad81fc9e6329868228 authored about 1 year ago
[Deploy] Fix the CPU usage indication

e805c744fb887b529a3cdeb19ee4635b27f0d144 authored about 1 year ago
Merge pull request #1797 from FedML-AI/zjh/fedllm

[FedLLM] migrate from FedML-AI/FedLLM

866f22013f2d106cf49ec20752a678fd6e360e51 authored about 1 year ago
[train.llm, FedLLM] migrate common utils to `train.llm`

dbc470a925a518c0a9f5aa0f536fb29a69ffaa96 authored about 1 year ago
[FedLLM] migrate from FedML-AI/FedLL

bef4f61ae842f95b701dc6947156bcf50e4c3614 authored about 1 year ago
Merge pull request #1798 from FedML-AI/zjh/train/llm

[train.llm] refactor and update `setup.py`

b433c5cafafb6f92c75953aa1bd14ddf3307a8f8 authored about 1 year ago
[train.llm] update setup.py

9b4e40ec1281b82b0eda90fb8019d99b92754817 authored about 1 year ago
[train.llm] separate train script from library; update

- move train script to examples
- move `train.llm.src` -> `train.llm`
- update `DatasetArguments...

fabe101ba282ebb4bc0f8cc32199320ef6bb4953 authored about 1 year ago
Merge pull request #1796 from FedML-AI/zjh/train/llm

[train.llm] move train_utils, train_deepspeed: support hostfile

cf8a93498ccca87c11ff484a4a019002301bc957 authored about 1 year ago
[train.llm] move train_utils, train_deepspeed: support hostfile

a7d18317c85ef899baf866772a8035b2dac9daaa authored about 1 year ago
Merge pull request #1795 from FedML-AI/raphael/fix-multiple-worker

[Deploy] Fix Duplicated Logs; Add Dummy Test Case

269bd282d45bd5d54810af3ec39633ec0a33e011 authored about 1 year ago
[Deploy] Fix Duplicated Logs; Add Dummy Test Case

4e79677a6ebebae9577552312c197b7b70ec23f4 authored about 1 year ago
Merge pull request #1794 from FedML-AI/raphael/fix-multiple-worker

[Deploy] Fix bugs when using multiple workers

226043f1a9c2d243d6a10f1df2230f2ec3e562c3 authored about 1 year ago
[Deploy] Fix bugs when using multiple workers

f86c7461b5364f0a2d107cbdf8b149ac685b123d authored about 1 year ago
Merge pull request #1793 from FedML-AI/alexleung/dev_branch

Alexleung/dev branch

df9c406bbd7d75a1d65f0be64993842bd74cd099 authored about 1 year ago
[CoreEngine] change the gpu ids on deployment.

be499b3572c86d21f7efd03434a520c5db26c04c authored about 1 year ago
Merge pull request #1792 from FedML-AI/raphael/fix-login

[Deploy] Fix Container Name

14f574643766eec6a6a48cc1a30655f99c12bcac authored about 1 year ago
[CoreEngine] change the num gpus on deployment.

7ed77d37fb95f60901737d02cf26d865c43f3b52 authored about 1 year ago
[Deploy] Fix Container Name

1c3077054713be5f37d8dc98f178394967172d26 authored about 1 year ago
Merge pull request #1791 from FedML-AI/alexleung/dev_branch

Alexleung/dev branch

d8f4099fb16d2f477d54b7386dc363220d08662c authored about 1 year ago
[CoreEngine] change the condition for checking if the process is running.

1f3b433505b47f33e1aec363405f4ea5a6dcfa57 authored about 1 year ago
Merge pull request #1790 from FedML-AI/dev/v0.7.0

Dev/v0.7.0

bf39048cdc47a343ded334849cc67d2657a445e9 authored about 1 year ago
Merge pull request #1789 from FedML-AI/raphael/fix-login

[Login] Fix Linux Sleep Process Status Case

49ccbd0620a400a0b520b480ff5c4be6dc896744 authored about 1 year ago
Merge pull request #1788 from FedML-AI/dev/v0.7.0

Merge pull request #1787 from FedML-AI/alexleung/dev_branch

f6c000d99eeb52f008411a7c035c2bdf3e365dc1 authored about 1 year ago
[Deploy] Change the default behavior when update the endpoint

1c0fdcdcfbd007a4f292502617db29f5996a5096 authored about 1 year ago
[Login] Fix Linux Sleep Progress case

a59bcfd81478c51d55fb1fb645e4049ef3fc276d authored about 1 year ago
Merge pull request #1787 from FedML-AI/alexleung/dev_branch

Alexleung/dev branch

b2c3b5fdef91c8af4a156a5c2fc95772445bb1bf authored about 1 year ago
Merge branch 'dev/v0.7.0' into alexleung/dev_branch

7736f5a4a50cc8883ffda9fdbf0dbefffb07c215 authored about 1 year ago
[DevOps] update devops files.

7c41935cab20a84758dc3709c1e27c6981d9417e authored about 1 year ago
[CoreEngine] support to set multi deployment worker when logging in to MLOps.

959a822f56dbe29f1540bcf5525a7417f3c7d1bd authored about 1 year ago
Update __init__.py

b1a0a8e70e9c9a0e8aa303473937df23abbba4b9 authored about 1 year ago
Update setup.py

ab282592534f8c9df2a29a24465fc7438c1b1356 authored about 1 year ago
Merge pull request #1786 from FedML-AI/test/v0.7.0

Test/v0.7.0

990480b442db6982cc4ed6c7840d11dd7ec47104 authored about 1 year ago
Merge pull request #1785 from FedML-AI/master

sync master to test

2e9fcf330d7826439e884f29db3ec5482d4fb906 authored about 1 year ago
Merge pull request #1574 from vuittont60/master

[Docs] fix typos

f78d23ffcca6acd7f7874a8703df9c44e41f13a4 authored about 1 year ago
Merge pull request #1660 from FedML-AI/alaydshah-patch-1

Update README.md

cf1d68290fdd54f2e0b84fab6e621b3ed1a8d422 authored about 1 year ago
Merge pull request #1781 from FedML-AI/zjh/train/llm

[train.llm] support LLM train

86e2d84e45f1e489c83b642b7ab04b694e19cde8 authored about 1 year ago
Merge pull request #1697 from FedML-AI/zjh/cross_cloud

[CrossCloud] init from cross-silo

f73e346a3ab75b8b19b5d75b278d22c65d9ad0cb authored about 1 year ago
[CrossCloud] init from cross-silo

3f2485ecae8bc6012c7d4eb925a0ffc448c9f758 authored about 1 year ago
Merge pull request #1784 from FedML-AI/zjh/openai-on-prem

[Deploy] OpenAI compatible API support for on-prem

2b53e01a95a174793f819278f8d4c453adc96aa4 authored about 1 year ago
[Deploy] Allow user inference without endpoint_name & model_name

d9c7028b020d1493beabc6635a6a1f2503410270 authored about 1 year ago
[Deploy] on-prem support for OpenAI API

221119b76ef467c2bdb5f24221039c68ec23596b authored about 1 year ago
Merge pull request #1783 from FedML-AI/raphael/support-update-endpoint

[Deploy] Fix log report when having multiple containers

3a4a9508cb8cecf4a0df14d4827559e5446d05cd authored about 1 year ago
[Deploy] Fix log report when having multiple-contaiiners

fbf80caba4d4e0ee29cd9183deecee70ab705f38 authored about 1 year ago
[CoreEngine] 1. we will use db as the storage when the cache is not available.

2. show docker engine installation guide when binding devices.
3. login as the daemon mode when r...

6a854f4d063bb35f2e87deacc6c75ac12f4107a5 authored about 1 year ago
[train.llm] migrate from FedML-AI/llm-finetune

b03578d0681e02e622a34767910f9069db433f59 authored about 1 year ago
[train.llm] init

d89ddf21c91184901d5d077f34043c02eddc2630 authored about 1 year ago
Merge pull request #1779 from FedML-AI/alaydshah/storage/support_path_names

Support path in name

5518705bbd6f91801c9c8a0578148f3da0e3d8b5 authored about 1 year ago
Merge pull request #1780 from FedML-AI/raphael/fix-model-upload

[Deploy] Fix Model Upload API/CLI

29f4fff18814f44527557f66dbb0802231d049ef authored about 1 year ago
[Deploy] Fix Model Upload API/CLI

52df98d3183685be688753402965e917adacc810 authored about 1 year ago
Merge pull request #1776 from FedML-AI/raphael/support-update-endpoint

[Deploy] Support Update Endpoint Operation

da9562d771671d7de6986bb7830217e05c45c9f7 authored about 1 year ago
Support path in name

53399bf94958f263ae0740a13fc93dcce4d1ddec authored about 1 year ago
[Deploy] Support Upate Endpoint Op

005ab80af213b4cd314a8a9a6e29faea48346939 authored about 1 year ago
[Deploy] Differentiate Update & Scale Op

e414fd1488c0232015c6871dc1499f41797ef96e authored about 1 year ago
Merge pull request #1778 from FedML-AI/alaydshah/storage/metadata

Support Storage Metadata

d31fde1555af65178749a8dc629d12c30ccec117 authored about 1 year ago
Support Metadata

ab7e1cfaa0699ad20324ca3e6ecea175c66eb6ce authored about 1 year ago
Merge pull request #1775 from FedML-AI/raphael/fix-async-inf

[Deploy] Fix the determination logic when a worker cannot response to master

28cc10ead095b54d4354cae99a9e89f049e6dc13 authored about 1 year ago
[Deploy] Fix the determin logic when a worker cannot response to master

2a092c3be01066950c6366f7f76ffc7c7cf0d84b authored about 1 year ago
Merge pull request #1774 from FedML-AI/merge-swap

Dev/v0.7.0

932b3f685a858776bce8a0ca1d544002a280cb2c authored about 1 year ago
Merge branch 'test/v0.7.0' into merge-swap

3ab3862a867ed825157919fc41ea901e8b2a9152 authored about 1 year ago
Merge pull request #1773 from FedML-AI/alexleung/dev_branch

Alexleung/dev branch

c64894ef8d9bfdc0cf102f9a19dfb98dbd1ee177 authored about 1 year ago
[DevOps] update devops files.

aa6e5e5c61c2e06e7a0f5bb0254ccc58a79f9b7c authored about 1 year ago
[CoreEngine] change the hint text when logging in multi times.

79c061ec4a28650aa5cd7edac3fa6b7e3a8eeb31 authored about 1 year ago
[CoreEngine] change the error message when no resource available for deploy jobs.

9d537d8c8284bde99d4aaae22e4fdeae7a035ed0 authored about 1 year ago
Merge pull request #1772 from FedML-AI/alexleung/dev_branch

Alexleung/dev branch

9c13a950957233e8ec0a9c034aa3655efe06410b authored about 1 year ago
[DevOps] update devops files.

1c49ebddb1433d1e283e94c3ad6de2c9de71c4a2 authored about 1 year ago
[CoreEngine] response 404 in the ready api.

46c728a62700cb57666604c07d3197f2a8f69657 authored about 1 year ago
Merge pull request #1771 from FedML-AI/alexleung/dev_branch

Alexleung/dev branch

4b30c2991060a919413907cc0f81537d4f8d013e authored about 1 year ago
[CoreEngine] the syntax '-> str | None' will not be allowed in the python 3.8, so we need to remove it.

1c9efcbe4d1313d6b043115368cc239e159f8f40 authored about 1 year ago
[CoreEngine] the syntax '-> str | None' will not be allowed in the python 3.8, so we need to remove it.

782841935f50ae2e6c1b750fe688e3ade5f3c83c authored about 1 year ago
[DevOps] update devops files.

cf833c40a5dd5a6a921d8f8bdcb36051883112d0 authored about 1 year ago
Merge pull request #1770 from FedML-AI/alexleung/dev_branch

[CoreEngine] 1. change the inference request to async mode.

9056117d3e90c84255f894a4a20c34949bfcd70c authored about 1 year ago
Merge branch 'dev/v0.7.0' into alexleung/dev_branch

b72738d2a0e02e4b73761e2963cbf9ab648c2204 authored about 1 year ago
Merge pull request #1765 from FedML-AI/dimitris/quick_fix_local_s3

quick fix local s3

16b220ee24c9cc0d8e808e56e37224cd61d99e97 authored about 1 year ago
[CoreEngine] 1. change the inference request to async mode.

2. refactor the monitor for endpoints.

40ac75cd2dd2a58ee9350dad9d63f4809abc03c3 authored about 1 year ago
Merge pull request #1769 from FedML-AI/alaydshah/storage/apiKeyToUserId

Update storage logic; support download dest

a0a5d3493b39150dd130b82aa777e227b6a8cb9c authored about 1 year ago
Enhance help message

58bc06ee1fececf602d9497bcb6684f92d3fb740 authored about 1 year ago
Update storage logic; support download dest

bed8bf0c8fdf4da35b8021962a43dcbb2b1cc558 authored about 1 year ago
Merge pull request #1768 from FedML-AI/alaydshah/model/bug/fix

Set master and worker id default to None

0eaf4b7548cb4162b9bae5315c9fe7bd4af8de29 authored about 1 year ago
Set master and worker id default to None

e30413b202ddbfb23b1e7fb5e3b5238ecf6fa240 authored about 1 year ago
Merge pull request #1767 from FedML-AI/raphael/add-gpu-ids

[Deploy] Support more container config

6ba3920f3eacd354eb4f1d8e8b4b96403d2f9225 authored about 1 year ago
[Deploy] Support more container config

c7d448285751615065ec6e5b95b7ed4575624b27 authored about 1 year ago
Merge pull request #1766 from FedML-AI/raphael/add-gpu-ids

Deploy] Fix the logic of using proxy inference

679464c773bdb7b7914be05e454b586e967d312e authored about 1 year ago
[Deploy] Fix the logic of using proxy inference

1e390cd1874be8359efadfaa61db560529d1c078 authored about 1 year ago
Fixing path locator for uploading and downloading from local s3.

c5381faac20d59019cf2a68fd7dac7a0cf35cd3e authored about 1 year ago
Merge pull request #1764 from FedML-AI/raphael/add-gpu-ids

[Deploy] Support Async Inference using predict method

c1a672aa6158d25912bd8c604eaa2322f2e57476 authored about 1 year ago
[Depoy] Support Async Inf for predict method

2aab150765ec6c4b4dcc9d7f72937691f32149cb authored about 1 year ago
Merge pull request #1763 from FedML-AI/alexleung/dev_branch

[CoreEngine] adjust the policy for monitoring endpoints.

8a92a24701a71fd2719333c1d5f826e0a9cc3f97 authored about 1 year ago
[CoreEngine] adjust the policy for monitoring endpoints.

01e2d44ef45d41aa83ed7c64eed4cb8a9fd29212 authored about 1 year ago
Merge pull request #1762 from FedML-AI/alexleung/dev_branch

Alexleung/dev branch

e8700dee409500fa5d7721abc07a26e6afacff61 authored about 1 year ago
[CoreEngine] check the master and slave endpoint status with the ready mode.

9ef6932f917b889da0edc66d974f6d58019a99fc authored about 1 year ago
Merge pull request #1761 from FedML-AI/dev/v0.7.0

Dev/v0.7.0

4fc93a3ea2fac956560b3ea942b669737c60fe3e authored about 1 year ago
Merge pull request #1760 from FedML-AI/alexleung/dev_branch

[CoreEngine] add the monitor process to upload the inference logs.

07b50f73c20ded19084e114208f59b9d43421efe authored about 1 year ago