Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

github.com/FedML-AI/FedML

FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on any GPU cloud or on-premise cluster. Built on this library, TensorOpera AI (https://TensorOpera.ai) is your generative AI platform at scale.
https://github.com/FedML-AI/FedML

Merge branch 'dev/v0.7.0' into alaydshah/launch/docker

db3c8ba6059fec584ce7773c4a562e4cc35a7fc1 authored about 1 year ago
Debugging decorator

09ab29ad28d1bb4b10d9ee89b8500d4e00f630d7 authored about 1 year ago
Containerize Launch v1

629728da623c7fa583c24197f36d53cdb642ab78 authored about 1 year ago
Merge pull request #1841 from FedML-AI/zjh/deploy-hf-prebuild

[Deploy] update HF template

107380007267459b5c2965d0d09dcc3b65a7ac82 authored about 1 year ago
[Deploy] update HF template

51302ebb42782f0e57395e14da96273d6c5d4fe1 authored about 1 year ago
Merge pull request #1840 from FedML-AI/zjh/update-hf-template

[Deploy] update HF template

a08a2bfede04e2d8054caee0bedcce992fb698c1 authored about 1 year ago
[Deploy] Update template creation logic

33da6cd2bd32fffbd4b443395ada96f5b16d9e31 authored about 1 year ago
[Deploy] update HF template

1a918ad7981b144491df987d6f56d2625462f12f authored about 1 year ago
Update __init__.py

99599f996056056f44a881e74982fef1630ac434 authored about 1 year ago
Update setup.py

4a698becc9aca16797a059f5031c79f3a01ec178 authored about 1 year ago
Merge pull request #1839 from FedML-AI/test/v0.7.0

Test/v0.7.0

e3505a6a6b9f3612d08dabf4b86408178e6a092f authored about 1 year ago
Merge pull request #1838 from FedML-AI/merge-swap-7

Dev/v0.7.0

99140183652c6e9d0008bd69f3517b2ed9660ef4 authored about 1 year ago
Merge branch 'test/v0.7.0' into merge-swap-7

6d13c02e30ac50e71d40be0e8931dfceec289329 authored about 1 year ago
Merge pull request #1837 from FedML-AI/alexleung/dev_branch

Alexleung/dev branch

2a9c3677de608f9d25cd1ba3e485308137ae84a6 authored about 1 year ago
[DevOps] update devops files.

4a494d1457c22d8f87d5779cf874bddd9528104b authored about 1 year ago
Update __init__.py

05a1275ab0d05ea9af15e95559098b7a3b7bade1 authored about 1 year ago
Update setup.py

6cfa447536d5fcbdd22e21c0e7a292a904cd8509 authored about 1 year ago
Merge pull request #1826 from FedML-AI/test/v0.7.0

Test/v0.7.0

7896338395e956732285c38cc7f72ebb3501dbe2 authored about 1 year ago
Merge branch 'master' into test/v0.7.0

d033e9e5622b4dcf1d2e100750c891fb736d9cfe authored about 1 year ago
Merge pull request #1836 from FedML-AI/merge-swap-6

Dev/v0.7.0

e163738f55e39831f611b725871b91368bfa6811 authored about 1 year ago
Merge branch 'test/v0.7.0' into merge-swap-6

e4bb23d5f9707e384502766cc19e05a60216586e authored about 1 year ago
[CoreEngine] clean the exit file when logging in server.

fa4338492b08371e6e85093030a65f481369d9f1 authored about 1 year ago
[CoreEngine] set the user name to mqtt client id.

79abb2d8cadab18ddcb94a012bb8bfb178af67ed authored about 1 year ago
[CoreEngine] set the edge status when exceptions occurred.

051e6c06a045d2ec4d9adc7d25b3d7ee58942e2e authored about 1 year ago
Merge pull request #1835 from FedML-AI/dev/v0.7.0

Dev/v0.7.0

0ecf74275abc540ef34f30014197328c8ea91fe1 authored about 1 year ago
Merge pull request #1834 from FedML-AI/alexleung/dev_branch

[CoreEngine] change the mqtt parameter to clean session, no retain msg

0949e5d14c0407ca52aa859a7dbf51b8d2f20559 authored about 1 year ago
Merge pull request #1827 from FedML-AI/raphael/fix-deploy

[Deploy] Add load balance logic

42c2db287b6fd76ceba0cfe8d591b4a628e2496e authored about 1 year ago
Merge pull request #1832 from FedML-AI/dimitris/launch_help_print

Prettifying launch help print msg when config not given.

7f5ce58d3a4a983c2684f00e6e81b10fd1fa91e4 authored about 1 year ago
Merge pull request #1830 from FedML-AI/dimitris/mpi_fixes

MPI Graceful Shutdown and Import Stmt Fix

e1612fda44cde823dddb11d77a721dee2fb60d74 authored about 1 year ago
[CoreEngine] change the mqtt parameter to clean session, no retain msg

bfcf8b2d188a6c4abe8296df47fb0e95e29a4a24 authored about 1 year ago
Merge pull request #1833 from FedML-AI/alaydshah/launch/error_message/nit

Fix order to have accurate error message

1a3d1c9b6af47c92689abddc2a3625fec110b836 authored about 1 year ago
Fix order to have accurate error message

a43faf7d9994edc2d80f39e3bb3a801fc460127b authored about 1 year ago
Modifying launch.py to print help msg on noconfig file.

85f684cd65a34a78555f7fe3d0cd1897a342d5ac authored about 1 year ago
(1) Changed mpi4py import statement throughout the library to avoid reference error

(2) removed ABORT call from fedml_comm_mamager and use MPI.Finalize() for graceful shutdown of th...

68f30b0430920e6b46324f56da3ff819f73e5025 authored about 1 year ago
Merge pull request #1829 from FedML-AI/alexleung/dev_branch

[CoreEngine] deploy the model with the model version from the job yaml.

faad90577ef4996f943a9179864214c2411ccb46 authored about 1 year ago
[CoreEngine] deploy the model with the model version from the job yaml.

d0c0142c44024bb5bf45c366a9f7579f620a6714 authored about 1 year ago
Merge pull request #1811 from FedML-AI/alaydshah/storage/backend

Storage Backend Integration

4e0f2acd5382e0913fa4caea6f2bba6f2494d313 authored about 1 year ago
Fixing storage bugs

08a5235ae8cc22c83b7b7069dafe882f0574a422 authored about 1 year ago
Merge branch 'dev/v0.7.0' into alaydshah/storage/backend

26e43de4beac84b78f833e2414ab83cb83d1996e authored about 1 year ago
[fedllm] bugfix arg parsing

1c0dcbc9d4b437ee2d311d12ec4d720b266209a9 authored about 1 year ago
[Deploy] Add load balance logic

1bec1a4dad1071cdee2161198b0ae9c1b3e33951 authored about 1 year ago
Merge pull request #1819 from FedML-AI/raphael/fix-deploy-failed

[Deploy] Delete deployment run info when deploy failed

a5833d66235cf154a131ce306a95a8da396ebd5a authored about 1 year ago
Update __init__.py

efdff3a5be70e21c7b472590f1793767c5dd3018 authored about 1 year ago
Update setup.py

6fbf85c6a79a851248451f4376fc2c0803e1f8c6 authored about 1 year ago
Update __init__.py

94ffa8d7294479ca1ffb036bce5158642996c25f authored about 1 year ago
Update setup.py

d0bccd2c25841d45bcc41ed90e79380604f99504 authored about 1 year ago
Merge pull request #1825 from FedML-AI/test/v0.7.0

Test/v0.7.0

ec71bc6256350cb9429fd0e353d696b844156607 authored about 1 year ago
Merge branch 'master' into test/v0.7.0

423960e9db38c4635a277b9d73873aad715a4888 authored about 1 year ago
Merge pull request #1824 from FedML-AI/merge-swap-5

Dev/v0.7.0

0bffb9e197e43eab844f7f4331e798f7eb9bb15d authored about 1 year ago
Merge branch 'test/v0.7.0' into merge-swap-5

5d00e82aef73005ad4a7423816b2644aa2958ced authored about 1 year ago
Merge pull request #1823 from FedML-AI/alexleung/dev_branch

Alexleung/dev branch

5c74e15ea9e251bcec6f9bfe2c841862f5356a85 authored about 1 year ago
[DevOps] update devops files.

71283597c46d2226a6296b0c92db250c0007297a authored about 1 year ago
[CoreEngine] check the inference ready with response codes.

27681d825418566de626a74f8286576b35f8c6b7 authored about 1 year ago
[Deploy] Fix Status Checking List When Updating

42fba4ae0131486b5cbeaac10c0bdfdfd6bb0768 authored about 1 year ago
Update __init__.py

9a010bd143152b6da0b9dc28289b0f88887fa315 authored about 1 year ago
Update setup.py

63a10ba7848107af1d2517c94c9f7530d80c3963 authored about 1 year ago
Merge pull request #1822 from FedML-AI/test/v0.7.0

Test/v0.7.0

12b554e72dcdf53aad9d04638ec6cfa2bde70b07 authored about 1 year ago
Merge pull request #1821 from FedML-AI/merge-swap-4

Dev/v0.7.0

33521f1111696ec9db79ddc6ef28dde92b5c9572 authored about 1 year ago
Merge branch 'test/v0.7.0' into merge-swap-4

4e672fb5c751cfe34ed3e0c4fe0e7ab04e7c11f7 authored about 1 year ago
Merge pull request #1820 from FedML-AI/alexleung/dev_branch

Alexleung/dev branch

e3b75f0688ba9b3289b2fc9709e1da8df12e50e3 authored about 1 year ago
[CoreEngine] use the endpoint id defined in the job yaml to launch deploy jobs.

3957bcd1f58eb4d42311cf5d1445ecb3702909c1 authored about 1 year ago
[Deploy] Support Target Mounting Path

964937af0e86316601c5b55d8f1f3a2f96706a67 authored about 1 year ago
[Deploy] Delete deployment run info when deploy failed

25b04e2e674050a212184bcf7d32c8146f320f08 authored about 1 year ago
[CoreEngine] don't need to check if the endpoint is activated on the monitor.

10d50dc71e4f64ece1469ad4589e7e56a878d540 authored about 1 year ago
[CoreEngine] report the realtime gpu available count, just change the endpoint active flag when activating and deactivating the endpoint.

cfeb89b2bc522b5a21e924265a98a312460fcd9c authored about 1 year ago
Merge pull request #1817 from FedML-AI/merge-swap-3

Dev/v0.7.0

550fdcb504d25e12d65cb99e2049ef90bda85769 authored about 1 year ago
Merge branch 'test/v0.7.0' into merge-swap-3

24a7017f8eb3afffecc96e21a7343203ca2f1422 authored about 1 year ago
Merge pull request #1816 from FedML-AI/dev/v0.7.0

Merge pull request #1815 from FedML-AI/alexleung/dev_branch

720afe2a9f8df3c0ef96035c3b1b78e42cba5521 authored about 1 year ago
Merge pull request #1815 from FedML-AI/alexleung/dev_branch

Alexleung/dev branch

a54c7d14a0c0fde4128aab64d460400f1004c7b5 authored about 1 year ago
Merge branch 'dev/v0.7.0' into alexleung/dev_branch

e10693e91d3bb39240d59321adc91605309ac731 authored about 1 year ago
Merge pull request #1812 from FedML-AI/raphael/fix-redisLock

[Deploy] Fix redis resource-competition; Fix failure judgement

89ebcfd9f6d477e4a85b3d290b9958bd3a44e582 authored about 1 year ago
[CoreEngine] adjust the lock range for gpu ids.

85b5a68e31c79a66071de873670f7e19deb69c3c authored about 1 year ago
[CoreEngine] print the releasing infos.

b16a3ff726d924da0bb35eee0b33dfe0b007efe9 authored about 1 year ago
Merge pull request #1814 from FedML-AI/zjh/deploy_fix

[Deploy] fix response status code error

141bcc3fb11a87ca76c4564c45ed76771a195a71 authored about 1 year ago
Merge pull request #1813 from FedML-AI/alaydshah/fix/cluster/autostop

Cluster autostop bug fix

d932c80a67059bc46e0f48680a7d799802159df9 authored about 1 year ago
[Deploy] fix response status code error

cfd33cb16c48e1b7841faa2bcdfe8e0372f7b549 authored about 1 year ago
Cluster autostop bug fix

e21f250d5dccba2c6db3e315d9e6b3aca60c28a7 authored about 1 year ago
[Deploy] Delete the data when deploy failed

13e311f9a6eba90bdf4ac81708b4d64e5a30567c authored about 1 year ago
[CoreEngine] process the exceptions when deleting endpoints.

51f2a0dc2eb4fafad5e8e4822fe2dd7c366242ed authored about 1 year ago
[CoreEngine] print releasing info to diagnose the gpu resources.

d218f10c88af0c338515301531c82ec77b1cc712 authored about 1 year ago
[Deploy] Fix redis resource-competition; Fix failure judgement

15cd898f606686c1b8bcc99879e06aa37ceeecae authored about 1 year ago
Storage Backend Integration

a2f5d638f2ad9e1576608bf7b05a2bbb39ba3df7 authored about 1 year ago
Merge pull request #1810 from FedML-AI/merge-swap-2

Dev/v0.7.0

bf7502436447389891f30eafc456b56b1020d0a2 authored about 1 year ago
Merge branch 'test/v0.7.0' into merge-swap-2

c5ec60962391b6a69b26495ace884fc8f883fa28 authored about 1 year ago
Merge pull request #1809 from FedML-AI/alexleung/dev_branch

Alexleung/dev branch

687a3786f9a38899bee5252007629f6212a4364b authored about 1 year ago
[DevOps] update devops files.

3f81551877ab53c77249cbcb4060d53b9db2b51c authored about 1 year ago
[CoreEngine] not release the gpu when job is deployment in the monitor.

68afd03a58c0c632155eff64b837c9cae4071a0e authored about 1 year ago
Merge pull request #1808 from FedML-AI/merge-swap-1

Dev/v0.7.0

94d3338ce5571e7f8c9d54fabc42b5071235edf6 authored about 1 year ago
Merge branch 'test/v0.7.0' into merge-swap-1

53cb14d07c321ba7421afaf1c2ac5e19e287f599 authored about 1 year ago
Merge pull request #1807 from FedML-AI/alexleung/dev_branch

Alexleung/dev branch

be8d35acfef3cb1cdc4882ab6e35ab21ae1ae555 authored about 1 year ago
[CoreEngine] report the exception status when starting failed.

463c56d2dfcece13c76051cd97218261ab435d34 authored about 1 year ago
Merge pull request #1806 from FedML-AI/dev/v0.7.0

Dev/v0.7.0

befb8c6af7f2e952bf631cddee4b4eb634cb35a6 authored about 1 year ago
Merge pull request #1805 from FedML-AI/alexleung/dev_branch

[CoreEngine] fixed the gpu ids assignment.

d46b4b16d88eb27bdfce23be1d76435cfead1956 authored about 1 year ago
[CoreEngine] fixed the gpu ids assignment.

606ca7151a1d0eaddfdbfc66454b7bd1662143dd authored about 1 year ago
Merge pull request #1804 from FedML-AI/raphael/fix-auto-restart

[Deploy] Fix auto detection of container service health

23029e72d77b37373fbf4c549f90636cfca23620 authored about 1 year ago
[Deploy] Fix auto detection of container service health

3f2ed008355629adffda8925256e902ee1c7ff3d authored about 1 year ago
Update __init__.py

b19b22eb29bad22e013f03729fef8aa141244db1 authored about 1 year ago
Update setup.py

61541bacb94aa955e20d9bcd551b3f4b8b377fe2 authored about 1 year ago
Merge pull request #1803 from FedML-AI/dev/v0.7.0

Dev/v0.7.0

9345f72c77de03aa65f53088854e5fd9eaf91d70 authored about 1 year ago
Merge pull request #1802 from FedML-AI/alexleung/dev_branch

[CoreEngine] set the gpu id from the mlops scheduler.

4706f5deb094662c2b78c51e54badba4b3ed8960 authored about 1 year ago