Ecosyste.ms: OpenCollective
An open API service for software projects hosted on Open Collective.
github.com/FedML-AI/FedML
FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on any GPU cloud or on-premise cluster. Built on this library, TensorOpera AI (https://TensorOpera.ai) is your generative AI platform at scale.
https://github.com/FedML-AI/FedML
683c7c709c91905fbf80a2f0993b43a2672b7586 authored about 1 year ago
[DevOps] update devops files.
e79d6f8ea7e22ca92037f28eac99bb9420dabec0 authored about 1 year ago22e069d53341c515994d9c798d182be32c80ba0a authored about 1 year ago
[CoreEngine] check if the endpoint is ready, debug the federate job o…
87731451ea39848878f16c48ba4ad5a7ca0c47f6 authored about 1 year ago3e0ff523f0ce8c8413519d4ef8f5d22b9c2eb2d2 authored about 1 year ago
6cc57da3be733dd2044f11ecfc9c5527ee4e4e6c authored about 1 year ago
a22182adba8d6dc0bf3290bc974c96179e13d085 authored about 1 year ago
[Deploy] Fix the CPU usage indication
ae1bf62cd6a1e391f0370bad81fc9e6329868228 authored about 1 year agoe805c744fb887b529a3cdeb19ee4635b27f0d144 authored about 1 year ago
[FedLLM] migrate from FedML-AI/FedLLM
866f22013f2d106cf49ec20752a678fd6e360e51 authored about 1 year agodbc470a925a518c0a9f5aa0f536fb29a69ffaa96 authored about 1 year ago
bef4f61ae842f95b701dc6947156bcf50e4c3614 authored about 1 year ago
[train.llm] refactor and update `setup.py`
b433c5cafafb6f92c75953aa1bd14ddf3307a8f8 authored about 1 year ago9b4e40ec1281b82b0eda90fb8019d99b92754817 authored about 1 year ago
- move train script to examples
- move `train.llm.src` -> `train.llm`
- update `DatasetArguments...
[train.llm] move train_utils, train_deepspeed: support hostfile
cf8a93498ccca87c11ff484a4a019002301bc957 authored about 1 year agoa7d18317c85ef899baf866772a8035b2dac9daaa authored about 1 year ago
[Deploy] Fix Duplicated Logs; Add Dummy Test Case
269bd282d45bd5d54810af3ec39633ec0a33e011 authored about 1 year ago4e79677a6ebebae9577552312c197b7b70ec23f4 authored about 1 year ago
[Deploy] Fix bugs when using multiple workers
226043f1a9c2d243d6a10f1df2230f2ec3e562c3 authored about 1 year agof86c7461b5364f0a2d107cbdf8b149ac685b123d authored about 1 year ago
Alexleung/dev branch
df9c406bbd7d75a1d65f0be64993842bd74cd099 authored about 1 year agobe499b3572c86d21f7efd03434a520c5db26c04c authored about 1 year ago
[Deploy] Fix Container Name
14f574643766eec6a6a48cc1a30655f99c12bcac authored about 1 year ago7ed77d37fb95f60901737d02cf26d865c43f3b52 authored about 1 year ago
1c3077054713be5f37d8dc98f178394967172d26 authored about 1 year ago
Alexleung/dev branch
d8f4099fb16d2f477d54b7386dc363220d08662c authored about 1 year ago1f3b433505b47f33e1aec363405f4ea5a6dcfa57 authored about 1 year ago
Dev/v0.7.0
bf39048cdc47a343ded334849cc67d2657a445e9 authored about 1 year ago[Login] Fix Linux Sleep Process Status Case
49ccbd0620a400a0b520b480ff5c4be6dc896744 authored about 1 year agoMerge pull request #1787 from FedML-AI/alexleung/dev_branch
f6c000d99eeb52f008411a7c035c2bdf3e365dc1 authored about 1 year ago1c0fdcdcfbd007a4f292502617db29f5996a5096 authored about 1 year ago
a59bcfd81478c51d55fb1fb645e4049ef3fc276d authored about 1 year ago
Alexleung/dev branch
b2c3b5fdef91c8af4a156a5c2fc95772445bb1bf authored about 1 year ago7736f5a4a50cc8883ffda9fdbf0dbefffb07c215 authored about 1 year ago
7c41935cab20a84758dc3709c1e27c6981d9417e authored about 1 year ago
959a822f56dbe29f1540bcf5525a7417f3c7d1bd authored about 1 year ago
b1a0a8e70e9c9a0e8aa303473937df23abbba4b9 authored about 1 year ago
ab282592534f8c9df2a29a24465fc7438c1b1356 authored about 1 year ago
Test/v0.7.0
990480b442db6982cc4ed6c7840d11dd7ec47104 authored about 1 year agosync master to test
2e9fcf330d7826439e884f29db3ec5482d4fb906 authored about 1 year ago[Docs] fix typos
f78d23ffcca6acd7f7874a8703df9c44e41f13a4 authored about 1 year agoUpdate README.md
cf1d68290fdd54f2e0b84fab6e621b3ed1a8d422 authored about 1 year ago[train.llm] support LLM train
86e2d84e45f1e489c83b642b7ab04b694e19cde8 authored about 1 year ago[CrossCloud] init from cross-silo
f73e346a3ab75b8b19b5d75b278d22c65d9ad0cb authored about 1 year ago3f2485ecae8bc6012c7d4eb925a0ffc448c9f758 authored about 1 year ago
[Deploy] OpenAI compatible API support for on-prem
2b53e01a95a174793f819278f8d4c453adc96aa4 authored about 1 year agod9c7028b020d1493beabc6635a6a1f2503410270 authored about 1 year ago
221119b76ef467c2bdb5f24221039c68ec23596b authored about 1 year ago
[Deploy] Fix log report when having multiple containers
3a4a9508cb8cecf4a0df14d4827559e5446d05cd authored about 1 year agofbf80caba4d4e0ee29cd9183deecee70ab705f38 authored about 1 year ago
2. show docker engine installation guide when binding devices.
3. login as the daemon mode when r...
b03578d0681e02e622a34767910f9069db433f59 authored about 1 year ago
d89ddf21c91184901d5d077f34043c02eddc2630 authored about 1 year ago
Support path in name
5518705bbd6f91801c9c8a0578148f3da0e3d8b5 authored about 1 year ago[Deploy] Fix Model Upload API/CLI
29f4fff18814f44527557f66dbb0802231d049ef authored about 1 year ago52df98d3183685be688753402965e917adacc810 authored about 1 year ago
[Deploy] Support Update Endpoint Operation
da9562d771671d7de6986bb7830217e05c45c9f7 authored about 1 year ago53399bf94958f263ae0740a13fc93dcce4d1ddec authored about 1 year ago
005ab80af213b4cd314a8a9a6e29faea48346939 authored about 1 year ago
e414fd1488c0232015c6871dc1499f41797ef96e authored about 1 year ago
Support Storage Metadata
d31fde1555af65178749a8dc629d12c30ccec117 authored about 1 year agoab7e1cfaa0699ad20324ca3e6ecea175c66eb6ce authored about 1 year ago
[Deploy] Fix the determination logic when a worker cannot response to master
28cc10ead095b54d4354cae99a9e89f049e6dc13 authored about 1 year ago2a092c3be01066950c6366f7f76ffc7c7cf0d84b authored about 1 year ago
Dev/v0.7.0
932b3f685a858776bce8a0ca1d544002a280cb2c authored about 1 year ago3ab3862a867ed825157919fc41ea901e8b2a9152 authored about 1 year ago
Alexleung/dev branch
c64894ef8d9bfdc0cf102f9a19dfb98dbd1ee177 authored about 1 year agoaa6e5e5c61c2e06e7a0f5bb0254ccc58a79f9b7c authored about 1 year ago
79c061ec4a28650aa5cd7edac3fa6b7e3a8eeb31 authored about 1 year ago
9d537d8c8284bde99d4aaae22e4fdeae7a035ed0 authored about 1 year ago
Alexleung/dev branch
9c13a950957233e8ec0a9c034aa3655efe06410b authored about 1 year ago1c49ebddb1433d1e283e94c3ad6de2c9de71c4a2 authored about 1 year ago
46c728a62700cb57666604c07d3197f2a8f69657 authored about 1 year ago
Alexleung/dev branch
4b30c2991060a919413907cc0f81537d4f8d013e authored about 1 year ago1c9efcbe4d1313d6b043115368cc239e159f8f40 authored about 1 year ago
782841935f50ae2e6c1b750fe688e3ade5f3c83c authored about 1 year ago
cf833c40a5dd5a6a921d8f8bdcb36051883112d0 authored about 1 year ago
[CoreEngine] 1. change the inference request to async mode.
9056117d3e90c84255f894a4a20c34949bfcd70c authored about 1 year agob72738d2a0e02e4b73761e2963cbf9ab648c2204 authored about 1 year ago
quick fix local s3
16b220ee24c9cc0d8e808e56e37224cd61d99e97 authored about 1 year ago2. refactor the monitor for endpoints.
40ac75cd2dd2a58ee9350dad9d63f4809abc03c3 authored about 1 year agoUpdate storage logic; support download dest
a0a5d3493b39150dd130b82aa777e227b6a8cb9c authored about 1 year ago58bc06ee1fececf602d9497bcb6684f92d3fb740 authored about 1 year ago
bed8bf0c8fdf4da35b8021962a43dcbb2b1cc558 authored about 1 year ago
Set master and worker id default to None
0eaf4b7548cb4162b9bae5315c9fe7bd4af8de29 authored about 1 year agoe30413b202ddbfb23b1e7fb5e3b5238ecf6fa240 authored about 1 year ago
[Deploy] Support more container config
6ba3920f3eacd354eb4f1d8e8b4b96403d2f9225 authored about 1 year agoc7d448285751615065ec6e5b95b7ed4575624b27 authored about 1 year ago
Deploy] Fix the logic of using proxy inference
679464c773bdb7b7914be05e454b586e967d312e authored about 1 year ago1e390cd1874be8359efadfaa61db560529d1c078 authored about 1 year ago
c5381faac20d59019cf2a68fd7dac7a0cf35cd3e authored about 1 year ago
[Deploy] Support Async Inference using predict method
c1a672aa6158d25912bd8c604eaa2322f2e57476 authored about 1 year ago2aab150765ec6c4b4dcc9d7f72937691f32149cb authored about 1 year ago
[CoreEngine] adjust the policy for monitoring endpoints.
8a92a24701a71fd2719333c1d5f826e0a9cc3f97 authored about 1 year ago01e2d44ef45d41aa83ed7c64eed4cb8a9fd29212 authored about 1 year ago
Alexleung/dev branch
e8700dee409500fa5d7721abc07a26e6afacff61 authored about 1 year ago9ef6932f917b889da0edc66d974f6d58019a99fc authored about 1 year ago
Dev/v0.7.0
4fc93a3ea2fac956560b3ea942b669737c60fe3e authored about 1 year ago[CoreEngine] add the monitor process to upload the inference logs.
07b50f73c20ded19084e114208f59b9d43421efe authored about 1 year ago