Ecosyste.ms: OpenCollective
An open API service for software projects hosted on Open Collective.
github.com/FedML-AI/FedML
FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on any GPU cloud or on-premise cluster. Built on this library, TensorOpera AI (https://TensorOpera.ai) is your generative AI platform at scale.
https://github.com/FedML-AI/FedML
[CoreEngine] refactor the entire job flow to make the job automatically created and started without specify the project name and job name at launching.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[Docs] update the README.md for launching jobs to show launching results.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] when starting a job or listing jobs, prompt for API key and save to local storage for next use.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] prompt for API key and save to local storage for next use.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] show job running details to end users after directly launching the job.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] refactor the flow to confirm the launch job with the console mode and web mode.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
Cheetah dev
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] report the job status with reasonable edge id when it is running in the public cloud agent.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] check if fedml is running in a docker container, from which we generate the device id.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] ended the instance of public cloud servers when received failed and finished messages from all edges.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] refactor to make the job list work.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] change the field name to correct value when reporting the computing cost.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] report the job status with the public server id when running on the public cloud agent mode.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] Automatically delete the k8s deployment when the public server finished.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
Cheetah dev
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] write the sample job yaml for printing the Hello World text, printing the GPU information, downloading a file, training the vision transformer model using PyTorch.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] fix the issue that the server runner in the cloud agent mode can not work when receiving the start job request
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] refactor section names in the job yaml file and update related codes.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] refactor the README.md for launching jobs and check the result of launching jobs.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
Cheetah dev
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] add the requirements.txt for hello world example.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] refactor the job yaml to use three lines to define a job and adjust the launcher manager and client runner to make the scheduler work.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
Cheetah dev
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
Ubuntu 22 compatibility
hpdic opened this issue over 1 year ago
hpdic opened this issue over 1 year ago
[CoreEngine] change the url for stopping a job and make the API for stopping a job work.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
Cheetah dev
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
Cheetah dev
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
Cheetah dev
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
Cheetah dev
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
Sync the cheetah branch to the dev branch.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] make the compatibility with federated learning jobs.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
Sync the cheetah branch to dev branch.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
Sync dev to cheetah dev.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
Decentralized Federated Learning Application
afaanbayes opened this issue over 1 year ago
afaanbayes opened this issue over 1 year ago
An error occurred when running this code: args=fedml.init()
FryLcm opened this issue over 1 year ago
FryLcm opened this issue over 1 year ago
Add yolov6 to object detection
Xiaoyang-Wang opened this pull request over 1 year ago
Xiaoyang-Wang opened this pull request over 1 year ago
release 0.8.7
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
release 0.8.7
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[DevOps] upgrade to 0.8.7a5.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[DevOps] upgrade to 0.8.7a5.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[Serving] Fix bugs and add serval features
Raphael-Jin opened this pull request over 1 year ago
Raphael-Jin opened this pull request over 1 year ago
[CrossSilo] add additional verifications & cleanup
fedml-zijianhu opened this pull request over 1 year ago
fedml-zijianhu opened this pull request over 1 year ago
Sync dev to cheetah-dev
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[Serving] change the model url to open.fedml.ai [Tested].
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CrossSilo] support customized hierarchical cross-silo
fedml-zijianhu opened this pull request over 1 year ago
fedml-zijianhu opened this pull request over 1 year ago
ERROR while running FedML-master/python/examples/simulation/mpi_torch_async_fedavg/run.sh
yaokunxu opened this issue over 1 year ago
yaokunxu opened this issue over 1 year ago
[CoreEngine] fix `FileExistsError` for all `os.makedirs`
fedml-zijianhu opened this pull request over 1 year ago
fedml-zijianhu opened this pull request over 1 year ago
Dev/v0.7.0
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
Bugs while running ~/FedML/python/examples/simulation/mpi_torch_async_fedavg/torch_fedavg_mnist_lr_custum_data_and_model_example.py
yaokunxu opened this issue over 1 year ago
yaokunxu opened this issue over 1 year ago
errors in Network Connection Checking
Oswald1997 opened this issue over 1 year ago
Oswald1997 opened this issue over 1 year ago
TypeError: __init__() takes from 1 to 4 positional arguments but 5 were given
ggiggit opened this issue over 1 year ago
ggiggit opened this issue over 1 year ago
[CoreEngine] cleanup `skip_log_model_net` logic
fedml-zijianhu opened this pull request over 1 year ago
fedml-zijianhu opened this pull request over 1 year ago
A problem running the example stuck after "using_mlops=true"
yaokunxu opened this issue over 1 year ago
yaokunxu opened this issue over 1 year ago
Dev/v0.7.0
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[DevOps] upgrade to 0.8.7a4.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
Fix Bug: add exception handler when utf-8 cannot decode the output/err
Raphael-Jin opened this pull request over 1 year ago
Raphael-Jin opened this pull request over 1 year ago
[CoreEngine] added the skip_log_model_net option for llm training, fixed the loss is not clipped to float numbers.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] added the skip_log_model_net option for llm training, fixed the loss is not clipped to float numbers.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[Heart-Disease-App] Fix Bug When Installing flamby
Raphael-Jin opened this pull request over 1 year ago
Raphael-Jin opened this pull request over 1 year ago
[CrossDevice] fixed issues that the test metrics are reported twice to MLOps and loss metrics are clipped to integers.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CrossDevice] fixed issues that the test metrics are reported twice to MLOps and loss metrics are clipped to integers.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[DevOps] make the serving k8s cluster work with latest images, update related chart files.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[DevOps] make the serving k8s cluster work with latest images, update related chart files.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[DevOps] update to 0.8.7a1.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[DevOps] update to 0.8.7a1.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
Dev/v0.7.0
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[DevOps] update to 0.8.6.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
Dev/v0.7.0
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] fixed the issue that failed to verify the pip ssl certificate when checking OTA versions.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
Dev/v0.7.0
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[DevOps] update to 0.8.6a3.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
Dev/v0.7.0
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[DevOps] update to 0.8.6a2
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] added timestamp when reporting system metrics.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] added timestamp when reporting system metrics.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
Updated fedml.data.load parameters to match source code arguments
antoniolang1107 opened this pull request over 1 year ago
antoniolang1107 opened this pull request over 1 year ago
[CoreEngine] build light docker and upgrade to 0.8.5
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] build light docker and upgrade to 0.8.5
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[DevOps] update jenkinsfile.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[DevOps] update jenkinsfile.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
Dev/v0.7.0
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[DevOps] update jenkinsfile.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] select the python program based on current running python version.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] select the python program based on current running python version.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
Fix Bugs: MPI Mode Do Not Have Client Rank -1
Raphael-Jin opened this pull request over 1 year ago
Raphael-Jin opened this pull request over 1 year ago
Dev/v0.7.0
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] refactor the log api.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
Why aggregation algorithm has different name but identical implementation
Adeelbek opened this issue over 1 year ago
Adeelbek opened this issue over 1 year ago
Dev/v0.7.0
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
Dev/v0.7.0
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
How to get the mAP50 after server aggregation when training the fedcv_detection?
howhomiee opened this issue over 1 year ago
howhomiee opened this issue over 1 year ago
[Serving] make the inference backend for deepspeed be working.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[DevOps] update jenkinsfile.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] make the public cloud server scheduled into specific nodes.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[MLOps] support LLM record logging
fedml-zijianhu opened this pull request over 1 year ago
fedml-zijianhu opened this pull request over 1 year ago
[CoreEngine] make the compatibility when opening subprocess on windows.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[CoreEngine] make the compatibility when opening subprocess on windows.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[DevOps] update jenkinsfile.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[Examples] update bootstrap.sh.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago
[Examples] update bootstrap.sh.
fedml-alex opened this pull request over 1 year ago
fedml-alex opened this pull request over 1 year ago