Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

github.com/dmlc/xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
https://github.com/dmlc/xgboost

Fix buildkite label. (#8287)

299e5000a422a939f6ecf7970a09c590fee40711 authored about 2 years ago by Jiaming Yuan <[email protected]>
Obtain CSR matrix from DMatrix. (#8269)

55cf24cc329ae35bd4daa6a3d09a0419304df0ae authored about 2 years ago by Jiaming Yuan <[email protected]>
[CI] Put Multi-GPU test suites in separate pipeline (#8286)

* [CI] Put Multi-GPU test suites in separate pipeline

* Avoid unset var error in Bash

b14c44ee5e4f941c59de3d91633f4b0acdff5b23 authored about 2 years ago by Philip Hyunsu Cho <[email protected]>
[pyspark][doc] add more doc for pyspark (#8271)

Co-authored-by: fis <[email protected]>

cbf3a5f91899d0fe1046bd4fcf9a4c8698d76529 authored about 2 years ago by Bobby Wang <[email protected]>
[pyspark] disable repartition_random_shuffle by default (#8283)

c91fed083d727d2d4c0e59675586d4a5eba5d104 authored about 2 years ago by Bobby Wang <[email protected]>
Fix mixed types with cuDF. (#8280)

6925b222e0dc268803b59c93b28db65cf130beb7 authored about 2 years ago by Jiaming Yuan <[email protected]>
Mark next release as 1.7 instead of 2.0 (#8281)

f835368bcffec8cf0675d22bd4a1e27832b6e616 authored about 2 years ago by Jiaming Yuan <[email protected]>
Remove MGPU cpp tests. (#8276)

Co-authored-by: Hyunsu Philip Cho <[email protected]>

6d1452074a3399dfd4a8e857cb64de3d705ef480 authored about 2 years ago by Jiaming Yuan <[email protected]>
Support more pandas nullable types (#8262)

- Float32/64
- Category.

fcab51aa8222df5d09a90715d5941158078d0181 authored about 2 years ago by Jiaming Yuan <[email protected]>
GitHub Workflows security hardening (#8267)

Signed-off-by: Alex <[email protected]>

1082ccd3cc82317da5d546dc0063d1478382180d authored about 2 years ago by Alex <[email protected]>
Use quantised gradients in gpu_hist histograms (#8246)

8f776771934478acdbe0b9b4a2724ffa812c034b authored about 2 years ago by Rory Mitchell <[email protected]>
Fix sparse threshold warning. (#8268)

4056974e371439280f3e74b875dd80f5aafb1cc9 authored about 2 years ago by Jiaming Yuan <[email protected]>
[pyspark] Add validation for param 'early_stopping_rounds' and 'validation_indicator_col' (#8250)

Signed-off-by: Weichen Xu <[email protected]>

ff71c69adf801f0dd9b8121f0f0a8d7cb18dfdb7 authored about 2 years ago by WeichenXu <[email protected]>
[doc] Fix sphinx build. (#8270)

0cd11b893a7952a3913c1fe9bba516f49a9a568c authored about 2 years ago by Jiaming Yuan <[email protected]>
Enable OpenSSF Scorecard Github Action (#8263)

* chore: enable scorecard github action

Signed-off-by: Joyce Brum <[email protected]>

...

be5b95e74378d89717d6c1dc7ccb5ab9c03c3aba authored about 2 years ago by Joyce <[email protected]>
[jvm-packages] fix spark-rapids compatibility issue (#8240)

* [jvm-packages] fix spark-rapids compatibility issue

spark-rapids (from 22.10) has shimmed G...

8d247f0d64bee392857b330b20bfe6086798eb7b authored about 2 years ago by Bobby Wang <[email protected]>
[pyspark] Fix xgboost spark estimator dataset repartition issues (#8231)

ab342af24261b4708c5a2b61e40c25c1b433a2d1 authored about 2 years ago by WeichenXu <[email protected]>
Add checks to C pointer arguments. (#8254)

3fd331f8f2984c51ef2fbb40358a3cf9115d9274 authored about 2 years ago by Jiaming Yuan <[email protected]>
Optional by-column histogram build. (#8233)

Co-authored-by: dmitry.razdoburdin <[email protected]>

eb7bbee2c9ceec9961eb2ebe91999bc4de2f7b71 authored about 2 years ago by Dmitry Razdoburdin <[email protected]>
Initial support for IPv6 (#8225)

- Merge rabit socket into XGBoost.
- Dask interface support.
- Add test to the socket.

b791446623977516a47d4795b2860dd590242335 authored about 2 years ago by Jiaming Yuan <[email protected]>
JNI wrapper for the collective communicator (#8242)

7d43e74e71c1a0f6526fda78d46395c883d14cb0 authored about 2 years ago by Rong Ou <[email protected]>
Calculate `base_score` based on input labels for mae. (#8107)

Fit an intercept as base score for abs loss.

fffb1fca522da1524350b1a563580f2e4e115ef4 authored about 2 years ago by Jiaming Yuan <[email protected]>
[pyspark] make the model saved by pyspark compatible (#8219)

Co-authored-by: Hyunsu Cho <[email protected]>

4f42aa5f1224cdbf4e2fff8631e055182a3f7e35 authored about 2 years ago by Bobby Wang <[email protected]>
[pyspark] fix empty data issue when constructing DMatrix (#8245)

Co-authored-by: Hyunsu Philip Cho <[email protected]>

520586ffa71c000da154452cc901008069a0256c authored about 2 years ago by Bobby Wang <[email protected]>
[CI] Retire Jenkins server (#8243)

70df36c99cf7112319a02c25f7aac3f39a718f33 authored about 2 years ago by Philip Hyunsu Cho <[email protected]>
Mitigate flaky data iter test. (#8244)

- Reduce the number of batches.
- Verify labels.

2e63af61174504e84d66804f009bed165dd3fdbf authored about 2 years ago by Jiaming Yuan <[email protected]>
Make `QuantileDMatrix` default to sklearn esitmators. (#8220)

bdf265076d85d84ca6baba05d8524280ddfe5637 authored about 2 years ago by Jiaming Yuan <[email protected]>
Common interface for collective communication (#8057)

* implement broadcast for federated communicator

* implement allreduce

* add communicator ...

a2686543a9febfbc960a6596156b04b6fb2e4096 authored about 2 years ago by Rong Ou <[email protected]>
Prepare for improving Windows networking compatibility. (#8234)

* Prepare for improving Windows networking compatibility.

* Include dmlc filesystem indirectl...

bc818316f25fc28d8ab8c1b42eeafc211fd94065 authored about 2 years ago by Jiaming Yuan <[email protected]>
[CI] Use binary R dependencies on Windows. (#8241)

dd44ac91b844c47f2b97a20a3e7f7a7a5273748a authored about 2 years ago by Jiaming Yuan <[email protected]>
[CI] Don't require manual approval for master branch (#8235)

23faf656ad47d89b0d230b06d8f7e8c61ae583c6 authored about 2 years ago by Philip Hyunsu Cho <[email protected]>
[CI] Migrate CI pipelines from Jenkins to BuildKite (#8142)

* [CI] Migrate CI pipelines from Jenkins to BuildKite

* Require manual approval

* Less ver...

e888eb2fa9e8033881f222852f58820e167c557c authored about 2 years ago by Philip Hyunsu Cho <[email protected]>
Drop use of deleted virtual function to support older MacOS (#8226)

* Support older MacOS

* Update json.h

b397d64c968dd9bde14c28cddfa0263236bbc303 authored about 2 years ago by Philip Hyunsu Cho <[email protected]>
Updated dart.rst with correct links (#8229)

Updated the DART paper link as it was invalid and link was broken.

dc07137a2c5804399e99f1893594e3ab5beadbf7 authored about 2 years ago by Rehan Guha <[email protected]>
Add `max_cat_threshold` to GPU and handle missing cat values. (#8212)

b5eb36f1afaff0338c1493d3dcf1602a7b770200 authored about 2 years ago by Jiaming Yuan <[email protected]>
Copy data from Ellpack to GHist. (#8215)

441ffc017aee142cbcbae9b7f74754fd5d9d1ebd authored about 2 years ago by Jiaming Yuan <[email protected]>
[pyspark] Cleanup the comments (#8217)

7ee10e3dbdd81f8dc00bf34da34c90dd8e7b244f authored about 2 years ago by Bobby Wang <[email protected]>
Fix dask interface with latest cupy. (#8210)

ada4a86d1c3687ba78392993e6b2902d69175ef2 authored about 2 years ago by Jiaming Yuan <[email protected]>
Optimization/buildhist/hist util (#8218)

* BuildHistKernel optimization

Co-authored-by: dmitry.razdoburdin <[email protected]...

deae99e662caf3c126184a8416e960295aa6024a authored about 2 years ago by Dmitry Razdoburdin <[email protected]>
Fix dask.py lint error (#8216)

b78bc734d9f75e3d6179583e72a450cd899e12d3 authored about 2 years ago by Rong Ou <[email protected]>
Work around MSVC behavior wrt constexpr capture (#8211)

* Work around MSVC behavior wrt constexpr capture

* Fix lint

56395d120b29a51a0980f64bd1e36285dbf11400 authored about 2 years ago by Philip Hyunsu Cho <[email protected]>
[doc] Update prediction.rst (#8214)

a868498c18aa73190295897b8cd143bdb5efe903 authored about 2 years ago by CW <[email protected]>
Mark parameter validation non-experimental. (#8206)

8dac90a593421f8804f32fa659e2f879b2a23589 authored about 2 years ago by Jiaming Yuan <[email protected]>
Set max message size in insecure gRPC (#8203)

d6e2013c5f3df0fff702cf3bfbdbdbc911e566dc authored about 2 years ago by Rong Ou <[email protected]>
[pyspark] Fixing xgboost.spark python doc (#8200)

Signed-off-by: Weichen Xu <[email protected]>

651f0a8889924397f3a1263e2c4f7625a8b88bc7 authored about 2 years ago by WeichenXu <[email protected]>
[pyspark] Add param validation for "objective" and "eval_metric" param, and remove invalid booster params (#8173)

Signed-off-by: Weichen Xu <[email protected]>

d03794ce7a6d5772bd5c3b7d74b7a92557f2e984 authored about 2 years ago by WeichenXu <[email protected]>
Fix release script. (#8187) (#8195)

9b32e6e2dc69c64cac89a074ef72074f386fec75 authored about 2 years ago by Jiaming Yuan <[email protected]>
[pyspark] Implement SparkXGBRanker estimator (#8172)

Signed-off-by: Weichen Xu <[email protected]>

f4628c22a4a26eca581255bac41133e48dcf9ed2 authored about 2 years ago by WeichenXu <[email protected]>
[CI] Prune unused archs from libnccl (#8179)

* [CI] Prune unused archs from libnccl

* Put pruning logic in CI directory

* Don't use --c...

35ef8abc27c1d6900427619dd5789d2801809792 authored about 2 years ago by Philip Hyunsu Cho <[email protected]>
Allow insecure gRPC connections for federated learning (#8181)

* Allow insecure gRPC connections for federated learning

* format

ad3bc0edeedc25c917cbf4699df61c91cd9fce65 authored about 2 years ago by Rong Ou <[email protected]>
[pyspark] Make Xgboost estimator support using sparse matrix as optimization (#8145)

Signed-off-by: Weichen Xu <[email protected]>

53d2a733b0237ae611bf7c751c33e1be9192a1e2 authored about 2 years ago by WeichenXu <[email protected]>
Optimise histogram kernels (#8118)

1703dc330f1d2624463b2ba861e592ce40fcc158 authored about 2 years ago by Rory Mitchell <[email protected]>
Use make on i system (#8178)

Co-authored-by: GavinZhang <[email protected]>

40a10c217de15cea7b0fa51f175b5c65633f788a authored about 2 years ago by Gavin Zhang <[email protected]>
Bump hadoop-common from 3.2.3 to 3.2.4 in /jvm-packages/xgboost4j-flink (#8157)

Bumps hadoop-common from 3.2.3 to 3.2.4.

---
updated-dependencies:
- dependency-name: org.a...

93966b0d1907e73638fb225958bbb4fec4e977a8 authored about 2 years ago by dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Grammar Fix in Introduction to Boosted Trees (#8166)

Added "of" to "objective functions is that they consist of two parts" in line 32 of ./doc/tutori...

a9458fd8440dd23a0a26238346bf35f718423e79 authored about 2 years ago by Andy Kattine <[email protected]>
Edit grammar in custom metric tutorial (#8163)

fa869eebd9501a3f9f876388ddc04a416182e89f authored about 2 years ago by Ravi Makhija <[email protected]>
Tune cuda architectures (#8152)

f421c26d3563da351f54a7baf89574521929b916 authored about 2 years ago by Rory Mitchell <[email protected]>
Support CPU input for device `QuantileDMatrix`. (#8136)

- Copy `GHistIndexMatrix` to `Ellpack` when needed.

16bca5d4a189af8febbf05150fc512f283e56eef authored about 2 years ago by Jiaming Yuan <[email protected]>
[dask] Deterministic rank assignment. (#8018)

36e7c5364da6baa0bb0bcae9307902d4d6e7881b authored about 2 years ago by Jiaming Yuan <[email protected]>
Simplify Python getting started example (#8153)

Load data set via `sklearn` rather than a local file path.

20d1bba1bb8b3aa0ba3762e99e21117b416a8567 authored about 2 years ago by Ravi Makhija <[email protected]>
[CI] Fix R build on Jenkins. (#8154)

d868126c394744e8b458bc16f8c63cc003b75024 authored about 2 years ago by Jiaming Yuan <[email protected]>
Use black on more Python files. (#8137)

570f8ae4ba4ce6a14c0e334662be0f3fda2ff5eb authored about 2 years ago by Jiaming Yuan <[email protected]>
[doc] Clarification for feature importance. (#8151)

bdb291f1c216025d75b2155b29e2c5396195741c authored about 2 years ago by Jiaming Yuan <[email protected]>
Fix loading DMatrix binary in distributed env. (#8149)

- Try to load DMatrix binary before trying to parse text input.
- Remove some unmaintained code.

446d536c23c5451eaf2879c5b266a2a68ceb07ec authored about 2 years ago by Jiaming Yuan <[email protected]>
Update PyPi wheel size limit. (#8150)

8fc60b31bc21b344e641f68092f14b809fbf63aa authored about 2 years ago by Jiaming Yuan <[email protected]>
Use config_context in sklearn interface. (#8141)

9ae547f9944bacabf158f487972fba632c62f786 authored about 2 years ago by Jiaming Yuan <[email protected]>
[pyspark] support a list of feature column names (#8117)

03cc3b359c98f267a39bfcfe5c7186b410bf986c authored about 2 years ago by Bobby Wang <[email protected]>
Update CUDA docker image and NCCL. (#8139)

bcc8679a052fb417de327b0271a7d5a5e47d29b3 authored about 2 years ago by Jiaming Yuan <[email protected]>
In PySpark Estimator example use the model with validation_indicator (#8131)

* use the validation_indicator model

* use the validation_indicator model for regression

ff471b3fabd1cb597761cd28335081e5f763fff8 authored about 2 years ago by Praateek Mahajan <[email protected]>
Quantile DMatrix for CPU. (#8130)

- Add a new `QuantileDMatrix` that works for both CPU and GPU.
- Deprecate `DeviceQuantileDMa...

d87f69215e729415c193c84ebe7343e1e5158acc authored about 2 years ago by Jiaming Yuan <[email protected]>
Fix compatibility with latest cupy. (#8129)

* Fix compatibility with latest cupy.

* Freeze mypy.

2cba1d9fcc2e03b366bbadac9db90693b083bbf1 authored about 2 years ago by Jiaming Yuan <[email protected]>
[Doc] Indicate lack of py-xgboost-gpu on Windows (#8127)

24c237308097b693b744af2ad1f86f44be068523 authored about 2 years ago by Philip Hyunsu Cho <[email protected]>
Implement iterative DMatrix for CPU. (#8116)

2c70751d1ed9d51e1108457ae216027883993845 authored about 2 years ago by Jiaming Yuan <[email protected]>
[pyspark] Cleanup data processing. (#8088)

- Use numpy stack for handling list of arrays.
- Reuse concat function from dask.
- Prepare ...

546de5efd2aba5fc1e1e1dc3bfe90d82641cdd11 authored about 2 years ago by Jiaming Yuan <[email protected]>
Move pylint helper from dmlc-core. (#8101)

* Move pylint helper from dmlc-core.

- Move the helper into the XGBoost ci_build.
- Run it w...

3970e4e6bb1688b662abd3c58cda615c5ba7892d authored about 2 years ago by Jiaming Yuan <[email protected]>
Fix feature weights with multiple column sampling. (#8100)

7785d65c8a086bc757a86472fb0ffc4d9c918216 authored about 2 years ago by Jiaming Yuan <[email protected]>
Prepare gradient index for Quantile DMatrix. (#8103)

* Prepare gradient index for Quantile DMatrix.

- Implement push batch with adapter batch.
- ...

4a4e5c7c186cd65aeb2037bf6b187cb87fd58d46 authored about 2 years ago by Jiaming Yuan <[email protected]>
Refactor split valuation kernel (#8073)

1be09848a75033d4e11cbf9cf91224d9e036a2e6 authored about 2 years ago by Rory Mitchell <[email protected]>
docs: fix simple typo, cannonical -> canonical (#8099)

There is a small typo in src/common/partition_builder.h.

Should read `canonical` rather than ...

cb40bbdadd44db3e49dba3d34dc1ff76c3107760 authored about 2 years ago by Tim Gates <[email protected]>
[MSVC][std:c++latest] Fix compiler error (#8093)

Co-authored-by: QuellaZhang <[email protected]>

703261e78f9639ccb58751f51ff3073a7311b118 authored about 2 years ago by QuellaZhang <[email protected]>
Cleanup data generator. (#8094)

- Avoid duplicated definition of data shape.
- Explicitly define numpy iterator for CPU data.

ef11b024e8c18a590ee7d0c8f9bba6fee064e441 authored about 2 years ago by Jiaming Yuan <[email protected]>
Limit `max_depth` to 30 for GPU. (#8098)

5156be0f49238476511adfe541e643e65b4a897f authored about 2 years ago by Jiaming Yuan <[email protected]>
[Python] Require black and isort for new Python files. (#8096)

* [Python] Require black and isort for new Python files.

- Require black and isort for spark ...

8bdea72688a3d1999e084d683ef1e14469233800 authored about 2 years ago by Jiaming Yuan <[email protected]>
[pyspark] User guide doc and tutorials (#8082)

Co-authored-by: Bobby Wang <[email protected]>

f23cc92130a172fa7a5fbdf5bb605c0bc5183ffe authored over 2 years ago by WeichenXu <[email protected]>
[PySpark] change the returning model type to string from binary (#8085)

* [PySpark] change the returning model type to string from binary

XGBoost pyspark can be can ...

f801d3cf15b5b87e78377a8524391f005c29aea0 authored over 2 years ago by Bobby Wang <[email protected]>
[dask] Mitigate non-deterministic test. (#8077)

2365f82750ce29f11df7939800e2bd5e463e078c authored over 2 years ago by Jiaming Yuan <[email protected]>
Remove unused updater basemaker (#8091)

7a6b711eb88eb3b1bbd64faa0ab6c809298dc9ac authored over 2 years ago by Rong Ou <[email protected]>
[CI] Clear workspace after budget check (#8092)

* [CI] Clear workspace after budget check

* Windows too

4325178822dd8c5c89e743fb05610a3307526a65 authored over 2 years ago by Philip Hyunsu Cho <[email protected]>
Small cleanups to various data types. (#8086)

- Use `bst_bin_t` in batch param constructor.
- Use `StringView` to avoid `std::string` when ap...

4083440690d5c4bd7a2e5768afcb248b950055ab authored over 2 years ago by Jiaming Yuan <[email protected]>
[doc] Integrate pyspark module into sphinx doc [skip ci] (#8066)

e28f6f66577eb31451e5d3fbf4be0a21626f7372 authored over 2 years ago by Jiaming Yuan <[email protected]>
Check cudf lazily (#8084)

579ab23b10969f473d30c5f18b546963941eccff authored over 2 years ago by Rafail Giavrimis <[email protected]>
[PySpark] add gpu support for spark local mode (#8068)

a33f35eecf6b4ed1244a4369bd3e0aaaf64d71f5 authored over 2 years ago by Bobby Wang <[email protected]>
[PySpark] fix raw_prediction_col parameter and minor cleanup (#8067)

91bb9e2cb3ef926d88e307ef9864459ca097a664 authored over 2 years ago by Bobby Wang <[email protected]>
Mitigate flaky GPU test. (#8078)

The flakiness is caused by the global random engine, which will take some time to fix.

0ce80b7bcf118758960e54b353801e92254441cb authored over 2 years ago by Jiaming Yuan <[email protected]>
Fix GPU quantile distributed test. (#8076)

7a5586f3db8585dbec2b08f660b765ad5d2c1d2e authored over 2 years ago by Jiaming Yuan <[email protected]>
[dask] Fix potential error in demo. (#8079)

* Use dask_cudf instead.

8fccc3c4ad3ea4821ecfc74942771c10cd9b074a authored over 2 years ago by Jiaming Yuan <[email protected]>
Make test for categorical data deterministic. (#8080)

647d3844ddfa141351c183daa4d82ab9bdebef79 authored over 2 years ago by Jiaming Yuan <[email protected]>
Update Python requirement to >=3.8. (#8071)

Additional changes:
- Use mamba for CPU test on Jenkins.
- Cleanup CPU test dependencies.
-...

dae7a41baa8305597843ddab1219170124172663 authored over 2 years ago by Jiaming Yuan <[email protected]>
Split up column matrix initialization. (#8060)

* Split up column matrix initialization.

This PR splits the column matrix initialization into...

8dd96013f1ee6d9dd78882ab752896355a5ee355 authored over 2 years ago by Jiaming Yuan <[email protected]>
[CI] Fix S3 uploads (#8069)

* [CI] Fix S3 upload issues

* Don't launch Docker containers when uploading to S3

36cf979b823f2bb45efad2f90aa61eed05da7195 authored over 2 years ago by Philip Hyunsu Cho <[email protected]>