Ecosyste.ms: OpenCollective

An open API service for software projects hosted on Open Collective.

github.com/dmlc/xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
https://github.com/dmlc/xgboost

Modify caching allocator/vector and fix issues relating to inability to train large datasets (#4615)

7a388cbf8b6b80e5a6fe159c8abe2e52af12d33e authored over 5 years ago by sriramch <[email protected]>
fix auc error in distributed mode caused by unbalanced dataset (#4645)

cd1526d3b1155432fca82e8b4895bc0b827b4d29 authored over 5 years ago by Xu Xiao <[email protected]>
fix spark tests on machines with many cores (#4634)

30204b50fe4a39576b61491920166cc390cc9e25 authored over 5 years ago by Rong Ou <[email protected]>
[jvm-packages] Expose setMissing method in XGBoostClassificationModel / XGBoostRegressionModel (#4643)

d333918f5e95ce0af7a71993541632514cb5beba authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
Fix early stopping in the Python package (#4638)

* Fix #4630, #4421: Preserve correct ordering between metrics, and always use last metric for ea...

1aaf4a679de62cbb03062aa4953813da5af4a94e authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
Eliminate FutureWarning: Series.base is deprecated (#4337)

* Remove all references to data.base

Should eliminate the deprecation warning in issue #4300
...

562d9ae963acdb45b52840ac29361ec83c038ba9 authored over 5 years ago by Marcos <[email protected]>
Fix CPU hist init for sparse dataset. (#4625)

* Fix CPU hist init for sparse dataset.

* Implement sparse histogram cut.
* Allow empty feat...

d9a47794a5b5ea47808f83e559b363e53c96a26f authored over 5 years ago by Jiaming Yuan <[email protected]>
Empty evaluation list in early stopping should produce meaningful error message (#4633)

* Empty evaluation list should not break early stopping

* Fix lint

* Update callback.py

b7a1f22d24eb551cd9525e85c8c3ec98febcf429 authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
Add warning when save_model() is called from scikit-learn interface (#4632)

4df246191f9c0dcbe38ff6736fc489eac9d7d6a6 authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
Support ndcg- and map- (#4635)

96bf91725b4d283dcf05d931ffe5bec1e1042ba7 authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
[R] Use built-in label when xgb.DMatrix is given to xgb.cv() (#4631)

* Use built-in label when xgb.DMatrix is given to xgb.cv()

* Add a test

* Fix test

* Bu...

4e9fad74eb4df6a04758a6b919826388c00530f3 authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
pytest tests/python fails if no pandas installed (#4620)

* _maybe_pandas_xxx should return their arguments unchanged if no pandas installed

* Tests sh...

986fee60229c77920e93ae3b374c2e674faee78d authored over 5 years ago by Oleksandr Pryimak <[email protected]>
Fix external memory for get column batches. (#4622)

* Fix external memory for get column batches.

This fixes two bugs:

* Use PushCSC for get c...

45876bf41bc48cc54e01f7fc327b066c2e060011 authored over 5 years ago by Jiaming Yuan <[email protected]>
Support Dask 2.0 (#4617)

a30176907f5628020f8238464c2f58907ee724d1 authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
Add to documentation how to run tests locally (#4610)

* Add to documentation how to build native unit tests

* Add instructions to run Python tests ...

923e6c86baf6e7f9614b042d7334583bd81ba711 authored over 5 years ago by Oleksandr Pryimak <[email protected]>
fix gpu predictor when dmatrix is mismatched with model (#4613)

63ec95623dc3d6269201d65e27d57ced86186605 authored over 5 years ago by Rong Ou <[email protected]>
Optimize ‘hist’ for multi-core CPU (#4529)

* Initial performance optimizations for xgboost

* remove includes

* revert float->double
...

4d6590be3c9a043d44d9e4fe0a456a9f8179ec72 authored over 5 years ago by Egor Smirnov <[email protected]>
[jvm-packages] delete all constraints from spark layer about obj and eval metrics and handle error in jvm layer (#4560)

* temp

* prediction part

* remove supported*

* add for test

* fix param name

* ad...

abffbe014eb1a897f22fbd7671797c4e65415a0c authored over 5 years ago by Nan Zhu <[email protected]>
Use Sphinx 2.1+ to compile documentation [skip ci] (#4609)

dd01f7c4f5c4043cb52cf1e76b2c0593d5dea5c3 authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
Fix doc for customized objective/metric [skip ci] (#4608)

cd3a3f99da3869d00c82701f34eef026fc9b204a authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
Doc and demo for customized metric and obj. (#4598)

Co-Authored-By: Theodore Vasiloudis <[email protected]>

5b2f805e7474c1d18f0a5262f8964d49f0ce4d89 authored over 5 years ago by Jiaming Yuan <[email protected]>
Implement tree model dump with code generator. (#4602)

* Implement tree model dump with a code generator.

* Split up generators.
* Implement graphv...

8bdf15120a3b2f2f698f6f2444fb145fe9a71817 authored over 5 years ago by Jiaming Yuan <[email protected]>
[jvm-packages]fix silly bug in feature scoring (#4604)

fe2de6f415ffbd7ae09bd7812a966c43f26938f2 authored over 5 years ago by Nan Zhu <[email protected]>
return values in Init and Finalize (#96)

* make inti function return values

* address the comments

65b718a5e786bd7d0a850f3fa1df0dbdab023eb1 authored over 5 years ago by Nan Zhu <[email protected]>
allow not stop process in error (#97)

* allow not stop process in error

* fix merge error

fc85f776f41ab25af2f85ff58713c33128148611 authored over 5 years ago by Nan Zhu <[email protected]>
Add instruction to run formatting checks locally [skip ci] (#4591)

1f98f18cb891996724af0d8d911059c1b896a053 authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
Update doc for feature constraints and `n_gpus`. (#4596)

* Update doc for feature constraints.

* Fix some warnings.

* Clean up doc for `n_gpus`.

2cff7351265ad9bc4450d9b03b9963a4b64ada87 authored over 5 years ago by Jiaming Yuan <[email protected]>
Set reg_lambda=1e-5 for scikit-learn-like random forest classes. (#4558)

9fa29ad7530de6f02eec0f3d54a2f550130cfe5e authored over 5 years ago by Andy Adinets <[email protected]>
Fix docstring for XGBModel.predict() [skip ci] (#4592)

30e1cb4e9e6eec58f90db4ea06523b076374479a authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
fix benchmark_tree.py (#4593)

77fc28427d96808e74d028697ae8122c900c9ee5 authored over 5 years ago by Rong Ou <[email protected]>
Address some sphinx warnings and errors, add doc for building doc. (#4589)

9494950ee717d73f8594962f3878b18d9019ae08 authored over 5 years ago by Jiaming Yuan <[email protected]>
fix compiler warning (#4588)

6125521caf96d33493054b1808e1988cd2a69442 authored over 5 years ago by Rong Ou <[email protected]>
Fix race condition in interaction constraint. (#4587)

* Split up the kernel to sync write.

* QueryNode is no-longer used in Query, but kept for tes...

fdf27a5b8294895b4287e48a4d6ae4ee6f84d04a authored over 5 years ago by Jiaming Yuan <[email protected]>
Refactor out row partitioning logic from gpu_hist, introduce caching device vectors (#4554)

221e163185061d5ca4c548bb3ca65f63ece3bf42 authored over 5 years ago by Rory Mitchell <[email protected]>
[CI] Specify account ID when logging into ECR Docker registry (#4584)

* [CI] Specify account ID when logging into ECR Docker registry

* Do not display awscli login...

0c50f8417a36ff731b35a5405fb25f70c5ad8c93 authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
Feature interaction for GPU Hist. (#4534)

* GPU hist Interaction Constraints.
* Duplicate related parameters.
* Add tests for CPU intera...

ae05948e32214740c48ec5bb4ef3fa0785842226 authored over 5 years ago by Jiaming Yuan <[email protected]>
[CI] Remove CUDA 8.0 from CI pipeline (#4580)

570374effe2f84373164dec864868519ca48b0e9 authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
Deprecate single node multi-gpu mode (#4579)

* deprecate multi-gpu training

* add single node

* add warning

e94f85f0e4268f01947a951d3a946a74b8fb97c3 authored over 5 years ago by Rong Ou <[email protected]>
Optimizations for quantisation on device (#4572)

* - do not create device vectors for the entire sparse page while computing histograms...
- ...

675765433758ad1399aba97d61a0ccc8a207cb7f authored over 5 years ago by sriramch <[email protected]>
Remove doc about not supporting cuda 10.1 (#4578)

ba1d8487670f4ff446a74eb0343de70101162cef authored over 5 years ago by Rong Ou <[email protected]>
[jvm-packages] updated kryo dependency to 2.22 (#4575)

7ae11c9284fa221f72d9daa8caf844b55e617a83 authored over 5 years ago by Daniel Stahl <[email protected]>
Set the appropriate device before freeing device memory... (#4566)

* - set the appropriate device before freeing device memory...
- pr #4532 added a global mem...

90f683b25be28150487dead9a695b20dc464d0b4 authored over 5 years ago by sriramch <[email protected]>
Choose the appropriate tree method *only* when the tree method is auto (#4571)

* Remove redundant checks.

a22368d2100d8f964c1c47fe8c04bfbe17b65060 authored over 5 years ago by sriramch <[email protected]>
Mark SparsePageDmatrix destructor default. (#4568)

66f9951d7087801cd4a2d5aa360551029b6835b7 authored over 5 years ago by Jiaming Yuan <[email protected]>
Offload some configurations into GBM. (#4553)

This is part 1 of refactoring configuration.

* Move tree heuristic configurations.
* Split...

c5719cc45757945e9010f3d230ca59e4c7a84c8e authored over 5 years ago by Jiaming Yuan <[email protected]>
- training with external memory - part 2 of 2 (#4526)

* - training with external memory - part 2 of 2
- when external memory support is enabled, b...

a2042b685a5c8664aac5c25ba0f03c8dc497a0af authored over 5 years ago by sriramch <[email protected]>
Remove remaining reg:linear. (#4544)

4591039ebace6cf5a3f859a00f0212b072585285 authored over 5 years ago by Jiaming Yuan <[email protected]>
Fix Python demo and doc. (#4545)

* Remove old doc.
* Fix checking __stdin__.

4e9965cb9d9db723194196eaa62943d004bdd6f0 authored over 5 years ago by Jiaming Yuan <[email protected]>
Add `rmsle` metric and `reg:squaredlogerror` objective (#4541)

2f1319f273339f5c485d9bbaba865e662f714627 authored over 5 years ago by Jiaming Yuan <[email protected]>
Overload device memory allocation (#4532)

* Group source files, include headers in source files

* Overload device memory allocation

9683fd433e7a50c73be5c41a4e4f6075dcefb56e authored over 5 years ago by Rory Mitchell <[email protected]>
Fix tweedie metric string. (#4543)

da21ac0cc22dc283fc4f551da7ebdb66d61fad20 authored over 5 years ago by Jiaming Yuan <[email protected]>
Ensure gcc is at least 5.x (#4538)

* make sure that xgboost has gcc 5.x at the very least to build on gcc tool chain

59ae42a1792697165507a04f4ef4879487aba2ca authored over 5 years ago by sriramch <[email protected]>
Use yaml.safe_load. (#4537)

afa99e6d9deec31ee249c12694ab9e1af9bea4b8 authored over 5 years ago by Jiaming Yuan <[email protected]>
Fix C++11 config parser (#4521)

* Fix C++11 config parser
* Use raw strings to improve readability of regex
* Fix compilation ...

3f2fe25a32f8ca3b1f160488e3a152c2c8649a0f authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
Refactor histogram building code for gpu_hist (#4528)

23a10c8339fbe5fed13b6b51924929705bd34b38 authored over 5 years ago by Rory Mitchell <[email protected]>
Deprecate gpu_exact, bump required cuda version in docs (#4527)

399fabed49a3c8e1feb7db26d13a1277d380326d authored over 5 years ago by Rory Mitchell <[email protected]>
Fix #4497: Enable feature importance property for DART booster (#4525)

c2a3902ba3cf3263c733ef9ed686b24ee9223e8b authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
Enforce exclusion between pred_interactions=True and pred_interactions=True (#4522)

ea4441775405b97d99ebc6b997b726b64564723b authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
Smarter choice of histogram construction for distributed gpu_hist (#4519)

* Smarter choice of histogram construction for distributed gpu_hist

* Limit omp team size in ...

fbbae3386a9b6ff3ed31fe4ef5106a3c4665c6e5 authored over 5 years ago by Rory Mitchell <[email protected]>
Simplify INI-style config reader using C++11 STL (#4478)

* simplify the config.h file

* revise config.h

* revised config.h

* revise format

* ...

dd60fc23e6dbe67a57b3e131bacc2140f59c02f7 authored over 5 years ago by fuhaoda <[email protected]>
Fix prediction from loaded pickle. (#4516)

b48f895027fcb159961857f1e24ab7a67eda14dd authored over 5 years ago by Jiaming Yuan <[email protected]>
- training with external memory part 1 of 2 (#4486)

* - training with external memory part 1 of 2
- this pr focuses on computing the quantiles u...

fed665ae8a98fb64818b60e75bebda6c4d32e770 authored over 5 years ago by sriramch <[email protected]>
Fix crash with approx tree method on cpu (#4510)

6e169007113c48b1bb95de473e31bce2dfb4bc30 authored over 5 years ago by sriramch <[email protected]>
De-duplicate GPU parameters. (#4454)

* Only define `gpu_id` and `n_gpus` in `LearnerTrainParam`
* Pass LearnerTrainParam through X...

c589eff941aacaa6386859af11d4ccedab266475 authored over 5 years ago by Jiaming Yuan <[email protected]>
- fix issues with training with external memory on cpu (#4487)

* - fix issues with training with external memory on cpu
- use the batch size to determine t...

a3fedbeaa884a40986dcd60bddd678c89a41b18c authored over 5 years ago by sriramch <[email protected]>
Fix dask API sphinx docstrings (#4507)

* Fix dask API sphinx docstrings

* Update GPU docs page

972f693eaf8d20b49b7a6dc57f8a1aa3ea6323bb authored over 5 years ago by Rory Mitchell <[email protected]>
add dll_path for cygwin users (#4499)

3f7e5d9c478e0cb94651e04a00a05fb9e2135c18 authored over 5 years ago by yellowdolphin <[email protected]>
Add native support for Dask (#4473)

* Add native support for Dask

* Add multi-GPU demo

* Add sklearn example

09b90d932961deef7a6fe44f19e4b0b2720fb2aa authored over 5 years ago by Rory Mitchell <[email protected]>
Revert hist init optimization. (#4502)

55e645c5f56559def985c0450ca747c00a538239 authored over 5 years ago by Jiaming Yuan <[email protected]>
Add python RF documentation (#4500)

8ddd2715eefa21555c77bbf1ac2ea314ac9f279e authored over 5 years ago by Rory Mitchell <[email protected]>
[jvm-packages] Add back reg:linear for scala. (#4490)

* Add back reg:linear for scala.

* Fix linter.

0ce300e73ad9c586e41762d07f7acdcfa03f323d authored over 5 years ago by Jiaming Yuan <[email protected]>
Add support for cross-validation using query ID (#4474)

* adding support for matrix slicing with query ID for cross-validation

* hail mary test of un...

278562db133005f93edbffc74afc4b02b49bccca authored over 5 years ago by Bryan Woods <[email protected]>
Ensure pandas DataFrame column names are treated as strings in type error message (#4481)

5a567ec24979ddd3bd033f336ff15585e736701e authored over 5 years ago by Sean Owen <[email protected]>
[RFC] Version 0.90 release candidate (#4475)

* Release 0.90

* Add script to automatically generate acknowledgment

* Update NEWS.md

515f5f5c4779ff5361dcd796e22d55937e1048ea authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
Update xgboost4j_spark_tutorial.rst (#4476)

adcd8ea7c6723e2297e35d52a56a2fa3dcc661bf authored over 5 years ago by Nan Zhu <[email protected]>
[CI] Add Python and C++ tests for Windows GPU target (#4469)

* Add CMake option to use bundled gtest from dmlc-core, so that it is easy to build XGBoost with...

cf2400036e4d4cac7fcd4daf70f0061b02137c07 authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
added JSON vignette (#4439)

3e930e4f2d807f63f30be1d743bf5d3e4906a0cf authored over 5 years ago by ras44 <[email protected]>
only copy the model once when predicting multiple batches (#4457)

a9ec2dd295356978d61a4229e48e574cdca3aff9 authored over 5 years ago by Rong Ou <[email protected]>
add cuda 10.1 support (#4468)

df2cdaca502238f2dee5f2ef006778ed0491cd50 authored over 5 years ago by Rong Ou <[email protected]>
[CI] Add Windows GPU to Jenkins CI pipeline (#4463)

* Fix #4462: Use /MT flag consistently for MSVC target

* First attempt at Windows CI

* Dis...

c6f2a7e186c5fa4b745414fbd1bda1db8aac491f authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
[CI] Build XGBoost wheels with CUDA 9.0 (#4459)

* [CI] Build XGBoost wheels with CUDA 9.0

* Do not call archiveArtifacts for 8.0 wheel

e7d17ec4f4a091bac58c1d241be3f4969400b874 authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
[CI] Cache two R build Docker containers (#4458)

b5f7cbfadf203f6e28ddd7543726346046df656b authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
mgpu predictor using explicit offsets (#4438)

* mgpu prediction using explicit sharding

be0f346ec907b9b8c76597c2ee308fcd53ba311a authored over 5 years ago by Rong Ou <[email protected]>
Correctly determine cuda version (#4453)

d16d9a99880c8e3a1872f84fc7242751bb5ac67e authored over 5 years ago by Rory Mitchell <[email protected]>
[BLOCKING][CI] Upgrade to Spark 2.4.3 (#4414)

* [CI] Upgrade to Spark 2.4.2

* Pass Spark version to build script

* Allow multiple --buil...

6ff994126a953ba01d581b5a1e61654fd14ae7a8 authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
[jvm-packages] Automatically set maximize_evaluation_metrics if not explicitly given in XGBoost4J-Spark (#4446)

* Automatically set maximize_evaluation_metrics if not explicitly given.

* When custom_eval i...

18e4fc3690bf1309a747ca3273cf367dfeea5a00 authored over 5 years ago by Shaochen Shi <[email protected]>
Enable building with shared NCCL. (#4447)

* Add `BUILD_WITH_SHARED_NCCL` to CMake.

8da4907e89ae0cb174d2ed1ff06ca56d85cc1c78 authored over 5 years ago by Jiaming Yuan <[email protected]>
Fix list formatting in missing value tutorial in XGBoost4J-Spark

ade3f302373cf630f5115aca26cb40ea9cac78ac authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
Fix list formatting in missing value tutorial in XGBoost4J-Spark

b511638ca10a4719e2df6f6e7571fd95a03aaf0a authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
[jvm-packages] Tutorial on handling missing values (#4425)

Add tutorial on missing values and how to handle those within XGBoost.

eabcc0e21081ce17224a7c3dffd39c979a27543d authored over 5 years ago by Daniel Hen <[email protected]>
Change obj name to `reg:squarederror` in learner. (#4427)

* Change memory dump size in R test.

5de7e12704b38d81a46471e02a791d4c586a7286 authored over 5 years ago by Jiaming Yuan <[email protected]>
In AUC and AUCPR metrics, detect whether weights are per-instance or per-group (#4216)

* In AUC and AUCPR metrics, detect whether weights are per-instance or per-group

* Fix C++ st...

8d1098a983e36479a7b3dc354425154a9ff35ea9 authored over 5 years ago by Xin Yin <[email protected]>
Make AUCPR work with multiple query groups (#4436)

* Make AUCPR work with multiple query groups

* Check AUCPR <= 1.0 in distributed setting

9252b686ae875997bac21ca411c3da15fd902732 authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
max_digits10 guarantees float decimal roundtrip (#4435)

2 additional digits are not needed to guarantee that casting the decimal representation will res...

2be85fc62abf467286027018ef7869177155e4c8 authored over 5 years ago by ras44 <[email protected]>
Initial support for external memory in gpu_predictor (#4284)

feb6ae3e18686fbe87844ab0ca0116bbc68c14e4 authored over 5 years ago by Rong Ou <[email protected]>
Fix typo in xgboost_R.h (#4432)

54980b8959680a0da06a3fc0ec776e47c8cbb0a1 authored over 5 years ago by ras44 <[email protected]>
Upgrade dmlc-core (#4430)

c1e4a0f2c6fe48d1b00b26ea236f1b0f987f9d6c authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
Make CMakeLists.txt compatible with CMake 3.3 (#4420)

* Make CMakeLists.txt compatible with CMake 3.3; require CMake 3.11 for MSVC

* Use CMake 3.12...

bfddc2c42cd7e9a3b6f50db2cdea815053cbbf9f authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
Simplify bound checking in feature interaction constraints (#4428)

17df5fd296aaff18123c53a777f24111b3dd7336 authored over 5 years ago by Philip Hyunsu Cho <[email protected]>
Use feature interaction constraints to narrow search space for split candidates (#4341)

* Use feature interaction constraints to narrow search space for split candidates.

* fix clan...

4c74336384e2e4e32434850d9f2e76c8ed923238 authored over 5 years ago by Xu Xiao <[email protected]>