Ecosyste.ms: OpenCollective
An open API service for software projects hosted on Open Collective.
github.com/JuliaReinforcementLearning/ReinforcementLearning.jl
A reinforcement learning package for Julia
https://github.com/JuliaReinforcementLearning/ReinforcementLearning.jl
* fix 129
* add tests
* fix tests
92435e03572ef5d8eec54dffe700bcc9ee7e07ec authored almost 4 years ago by Jun Tian <[email protected]>5359c959d2145676ccd361536d1e3fd570872886 authored almost 4 years ago by findmyway <[email protected]>
6cffc7030a9173cde7b3be2b82b8e577c739ff50 authored almost 4 years ago by norci <[email protected]>
6899b5ee4921859d457a8735477baf6671ddf18c authored almost 4 years ago by Jun Tian <[email protected]>
Co-authored-by: norci <[email protected]>
102d39a30d92ce75aeadb742590677d65b32a118 authored almost 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>0dc63a4bd4379fa7cdc2c7a4dd133af28570e478 authored almost 4 years ago by Jun Tian <[email protected]>
* docs: update README.md [skip ci]
* docs: update .all-contributorsrc [skip ci]
Co-authore...
efcc26007e4e1977ba29c48b22171126c2456ada authored almost 4 years ago by allcontributors[bot] <46447321+allcontributors[bot]@users.noreply.github.com>7010547018471eac97428aa2e54a507b86eb9bcd authored almost 4 years ago by Jun Tian <[email protected]>
* ignore vim generated temp files
* add JuliaRL_BasicDQN_EmptyRoom experiment
* add per-st...
7f4abb31fd0fd41c0f3607bae4ed788fbb5db1e8 authored almost 4 years ago by Sid-Bhatia-0 <[email protected]>* Hack to allow multidim actions in ppo
* Fix for single dim envs
* Handle single and mult...
5c7fefc0ebfc04fdd09b232c4b3f8696dbbdc77a authored almost 4 years ago by Albin Heimerson <[email protected]>Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
ec2b8496ac5a66ee913789f6235e6bc96f536d49 authored almost 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>Co-authored-by: norci <[email protected]>
7ca5ab7cf2c1297a0c4b3aa748aa4dc44edecb90 authored almost 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
461d62fa729a6a4f419913a781853f2030bb4301 authored almost 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>b90712f59da5ebe083a534c8f101eadcdc24ef3a authored almost 4 years ago by Nerd <[email protected]>
* docs: update README.md [skip ci]
* docs: update .all-contributorsrc [skip ci]
Co-authore...
6eb2a301f9f5c6daa9ab50869bb3ea435955f4b6 authored almost 4 years ago by allcontributors[bot] <46447321+allcontributors[bot]@users.noreply.github.com>which sets the time badget of each experiment.
d978b7229b766e6d443af2dcfda3eb2b0e1381ca authored almost 4 years ago by norci <[email protected]>Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
c1403341ab04cffc04937dc7b6665c31f1f1d4a9 authored almost 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>95a091b0cdcfebcb0a6de85d643f145dce468320 authored almost 4 years ago by Jun Tian <[email protected]>
* simplify the definition of environment wrappers
* simplify further
f936b96a9f7f9a9bd53e4e10912e8ce0b8520685 authored almost 4 years ago by Jun Tian <[email protected]>* docs: update README.md [skip ci]
* docs: update .all-contributorsrc [skip ci]
Co-authore...
01df29210a77044d9a88a4255018b54e607c7a68 authored almost 4 years ago by allcontributors[bot] <46447321+allcontributors[bot]@users.noreply.github.com>d973a4237f5dac813213147bd11301a7bb9e5a94 authored almost 4 years ago by Jun Tian <[email protected]>
* Add episode length for Bit Flipping Env
Add episode length for Bit Flipping Env
Fix typo
...
* add behavior cloning
* add TODO
* add experiment for bc
* fix test error
* update ...
9a8cb2d55d97847b119247ea1369ac2656f49654 authored almost 4 years ago by Jun Tian <[email protected]>60069e3a6ba4bfcf1620c3cfd71d569e0dad854f authored almost 4 years ago by norci <[email protected]>
10403d36ce257bdd95c6a48510936d378b483728 authored almost 4 years ago by Jun Tian <[email protected]>
facdcc02ad492ae42a81966877c92a6ee0d8df54 authored almost 4 years ago by Jun Tian <[email protected]>
f13dff7d7219e5d2160dd1a1f6d15d6ded3e9227 authored almost 4 years ago by Jun Tian <[email protected]>
95da206404c8ac493c617dbdeca745f66b85f0a0 authored almost 4 years ago by Jun Tian <[email protected]>
* simplify trajectory
* add VectorSATrajectory
51cab361883e1cea2acf3c92e8de535fa04cc8f1 authored almost 4 years ago by Jun Tian <[email protected]>Then it can be used for PreEpisodeStage & PostEpisodeStage.
454493ca08b29620632204a1ea6aa53e4a035e40 authored almost 4 years ago by norci <[email protected]>* docs: update README.md [skip ci]
* docs: update .all-contributorsrc [skip ci]
Co-authore...
ae8ca6f29adf8c0fcf7e7069b40173dce5d95f8f authored almost 4 years ago by allcontributors[bot] <46447321+allcontributors[bot]@users.noreply.github.com>57d1f5f24fae3d9b1d3ea5bf59fa6070a1ee1044 authored almost 4 years ago by Jun Tian <[email protected]>
1620e85e8b1c0395e2fdfb110dc4a7781bb6c6d2 authored almost 4 years ago by Jun Tian <[email protected]>
f39c0adf3eecdcae240493ac5d1144aa40cc4bc8 authored almost 4 years ago by norci <[email protected]>
Automatic JuliaFormatter.jl run
25d55d42ed7e9c901fe2257a33d06d45c406c669 authored almost 4 years ago by norci <[email protected]>* docs: update README.md [skip ci]
* docs: update .all-contributorsrc [skip ci]
Co-authore...
b0d39b21f5841d350e3f77f5d44707e9e2773a25 authored almost 4 years ago by allcontributors[bot] <46447321+allcontributors[bot]@users.noreply.github.com>af5977edd9b920bb264e288f9d6e3fd9e10eb585 authored almost 4 years ago by Jun Tian <[email protected]>
9fc50411b6f6888eace0146ddcdde148e1adb287 authored almost 4 years ago by Jun Tian <[email protected]>
2906e63b33f271e94f8dd2e93201100e730d1740 authored almost 4 years ago by Jun Tian <[email protected]>
dec20f0fef87996cf4d63efeae39df926b15537a authored almost 4 years ago by findmyway <[email protected]>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
0d2066273b47e86f8c0ef26982c86d368db79d2b authored almost 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
d5d4c0001cfeba2638affbd7e8191fb3bbe6f611 authored almost 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>4a6f4924cbfb662674728c6b9a27806f2b82e54b authored almost 4 years ago by Jun Tian <[email protected]>
* docs: update README.md [skip ci]
* docs: update .all-contributorsrc [skip ci]
Co-authore...
4c5b85a7586d8f7699620dc27d37794c38f94827 authored almost 4 years ago by allcontributors[bot] <46447321+allcontributors[bot]@users.noreply.github.com>6def4590207eaa4a38da83eb217e671d1fa91d60 authored almost 4 years ago by Jun Tian <[email protected]>
95d7b882f38564d8e36302f8b3419402d6697b24 authored almost 4 years ago by Jun Tian <[email protected]>
10a3870fc16c3d3165d75332702f34083d66231d authored almost 4 years ago by Jun Tian <[email protected]>
f9c4e8d0a6989956adc2820049eab9a28f45daa7 authored almost 4 years ago by Jun Tian <[email protected]>
* docs: update README.md [skip ci]
* docs: update .all-contributorsrc [skip ci]
Co-authore...
57139e6d9f57e454f0b4c3eff27db9a4c6d6187b authored almost 4 years ago by allcontributors[bot] <46447321+allcontributors[bot]@users.noreply.github.com>1056d5c458e05e67a9c8d7760c347432537599f3 authored almost 4 years ago by Jun Tian <[email protected]>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-...
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
4753f9e0313290f3dccadc1789cc24c712e72f2e authored almost 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>* update index.md
* update README.md
* update readme
* update index.md
4398cab28bc0c86d660c25dd70b0e310c7b7786d authored almost 4 years ago by Jun Tian <[email protected]>7a1ef578a985e6c4d9ba3ac0bb04799693b5f169 authored almost 4 years ago by Jun Tian <[email protected]>
fabcff5fab160e4441021d77314a966a1845c780 authored almost 4 years ago by Jun Tian <[email protected]>
cede8023274a785b2b82ec96b0b3aa165b49748e authored almost 4 years ago by Jun Tian <[email protected]>
23adcb383eec7ce0f980f73e487bd8c183f8a343 authored almost 4 years ago by Jun Tian <[email protected]>
142e3ac89de4b694c778193995cd96e1f93eaf6f authored almost 4 years ago by Jun Tian <[email protected]>
1971bc855ecf7cd8b42a628f22bf67ee6051dcfd authored almost 4 years ago by Jun Tian <[email protected]>
4952db9c48a8b9d547d4f6b4265ccba04f16473d authored almost 4 years ago by Jun Tian <[email protected]>
a94b018d3914b0a6e5cb550637d4f940b2663b14 authored almost 4 years ago by Jun Tian <[email protected]>
6ea0097d1494b45579c7916489f1eff19a52e6c1 authored almost 4 years ago by Jun Tian <[email protected]>
28bd31b7b964f576f6d3a6357a0f8fc2fa92a4b2 authored almost 4 years ago by Jun Tian <[email protected]>
47c1c3f753e92a6f62302c1bcb55878515b2dd5c authored almost 4 years ago by Jun Tian <[email protected]>
5f40c48ea78a5a6327e7fe17ee1fb8e0dc0458f9 authored almost 4 years ago by Jun Tian <[email protected]>
75e4b7484d8bfa2845006ed5981d6e771c732cd8 authored almost 4 years ago by Jun Tian <[email protected]>
d71899cd6c1608d14be5f667d6e9f7ff7b8e0594 authored almost 4 years ago by Jun Tian <[email protected]>
a09bddb2ae0244e3b8ee03ee27f96cede4a78b84 authored almost 4 years ago by Jun Tian <[email protected]>
fd23dc4527d3559d738c50786b876696eab939d8 authored almost 4 years ago by Jun Tian <[email protected]>
27c4700c389011dca6fa38ca0cc7190a1f673edf authored almost 4 years ago by Jun Tian <[email protected]>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
8b19329c6bffd468226f0c950703437f451e83f7 authored almost 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>* rename TabularLearner to TabularRandomPolicy
* sync chapter01
* sync changes related to ...
1acc3e62e9157d2c6e3efe6d03e1d5bfeb28d5c7 authored almost 4 years ago by Jun Tian <[email protected]>fea4d05a02aa8d9b58ef252295964296f14666e4 authored almost 4 years ago by Jun Tian <[email protected]>
a4a056a65f3072aeff07e0424aad0a9816dadfc2 authored almost 4 years ago by Jun Tian <[email protected]>
26a48ac86b180da10c55d6e566cac4ec6ac767e5 authored almost 4 years ago by Jun Tian <[email protected]>
Removed CumulativeReward.
Bugfix with MultiAgentHook
Add VectorSARTTrajectory
Bugfix with Mul...
* CompatHelper: bump compat for "Adapt" to "3.1"
* Update compat of Adapt
Co-authored-by: ...
1336a4fb30045769f707d202a30ede385d052070 authored almost 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
bc0f8bdec9715bf47a52693048a11fd50cb46ce8 authored almost 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>6e43963305ee2eb4489079d622ec180173bcedf9 authored almost 4 years ago by Jun Tian <[email protected]>
* support string representation for TicTacToeEnv
* fix bugs with DefaultStateStyleEnv
* ex...
99d69d25830f14b741e259acd602779974278891 authored almost 4 years ago by Jun Tian <[email protected]>00dcd326213079d81cdbec8f13e56ebda9a9d1d5 authored almost 4 years ago by Jun Tian <[email protected]>
CompatHelper: bump compat for "FillArrays" to "0.11"
edc11cf761fa7e47cd0c5213a8ceb6188c23bc86 authored almost 4 years ago by norci <[email protected]>Automatic JuliaFormatter.jl run
2376aeb7c1b99fbb9824ac2e334bf39e6d0ad3d5 authored almost 4 years ago by norci <[email protected]>bugfix in StopAfterNoImprovement.
1a878ae12a269a5f0c26de9d311ce257222d31c3 authored almost 4 years ago by norci <[email protected]>35c3dfcc59962946e5902cdba1c9b3350366f172 authored almost 4 years ago by norci <[email protected]>
9324ed409229d049f43938fef07906f6aa3652af authored almost 4 years ago by Jun Tian <[email protected]>
* Refactor exports
Remove GoalState as it is supported by RLBase
* Update compat of RLBase
Bu...
0f518b7fd8066de2d7afe8b2a787803ca4ea4516 authored almost 4 years ago by Sriram <[email protected]>1f008d9a5751733846e206444cbefc624e548596 authored almost 4 years ago by Jun Tian <[email protected]>
Add GoalState to support goal conditioned reinforcement learning
2ba59300fc1cb0afa519faeab46064195442ad8e authored almost 4 years ago by Sriram <[email protected]>28e675020f64d8f11b7479d1e2f5ab2e84d1099b authored about 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
a44eed2e83a67c41cec6a71b36e0138c772f8f1e authored about 4 years ago by Jun Tian <[email protected]>
4d674f31e32bc787dbedfde8130ca60ba367f5d6 authored about 4 years ago by findmyway <[email protected]>
Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
6ca62199c4cfc0472b6793f813eae3d9191fdeee authored about 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
049d0b81ce4eb6f3a09b2fef65661a4992daac27 authored about 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>* Add BitFlipping Environment
Add BitFlipping Environment
* Bug FIxes
Bug FIxes
abc7f2b3f069d946ceead625573ac168688c0343 authored about 4 years ago by Sriram <[email protected]>MultiThreadEnv is already moved into RLZoo
ce14073ab8d8b046d20421ab6440266c7df1b705 authored about 4 years ago by Jun Tian <[email protected]>a2ef6188630c710b6f9256684705462694519ff4 authored about 4 years ago by Sid-Bhatia-0 <[email protected]>
e27c61fc1ad1aa1cd04f2e6af52f41002b32ded6 authored about 4 years ago by Sid-Bhatia-0 <[email protected]>
0030e07b97e58add67c372eccd37117cbd6dbe13 authored about 4 years ago by Sid-Bhatia-0 <[email protected]>
Co-authored-by: findmyway <[email protected]>
560ce6f5db6ab9c000b3ab974f67e2d1904bf6fc authored about 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>