github.com/JuliaReinforcementLearning/ReinforcementLearning.jl commits

Fix129 (#130)

* fix 129

* add tests

* fix tests

92435e03572ef5d8eec54dffe700bcc9ee7e07ec authored almost 4 years ago by Jun Tian <[email protected]>

Format .jl files

5359c959d2145676ccd361536d1e3fd570872886 authored almost 4 years ago by findmyway <[email protected]>

refactor NeuralNetworkApproximator with `@forward` (#221)

6cffc7030a9173cde7b3be2b82b8e577c739ff50 authored almost 4 years ago by norci <[email protected]>

Update README.md

6899b5ee4921859d457a8735477baf6671ddf18c authored almost 4 years ago by Jun Tian <[email protected]>

Format .jl files (#153)

Co-authored-by: norci <[email protected]>

102d39a30d92ce75aeadb742590677d65b32a118 authored almost 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Update README.md

0dc63a4bd4379fa7cdc2c7a4dd133af28570e478 authored almost 4 years ago by Jun Tian <[email protected]>

docs: add albheim as a contributor (#207)

* docs: update README.md [skip ci]

* docs: update .all-contributorsrc [skip ci]

Co-authore...

efcc26007e4e1977ba29c48b22171126c2456ada authored almost 4 years ago by allcontributors[bot] <46447321+allcontributors[bot]@users.noreply.github.com>

Bump version

7010547018471eac97428aa2e54a507b86eb9bcd authored almost 4 years ago by Jun Tian <[email protected]>

add GridWorlds environments (#152)

* ignore vim generated temp files

* add JuliaRL_BasicDQN_EmptyRoom experiment

* add per-st...

7f4abb31fd0fd41c0f3607bae4ed788fbb5db1e8 authored almost 4 years ago by Sid-Bhatia-0 <[email protected]>

Allow multidimensional actions in ppo (#151)

* Hack to allow multidim actions in ppo

* Fix for single dim envs

* Handle single and mult...

5c7fefc0ebfc04fdd09b232c4b3f8696dbbdc77a authored almost 4 years ago by Albin Heimerson <[email protected]>

CompatHelper: bump compat for "GR" to "0.55" (#128)

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

ec2b8496ac5a66ee913789f6235e6bc96f536d49 authored almost 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Format .jl files (#142)

Co-authored-by: norci <[email protected]>

7ca5ab7cf2c1297a0c4b3aa748aa4dc44edecb90 authored almost 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

CompatHelper: bump compat for "BSON" to "0.3" (#149)

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

461d62fa729a6a4f419913a781853f2030bb4301 authored almost 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Update StructArrays dependency (#150)

b90712f59da5ebe083a534c8f101eadcdc24ef3a authored almost 4 years ago by Nerd <[email protected]>

docs: add metab0t as a contributor (#204)

* docs: update README.md [skip ci]

* docs: update .all-contributorsrc [skip ci]

Co-authore...

6eb2a301f9f5c6daa9ab50869bb3ea435955f4b6 authored almost 4 years ago by allcontributors[bot] <46447321+allcontributors[bot]@users.noreply.github.com>

add a new stop condition (#220)

which sets the time badget of each experiment.

d978b7229b766e6d443af2dcfda3eb2b0e1381ca authored almost 4 years ago by norci <[email protected]>

Format .jl files (#126)

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

c1403341ab04cffc04937dc7b6665c31f1f1d4a9 authored almost 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Bump version

95a091b0cdcfebcb0a6de85d643f145dce468320 authored almost 4 years ago by Jun Tian <[email protected]>

Simplify env wrapper (#127)

* simplify the definition of environment wrappers

* simplify further

f936b96a9f7f9a9bd53e4e10912e8ce0b8520685 authored almost 4 years ago by Jun Tian <[email protected]>

docs: add xiruizhao as a contributor (#203)

* docs: update README.md [skip ci]

* docs: update .all-contributorsrc [skip ci]

Co-authore...

01df29210a77044d9a88a4255018b54e607c7a68 authored almost 4 years ago by allcontributors[bot] <46447321+allcontributors[bot]@users.noreply.github.com>

Bump version

d973a4237f5dac813213147bd11301a7bb9e5a94 authored almost 4 years ago by Jun Tian <[email protected]>

Add episode length for Bit Flipping Env (#125)

* Add episode length for Bit Flipping Env

Add episode length for Bit Flipping Env
Fix typo
...

c6b171aa54ad9e4f0f6677a8eaac46b2c4f1f746 authored almost 4 years ago by Sriram <[email protected]>

Add behavior cloning (#146)

* add behavior cloning

* add TODO

* add experiment for bc

* fix test error

* update ...

9a8cb2d55d97847b119247ea1369ac2656f49654 authored almost 4 years ago by Jun Tian <[email protected]>

in _run, add hook for PRE_EXPERIMENT_STAGE & Post... (#219)

60069e3a6ba4bfcf1620c3cfd71d569e0dad854f authored almost 4 years ago by norci <[email protected]>

Fix typo

10403d36ce257bdd95c6a48510936d378b483728 authored almost 4 years ago by Jun Tian <[email protected]>

Bump version

facdcc02ad492ae42a81966877c92a6ee0d8df54 authored almost 4 years ago by Jun Tian <[email protected]>

support general trajectory (#218)

f13dff7d7219e5d2160dd1a1f6d15d6ded3e9227 authored almost 4 years ago by Jun Tian <[email protected]>

add cache to speed up sampling (#216)

95da206404c8ac493c617dbdeca745f66b85f0a0 authored almost 4 years ago by Jun Tian <[email protected]>

Simplify trajectories (#217)

* simplify trajectory

* add VectorSATrajectory

51cab361883e1cea2acf3c92e8de535fa04cc8f1 authored almost 4 years ago by Jun Tian <[email protected]>

In DoEveryNEpisode, added a keyword argument stage. (#214)

Then it can be used for PreEpisodeStage & PostEpisodeStage.

454493ca08b29620632204a1ea6aa53e4a035e40 authored almost 4 years ago by norci <[email protected]>

docs: add norci as a contributor (#200)

* docs: update README.md [skip ci]

* docs: update .all-contributorsrc [skip ci]

Co-authore...

ae8ca6f29adf8c0fcf7e7069b40173dce5d95f8f authored almost 4 years ago by allcontributors[bot] <46447321+allcontributors[bot]@users.noreply.github.com>

Update Project.toml

57d1f5f24fae3d9b1d3ea5bf59fa6070a1ee1044 authored almost 4 years ago by Jun Tian <[email protected]>

add tests for POMDP models (#123)

1620e85e8b1c0395e2fdfb110dc4a7781bb6c6d2 authored almost 4 years ago by Jun Tian <[email protected]>

refactor StopAfterNoImprovement (#213)

f39c0adf3eecdcae240493ac5d1144aa40cc4bc8 authored almost 4 years ago by norci <[email protected]>

Merge pull request #210 from JuliaReinforcementLearning/auto-juliaformatter-pr

Automatic JuliaFormatter.jl run

25d55d42ed7e9c901fe2257a33d06d45c406c669 authored almost 4 years ago by norci <[email protected]>

docs: add rcnlee as a contributor (#199)

* docs: update README.md [skip ci]

* docs: update .all-contributorsrc [skip ci]

Co-authore...

b0d39b21f5841d350e3f77f5d44707e9e2773a25 authored almost 4 years ago by allcontributors[bot] <46447321+allcontributors[bot]@users.noreply.github.com>

Delete changelog.yml

af5977edd9b920bb264e288f9d6e3fd9e10eb585 authored almost 4 years ago by Jun Tian <[email protected]>

Bump version

9fc50411b6f6888eace0146ddcdde148e1adb287 authored almost 4 years ago by Jun Tian <[email protected]>

fix #181 (#122)

2906e63b33f271e94f8dd2e93201100e730d1740 authored almost 4 years ago by Jun Tian <[email protected]>

Format .jl files

dec20f0fef87996cf4d63efeae39df926b15537a authored almost 4 years ago by findmyway <[email protected]>

Format .jl files (#120)

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

0d2066273b47e86f8c0ef26982c86d368db79d2b authored almost 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

CompatHelper: bump compat for "GR" to "0.54" (#124)

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

d5d4c0001cfeba2638affbd7e8191fb3bbe6f611 authored almost 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Add link to blog

4a6f4924cbfb662674728c6b9a27806f2b82e54b authored almost 4 years ago by Jun Tian <[email protected]>

docs: add drozzy as a contributor (#195)

* docs: update README.md [skip ci]

* docs: update .all-contributorsrc [skip ci]

Co-authore...

4c5b85a7586d8f7699620dc27d37794c38f94827 authored almost 4 years ago by allcontributors[bot] <46447321+allcontributors[bot]@users.noreply.github.com>

minor fix with RandomStartPolicy (#212)

6def4590207eaa4a38da83eb217e671d1fa91d60 authored almost 4 years ago by Jun Tian <[email protected]>

Bump version

95d7b882f38564d8e36302f8b3419402d6697b24 authored almost 4 years ago by Jun Tian <[email protected]>

fix atari related experiments (#145)

10a3870fc16c3d3165d75332702f34083d66231d authored almost 4 years ago by Jun Tian <[email protected]>

Update README.md

f9c4e8d0a6989956adc2820049eab9a28f45daa7 authored almost 4 years ago by Jun Tian <[email protected]>

docs: add findmyway as a contributor (#189)

* docs: update README.md [skip ci]

* docs: update .all-contributorsrc [skip ci]

Co-authore...

57139e6d9f57e454f0b4c3eff27db9a4c6d6187b authored almost 4 years ago by allcontributors[bot] <46447321+allcontributors[bot]@users.noreply.github.com>

Automated commit made by MassInstallAction.jl (#99)

1056d5c458e05e67a9c8d7760c347432537599f3 authored almost 4 years ago by Jun Tian <[email protected]>

CompatHelper: bump compat for "Reexport" to "1.0" (#172)

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-...

f7e3f48132f5ad23dddfd7640996a9e4a50fc5e1 authored almost 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Format .jl files (#176)

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

4753f9e0313290f3dccadc1789cc24c712e72f2e authored almost 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Update readme (#188)

* update index.md

* update README.md

* update readme

* update index.md

4398cab28bc0c86d660c25dd70b0e310c7b7786d authored almost 4 years ago by Jun Tian <[email protected]>

Update TagBot.yml

7a1ef578a985e6c4d9ba3ac0bb04799693b5f169 authored almost 4 years ago by Jun Tian <[email protected]>

Update TagBot.yml

fabcff5fab160e4441021d77314a966a1845c780 authored almost 4 years ago by Jun Tian <[email protected]>

Update TagBot.yml

cede8023274a785b2b82ec96b0b3aa165b49748e authored almost 4 years ago by Jun Tian <[email protected]>

Delete .travis.yml

23adcb383eec7ce0f980f73e487bd8c183f8a343 authored almost 4 years ago by Jun Tian <[email protected]>

Update Project.toml

142e3ac89de4b694c778193995cd96e1f93eaf6f authored almost 4 years ago by Jun Tian <[email protected]>

fix doc string to pass doc building (#211)

1971bc855ecf7cd8b42a628f22bf67ee6051dcfd authored almost 4 years ago by Jun Tian <[email protected]>

Update ci.yml

4952db9c48a8b9d547d4f6b4265ccba04f16473d authored almost 4 years ago by Jun Tian <[email protected]>

Update TagBot.yml

a94b018d3914b0a6e5cb550637d4f940b2663b14 authored almost 4 years ago by Jun Tian <[email protected]>

Create ci.yml

6ea0097d1494b45579c7916489f1eff19a52e6c1 authored almost 4 years ago by Jun Tian <[email protected]>

Delete changelog.yml

28bd31b7b964f576f6d3a6357a0f8fc2fa92a4b2 authored almost 4 years ago by Jun Tian <[email protected]>

add Dockerfile (#187)

47c1c3f753e92a6f62302c1bcb55878515b2dd5c authored almost 4 years ago by Jun Tian <[email protected]>

Update README.md

5f40c48ea78a5a6327e7fe17ee1fb8e0dc0458f9 authored almost 4 years ago by Jun Tian <[email protected]>

Update README.md

75e4b7484d8bfa2845006ed5981d6e771c732cd8 authored almost 4 years ago by Jun Tian <[email protected]>

Update Project.toml

d71899cd6c1608d14be5f667d6e9f7ff7b8e0594 authored almost 4 years ago by Jun Tian <[email protected]>

Bump version

a09bddb2ae0244e3b8ee03ee27f96cede4a78b84 authored almost 4 years ago by Jun Tian <[email protected]>

remove duplicated code in BitFlippingEnv (#123)

fd23dc4527d3559d738c50786b876696eab939d8 authored almost 4 years ago by Jun Tian <[email protected]>

Add compat for DataStructures

27c4700c389011dca6fa38ca0cc7190a1f673edf authored almost 4 years ago by Jun Tian <[email protected]>

CompatHelper: bump compat for "Zygote" to "0.6" (#138)

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

8b19329c6bffd468226f0c950703437f451e83f7 authored almost 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Support rlintro (#144)

* rename TabularLearner to TabularRandomPolicy

* sync chapter01

* sync changes related to ...

1acc3e62e9157d2c6e3efe6d03e1d5bfeb28d5c7 authored almost 4 years ago by Jun Tian <[email protected]>

Bump version

fea4d05a02aa8d9b58ef252295964296f14666e4 authored almost 4 years ago by Jun Tian <[email protected]>

bugfix with TabularRandomPolicy (#209)

a4a056a65f3072aeff07e0424aad0a9816dadfc2 authored almost 4 years ago by Jun Tian <[email protected]>

bugfix with TabularRandomPolicy (#208)

26a48ac86b180da10c55d6e566cac4ec6ac767e5 authored almost 4 years ago by Jun Tian <[email protected]>

Support rlintro (#200)

Removed CumulativeReward.
Bugfix with MultiAgentHook
Add VectorSARTTrajectory
Bugfix with Mul...

0122ccc10533f73ae4fd6655a618c957f11fe65b authored almost 4 years ago by Jun Tian <[email protected]>

CompatHelper: bump compat for "Adapt" to "3.1" (#204)

* CompatHelper: bump compat for "Adapt" to "3.1"

* Update compat of Adapt

Co-authored-by: ...

1336a4fb30045769f707d202a30ede385d052070 authored almost 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

CompatHelper: bump compat for "Functors" to "0.2" (#207)

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

bc0f8bdec9715bf47a52693048a11fd50cb46ce8 authored almost 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Bump version

6e43963305ee2eb4489079d622ec180173bcedf9 authored almost 4 years ago by Jun Tian <[email protected]>

Support rlintro (#119)

* support string representation for TicTacToeEnv

* fix bugs with DefaultStateStyleEnv

* ex...

99d69d25830f14b741e259acd602779974278891 authored almost 4 years ago by Jun Tian <[email protected]>

Bump version

00dcd326213079d81cdbec8f13e56ebda9a9d1d5 authored almost 4 years ago by Jun Tian <[email protected]>

Merge pull request #203 from JuliaReinforcementLearning/compathelper/new_version/2021-01-07-00-19-04-399-867728912

CompatHelper: bump compat for "FillArrays" to "0.11"

edc11cf761fa7e47cd0c5213a8ceb6188c23bc86 authored almost 4 years ago by norci <[email protected]>

Merge pull request #201 from JuliaReinforcementLearning/auto-juliaformatter-pr

Automatic JuliaFormatter.jl run

2376aeb7c1b99fbb9824ac2e334bf39e6d0ad3d5 authored almost 4 years ago by norci <[email protected]>

Merge pull request #206 from norci/bugfix

bugfix in StopAfterNoImprovement.

1a878ae12a269a5f0c26de9d311ce257222d31c3 authored almost 4 years ago by norci <[email protected]>

bugfix in StopAfterNoImprovement.

35c3dfcc59962946e5902cdba1c9b3350366f172 authored almost 4 years ago by norci <[email protected]>

update dependency (#177)

9324ed409229d049f43938fef07906f6aa3652af authored almost 4 years ago by Jun Tian <[email protected]>

Refactor exports (#122)

* Refactor exports

Remove GoalState as it is supported by RLBase

* Update compat of RLBase

Bu...

0f518b7fd8066de2d7afe8b2a787803ca4ea4516 authored almost 4 years ago by Sriram <[email protected]>

Update Project.toml

1f008d9a5751733846e206444cbefc624e548596 authored almost 4 years ago by Jun Tian <[email protected]>

Add GoalState to support goal conditioned reinforcement learning (#121)

Add GoalState to support goal conditioned reinforcement learning

2ba59300fc1cb0afa519faeab46064195442ad8e authored almost 4 years ago by Sriram <[email protected]>

CompatHelper: bump compat for "FillArrays" to "0.11"

28e675020f64d8f11b7479d1e2f5ab2e84d1099b authored about 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

remove unnecessary plot (#121)

a44eed2e83a67c41cec6a71b36e0138c772f8f1e authored about 4 years ago by Jun Tian <[email protected]>

Format .jl files

4d674f31e32bc787dbedfde8130ca60ba367f5d6 authored about 4 years ago by findmyway <[email protected]>

Format .jl files (#118)

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

6ca62199c4cfc0472b6793f813eae3d9191fdeee authored about 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Format .jl files (#117)

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

049d0b81ce4eb6f3a09b2fef65661a4992daac27 authored about 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Add Bit Flipping Environment[WIP] (#116)

* Add BitFlipping Environment

Add BitFlipping Environment

* Bug FIxes

Bug FIxes

abc7f2b3f069d946ceead625573ac168688c0343 authored about 4 years ago by Sriram <[email protected]>

Remove MultiThreadEnv from README.md

MultiThreadEnv is already moved into RLZoo

ce14073ab8d8b046d20421ab6440266c7df1b705 authored about 4 years ago by Jun Tian <[email protected]>

reset current_t upon resetting MaxTimeoutEnv #115

a2ef6188630c710b6f9256684705462694519ff4 authored about 4 years ago by Sid-Bhatia-0 <[email protected]>

add is_terminated test after reset!

e27c61fc1ad1aa1cd04f2e6af52f41002b32ded6 authored about 4 years ago by Sid-Bhatia-0 <[email protected]>

add test env.current_t == 1 after reset!

0030e07b97e58add67c372eccd37117cbd6dbe13 authored about 4 years ago by Sid-Bhatia-0 <[email protected]>

Format .jl files (#198)

Co-authored-by: findmyway <[email protected]>

560ce6f5db6ab9c000b3ab974f67e2d1904bf6fc authored about 4 years ago by github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Ecosyste.ms: OpenCollective

github.com/JuliaReinforcementLearning/ReinforcementLearning.jl