Choice of discount rate in reinforcement learning with long-delay rewards |
| |
Authors: | LIN Xiangyang XING Qinghua LIU Fuxian |
| |
Abstract: | In the world, most of the successes are results of long-term efforts. The reward of success is extremely high, but be-fore that, a long-term investment process ... |
| |
Keywords: | |
|
|