Bill Zou Garner Options
The theoretical Assessment demonstrates that EDIS exhibits minimized suboptimality in comparison with exclusively employing online knowledge or instantly reusing offline details. EDIS is really a plug-in strategy and can be combined with current techniques in offline-to-on-line RL setting. By applying EDIS to off-the-shelf strategies Cal-QL and IQL