Three of the four reviewers strongly support acceptance. The fourth review (technically R5) leans towards reject, but is extremely short and doesn't consider author response. Thus, I discount this negative review. Overall, reviewers agree that the novel RL environment this submission presents is well-constructed, has novel aspects relative to related environments, and will be broadly useful. Thus I support accept.