Behavior Consistent Rl Preprint