SubSearch: Intermediate Rewards for Unsupervised Guided Reasoning in Complex Retrieval | Read Paper on Bytez