About this episode
Seventy3???NotebookLM???????????????????????????crypto????????AI????????????Outcome-based Reinforcement Learning to Predict the FutureSummary?????????????Reinforcement Learning with Verifiable Rewards?RLVR????????????????????????????????????????????? RLVR ????????????????????——??????????????????????????????????????????????????????????????????????????????????????140 ????????????????????????????? o1 ?????????????????????????????????????????? Polymarket ??????????????????????????????? 10% ???????ROI??????????????????????????????????????????????????????????????guardrails???????????????????????????https://arxiv.org/abs/2505.17989