[Article Review] “Q-learning is Not Yet Scalable” by Seohong Park
·
카테고리 없음
글 원문: https://seohong.me/blog/q-learning-is-not-yet-scalable/ Q-learning is not yet scalableQ-learning is not yet scalable Seohong ParkUC BerkeleyJune 2025 Does RL scale? Over the past few years, we've seen that next-token prediction scales, denoising diffusion scales, contrastive learning scales, and so on, all the way to the point where we canseohong.me 강화학습에 대한 좋은 아티클이 있어서 한국어로도 다시 정리해보면 좋을 ..