SYNTHETIC-2: Scaling Distributed Synthetic Data Generation for Verified Reasoning
M. Senghaas, J. Ong, M. Basra, J. Mattern, J. Straube, S. Jaghouar, J. Hagemann
INTELLECT-2: A Reasoning Model Trained Through Globally Decentralized Reinforcement Learning
S. Jaghouar, J. Mattern, J. Ong, J. Straube, M. Basra, A. Pazdera, K. Thaman, M. Di Ferrante, F. Gabriel, F. Obeid, K. Erdem, M. Keiblinger, M. Senghaas, J. Hagemann