Developers caught DeepSeek R1 having an ‘aha moment’ on its own during training
The DeepSeek R1 developers caught the reasoning model having an “aha moment” while solving a math problem.
Source: https://bgr.com/tech/developers-caught-deepseek-r1-having-an-aha-moment-on-its-own-during-training/