Has your local thinking model had an 'Aha!' moment similar to the one in Deepeek R1 papers?
Has your local thinking model had an 'Aha!' moment similar to the one in Deepeek R1 papers?
Heres a link to the papers, starting around the end of page 8 is revelant paragraph. Thank you hendrik! https://github.com/deepseek-ai/DeepSeek-R1/blob/main/DeepSeek_R1.pdf