China’s DeepSeek applying trial-and-error learning to its AI ‘reasoning’

September 18, 2025 Yanac

Model can also explain its answers, researchers find
Chinese AI company DeepSeek has shown it can improve the reasoning of its LLM DeepSeek-R1 through trial-and-error based reinforcement learning, and even be made to explain its reasoning on math and coding problems, even though explanations might sometimes be unintelligible.…The RegisterRead More