What`s new ?

DeepSeekのアハ体験

2025年2月1日 - By takekida

DeepSeek-R1’s bold bet on reinforcement learning: How it outpaced OpenAI at 30f the cost

DeepSeek-R1’s Monday release has sent shockwaves through the AI community, disrupting assumptions about what’s required to achieve cutting-edge AI performance. T…

VentureBeat

DeepSeekのコスト面での躍進は教師あり微調整(supervised fine-tuning：SFT) を使わずにほぼ強化学習のみでなり遂げたことによるものだということで技術者はレポートに中でひらめきの比喩であるAha moment(アハ体験?）という言葉を使っています。すぐに大手もキャッチアップすることと考えられるのでLLMのモデルの低負荷化はさらに学習精度を上げる方向に働いていくのは間違いなく…まさに半導体の世界のように進化が用途を増やすことに貢献しそうです。

Please follow and like us: