Synced
DeepSeek Signals Next-Gen R2 Model, Unveils Novel Approach to Scaling Inference with SPCT
DeepSeek AI has published research introducing a novel approach to scaling general reward models (GRMs) during inference, signaling progress toward its next-generation R2 model. The technique, known as SPCT, addresses a critical bottleneck in how AI systems evaluate and rank outputs at scale, potentially improving the efficiency of large language models during deployment.
Read more