JD.com and Research Partners Unveil RLSD Technology to Reduce AI Reasoning Model Training Costs - Wedoany

Homepage News Details

JD.com and Research Partners Unveil RLSD Technology to Reduce AI Reasoning Model Training Costs

2026-04-29 15:14

Favorite

en.Wedoany.com Reported - The high cost of training AI reasoning models has long been a challenge for enterprise teams. Researchers from JD.com, in collaboration with multiple academic institutions, have proposed a new training paradigm called RLSD, designed to build custom reasoning agents with fewer computational resources. This technology combines reinforcement learning with self-distillation, addressing the problem of sparse signals or high computational overhead found in traditional methods.

RLVR Graph

In experiments, models trained with RLSD achieved an average accuracy of 56.18% on multiple visual reasoning benchmarks, surpassing both the base model and standard RLVR methods. According to Yang Chenxu, co-author of the paper, RLSD decouples the direction and magnitude of updates, using verifiable reward signals to determine the direction and achieving fine-grained, per-token feedback through self-distillation. This avoids information leakage issues and maintains training stability.

RLSD requires only one additional forward pass and converges approximately twice as fast as traditional methods. It is suitable for tasks with verifiable rewards, such as code compilation or mathematical verification, and can flexibly leverage privileged information. The technique can be lightly integrated into existing open-source frameworks, offering enterprises a new way to optimize models using internal data.

This article is compiled by Wedoany. All AI citations must indicate the source as "Wedoany". If there is any infringement or other issues, please notify us promptly, and we will modify or delete it accordingly. Email: news@wedoany.com

Information and Communication Artificial Intelligence Engineering

Previous：XCMG XLC150M Crawler Crane for Wind Power Auxiliary Lifting Launched

Next：UK Housing Finance Company commits £550 million for affordable housing development in three regions

GYTA Type Non-Self-Supporting Aerial/Duct Optical Cable

TONGDING INTERCONNECTION INFORMATION CO., LTD.

Industrial and Commercial Point-Type Gas Detector

Jinan Benan Technology Development Co., Ltd.

G.652.D Wavelength-extended Non-dispersion Shifted Single-mode Optical Fiber

HONGAN GROUP CO., LTD.

Negotiable /Unit

Multi-function Marine Monitoring Low-altitude Detection Radar Sea-air Boundary Ocean Dynamic Environment Measurement

Chengdu Dixin Technology Co., Ltd.

SIS Safety Instrumentation Solution

Beijing Consen Automation Technology Co., Ltd.

Intelligent Warehousing

Jiangsu Zhongtian Technology Co., Ltd.

QPS-20A Redundant Power Fast Switcher

CHN ENERGY ZHISHEN CONTROL TECHNOLOGY CO., LTD.

TWP16 P-Band Tropospheric Wind Profile Radar

China Huayun Meteorological Technology Group Co., Ltd.

Intelligent Manufacturing

DONGFANG ELECTRIC CORPORATION

Mine Explosion-proof and Intrinsically Safe Industrial Ethernet Ring Network (10 Gigabit/Gigabit)

Chongqing Mas Sci&Tech Co., Ltd.

Mining Intelligent Rope-replacement Robot

Fiber Optic Connector

Jingye Group Co., Ltd.

Related Recommendations

Poolside from the US Releases Open-Source Programming Model Laguna XS.2

Eino Launches Agentic Network Observability Platform in the US

Majestic Labs Launches Prometheus AI Server with Single-Node 128 TB Memory, Breaking Through Memory Wall Bottleneck

Blaize, Nokia and Datacomm to Deploy Hybrid AI in Indonesia

RFOptic Launches 8GHz RF over Fiber Link Supporting 5G and C-Band

u-blox launches GNSS module ZED-X20P-01B achieving decimeter-level accuracy

SouthernCrossAI Joins Equinix Fabric AI, Deploying Sovereign AI Inference Nodes in Australia Based on SambaNova SN50

EU Plans to Shift Digital Regulatory Focus to Cloud Services and AI

US Tech Giants' AI Investment Nears $600 Billion; Investors Look for Returns

JD.com and Research Partners Unveil RLSD Technology to Reduce AI Reasoning Model Training Costs

Lastest Bulletin

Foundation Work Completed for India's Sabarmati River Rail Bridge, Superstructure Construction Underway

New Paths for AI in Industrial Automation: Augment Existing Systems, Don't Replace Them

Poolside from the US Releases Open-Source Programming Model Laguna XS.2

Accenture Invests in U.S.-based General Robotics to Accelerate AI-Powered Autonomous Operations

Eino Launches Agentic Network Observability Platform in the US

China's C909 aircraft operates second regular Central Asian route

Majestic Labs Launches Prometheus AI Server with Single-Node 128 TB Memory, Breaking Through Memory Wall Bottleneck

Hormel Foods Completes Sale of Whole Turkey Business to Life Sciences Innovation Company

Emerson Wireless Acoustic Transmitter Enables Online Valve Monitoring

Australia's Beaudesert-Beenleigh Road highway upgrade completed, dual lanes widened to four lanes