RLHF

Researchers Replicated OpenAI’s Work Based on Proximal Policy Optimisation (PPO) in RLHF

News Report Technology

Researchers Replicated OpenAI’s Work Based on Proximal Policy Optimisation (PPO) in RLHF

by Damir Yalalov

October 27, 2023

Meta Unveils Game-Changing Open-Source LLaMa-2-Chat with Unprecedented Performance

Featured News Report Technology

Meta Unveils Game-Changing Open-Source LLaMa-2-Chat with Unprecedented Performance

by Damir Yalalov

July 18, 2023

OpenAI: New Process-Supervised Reward Modeling Improves AI Reasoning

Featured News Report Technology

OpenAI: New Process-Supervised Reward Modeling Improves AI Reasoning

by Damir Yalalov

June 1, 2023

Anthropic Proposes a ‘Contextual AI’ for Chat Models Based on 60 Principles

Featured News Report SMW Technology

Anthropic Proposes a ‘Contextual AI’ for Chat Models Based on 60 Principles

by Damir Yalalov

May 10, 2023

Hot Stories

Circle Secures New York Trust Charter, Fortifying Regulatory Foundation For USDC

by Alisa Davidson

July 31, 2026

Gate Update: Zero-Fee US Stocks, A 63% Chip Surge, And Fed Volatility Define A Landmark Week

by Alisa Davidson

July 31, 2026

Gate Introduces Zero-Fee Trading For Eligible US Stocks And ETFs, Becoming First Digital Asset Platform To Eliminate Commission Fees

by Alisa Davidson

July 31, 2026

Why Enterprise AI Agents Fail—And What They Need To Work

by Rise Ooi

July 31, 2026

Latest News

Circle Secures New York Trust Charter, Fortifying Regulatory Foundation For USDC

by Alisa Davidson

July 31, 2026

Gate Update: Zero-Fee US Stocks, A 63% Chip Surge, And Fed Volatility Define A Landmark Week

by Alisa Davidson

July 31, 2026

Gate Introduces Zero-Fee Trading For Eligible US Stocks And ETFs, Becoming First Digital Asset Platform To Eliminate Commission Fees

by Alisa Davidson

July 31, 2026

Morph, Morpho And Gauntlet Partner To Deliver Institutional On-Chain Yield To Bitget’s 125M Users

by Alisa Davidson

July 31, 2026