Deep Reinforcement Learning from Human Preferences

For sophisticated reinforcement learning (RL) systems to interact usefully with real-world environments, we need to communicate complex goals to these systems. In this work, we explore goals defined in terms of (non-expert) human preferences between pairs of trajectory segments. We show that this approach can effectively solve complex RL tasks without access to the reward […]

June 12, 2017

Day: June 12, 2017

BUSINESS

Snowflake says its new LLM outperforms Meta’s Llama 3 on half the training

Investors are growing increasingly weary of AI

AI startup Tome restructures to focus on paying customers, lays off staff

iOS 18 to include limited on-device AI features

The end-to-end AI chain emerges – it’s like talking to your company’s top engineer

HEALTH

Startup Uses AI to Edit Human DNA

Google and Bayer announce an AI platform to cut radiologists’ workloads

Hopes rise for mRNA cancer vaccine after Moderna trial shows promise

AI assists clinicians in responding to patient messages at Stanford Medicine

Google DeepMind alumni unveil Bioptimus: Aiming to build first universal biology AI model

WORLD AFFAIRS

Deepfakes in the courtroom: US judicial panel debates new AI evidence rules

AI Can Tell Your Political Affiliation Just by Looking at Your Face, Researchers Find

Tech exec predicts ‘AI girlfriends’ will create $1B business: ‘Comfort at the end of the day’

65% of educators think AI can save them time on admin tasks, a new study finds

Feds appoint “AI doomer” to run US AI safety institute

TECHNOLOGY

These AI Tokens Are Set to Merge—Here’s How It Will Work

GPT-4 Turbo reclaims ‘best AI model’ crown from Anthropic’s Claude 3

Google will outpace Microsoft in AI investment, DeepMind CEO says

Mistral CEO Says AI Companies Are Trying to Build God

Anthropic CEO Says That by Next Year, AI Models Could Be Able to “Replicate and Survive in the Wild”

CREATIVE

TCL’s first original movie is an absurd-looking, AI-generated love story

New AI music generator Udio synthesizes realistic music on demand

Spotify launches AI Playlist allowing users to create playlists with prompt

Assembly AI claims its new Universal-1 model has 30% fewer hallucinations than Whisper

Meta’s AI image generator struggles to create images of couples of different races

EMAIL: [email protected]