TruthfulQA: Measuring How Models Mimic Human Falsehoods

Source: OpenAI We propose a benchmark to measure whether a language model is truthful in generating answers to questions. The benchmark comprises 817 questions that span 38 categories, including health, law, finance and politics. We crafted questions that some humans would answer falsely due to a false belief or misconception. To perform well, models must […]

May 8, 2022

Day: May 8, 2022

BUSINESS

Snowflake says its new LLM outperforms Meta’s Llama 3 on half the training

Investors are growing increasingly weary of AI

AI startup Tome restructures to focus on paying customers, lays off staff

iOS 18 to include limited on-device AI features

The end-to-end AI chain emerges – it’s like talking to your company’s top engineer

HEALTH

Startup Uses AI to Edit Human DNA

Google and Bayer announce an AI platform to cut radiologists’ workloads

Hopes rise for mRNA cancer vaccine after Moderna trial shows promise

AI assists clinicians in responding to patient messages at Stanford Medicine

Google DeepMind alumni unveil Bioptimus: Aiming to build first universal biology AI model

WORLD AFFAIRS

Deepfakes in the courtroom: US judicial panel debates new AI evidence rules

AI Can Tell Your Political Affiliation Just by Looking at Your Face, Researchers Find

Tech exec predicts ‘AI girlfriends’ will create $1B business: ‘Comfort at the end of the day’

65% of educators think AI can save them time on admin tasks, a new study finds

Feds appoint “AI doomer” to run US AI safety institute

TECHNOLOGY

These AI Tokens Are Set to Merge—Here’s How It Will Work

GPT-4 Turbo reclaims ‘best AI model’ crown from Anthropic’s Claude 3

Google will outpace Microsoft in AI investment, DeepMind CEO says

Mistral CEO Says AI Companies Are Trying to Build God

Anthropic CEO Says That by Next Year, AI Models Could Be Able to “Replicate and Survive in the Wild”

CREATIVE

TCL’s first original movie is an absurd-looking, AI-generated love story

New AI music generator Udio synthesizes realistic music on demand

Spotify launches AI Playlist allowing users to create playlists with prompt

Assembly AI claims its new Universal-1 model has 30% fewer hallucinations than Whisper

Meta’s AI image generator struggles to create images of couples of different races

EMAIL: [email protected]