Textbooks Are All You Need II: phi-1.5 technical report
Source: Microsoft We continue the investigation into the power of smaller Transformer-based language models as initiated by textbf{TinyStories} — a 10 million parameter model that can produce coherent English — and the follow-up work on textbf{phi-1}, a 1.3 billion parameter model with Python coding performance close to the state-of-the-art. The latter work proposed to use […]
Prisoners in Finland Are Training AI
AI Chatbots Are Invading Your Local Government – Making Everyone Nervous
Meta sets GPT-4 as the bar for its next AI model, says a new report
Russian AI bot, YandexGPT, shows larger potential than ChatGPT
Senators unveil bipartisan proposal to require ChatGPT-Level AI to obtain a Government License
ChatGPT gulps up 500 milliliters of water (close to what’s in a 16-ounce water bottle) every time you ask it a series of between 5 to 50 prompts
MIT student uses AI to design buildings with less concrete
MADLAD-400: A Multilingual And Document-Level Large Audited Dataset
Source: Google We introduce MADLAD-400, a manually audited, general domain 3T token monolingual dataset based on CommonCrawl, spanning 419 languages. We discuss the limitations revealed by self-auditing MADLAD-400, and the role data auditing had in the dataset creation process. We then train and release a 10.7B-parameter multilingual machine translation model on 250 billion tokens covering […]
Congress releases a framework for the future of AI
Manuscripts that don’t disclose AI assistance are slipping past peer reviewers
Jerry Jones, Dallas Cowboys Billionaire Owner, Creates Virtual AI Clone for Commoners to Interact With for $55
IRS to start using AI to identify sophisticated schemes to avoid taxes
TSMC warns AI chip crunch will last another 18 months
Roblox’s new AI chatbot will help you build virtual worlds
McKinsey partners with Salesforce to offer speedy AI adoption plans for enterprises
Nasdaq receives SEC approval for AI-based trade orders
From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting
Source: Salesforce Selecting the “right” amount of information to include in a summary is a difficult task. A good summary should be detailed and entity-centric without being overly dense and hard to follow. To better understand this tradeoff, we solicit increasingly dense GPT-4 summaries with what we refer to as a “Chain of Density” (CoD) […]