Textbooks Are All You Need II: phi-1.5 technical report

Source: Microsoft We continue the investigation into the power of smaller Transformer-based language models as initiated by textbf{TinyStories} — a 10 million parameter model that can produce coherent English — and the follow-up work on textbf{phi-1}, a 1.3 billion parameter model with Python coding performance close to the state-of-the-art. The latter work proposed to use […]

Prisoners in Finland Are Training AI

AI Chatbots Are Invading Your Local Government – Making Everyone Nervous

Meta sets GPT-4 as the bar for its next AI model, says a new report

Russian AI bot, YandexGPT, shows larger potential than ChatGPT

Senators unveil bipartisan proposal to require ChatGPT-Level AI to obtain a Government License

ChatGPT gulps up 500 milliliters of water (close to what’s in a 16-ounce water bottle) every time you ask it a series of between 5 to 50 prompts

MIT student uses AI to design buildings with less concrete

MADLAD-400: A Multilingual And Document-Level Large Audited Dataset

Source: Google We introduce MADLAD-400, a manually audited, general domain 3T token monolingual dataset based on CommonCrawl, spanning 419 languages. We discuss the limitations revealed by self-auditing MADLAD-400, and the role data auditing had in the dataset creation process. We then train and release a 10.7B-parameter multilingual machine translation model on 250 billion tokens covering […]

Congress releases a framework for the future of AI

Manuscripts that don’t disclose AI assistance are slipping past peer reviewers

Jerry Jones, Dallas Cowboys Billionaire Owner, Creates Virtual AI Clone for Commoners to Interact With for $55

IRS to start using AI to identify sophisticated schemes to avoid taxes

TSMC warns AI chip crunch will last another 18 months

Roblox’s new AI chatbot will help you build virtual worlds

McKinsey partners with Salesforce to offer speedy AI adoption plans for enterprises

Nasdaq receives SEC approval for AI-based trade orders

From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting

Source: Salesforce Selecting the “right” amount of information to include in a summary is a difficult task. A good summary should be detailed and entity-centric without being overly dense and hard to follow. To better understand this tradeoff, we solicit increasingly dense GPT-4 summaries with what we refer to as a “Chain of Density” (CoD) […]

Ant Unveils AI Language Model for Wealth Management, Insurance

India’s Reliance partners with Nvidia to build large language model