Monday, May 18, 2026

24 articles

Stochastic Gradient Descent (SGD’s) Frequency Bias and How Adam Fixes It

MarkTechPost

65 days ago1 min read

Stochastic Gradient Descent (SGD’s) Frequency Bias and How Adam Fixes It

Modern language models are trained on data with extremely uneven token distributions. A small number of words appear in almost every sentence, while many rare but meaningful tokens occur only occasionally. This creates a hidden optimization challenge: parameters associated with common tokens...

NVIDIA Introduces a 4-Bit Pretraining Methodology Using NVFP4, Validated on a 12B Hybrid Mamba-Transformer at 10T Token Horizon

MarkTechPost

65 days ago1 min read

NVIDIA Introduces a 4-Bit Pretraining Methodology Using NVFP4, Validated on a 12B Hybrid Mamba-Transformer at 10T Token Horizon

Products

NVIDIA introduces a 4-bit pretraining methodology built around the NVFP4 microscaling format — combining selective BF16 layers, 16×16 Random Hadamard Transforms on Wgrad inputs, 2D weight scaling, and stochastic rounding on gradients — validated on a 12B hybrid Mamba-Transformer trained on 10...

Meet MemPrivacy: An Edge-Cloud Framework that Uses Local Reversible Pseudonymization to Protect User Data Without Breaking Memory Utility

MarkTechPost

65 days ago1 min read

Meet MemPrivacy: An Edge-Cloud Framework that Uses Local Reversible Pseudonymization to Protect User Data Without Breaking Memory Utility

Research

As LLM-powered agents move from research to production, one design tension is becoming harder to ignore: the more useful cloud-hosted memory becomes, the more private user data it exposes. Researchers from MemTensor (Shanghai), HONOR Device and Tongji University have introduced MemPrivacy, a...

Fast-tracking genetic leads to reverse cellular aging

DeepMind

66 days ago1 min read

Fast-tracking genetic leads to reverse cellular aging

Products

Biologists use Co-Scientist to find novel factors that successfully rejuvenate human cells.

TechCrunch

67 days ago1 min read

SandboxAQ brings its drug discovery models to Claude — no PhD in computing required

Anthropic

Other venture-backed companies like Chai Discovery and Isomorphic Labs have raced to build better models. SandboxAQ is betting that access is the bigger obstacle and that Claude solves it.

MIT Technology Review

67 days ago1 min read

What to expect from Google this week

Google

This story originally appeared in The Algorithm, our weekly newsletter on AI. To get stories like this in your inbox first, sign up here. When Google opens its doors tomorrow for its annual developer conference, I/O, it will do so as a clear third place in the foundation model race. A year ago, at...

TechCrunch

67 days ago1 min read

Anthropic has acquired the dev tools startup used by OpenAI, Google, and Cloudflare

OpenAI

Stainless, a New York-based startup, founded in 2022, rose to prominence in the emerging AI industry for automating the creation and maintenance of software development kits, or SDKs — the libraries developers use to interact with APIs.

The Verge

67 days ago1 min read

Musk v. Altman proved that AI is led by the wrong people

OpenAI

The tech trial of the year, Musk v. Altman, was ultimately a fight for control. Elon Musk argued that Sam Altman, with whom he helped found the now-massive company OpenAI, shouldn't direct the future of AI. Altman's lawyers, in turn, poked at Musk's own credibility. A jury came to a verdict on...

NVIDIA

67 days ago1 min read

Vera Arrives: NVIDIA’s First CPU Built for Agents Lands at Top AI Labs

OpenAI

The first NVIDIA Vera CPUs arrived at three of the world's leading AI labs on Friday — Anthropic in San Francisco, OpenAI in Mission Bay, SpaceXAI in Palo Alto — followed by a delivery to Oracle Cloud Infrastructure in Santa Clara on Monday. NVIDIA Vice President of Hyperscale and High-Performance...

NVIDIA

67 days ago1 min read

NVIDIA CEO Jensen Huang at Dell Technologies World: “Demand Is Going Parabolic, Utterly Parabolic”

Products

Agentic AI inference at one-tenth the cost per token with NVIDIA Vera Rubin NVL72. Agent sandboxes run 50% faster on NVIDIA Vera than traditional CPUs — while enterprise data queries are up to 3x faster with the Vera CPU. And 5,000 enterprises like Lilly, Samsung, and Honeywell are running AI...

TechCrunch

67 days ago1 min read

Elon Musk has lost his lawsuit against Sam Altman and OpenAI

OpenAI

Elon Musk's claim that he was mistreated by his OpenAI cofounders failed after nine California jurors decided in a unanimous verdict that his lawsuits had been filed too late.

Wired

67 days ago1 min read

Elon Musk Loses Landmark Lawsuit Against OpenAI

OpenAI

The nine-member panel took only two hours to return a verdict in favor of OpenAI on Monday, which the judge quickly adopted as her own final decision.

OpenAI

67 days ago1 min read

OpenAI and Dell partner to bring Codex to hybrid and on-premise enterprise environments

OpenAI

OpenAI and Dell partner to bring Codex to hybrid and on-premise environments, helping enterprises deploy AI coding agents securely across data and workflows.

TechCrunch

67 days ago1 min read

Amazon’s new Alexa+ powered feature can generate podcast episodes

Products

Amazon’s Alexa+ can now generate custom AI podcasts on demand, as the company expands its assistant into a personalized AI content platform.

MIT Technology Review

67 days ago1 min read

Inside Anduril and Meta’s quest to make smart glasses for warfare

Products

The defense-tech company Anduril has shared new details about the augmented-reality headset for the military it’s prototyping with Meta, including a vision for ordering drone strikes via eye-tracking and voice commands. Quay Barnett, who leads the efforts as a vice president at Anduril following a...

The Verge

67 days ago1 min read

Elon Musk lost his case against Sam Altman

Products

After around two hours of deliberation, the jury has reached a unanimous verdict in Musk v. Altman, the tech trial of the year. The group found that two claims were barred by the statute of limitations, and a third failed thanks to the dismissal of one of these. The jury here is an advisory jury,...

The Verge

67 days ago1 min read

Amazon Alexa Plus can now create AI-generated podcasts

Products

Alexa Plus, Amazon's upgraded AI assistant, can now generate podcasts on "virtually any topic," according to an announcement on Monday. With the update, Amazon says you can give Alexa Plus a topic, and the AI assistant will offer an overview of what its AI hosts plan to talk about, allowing you to...

Hugging Face

67 days ago1 min read

Fine-Tuning NVIDIA Cosmos Predict 2.5 with LoRA/DoRA for Robot Video Generation

Products

Hugging Face

67 days ago1 min read

PaddleOCR 3.5: Running OCR and Document Parsing Tasks with a Transformers Backend

Products

Hugging Face

67 days ago1 min read

The Open Agent Leaderboard

Products

Simon Willison

67 days ago1 min read

Glaucous-winged Gull, Brown Pelican, Snowy Egret, Canada Goose

Products

Glaucous-winged Gull, Brown Pelican, Snowy Egret, Canada Goose, in Los Angeles River, CA, USI'm heading home from PyCon US today so I went on a last morning walk to try and spot a pelican. I saw one! Didn't get a great photo of that, but I did see some goslings down by the swan boat lake.

Import AI

67 days ago1 min read

Import AI 457: AI stuxnet; cursed Muon optimizer; and positive alignment

Research

Welcome to Import AI, a newsletter about AI research. Import AI runs on arXiv, cappuccinos, and feedback from readers. If you’d like to support this, please subscribe. Subscribe now Stuxnet before Stuxnet:…Fast16 bugs software likely used in weapons programs…Here’s a fascinating investigation of a...

I’m a Normie. Can Normies Really Vibe Code?

Wired

67 days ago1 min read

I’m a Normie. Can Normies Really Vibe Code?

Anthropic

Apparently anyone can vibe code anything these days. So Claude and I tried to make a database for tracking the petty grievances of the masses.

TechCrunch

67 days ago1 min read

South Korea’s LetinAR is building optics behind AI glasses

Startups

A lens the size of a thumbnail — and the South Korean startup that makes it could become the optical backbone of the AI glasses era.