Shelly Fan in Singularity Hub:
How do you translate a Roman inscription found on a tombstone? How many pairs of tendons are supported by one bone in hummingbirds? Here is a chemical reaction that requires three steps: What are they? Based on the latest research on Tiberian pronunciation, identify all syllables ending in a consonant sound from this Hebrew text.
These are just a few example questions from the latest attempt to measure the capability of large language models. These algorithms power ChatGPT and Gemini. They’re getting “smarter” in specific domains—math, biology, medicine, programming—and developing a sort of common sense. Like the dreaded standardized tests we endured in school, researchers have long relied on benchmarks to track AI performance. But as cutting-edge algorithms now regularly score over 90 percent on such tests, older benchmarks are increasingly becoming obsolete.
More here.
Enjoying the content on 3QD? Help keep us going by donating now.

Which country has the highest rates of diabetes? Many people would guess the United States. Perhaps Canada or Australia? Mexico? The United Kingdom?
Investigating the crypto-religious is Elie’s venture in this book. Readers of Elie’s first two books,
In the genre of sanatorium literature, An Infinite Sadness stands apart. It doesn’t have much of the fellow feeling that defines Thomas Mann’s classic The Magic Mountain (1924), in which Hans Castorp, bowled over by love and intellectual companionship, struggles to leave the Berghof hospital. In Christa Wolf’s August (2012), the protagonist reflects on the affecting compassion of a fellow resident in a TB clinic, while in Ken Kesey’s One Flew over the Cuckoo’s Nest (1962), the bonds among the patients form the basis of their resistance to institutional authority. Alphonse Daudet, while seeking relief from spinal pain in a thermal spa, wrote that “the patients, in all their weirdness and diversity, draw comfort from the demonstration that their respective illnesses all have something in common.” Stories set in sanatoriums tend to show their characters slowly settling into their new homes, the world slipping away, time taking on different proportions. Separated from the imperatives of productivity, the sanatorium is an imaginative space in which the future is null and progress uncertain. As sickness becomes the rule instead of the exception, the patient begins to exist authentically, in a reality defined by a community of fellow sufferers.
I am not writing this essay as an author, nor as an activist, nor as a representative of the Iranian people. I am writing this as the daughter of former dissidents who lost the best days of their lives in the prisons of the Islamic Republic. As the daughter of a woman who was forced to give birth behind bars, interrogated while going through labor. As the niece of a man executed on a summer morning alongside thousands of others, his body swallowed by an unmarked mass grave. As the granddaughter of working-class grandparents who endured endless humiliations and hardships to raise grandchildren whose parents languished in the regime’s houses of horror.
F
Nearly 40% of new
Hamrah’s most recent project was Last Week in End Times Cinema, a weekly newsletter collecting together “pathetic and ridiculous” news stories about the movie business. (True to form, Hamrah blasted these digests out from his EarthLink account rather than bothering with Substack.) Those columns are now available in a separate collection as well. There are dispiriting headlines like “Winnie-the-Pooh: Blood and Honey 2 Ending Explained” and summaries of news stories about Sam Altman, the “eyebrowless CEO of OpenAI,” suggesting “AI might figure out on its own how to stop itself from ending the human race.” Read enough of these missives and it becomes obvious why studio heads were too focused on replacing actors with algorithms to properly market a film like Train Dreams, filing it away in Netflix’s library of slop after a curtailed theatrical release.

We think the current evidence is clear. By inference to the best explanation — the same reasoning we use in attributing general intelligence to other people — we are observing AGI of a high degree. Machines such as those envisioned by Turing have arrived. Similar arguments have been made before, and have engendered controversy and push-back. Our argument benefits from substantial advances and extra time. As of early 2026, the case for AGI is considerably more clear-cut.
NEW YORK — A Doberman pinscher named Penny was awarded best in show at the Westminster Kennel Club Dog Show on Tuesday night, the fifth of its breed to take the title of top dog. “You can’t attribute it to one thing, but she’s as great a Doberman as I’ve ever seen,” said her handler, Andy Linton, who previously won best in show 37 years ago and had hoped for a second victory, adding, “I had some goals, and this was one of them.”