cjheinz's blurblog

The talking dog by Seth Godin
Thursday July 17^th, 2025 at 8:18 AM

Seth's Blog

First mistake: If you meet a talking dog in the street and it makes a few grammatical errors or speaks with an accent, you don’t use a few errors to dismiss the fact that this is an actual talking dog. It’s amazing. It might even be worth having it join your team.

Second mistake: If a talking dog tells you something, that doesn’t mean it’s true. Check the work.

Even if your dog’s name is Claude.

What should we do now? As creatives, freelancers and impresarios gaze at the incoming AI revolution, it’s tempting to turn away and get back to work.

But what if this is the work?

There are plenty of caveats that come with the talking dogs of AI. Not just the hallucinations or the dislocations. There are issues of climate, of control and of access. But, as in all the past technology revolutions we’ve faced, highlighting the problems and walking away is probably not the best way to have influence or impact.

Just because candidates have flaws doesn’t mean you shouldn’t vote.

I’m launching a short new Udemy course on how I’m thinking about AI and how you might shift your perspective to put it to work. This is the biggest shift in our world since the invention of electricity, and you’re either going to work for an AI or an AI is going to work for you. Only one of these is a good option.

The course is discounted 40% for the next five days. It won’t give you any tricks or tips, but it might open your eyes to a different posture for dancing with the talking dog.

Read the whole story

cjheinz

1 day ago

reply

The talking dog, FTW!

Lexington, KY; Naples, FL

Why the federal government is making climate data disappear by Guest Author
Tuesday July 15^th, 2025 at 7:39 PM

Forward Kentucky

Why the federal government is making climate data disappear

For 25 years, a group of the country’s top experts has been fastidiously tracking the ways that climate change threatens every part of the United States. Their findings informed the National Climate Assessments, a series of congressionally mandated reports released every four years that translated the science into accessible warnings for policymakers and the public. But that work came to a halt this spring when the Trump administration abruptly dismissed all 400 experts working on the next edition. Then, on June 30, all of the past reports vanished too, along with the federal website they lived on.

A lot of information about the changing climate has disappeared under President Donald Trump’s second term, but the erasure of the National Climate Assessments is “by far the biggest loss we’ve seen,” said Gretchen Gehrke, who monitors federal websites with the Environmental Data and Governance Initiative. The National Climate Assessments were one of the most approachable resources that broke down how climate change will affect the places people care about, she said. The reports were also used by a wide swath of stakeholders — policymakers, farmers, businesses — to guide their decisions about the future. While the reports have been archived elsewhere, they’re no longer as easy to access. And it’s unclear what, if anything, will happen to the report that was planned for 2027 or 2028, which already existed in draft form.

So why did the reports survive Trump’s first term, but not his second? You could view their disappearance in a few different ways, experts said — as a flex of executive power, an escalation in the culture war over climate change, or a strategic attempt to erase the scientific foundation for climate policy. “If you suppress information and data, then you don’t have the evidence you need to be able to create regulations, strengthen regulations, and even to combat the repeal of regulations,” Gehrke said.

This isn’t climate denial in the traditional sense. The days of loudly debating the science have mostly given way to something quieter and more insidious: a campaign to withhold the raw information itself. “I don’t know if we’re living in climate denial anymore,” said Leah Aronowsky, a science historian at Columbia Climate School. “We have this new front of denial by erasure.”

By cutting funding for research and withholding crucial data, the Trump administration is making it harder to know exactly how the planet is changing. In April, the administration pulled nearly $4 million in funding from a Princeton program to improve computer models predicting changes in the oceans and atmosphere, claiming the work created “climate anxiety” among young people. That same month, the Environmental Protection Agency failed to submit its annual report to the United Nations detailing the country’s greenhouse gas emissions. In May, the National Oceanic and Atmospheric Administration ended its 45-year tradition of tracking billion-dollar weather disasters. Trump also hopes to shut down the Mauna Loa laboratory in Hawaiʻi, which has measured the steady rise in atmospheric levels of carbon dioxide since the 1950s — the first data to definitively show humans were changing the climate.

“This kind of wholesale suppression of an entire field of federally sponsored research, to my knowledge, is historically unprecedented,” Aronowsky said.

In a response to a request for comment, a NASA spokesperson said that it has “no legal obligations to host globalchange.gov’s data,” referring to the site that hosted the National Climate Assessments, adding that the U.S. Global Change Research Program had already “met its statutory requirements by presenting its reports to Congress.” The EPA directed Grist to a webpage containing past greenhouse gas emissions reports, as well as a version of what was supposed to be this year’s report obtained by the Environmental Defense Fund. However, the agency confirmed that the latest data has not been officially released. The White House declined to comment, and the National Oceanic and Atmospheric Administration did not respond.

Last year, a leaked training video from Project 2025 — the policy roadmap organized by The Heritage Foundation, a conservative think tank — showed a former Trump official declaring that political appointees would have to “eradicate climate change references from absolutely everywhere.” The strategy appears to be designed to boost the fossil fuel industry at a time when clean energy has become competitive and the reality of climate change harder to dismiss, as floods, fires, and heat waves have become perceptibly worse. “We will drill, baby, drill,” Trump said in his inauguration speech in January.

The administration hasn’t exactly been subtle about its endgame. Lee Zeldin, the head of the EPA, doesn’t deny the reality of climate change (he calls himself a “climate realist”), but he’s zealously dismantled environmental programs and has recommended that the White House strike down the “endangerment finding,” the bedrock of U.S. climate policy. It comes from a 2007 Supreme Court ruling on the Clean Air Act that required the EPA to regulate greenhouse gases as pollutants since they endanger public health. If the administration can convince the courts that climate change isn’t a health consideration, it could end that regulatory obligation.

“If you’re removing information about climate change, its reality, and its impact on people, then I think it’s a lot easier to make the case that it’s not an environmental health issue,” Gehrke said.

There’s a word for the idea that ignorance can serve political ends: agnotology (from the Greek “agnosis,” or “not knowing”), the study of how knowledge is deliberately obscured. What Trump is doing to information about climate change fits squarely in that tradition, according to Aronowsky: “If you remove it, then in a certain sense, it no longer exists, and therefore, there’s nothing to even debate, right?”

Climate denial first took off in the 1990s, when the oil and gas companies and industry-friendly think tanks started sowing doubt about climate science. Over the decades, as the evidence became rock-solid, those who opposed reducing the use of fossil fuels gradually shifted from outright denying the facts to attacking solutions like wind and solar power. What the Trump administration is doing now marks a radical break from this long-term trend, said John Cook, a climate misinformation researcher at The University of Melbourne in Australia. “This is a 180, not just a turn, but diving into something we’ve never even seen before,” he said. On the other hand, Cook said, the administration is taking a classic climate denial tactic — painting scientists as “alarmists” or conspirators who can’t be trusted — and turning it into government policy.

Half a year in, the second Trump administration’s treatment of climate information hasn’t yet reached the “eradication” levels that Project 2025 aspired to, at least on government websites. The EPA’s climate change website, for instance, is still up and running, even though all references to the phenomenon were erased on the agency’s home page. Most of the website deletions so far have served to isolate climate change as an issue, erasing its relationship to topics such as health and infrastructure, Gehrke said. Up until the National Climate Assessments disappeared, she would have said that “climate erasure” was an inappropriate characterization of what’s happening. “But now, I’m really not so sure,” she said.

Rachel Cleetus, the senior policy director with the Union of Concerned Scientists, thinks that the administration’s actions actually go beyond erasure. “They’re literally trying to change the basis on which a lot of policymaking is advanced — the science basis, the legal basis, and the economic basis,” she said. Her biggest concern isn’t just what facts have been removed, but what political propaganda might replace them. “That’s more dangerous, because it really leaves people in this twilight zone, where what’s real, and what’s important, and what is going to affect their daily lives is just being obfuscated.”

This story was updated on 7/14/25 to include a response from NASA received after the time of publication.

--30--

Written by Kate Yoder. Cross-posted from Grist.

Read the whole story

cjheinz

2 days ago

reply

The people behind this are guilty of crimes against humanity. They should pay the ultimate price for the millions of lives that will be lost due to their greed.

Lexington, KY; Naples, FL

Tariff Tantrums by Robert Kuttner
Monday July 14^th, 2025 at 8:33 PM

TAP : The American Prospect

Today on TAP: Don’t look for a grand strategy in Trump’s tariff policy—it’s mainly about impulsivity and bravado.

Read the whole story

cjheinz

3 days ago

reply

Bully’s gotta bully. So sad that our best neighbors, Canada & Mexico, get the worst of it.
Ashamed to be a US citizen. Do people actually like bullies???

Lexington, KY; Naples, FL

The Media’s Pivot to AI Is Not Real and Not Going to... by Jason Kottke
Monday July 14^th, 2025 at 8:18 PM

kottke.org

The Media’s Pivot to AI Is Not Real and Not Going to Work. “The only AI-related business strategy that makes any sense whatsoever is one where media companies and journalists go to great pains to show their audiences that they are human beings…”

Read the whole story

cjheinz

3 days ago

reply

I posted about this article. Really great work.

Lexington, KY; Naples, FL

The Media's Pivot to AI Is Not Real and Not Going to Work by Jason Koebler
Monday July 14^th, 2025 at 10:46 AM

404 Media

The Media's Pivot to AI Is Not Real and Not Going to Work

On May 23, we got a very interesting email from Ghost, the service we use to make 404 Media. “Paid subscription started,” the email said, which is the subject line of all of the automated emails we get when someone subscribes to 404 Media. The interesting thing about this email was that the new subscriber had been referred to 404 Media directly from chatgpt.com, meaning the person clicked a link to 404 Media from within a ChatGPT window. It is the first and only time that ChatGPT has ever sent us a paid subscriber.

From what I can tell, ChatGPT.com has sent us 1,600 pageviews since we founded 404 Media nearly two years ago. To give you a sense of where this slots in, this is slightly fewer than the Czech news aggregator novinky.cz, the Hungarian news portal Telex.hu, the Polish news aggregator Wykop.pl, and barely more than the Russian news aggregator Dzen.ru, the paywall jumping website removepaywall.com, and a computer graphics job board called 80.lv. In that same time, Google has sent roughly 3 million visitors, or 187,400 percent more than ChatGPT.

This is really neither here nor there because we have tried to set our website up to block ChatGPT from scraping us, though it is clear this is not always working. But even for sites that don’t block ChatGPT, new research from the internet infrastructure company CloudFlare suggests that OpenAI is crawling 1,500 individual webpages for every one visitor that it is sending to a website. Google traffic has begun to dry up as both Google’s own AI snippets and AI-powered SEO spam have obliterated the business models of many media websites.

This general dynamic—plummeting traffic because of AI snippets, ChatGPT, AI slop, Twitter no workie so good no more—has been called the “traffic apocalypse” and has all but killed some smaller websites and has been blamed by executives for hundreds of layoffs at larger ones.

Despite the fact that generative AI has been a destructive force against their businesses, their industry, and the truth more broadly, media executives still see AI as a business opportunity and a shiny object that they can tell investors and their staffs that they are very bullish on. They have to say this, I guess, because everything else they have tried hasn’t worked, and pretending that they are forward thinking or have any clue what they are doing will perhaps allow a specific type of media executive to squeeze out a few more months of salary.

But pivoting to AI is not a business strategy. Telling journalists they must use AI is not a business strategy. Partnering with AI companies is a business move, but becoming reliant on revenue from tech giants who are creating a machine that duplicates the work you’ve already created is not a smart or sustainable business move, and therefore it is not a smart business strategy. It is true that AI is changing the internet and is threatening journalists and media outlets. But the only AI-related business strategy that makes any sense whatsoever is one where media companies and journalists go to great pains to show their audiences that they are human beings, and that the work they are doing is worth supporting because it is human work that is vital to their audiences. This is something GQ’s editorial director Will Welch recently told New York magazine: “The good news for any digital publisher is that the new game we all have to play is also a sustainable one: You have to build a direct relationship with your core readers,” he said.

Becoming an “AI-first” media company has become a buzzword that execs can point at to explain that their businesses can use AI to become more ‘efficient’ and thus have a chance to become more profitable. Often, but not always, this message comes from executives who are laying off large swaths of their human staff.

In May, Business Insider laid off 21 percent of its workforce. In her layoff letter, Business Insider’s CEO Barbara Peng said “there’s a huge opportunity for companies who harness AI first.” She told the remaining employees there that they are “fully embracing AI,” “we are going all-in on AI,” and said “over 70 percent of Business Insider employees are already using Enterprise ChatGPT regularly (our goal is 100%), and we’re building prompt libraries and sharing everyday use cases that help us work faster, smarter, and better.” She added they are “exploring how AI can boost operations across shared services, helping us scale and operate more efficiently.”

Last year, Hearst Newspapers executives, who operate 78 newspapers nationwide, told the company in an all-hands meeting audio obtained by 404 Media that they are “leaning into [AI] as Hearst overall, the entire corporation.” Examples given in the meeting included using AI for slide decks, a “quiz generation tool” for readers, translations, a tool called Dispatch, which is an email summarization tool, and a tool called “Assembly,” which is “basically a public meeting monitor, transcriber, summarizer, all in one. What it does is it goes into publicly posted meeting videos online, transcribes them automatically, [and] automatically alerts journalists through Slack about what’s going on and links to the transcript.”

The Washington Post and the Los Angeles Times are doing all sorts of fucked up shit that definitely no one wants but are being imposed upon their newsrooms because they are owned by tech billionaires who are tired of losing money. The Washington Post has an AI chatbot and plans to create a Forbes contributor-esque opinion section with an AI writing tool that will assist outside writers. The Los Angeles Times introduced an AI bot that argues with its own writers and has written that the KKK was not so bad, actually. Both outlets have had massive layoffs in recent months.

The New York Times, which is actually doing well, says it is using AI to “create initial drafts of headlines, summaries of Times articles and other text that helps us produce and distribute the news.” Wirecutter is hiring a product director for AI and recently instructed its employees to consider how they can use AI to make their journalism better, New York magazine reported. Kevin Roose, an, uhh, complicated figure in the AI space, said “AI has essentially replaced Google for me for basic questions,” and said that he uses it for “brainstorming.” His Hard Fork colleague Casey Newton said he uses it for “research” and “fact-checking.”

Over at Columbia Journalism Review, a host of journalists and news execs, myself included, wrote about how AI is used in their newsrooms. The responses were all over the place and were occasionally horrifying, and ranged from people saying they were using AI as personal assistants to brainstorming partners to article drafters.

In his largely incoherent screed that shows how terrible he was at managing G/O Media, which took over Deadspin, Kotaku, Jezebel, Gizmodo, and other beloved websites and ran them into the ground at varying speeds, Jim Spanfeller nods at the “both good and perhaps bad” impacts of AI on news. In a truly astounding passage of a notably poorly written letter that manages to say less than nothing, he wrote: “AI is a prime example. It is here to a degree but there are so many more shoes to drop [...] Clearly this technology is already having a profound impact. But so much more is yet to come, both good and perhaps bad depending on where you sit and how well monitored and controlled it is. But one thing to keep in mind, consumers seek out content for many reasons. Certainly, for specific knowledge, which search and search like models satisfy in very effective ways. But also, for insights, enjoyment, entertainment and inspiration.”

At the MediaPost Publishing Insider Conference, a media industry business conference I just went to in New Orleans, there was much chatter about AI. Alice Ting, an executive for the Daily Mail gave a pretty interesting talk about how the Daily Mail is protecting its journalism from AI scrapers in order to eventually strike deals with AI companies to license their content.

“What many of you have seen is a surge in scraping of our content, a decline in traffic referrals, and an increase in hallucinated outputs that often misrepresent our brands,” Ting said. “Publishers can provide decades of vetted and timestamped content, verified, fact checked, semantically organized, editorially curated. And in addition offer fresh content on an almost daily basis.”

Ting is correct in that several publishers have struck lucrative deals with AI companies, but she also suggested that AI licensing would be a recurring revenue stream for publishers, which would require a series of competing LLMs to want to come in and license the same content over and over again. Many LLMs have already scraped almost everything there is to scrape, it’s not clear that there are going to consistently be new LLMs from companies wanting to pay to train on data that other LLMs have already trained on, and it’s not clear how much money the Daily Mail’s blogs of the day are going to be worth to an AI company on an ongoing basis. Betting that this time, hinging the future of our industry on massive, monopolistic tech giants will work out is the most Lucy with the football thing I can imagine.

There is not much evidence that selling access to LLMs will work out in a recurring way for any publisher, outside of the very largest publishers like, perhaps, the New York Times. Even at the conference, panel moderator Upneet Grover, founder of LH2 Holdings, which owns several smaller blogs, suggested that “a lot of these licensing revenues are not moving the needle, at least from the deals we’ve seen, but there’s this larger threat of more referral traffic being taken away from news publishers [by AI].”

In my own panel at the conference I made the general argument that I am making in this article, which is that none of this is going to work.

“We’re not just competing against large-scale publications and AI slop, we are competing against the entire rest of the internet. We were publishing articles and AI was scraping and republishing them within five minutes of us publishing them,” I said. “So many publications are leaning into ‘how can we use AI to be more efficient to publish more,’ and it’s not going to work. It’s not going to work because you’re competing against a child in Romania, a child in Bangladesh who is publishing 9,000 articles a day and they don’t care about facts, they don’t care about accuracy, but in an SEO algorithm it’s going to perform and that’s what you’re competing against. You have to compete on quality at this point and you have to find a real human being audience and you need to speak to them directly and treat them as though they are intelligent and not as though you are trying to feed them as much slop as possible.”

It makes sense that journalists and media execs are talking about AI because everyone is talking about AI, and because AI presents a particularly grave threat to the business models of so many media companies. It’s fine to continue to talk about AI. But the point of this article is that “we’re going to lean into AI” is not a business model, and it’s not even a business strategy, any more than pivoting to “video” was a strategy or chasing Facebook Live views was a strategy.

In a harrowing discussion with Axios, in which he excoriates many of the deals publishers have signed with OpenAI and other AI companies, Matthew Prince, the CEO of Cloudflare, said that the AI-driven traffic apocalypse is a nightmare for people who make content online: “If we don’t figure out how to fix this, the internet is going to die,” he said.

So AI is destroying traffic, ripping off our work, creating slop that destroys discoverability and further undermines trust, and allowing random people to create news-shaped objects that social media and search algorithms either can’t or don’t care to distinguish from real news. And yet media executives have decided that the only way to compete with this is to make their workers use AI to make content in a slightly more efficient way than they were already doing journalism.

This is not going to work, because “using AI” is not a reporting strategy or a writing strategy, and it’s definitely not a business strategy.

AI is a tool (sorry!) that people who are bad at their jobs will use badly and that people who are good at their jobs will maybe, possibly find some uses for. People who are terrible at their jobs (many executives), will tell their employees that they “need” to use AI, that their jobs depend on it, that they must become more productive, and that becoming an AI-first company is the strategy that will save them from the old failed strategy, which itself was the new strategy after other failed business models.

The only journalism business strategy that works, and that will ever work in a sustainable way, is if you create something of value that people (human beings, not bots) want to read or watch or listen to, and that they cannot find anywhere else. This can mean you’re breaking news, or it can mean that you have a particularly notable voice or personality. It can mean that you’re funny or irreverent or deeply serious or useful. It can mean that you confirm people’s priors in a way that makes them feel good. And you have to be trustworthy, to your audience at least. But basically, to make money doing journalism, you have to publish “content,” relatively often, that people want to consume.

This is not rocket science, and I am of course not the only person to point this out. There have been many, many features about the success of Feed Me, Emily Sundberg’s newsletter about New York, culture, and a bunch of other stuff. As she has pointed out in many interviews, she has been successful because she writes about interesting things and treats her audience like human beings. The places that are succeeding right now are individual writers who have a perspective, news outlets like WIRED that are fearless, publications that have invested in good reporters like The Atlantic, publications that tell you something that AI can’t, and worker owned, journalist-run outlets like us, Defector, Aftermath, Hellgate, Remap, Hearing Things, etc. There are also a host of personality-forward, journalism-adjacent YouTubers, TikTok influencers, and podcasters who have massive, loyal audiences, yet most of the traditional media is utterly allergic to learning anything from them.

There was a short period of time where it was possible to make money by paying human writers—some of them journalists, perhaps—to spam blog posts onto the internet that hit specific keywords, trending topics, or things that would perform well on social media. These were the early days of Gawker, Buzzfeed, VICE, and Vox. But the days of media companies tricking people into reading their articles using SEO or hitting a trending algorithm are over.

They are over because other people are doing it better than them now, and by “better,” I mean, more shamelessly and with reckless abandon. As we have written many times, news outlets are no longer just competing with each other, but with everyone on social media, and Netflix, and YouTube, and TikTok, and all the other people who post things on the internet. They are not just up against the total fracturing of social media, the degrading and enshittification of the discovery mechanisms on the internet, algorithms that artificially ding links to articles, AI snippets and summaries, etc. They are also competing with sophisticated AI slop and spam factories often being run by people on the other side of the world publishing things that look like “news” that is being created on a scale that even the most “efficient” journalist leveraging AI to save some perhaps negligible amount of time cannot ever hope to measure up to.

Every day, I get emails from AI spam influencers who are selling tools that allow slop peddlers to clone any website with one click, automatically generate newsletters about any topic, or generate plausible-seeming articles that are engineered to perform well in a search algorithm. Examples: “Clone any website in 9 seconds with Clonely AI,” “The future of video creation is here—and it’s faceless, seamless & limitless,” “just a straightforward path to earning 6-figures with an AI-powered newsletter that’s working right now.” These people do not care at all about truth or accuracy or our information ecosystem or anything else that a media company or a journalist would theoretically care about. If you want an example of what this looks like, consider the series of “Good Day” newsletters, which are AI generated and are in 355 small towns across America, many of which no longer have newspapers. These businesses are economically viable because they are being run by one person (or a very small team of people) who disproportionately live in low cost of living areas and who have essentially zero overhead.

And so becoming more “efficient” with AI is the wrong thing to do, and it’s the wrong thing to ask any journalist to do. The only thing that media companies can do in order to survive is to lean into their humanity, to teach their journalists how to do stories that cannot be done by AI, and to help young journalists learn the skills needed to do articles that weave together complicated concepts and, again, that focus on our shared human experience, in a way that AI cannot and will never be able to.

AI as buzzword and shiny object has been here for a long time. And I actually do not think AI is fake and sucks (I also don’t really believe that anyone thinks AI is “fake,” because we can see the internet collapsing around us). We report every day on the ways that AI is changing the web, in part because it is being shoved down our throats by big tech companies, spammers, etc. But I think that Princeton’s Arvind Narayanan and Sayash Kapoor are basically correct when they say that AI is “normal technology” that will not change everything but that over time will lead to modest improvements in people’s workflows as they get integrated into existing products or as they help around the edges. We—yes, even you—are using some version of AI, or some tools that have LLMs or machine learning in them in some way shape or form already, even if you hate such tools.

In early 2023, when I was the editor-in-chief of Motherboard, I was asked to put together a presentation for VICE executives about AI, and how I thought it would change both our journalism and the business of journalism. The reason I was asked to do this was because our team was writing a lot about AI, and there was a sense that the company could do something with AI to make money, or do better journalism, or some combination of those things. There was no sense or thought at the time, at least from what I was told, that VICE was planning to use AI as a pretext for replacing human journalists or cutting costs—it had already entered a cycle where it was constantly laying off journalists—but there was a sense that this was going to be the big new opportunity/threat, a new potential savior for a company that had already created a “virtual office” in Decentraland, a crypto-powered metaverse that last year had 42 daily active users.

I never got to give the presentation, because the executive who asked me to put it together left the company, and the new people either didn’t care or didn’t have time for me to give it. The company went bankrupt almost immediately after this change, and I left VICE soon after to make 404 Media with my co-founders, who also left VICE.

But my message at the time, and my message now two years later, is that AI has already changed our world, and that we have the opportunity to report on the technology as it already exists and is already being used—to justify layoffs, to dehumanize people, to spam the internet, etc. At the time, we had already written 840 articles that were tagged “AI,” which included articles about biased sentencing algorithms, predictive policing, facial recognition, deepfakes, AI romantic relationships, AI-powered spam and scams, etc.

The business opportunity then, as now, was to be an indispensable, very human guide to a technology that people—human beings—are making tons of money off of, using as an excuse to lay off workers, and are doing wild shit with. There was no magic strategy in which we could use AI to quadruple our output, replace workers, rise to the top of Google rankings, etc. There was, however, great risk in attempting to do this: “PR NIGHTMARE,” one of my slides about the risks of using AI I wrote said: “CNET plagiarism scandal. Big backlash from artists and writers to generative AI. Copyright issues. Race to the bottom.”

My other thought was that any efficiencies that could be squeezed out of AI, in our day-to-day jobs, were already being done so by good reporters and video producers at the company. There could be no top-down forced pivot to AI, because research and time-saving uses of AI were already being naturally integrated into our work by people who were smart in ways that were totally reasonable and mostly helpful, if not groundbreaking. The AI-as-force-multiplier was already happening, and while, yes, this probably helped the business in some way, it helped in ways that were not then and were never going to be actually perceptible to a company’s bottom line. AI was not a savior then, and it is not a savior now. For journalists and for media companies, there is no real “pivot to AI” that is possible unless that pivot means firing all of the employees and putting out a shittier product (which some companies have called a strategy). This is because the pivot has already occurred and the business prospects for media companies have gotten worse, not better. If Kevin Roose is using AI so much, in such a new and groundbreaking way, why aren’t his articles noticeably different than they ever were before, or why aren’t there way more of them than there were before? Where are the journalists who were formerly middling who are now pumping out incredible articles thanks to efficiencies granted by AI?

To be concrete: Many journalists, including me, at least sometimes use some sort of AI transcription tool for some of their less sensitive interviews. This saves me many hours, the tools have gotten better (but are still not perfect, and absolutely require double checking and should not be used for sensitive sources or sensitive stories). YouTube’s transcript feature is an incredible reporting tool that has allowed me to do stories that would have never been possible even a few years ago. YouTube’s built-in translations and subtitles, and its transcript tool are some of the only reasons that I was able to do this investigation into Indian AI slop creators, which allowed me to get the gist of what was happening in a given video before we handed them to human translators to get exact translations. Most podcasts I know of now use Descript, Riverside, or a similar tool to record and edit their podcasts; these have built-in AI transcription tools, built-in AI camera switching, and built-in text-to-video editing tools. Most media outlets use captioning that is built into Adobe Premiere or CapCut for their vertical videos and their YouTube videos (and then double check them). If you want to get extremely annoying about it, various machine learning algorithms are in ProTools, Audition, CapCut, Premiere, Canva, etc for things like photo editing, sound leveling, noise reduction, etc.

There are other journalists who feel very comfortable coding and doing data analysis and analyzing huge sets of documents. There are journalists out there who are already using AI to do some of these tasks and some of the resulting articles are surely good and could not have been done without AI.

But the people doing this well are doing so in a way where they are catching and fixing AI hallucinations, because the stakes for fucking up are so incredibly high. If you are one of the people who is doing this, then, great. I have little interest in policing other people’s writing processes so long as they are not publishing AI fever dreams or plagiarizing, and there are writers I respect who say they have their little chats with ChatGPT to help them organize their thoughts before they do a draft or who have vibecoded their own productivity tools or data analysis tools. But again, that’s not a business model. It’s a tool that has enabled some reporters to do their jobs, and, using their expertise, they have produced good and valuable work. This does not mean that every news outlet or every reporter needs to learn to shove the JFK documents into ChatGPT and have it shit out an investigation.

I also know that our credibility and the trust of our audience is the only thing that separates us from anyone else. It is the only “business model” that we have and that I am certain works: We trade good, accurate, interesting, human articles for money and attention. The risks of offloading that trust to an AI in a careless way is the biggest possible risk factor that we could have as a business. Having an article go out where someone goes “Actually, a robot wrote this,” is one of the worst possible things that could ever happen to us, and so we have made the brave decision to not do that.

This is part of what is so baffling about the Chicago Sun Times’ response to its somewhat complicated summer guide AI-generated reading list fiasco. Under its new owners, Chicago Public Media, The Sun Times has in recent years spent an incredible amount of time and effort rebuilding the image and good will that its previous private equity owners destroyed. And yet in its apology note, Melissa Bell, the CEO of Chicago Public Media, said that more AI is coming: “Chicago Public Media will not back away from experimenting and learning how to properly use AI,” she wrote, adding that the team was working with a fellow paid for by the Lenfest Institute, a nonprofit funded by OpenAI and Microsoft.

Bell does realize what makes the paper stand apart, though: “We must own our humanity,” Bell wrote. “Our humanity makes our work valuable.”

This is something that the New York Times’s Roose recently brought up that I thought was quite smart and yet is not something that he seems to have internalized when talking about how AI is going to change everything and that its widespread adoption is inevitable and the only path forward: “I wonder if [AI is] going to catalyze some counterreaction,” he said. “I’ve been thinking a lot recently about the slow-food movement and the farm-to-table movement, both of which came up in reaction to fast food. Fast food had a lot going for it—it was cheap, it was plentiful, you could get it in a hurry. But it also opened up a market for a healthier, more artisanal way of doing things. And I wonder if something similar will happen in creative industries—a kind of creative renaissance for things that feel real and human and aren’t just outputs from some A.I. company’s slop machine.”

This has ALREAAAAADDDDYYYYYY HAPPPENEEEEEDDDDDD, and it is quite literally the only path forward for all but perhaps the most gigantic of media companies. There is no reason for an individual journalist or an individual media company to make the fast food of the internet. It’s already being made, by spammers and the AI companies themselves. It is impossible to make it cheaper or better than them, because it is what they exist to do. The actual pivot that is needed is one to humanity. Media companies need to let their journalists be human. And they need to prove why they’re worth reading with every article they do.

Read the whole story

cjheinz

4 days ago

reply

A long article, but spot on. I am going to excerpt from it.

Lexington, KY; Naples, FL

How o3 and Grok 4 Accidentally Vindicated Neurosymbolic AI by Gary Marcus
Sunday July 13^th, 2025 at 4:31 PM

Marcus on AI

Machine learning, the branch of AI concerned with tuning algorithms from data, is an amazing field that has changed the world — and will continue doing so. But it is also filled with closed-minded egotists with too much money, and too much power.

This is a story, in three acts, spanning four decades, about how many of them tried, ultimately unsuccessfully, to keep a good idea, neurosymbolic AI, down—only to accidentally vindicate that idea in the end.

§

For those who are unfamiliar with the field’s history, or who think it began only in 2012, AI has been around for many decades, split, almost since its very beginning, into two different traditions.

One is the neural network or “connectionist” tradition which goes back to the 1940s and 1950s, first developed by Frank Rosenblatt, and popularized, advanced and revived by Geoffrey Hinton, Yann LeCun, and Yoshua Bengio (along with many others, including most prominently, Juergen Schmidhuber who rightly feels that his work has been under-credited), and brought to current form by OpenAI and Google. Such systems are statistical, very loosely inspired by certain aspects of the brain (viz. the “nodes” in neural networks are meant to be abstractions of neurons), and typically trained on large-scale data. Large Language Models (LLMs) grew out of that tradition.

The other is the symbol-manipulation tradition, with roots going back to Bertrand Russell and Gottlob Frege, and John von Neumann and Alan Turing, and the original godfathers of AI, Herb Simon, Marvin Minsky, and John McCarthy, and even Hinton’s great-great-great-grandfather George Boole. In this approach, symbols and variables stand for abstractions; mathematical and logical functions are core. Systems generally represent knowledge explicitly, often in databases, and typically make extensive use of (are written entirely in) classic computer programming languages. All of the world’s software relies on it.

For thirty years, I have been arguing for a reconciliation between the two, neurosymbolic AI. The core notion has always been that the two main strands of AI—neural networks and symbolic manipulation—complement each other, with different strengths and weaknesses. In my view, neither neural networks nor classical AI can really stand on their own. We must find ways to bring them together.

After a thirty-year journey, I believe that neurosymbolic AI’s moment has finally arrived, in part from an unlikely place.

§

In her bestseller Empire of AI, Karen Hao crisply sets the stage.

She begins by neatly distilling the scientific tension.

Hinton and Sutskever continued [after their seminal 2012 article on deep learning] to staunchly champion deep learning. Its flaws, they argued, are not inherent to the approach itself. Rather they are the artifacts of imperfect neural-network design as well as limited training data and compute. Some day with enough of both, fed into even better neural networks, deep learning models should be able to completely shed the aforementioned problems. "The human brain has about 100 trillion parameters, or synapses," Hinton told me in 2020.
"What we now call a really big model, like GPT-3, has 175 billion. It's a thousand times smaller than the brain.
"Deep learning is going to be able to do everything," he said.
Their modern-day nemesis was Gary Marcus, a professor emeritus of psychology and neural science at New York University, who would testify in Congress next to Sam Altman in May 2023. Four years earlier, Marcus coauthored a book called Rebooting AI, asserting that these issues were inherent to deep learning. Forever stuck in the realm of correlations, neural networks would never, with any amount of data or compute, be able to understand causal relationships-why things are the way they are-and thus perform causal reasoning. This critical part of human cognition is why humans need only learn the rules of the road in one city to be able to drive proficiently in many others, Marcus argued.
Tesla's Autopilot, by contrast, can log billions of miles of driving data and still crash when encountering unfamiliar scenarios or be fooled with a few strategically placed stickers. Marcus advocated instead for combining connectionism and symbolism, a strain of research known as neuro-symbolic AI. Expert systems can be programmed to understand causal relationships and excel at reasoning, shoring up the shortcomings of deep learning. Deep learning can rapidly update the system with data or represent things that are difficult to codify in rules, plugging the gaps of expert systems. "We actually need both approaches," Marcus told me.

She goes on to point out that the field has become an intellectual monoculture, with the neurosymbolic approach largely abandoned, and massive funding going to the pure connectionist (neural network) approach:

Despite the heated scientific conflict, however, the funding for AI development has continued to accelerate almost exclusively in the pure connectionist direction. Whether or not Marcus is right about the potential of neurosymbolic Al is beside the point; the bigger root issue has been the whittling down and weakening of a scientific environment for robustly exploring that possibility and other alternatives to deep learning.
For Hinton, Sutskever, and Marcus, the tight relationship between corporate funding and AI development also affected their own careers.

Hao then captures OpenAI’s sophomoric attitude towards fair scientific criticism:

Over the years, Marcus would become one of the biggest critics of OpenAI, writing detailed takedowns of its research and jeering its missteps on social media. Employees created an emoji of him on the company Slack to lift up morale after his denouncements and to otherwise use as a punch line. In March 2022, Marcus wrote a piece for Nautilus titled "Deep Learning Is Hitting a Wall”, repeating his argument that OpenAI's all-in approach to deep learning would lead it to fall short of true AI advancements. A month later, OpenAI released DALL-E 2 to immense fanfare, and Brockman cheekily tweeted a DALL-E 2-generated image using the prompt "deep learning hitting a wall.” The following day, Altman followed with another tweet: "Give me the confidence of a mediocre deep learning skeptic." Many OpenAI employees relished the chance to finally get back at Marcus.

But then again, as the saying goes, he who laughs last, laughs loudest.

§

For all the efforts that OpenAI and other leaders of deep learning, such as Geoffrey Hinton and Yann LeCun, have put into running neurosymbolic AI, and me personally, down over the last decade, the cutting edge is finally, if quietly and without public acknowledgement, tilting towards neurosymbolic AI.

This essay explains what neurosymbolic AI is, why you should believe it, how deep learning advocates long fought against it, and how in 2025, OpenAI and xAI have accidentally vindicated it.

And it is about why, in 2025, neurosymbolic AI has emerged as the team to beat.

It is also an essay about sociology.

§

The essential premise of neurosymbolic AI is this: the two most common approaches to AI, neural networks and classical symbolic AI, have complementary strengths and weaknesses. Neural networks are good at learning but weak at generalization; symbolic systems are good at generalization, but not at learning.

Since my first book in 2001, The Algebraic Mind, I have been arguing that three ideas drawn from classical symbol-manipulation are indispensable:

“Algebraic” systems such as algorithms, equations, and computer code, in which variables, and operations over those variables can be explicitly specified. In the equation, y = 3x + 2, x and y are variables, and one can instantiate (fill in) those variables with particular values, for example computing that y = 11 if x is set to 3. That sort of abstraction is the essence of computer programs. (Much of my empirical work as a cognitive scientist went toward showing that humans, even infants, could do something analogously algebraic).
Systems for explicitly representing structured, symbolic representations such that, for example, horse rides astronaut systematically mean something different from astronaut rides horse, and such that wholes can in general be predicted compositionally as a function of their parts.
Database-like systems for distinguishing individuals from kinds. In 2001, in the above-mentioned book, I warned that in their absence, hallucinations would emerge as a form of overgeneralization, which I warned was “an inevitable downside”. A quarter century later, hallucinations remain ubiquitous. Within a neural network substrate, the problem still has not been solved.

These three requirements might sound obvious—especially to someone trained in both computer science and cognitive science—but for decades the field of neural networks tried to make do without them.

Instead, until very recently, the consistent move by mainstream machine learning had been to try to derive all that is needed from data, without any recourse at all to symbolic systems such as traditional computer code, databases, etc., aiming to replace explicit representation with black boxes.

Over the years, many in the machine learning field derided attempts to use symbolic tools, often ridiculing them (without genuine argument) as being “not biologically plausible” or somehow (never really specified) in-principle ineffective.

§

Most iconic and influential among the detractors of neurosymbolic approaches has been Geoffrey Hinton.

Hinton has repeatedly argued that the pursuit of symbol-manipulation—even in the context of hybrid neurosymbolic models—is a huge scientific mistake. In a 2015 workshop at Stanford, for example, Hinton gave a talk that claimed that classical AI symbols were “as incorrect as the belief that a lightwave can only travel through space by causing disturbances in the luminiferous aether.”

Another time, in 2018, Hinton told an audience of the G7 to great laughs that hybridizing neural networks and symbolic systems (which is to say neurosymbolic AI) was as foolish as sticking electricity onto a gas engine instead of just making an electric car, arguing that “all progress recently [has come from] sucking in data, not from people putting programs inside.”

(Some of this was driven by a scrabbling over funding, with Hinton alleging that investing in classical AI, and, by extension, hybrid neurosymbolic models, would be “a disaster.”)

§

Hinton’s long-time hostility against any role at all for symbols has, in my judgement, cost the field dearly. Ideas that were were only discovered in the last couple years (e.g., some discussed later in this essay) may have been discovered much later than they might otherwise have been.

Many other important ideas have likely also yet to be discovered, precisely because the Hinton path has distracted immense resources from other ideas, fostering an intellectual monoculture that, in the words of Emily Bender, has been “sucking the oxygen from the room.”

First among the big players to look more broadly was Google DeepMind, which wisely has not taken Hinton’s dogma overly seriously. AlphaFold, AlphaProof, and AlphaGeometry are all successful neurosymbolic models. We would likely have more of such innovation already, if Hinton had not so insistently shaped the modern landscape in such a narrow-minded way.

To the extent that we are stuck for now with untrustworthy LLMs that nobody can quite control, rapidly enshittifying the internet with botshit and replacing humans with systems that can’t be relied on, it is in part because we have spent too much energy pursuing the pure neural net black box approach, and not enough into looking at alternatives. We are drowning in a sea of mediocre prose precisely because LLMs, the backbone of virtually all current sytems, lack the underlying representational wherewithal to do better.

Yet nearly all funding has been aimed simply at making them larger.

§

Despite the often open hostility of Hinton and many of his followers, I have always stuck to my guns, never arguing that we should dispense with deep learning (a commonly-repeated strawman) but always calling, instead, for a hybrid of deep learning and symbols.

In my 2018 Deep Learning: A Critical Appraisal for example, I wrote

Despite all of the problems I have sketched, I don’t think that we need to abandon deep learning.
Rather, we need to reconceptualize it: not as a universal solvent, but simply as one tool among many, a power screwdriver in a world in which we also need hammers, wrenches, and pliers, not to mentions chisels and drills, voltmeters, logic probes, and oscilloscopes.

As we will see in a moment, people are finally pursuing a tool-driven, neurosymbolic vision like this — and getting better results than they were able to get with pure neural networks.

People tend to remember that paper for its criticism. There was a long discussion of the weakness of models that that were then-current, including their difficulties in out-of-domain generalization, with abstraction, and with reasoning, and their overreliance on massive data.

But one of the main points of the paper was its call for the integration of neural networks and symbols:

Another place that we should look is towards classic, “symbolic” AI, sometimes referred to as GOFAI (Good Old-Fashioned AI). Symbolic AI takes its name from the idea, central to mathematics, logic, and computer science, that abstractions can be represented by symbols. Equations like f = ma allow us to calculate outputs for a wide range of inputs, irrespective of whether we have seen any particular values before; lines in computer programs do the same thing (if the value of variable x is greater than the value of variable y, perform action a).
By themselves, symbolic systems have often proven to be brittle, but they were largely developed in [an] era with vastly less data and computational power than we have now. The right move today may be to integrate deep learning, which excels at perceptual classification, with symbolic systems, which excel at inference and abstraction. One might think such a potential merger on analogy to the brain; perceptual input systems, like primary sensory cortex, seem to do something like what deep learning does, but there are other areas, like Broca’s area and prefrontal cortex, that seem to operate at much higher level of abstraction. The power and flexibility of the brain comes in part from its capacity to dynamically integrate many different computations in real-time.

§

The leaders of deep learning hated me for challenging their baby, and couldn’t tolerate any praise for the paper. When an influential economist Erik Brynjolfsson (then at MIT) complimented the article on Twitter, (“Thoughtful insights from @GaryMarcus on why deep learning won't get us all the way to artificial general intelligence”), Hinton’s long time associate Yann LeCun tried to contain the threat, immediately replying to Brynjolffson publicly that the paper was “Thoughtful, perhaps. But mostly wrong nevertheless.”

LeCun was never able to articulate his concerns, but his remark wasn’t about the science. Instead, it was a signal to thefield that my views should be rejected; literally hundreds of people piled on. (Few seem to have noticed the irony that came a few years later, when LeCun ultimately came around to making almost exactly the same points I was making then, declaring, e.g., that pure LLMs are not an adequate route to AGI, and emphasizing their limits in reasoning.)

§

The story was similar when my 2022 article Deep Learning is Hitting a Wall came out. The first part of the paper was negative argument against a hypothesis that was then popular: “Scale is all you need.”

I argued that scaling as it was then defined (pretraining data and compute) alone was insufficient to solve challenges with reasoning, misalignment and hallucinations. (Spoiler alert: it still hasn’t.)

The second part was an argument that neurosymbolic AI might be a way out of this mess:

Early pioneers, like John McCarthy and Marvin Minsky, thought that one could build AI programs precisely by extending these techniques, representing individual entities and abstract ideas with symbols that could be combined into complex structures and rich stores of knowledge, just as they are nowadays used in things like web browsers, email programs, and word processors. They were not wrong—extensions of those techniques are everywhere (in search engines, traffic-navigation systems, and game AI). But symbols on their own have had problems; pure symbolic systems can sometimes be clunky to work with, and have done a poor job on tasks like image recognition and speech recognition; the Big Data regime has never been their forté. As a result, there’s long been a hunger for something else.
That’s where neural networks fit in [solving problems from data where rules are hard to construct]…
To me, it seems blazingly obvious that you’d want both approaches in your arsenal.
… “hybrid models” that incorporate elements of both deep learning and symbol-manipulation… To think that we can simply abandon symbol-manipulation is to suspend disbelief.

Animosity, rather than genuine intellectual engagement, was again the main result. Thousands of people, from Altman to Musk, insulted me and the paper (Hao listed two of the many examples). LeCun wrote on Facebook, in May 2022, a few months before he turned against LLMs, “Not only is Al not "hitting a wall", cars with Al-powered driving assistance aren't hitting walls, or anything else, either.” Musk circulated a meme that mentioned me which featured a cartoon of deep learning-powered robot smashing over walls and buildings. (An accidental forewarning of all the damage that recklessly-rolled out LLMs have started to cause?)

§

Three years later, the pure scaling of pretraining data (and compute, which is to say more and more GPUs) simply hasn’t worked. The specific obstacles that I dwelled on, with respect to hallucinations, misalignment and reasoning errors, have not been overcome.

This started to became clear in November. One of the first insiders to publicly acknowledge it was Marc Andreesen, who said in a November 2024 interview that multiple models were “sort of hitting the same ceiling on capabilities”; Microsoft CEO Satya Nadella soon more or less acknowledged the same (“in the last … weeks there [has been] a lot of debate [on whether we] have we hit the wall with scaling laws”).

§

What has worked—to some degree—is importing some of the ideas from neurosymbolic AI, including using purely symbolic algorithms as a direct part of the workflow.

OpenAI quietly began doing this to some degree in 2023 with a system called “code interpreter”, in which LLMs (themselves neural networks) call (purely symbolic) Python interpreters. This is literally “putting programs inside”; exactly what Hinton said was a huge mistake. And they are innately building in that capacity, rather than learning it.

Sometimes, not always, when a system like o3 calls Python, the symbolic code is actually reported explicitly:

from PIL import Image, ImageDraw
# Grid size for 60 fruits
rows, cols = 6, 10 #6 × 10= 60

radius = 45
spacing = 25

canvas_width - cols * (2 * radius + spacing) + spacing
canvas_height = rows * (2 * radius + spacing) + spacing

img = Image.new ("RGB", (canvas_width, canvas_height), "white")
draw = ImageDraw. Draw(img) 

# Colors
apple_color = (220,0,0)#deepred
…

As leading machine learning expert Francois Chollet explained last year on X, such systems are manifestly neurosymbolic,

“Obviously combining a code interpreter (which is a symbolic system of enormous complexity) with an LLM is neurosymbolic. AlphaGo was neurosymbolic as well. These are universally accepted definitions.”

Recent “reasoning” models borrow even more from classic symbolic approaches, such as techniques like search and conditionally iterating through multiple solutions, and aggregating the results, techniques that traditional networks had long eschewed, in favor of a single feed-forward pass through a neural net.

From what I can tell, most modern models also make heavy use of data augmentation that involves (inter alia) running systems of symbolic rules and training on their outputs, a far cry from the original notion of training on naturalistically-observed data and then inducing whatever needed to be learned. (Details on how all this is implemented are sketchy, with companies like OpenAI saying almost nothing about implementation. DeepSeek has been a notable exception, explicitly acknowledging the use of symbolic rules in generating their training data.)

GPT-2 was a pure LLM; with no direct use of symbol-manipulation. A lot of models since then haven’t been. GPT-4 probably wasn’t (I suspect it included some symbolic filters in its guardrails), and o3, when it invokes code interpreter and symbolically-executed control structures certainly isn’t.

Intriguingly, o3 itself seems to know (pardon my anthropomorphic shorthand) a bunch about this, recognizing that it needs to draw on the (symbolic) code interpreter for certain kinds of problems:

§

Both informal experimentation and a number of recent quantitative results make it abundantly clear that current generative AI systems perform better when they avail themselves of symbolic tools. (Again, per Chollet, when you combine an LLM with a symbolic tool, you have a neurosymbolic system.)

The much-discussed Apple paper was one hint: the authors explicitly forced LLMs to do various tasks, such as Tower of Hanoi, without recourse to using code (symbolic by definition), and it’s under that scenario that they found breakdowns, such as failures on Hanoi with 8 rings.

Another research group, explicitly responding to the Apple paper, showed how you could get much better performance on problems like Hanoi by having models like o3 explicitly invoke code.

I have found similar results in my own informal experiments, when looking at algorithmic processes. ChatGPT (which doesn’t as far as I can tell) currently use Code Interpreter, struggles mightily to draw crosswords grids, as Haym Hirsch recently noted, making weird errors like reporting that 4 down here is “REACT” when it as actually an illegal word, “RCRCT”:

S T A R T 
T R A C E
A C O R N 
R E A C T 
T E N T S

In contrast, o3, when it draws on symbolic code (which it sometimes displays explicitly) can build crossword grids like this far faster than I could (short of my writing my own code).

T R A S H
R E P A Y
A P P L E
S A L O N
H Y E N A

§

The results that seal the deal came a few nights ago, at the launch of Grok 4, perhaps the largest and most expensive model in history. (Elon Musk claims it used 100 times the compute of Grok 2.)

This graph (of a challenging benchmark known as Humanity’s Last Exam) is without question one of the most vindicating things I have ever seen:

Why? Two things can be seen.

First, the lower set of data (yellow dots) represent a strong test of the pure scaling hypothesis. Although the units on the X axis aren’t made clear, it’s quite clear that pure scaling of training data and computer is reaching a point of diminishing returns, a long way from peak possible performance. Contrary to lots of talk from previous years, increasing compute alone is not driving some miraculous exponential explosion into superintelligence.

Second, adding symbolic tools (orange dots) dramatically improves performance.

(Scaling test time is also, as far as I understand it, drawing on classic symbolic techniques for iteration and aggregation.)

You can see this in other benchmarks, too. On this math competition, adding symbolic tools make a big difference:

In short, although LLMs are still far from perfect, enhancing them with symbolic tools, once an anathema, makes a huge difference. Between the strong success of DeepMind’s series of neurosymbolic hybrids (AlphaGo, AlphaFold, AlphaProof, AlphaGeometry, etc.) and the more recent results in which LLMs have been enhanced with python and other tools to dramatic effect, we can safely conclude that neurosymbolic AI is on track to be a major part of AI’s future.

§

None of which means we are close to AGI. There are many ways to put together symbols and neural nets. I am not at all convinced we have the right one.

Bolting code interpreters onto LLMs has value, but whether it gets us to AGI is a different matter. A neurosymbolic substrate for AI by itself (as I argued in 2020) is likely to be necessary but not sufficient. Advances in neurosymbolic AI are likely to be just one part of a larger picture. As Peter Voss has argued (and I agree) we need far more integration than a mere bolt-on would provide. Including LLMs somewhere in the next evolution of AI makes sense to me, but leaving them at the core may be a mistake.

Models like o3 are far better than non-code-interpreter systems on some tasks but not all (for now o3 is actually worse than o1 on hallucinations) and effective for some tasks, but not others. Core problems like “symbol-grounding” have not been systematically solved, for example. Explicit cognitive models (as I discussed recently in this newsletter) are also likely to be essential, yet remain very much underexplored. Reasoning is still unreliable; spatial reasoning is a long way from solved, as Fei-Fei Li has recently argued. We still also (just as I argued in 2020) lack proper ways of building and inferring symbolic world/cognitive models on-the-fly. And we may need more discipline in how we tie semantics to the symbolic components of our systems. There is likely still a great deal of work left to be done.

Reality is maybe a bit like a cartoon I recently posted on X. A hiker climbs across a mountain range, impressively far along (“you are here”), but nowhere near the highest peak (“AGI”). We have made some progress; we still likely have a long way to go.

§

Still, even at our current vantage point, having solved some problems but by no means all, there are several lessons we can draw:

Systems that draw on code interpreter, and hence by definition are neurosymbolic, often outperform those that lack symbolic tools, at least in a bunch of tasks. This is confirmation of the value of neurosymbolic AI.
Hinton’s admonition against using symbolic AI was extraordinarily misguided, and probably delayed discovery of useful tools like Code Interpreter by years. Largely because of Hinton’s influence, the field still talks about neurosymbolic AI as if it were a dirty word. That’s likely holding back the science.
It’s likely that OpenAI, Anthropic and others are already doing more neurosymbolic AI than they actually make public. They still pretend it’s all “a model” but that model uses all kinds of tricks and mixtures with tools, code generation etc.
A lot of recent improvements are likely coming from improvements in using symbolic tools, rather than scaling. Massive infrastructure investments like Stargate are likely based on misleading impressions of what is actually driving progress.
It would appear that it is only recently, in desperation, as scaling started to reach diminishing returns, that mainstream machine learning began to broaden its vistas.
Where the field is getting stuck now is likely on how to scale these new, neurosymbolic tools and how to make models “understand” when to use what and how to logically combine the context. This is why so called “code agents” inevitably fall apart as soon as one gives them a more complex task which requires reasoning.
The code construction itself is still produced by a system that is highly dependent on similarity to training examples. In easy, familiar cases, the results can be astonishing. In problems that diverge from easy and familiar cases, the constructed code can be buggy, or altogether flawed. As Phil Libin put it to me, “trying to improve the results [of o3] by continuing to talk with it rarely works after the first few prompts… It’s either gonna get it right in 5 minutes, or never.”
Neurosymbolic AI is not one thing, but many. o3’s use of neurosymbolic AI is very different from AlphaFold’s use of neurosymbolic AI. Very little of what has been tried has been discussed explicitly, and because the companies are often quite closed about what they are doing, the public science of neurosymbolic AI is greatly impoverished.
Getting to AGI will likely take still more breakthroughs. The best way to foster them is to have an intellectually open attitude. It would be great if we could see more of that.

§

Why was the industry so quick to rally around a connectionist-only approach and shut out naysayers? Why were the top companies in the space seemingly shy about their recent neurosymbolic successes?

Nobody knows for sure. But it may well be as simple as money. The message that we can simply scale our way to AGI is incredibly attractive to investors because it puts money as the central (and sufficient) force needed to advance.

Admitting that they need to rely on neurosymbolic tools would pierce the scaling narrative.

§

So here’s where we are: pure LLMs can’t reliably apply algorithms (vindicating the line of argument I first developed in the 1990s); if you enhance them with symbolic processes—yielding neurosymbolic systems—they often give better results.

OpenAI, without any sort of of public acknowledgement whatsoever, has accidentally vindicated neurosymbolic AI.

Fostering its further development may be among the best things that companies, researchers, and governments can do. Investors, take note.

Gary Marcus has been pushing for neurosymbolic AI throughout his career, since the early 1990s, and could not be more thrilled to see it start to blossom.

Subscribe now

Read the whole story

cjheinz

5 days ago

reply

It figures that Marcus said "LLM should be a tool in a toolbox" years before I did.

Lexington, KY; Naples, FL

The talking dog by Seth Godin Thursday July 17th, 2025 at 8:18 AM

Why the federal government is making climate data disappear by Guest Author Tuesday July 15th, 2025 at 7:39 PM

Tariff Tantrums by Robert Kuttner Monday July 14th, 2025 at 8:33 PM

The Media’s Pivot to AI Is Not Real and Not Going to... by Jason Kottke Monday July 14th, 2025 at 8:18 PM

The Media's Pivot to AI Is Not Real and Not Going to Work by Jason Koebler Monday July 14th, 2025 at 10:46 AM

How o3 and Grok 4 Accidentally Vindicated Neurosymbolic AI by Gary Marcus Sunday July 13th, 2025 at 4:31 PM

The talking dog by Seth Godin
Thursday July 17^th, 2025 at 8:18 AM

Why the federal government is making climate data disappear by Guest Author
Tuesday July 15^th, 2025 at 7:39 PM

Tariff Tantrums by Robert Kuttner
Monday July 14^th, 2025 at 8:33 PM

The Media’s Pivot to AI Is Not Real and Not Going to... by Jason Kottke
Monday July 14^th, 2025 at 8:18 PM

The Media's Pivot to AI Is Not Real and Not Going to Work by Jason Koebler
Monday July 14^th, 2025 at 10:46 AM

How o3 and Grok 4 Accidentally Vindicated Neurosymbolic AI by Gary Marcus
Sunday July 13^th, 2025 at 4:31 PM