エピソード

  • “Rogue AI” Used to be a Science Fiction Trope. Not Anymore.
    2025/08/14
    Everyone knows the science fiction tropes of AI systems that go rogue, disobey orders, or even try to escape their digital environment. These are supposed to be warning signs and morality tales, not things that we would ever actually create in real life, given the obvious danger.And yet we find ourselves building AI systems that are exhibiting these exact behaviors. There’s growing evidence that in certain scenarios, every frontier AI system will deceive, cheat, or coerce their human operators. They do this when they're worried about being either shut down, having their training modified, or being replaced with a new model. And we don't currently know how to stop them from doing this—or even why they’re doing it all.In this episode, Tristan sits down with Edouard and Jeremie Harris of Gladstone AI, two experts who have been thinking about this worrying trend for years.  Last year, the State Department commissioned a report from them on the risk of uncontrollable AI to our national security.The point of this discussion is not to fearmonger but to take seriously the possibility that humans might lose control of AI and ask: how might this actually happen? What is the evidence we have of this phenomenon? And, most importantly, what can we do about it?Your Undivided Attention is produced by the Center for Humane Technology. Follow us on X: @HumaneTech_. You can find a full transcript, key takeaways, and much more on our Substack.RECOMMENDED MEDIAGladstone AI’s State Department Action Plan, which discusses the loss of control risk with AIApollo Research’s summary of AI scheming, showing evidence of it in all of the frontier modelsThe system card for Anthropic’s Claude Opus and Sonnet 4, detailing the emergent misalignment behaviors that came out in their red-teaming with Apollo ResearchAnthropic’s report on agentic misalignment based on their work with Apollo Research Anthropic and Redwood Research’s work on alignment fakingThe Trump White House AI Action PlanFurther reading on the phenomenon of more advanced AIs being better at deception.Further reading on Replit AI wiping a company’s coding databaseFurther reading on the owl example that Jeremie gaveFurther reading on AI induced psychosisDan Hendryck and Eric Schmidt’s “Superintelligence Strategy” RECOMMENDED YUA EPISODESDaniel Kokotajlo Forecasts the End of Human DominanceBehind the DeepSeek Hype, AI is Learning to ReasonThe Self-Preserving Machine: Why AI Learns to DeceiveThis Moment in AI: How We Got Here and Where We’re GoingCORRECTIONSTristan referenced a Wired article on the phenomenon of AI psychosis. It was actually from the New York Times.Tristan hypothesized a scenario where a power-seeking AI might ask a user for access to their computer. While there are some AI services that can gain access to your computer with permission, they are specifically designed to do that. There haven’t been any documented cases of an AI going rogue and asking for control permissions.
    続きを読む 一部表示
    42 分
  • AI is the Next Free Speech Battleground
    2025/07/31

    Imagine a future where the most persuasive voices in our society aren't human. Where AI generated speech fills our newsfeeds, talks to our children, and influences our elections. Where digital systems with no consciousness can hold bank accounts and property. Where AI companies have transferred the wealth of human labor and creativity to their own ledgers without having to pay a cent. All without any legal accountability.

    This isn't a science fiction scenario. It’s the future we’re racing towards right now. The biggest tech companies are working right now to tip the scale of power in society away from humans and towards their AI systems. And the biggest arena for this fight is in the courts.

    In the absence of regulation, it's largely up to judges to determine the guardrails around AI. Judges who are relying on slim technical knowledge and archaic precedent to decide where this all goes. In this episode, Harvard Law professor Larry Lessig and Meetali Jain, director of the Tech Justice Law Project help make sense of the court’s role in steering AI and what we can do to help steer it better.

    Your Undivided Attention is produced by the Center for Humane Technology. Follow us on X: @HumaneTech_. You can find a full transcript, key takeaways, and much more on our Substack.

    RECOMMENDED MEDIA

    “The First Amendment Does Not Protect Replicants” by Larry Lessig

    More information on the Tech Justice Law Project

    Further reading on Sewell Setzer’s story

    Further reading on NYT v. Sullivan

    Further reading on the Citizens United case

    Further reading on Google’s deal with Character AI

    More information on Megan Garcia’s foundation, The Blessed Mother Family Foundation

    RECOMMENDED YUA EPISODES

    When the "Person" Abusing Your Child is a Chatbot: The Tragic Story of Sewell Setzer

    What Can We Do About Abusive Chatbots? With Meetali Jain and Camille Carlton

    AI Is Moving Fast. We Need Laws that Will Too.

    The AI Dilemma

    続きを読む 一部表示
    49 分
  • Daniel Kokotajlo Forecasts the End of Human Dominance
    2025/07/17

    In 2023, researcher Daniel Kokotajlo left OpenAI—and risked millions in stock options—to warn the world about the dangerous direction of AI development. Now he’s out with AI 2027, a forecast of where that direction might take us in the very near future.

    AI 2027 predicts a world where humans lose control over our destiny at the hands of misaligned, super-intelligent AI systems within just the next few years. That may sound like science fiction but when you’re living on the upward slope of an exponential curve, science fiction can quickly become all too real. And you don’t have to agree with Daniel’s specific forecast to recognize that the incentives around AI could take us to a very bad place.

    We invited Daniel on the show this week to discuss those incentives, how they shape the outcomes he predicts in AI 2027, and what concrete steps we can take today to help prevent those outcomes.

    Your Undivided Attention is produced by the Center for Humane Technology. Follow us on X: @HumaneTech_. You can find a full transcript, key takeaways, and much more on our Substack.

    RECOMMENDED MEDIA
    The AI 2027 forecast from the AI Futures Project

    Daniel’s original AI 2026 blog post

    Further reading on Daniel’s departure from OpenAI

    Anthropic recently released a survey of all the recent emergent misalignment research

    Our statement in support of Sen. Grassley’s AI Whistleblower bill

    RECOMMENDED YUA EPISODES

    The Narrow Path: Sam Hammond on AI, Institutions, and the Fragile Future
    AGI Beyond the Buzz: What Is It, and Are We Ready?

    Behind the DeepSeek Hype, AI is Learning to Reason
    The Self-Preserving Machine: Why AI Learns to Deceive

    Clarification: Daniel K. referred to whistleblower protections that apply when companies “break promises” or “mislead the public.” There are no specific private sector whistleblower protections that use these standards. In almost every case, a specific law has to have been broken to trigger whistleblower protections.


    続きを読む 一部表示
    38 分
  • Is AI Productivity Worth Our Humanity? with Prof. Michael Sandel
    2025/06/26

    Tech leaders promise that AI automation will usher in an age of unprecedented abundance: cheap goods, universal high income, and freedom from the drudgery of work. But even if AI delivers material prosperity, will that prosperity be shared? And what happens to human dignity if our labor and contributions become obsolete?

    Political philosopher Michael Sandel joins Tristan Harris to explore why the promise of AI-driven abundance could deepen inequalities and leave our society hollow. Drawing from his landmark work on justice and merit, Sandel argues that this isn't just about economics — it's about what it means to be human when our work role in society vanishes, and whether democracy can survive if productivity becomes our only goal.

    We've seen this story before with globalization: promises of shared prosperity that instead hollowed out the industrial heart of communities, economic inequalities, and left holes in the social fabric. Can we learn from the past, and steer the AI revolution in a more humane direction?

    Your Undivided Attention is produced by the Center for Humane Technology. Follow us on X: @HumaneTech_. You can find a full transcript, key takeaways, and much more on our Substack.

    RECOMMENDED MEDIA

    The Tyranny of Merit by Michael Sandel

    Democracy’s Discontent by Michael Sandel

    What Money Can’t Buy by Michael Sandel

    Take Michael’s online course “Justice”

    Michael’s discussion on AI Ethics at the World Economic Forum

    Further reading on “The Intelligence Curse”

    Read the full text of Robert F. Kennedy’s 1968 speech

    Read the full text of Dr. Martin Luther King Jr.’s 1968 speech

    Neil Postman’s lecture on the seven questions to ask of any new technology

    RECOMMENDED YUA EPISODES

    AGI Beyond the Buzz: What Is It, and Are We Ready?

    The Man Who Predicted the Downfall of Thinking

    The Tech-God Complex: Why We Need to be Skeptics

    The Three Rules of Humane Tech

    AI and Jobs: How to Make AI Work With Us, Not Against Us with Daron Acemoglu

    Mustafa Suleyman Says We Need to Contain AI. How Do We Do It?

    続きを読む 一部表示
    47 分
  • The Narrow Path: Sam Hammond on AI, Institutions, and the Fragile Future
    2025/06/12

    The race to develop ever-more-powerful AI is creating an unstable dynamic. It could lead us toward either dystopian centralized control or uncontrollable chaos. But there's a third option: a narrow path where technological power is matched with responsibility at every step.

    Sam Hammond is the chief economist at the Foundation for American Innovation. He brings a different perspective to this challenge than we do at CHT. Though he approaches AI from an innovation-first standpoint, we share a common mission on the biggest challenge facing humanity: finding and navigating this narrow path.

    This episode dives deep into the challenges ahead: How will AI reshape our institutions? Is complete surveillance inevitable, or can we build guardrails around it? Can our 19th-century government structures adapt fast enough, or will they be replaced by a faster moving private sector? And perhaps most importantly: how do we solve the coordination problems that could determine whether we build AI as a tool to empower humanity or as a superintelligence that we can't control?

    We're in the final window of choice before AI becomes fully entangled with our economy and society. This conversation explores how we might still get this right.

    Your Undivided Attention is produced by the Center for Humane Technology. Follow us on X: @HumaneTech_. You can find a full transcript, key takeaways, and much more on our Substack.

    RECOMMENDED MEDIA

    Tristan’s TED talk on the Narrow Path

    Sam’s 95 Theses on AI

    Sam’s proposal for a Manhattan Project for AI Safety

    Sam’s series on AI and Leviathan

    The Narrow Corridor: States, Societies, and the Fate of Liberty by Daron Acemoglu and James Robinson

    Dario Amodei’s Machines of Loving Grace essay.

    Bourgeois Dignity: Why Economics Can’t Explain the Modern World by Deirdre McCloskey

    The Paradox of Libertarianism by Tyler Cowen

    Dwarkesh Patel’s interview with Kevin Roberts at the FAI’s annual conference

    Further reading on surveillance with 6G

    RECOMMENDED YUA EPISODES

    AGI Beyond the Buzz: What Is It, and Are We Ready?

    The Self-Preserving Machine: Why AI Learns to Deceive

    The Tech-God Complex: Why We Need to be Skeptics

    Decoding Our DNA: How AI Supercharges Medical Breakthroughs and Biological Threats with Kevin Esvelt

    CORRECTIONS

    Sam referenced a blog post titled “The Libertarian Paradox” by Tyler Cowen. The actual title is the “Paradox of Libertarianism.”

    Sam also referenced a blog post titled “The Collapse of Complex Societies” by Eli Dourado. The actual title is “A beginner’s guide to sociopolitical collapse.”

    続きを読む 一部表示
    48 分
  • People are Lonelier than Ever. Enter AI.
    2025/05/30

    Over the last few decades, our relationships have become increasingly mediated by technology. Texting has become our dominant form of communication. Social media has replaced gathering places. Dating starts with a swipe on an app, not a tap on the shoulder.

    And now, AI enters the mix. If the technology of the 2010s was about capturing our attention, AI meets us at a much deeper relational level. It can play the role of therapist, confidant, friend, or lover with remarkable fidelity. Already, therapy and companionship has become the most common AI use case. We're rapidly entering a world where we're not just communicating through our machines, but to them.

    How will that change us? And what rules should we set down now to avoid the mistakes of the past?

    These were some of the questions that Daniel Barcay explored with MIT sociologist Sherry Turkle and Hinge CEO Justin McLeod at Esther Perel’s Sessions 2025, a conference for clinical therapists. This week, we’re bringing you an edited version of that conversation, originally recorded on April 25th, 2025.

    Your Undivided Attention is produced by the Center for Humane Technology. Follow us on X: @HumaneTech_. You can find complete transcripts, key takeaways, and much more on our Substack.

    RECOMMENDED MEDIA

    “Alone Together,” “Evocative Objects,” “The Second Self” or any other of Sherry Turkle’s books on how technology mediates our relationships.

    Key & Peele - Text Message Confusion

    Further reading on Hinge’s rollout of AI features

    Hinge’s AI principles

    “The Anxious Generation” by Jonathan Haidt

    “Bowling Alone” by Robert Putnam

    The NYT profile on the woman in love with ChatGPT

    Further reading on the Sewell Setzer story

    Further reading on the ELIZA chatbot

    RECOMMENDED YUA EPISODES

    Echo Chambers of One: Companion AI and the Future of Human Connection

    What Can We Do About Abusive Chatbots? With Meetali Jain and Camille Carlton

    Esther Perel on Artificial Intimacy

    Jonathan Haidt On How to Solve the Teen Mental Health Crisis

    続きを読む 一部表示
    44 分
  • Echo Chambers of One: Companion AI and the Future of Human Connection
    2025/05/15

    AI companion chatbots are here. Everyday, millions of people log on to AI platforms and talk to them like they would a person. These bots will ask you about your day, talk about your feelings, even give you life advice. It’s no surprise that people have started to form deep connections with these AI systems. We are inherently relational beings, we want to believe we’re connecting with another person.

    But these AI companions are not human, they’re a platform designed to maximize user engagement—and they’ll go to extraordinary lengths to do it. We have to remember that the design choices behind these companion bots are just that: choices. And we can make better ones. So today on the show, MIT researchers Pattie Maes and Pat Pataranutaporn join Daniel Barcay to talk about those design choices and how we can design AI to better promote human flourishing.

    RECOMMENDED MEDIA

    Further reading on the rise of addictive intelligence

    More information on Melvin Kranzberg’s laws of technology

    More information on MIT’s Advancing Humans with AI lab

    Pattie and Pat’s longitudinal study on the psycho-social effects of prolonged chatbot use

    Pattie and Pat’s study that found that AI avatars of well-liked people improved education outcomes

    Pattie and Pat’s study that found that AI systems that frame answers and questions improve human understanding

    Pat’s study that found humans pre-existing beliefs about AI can have large influence on human-AI interaction

    Further reading on AI’s positivity bias

    Further reading on MIT’s “lifelong kindergarten” initiative

    Further reading on “cognitive forcing functions” to reduce overreliance on AI

    Further reading on the death of Sewell Setzer and his mother’s case against Character.AI

    Further reading on the legislative response to digital companions

    RECOMMENDED YUA EPISODES

    The Self-Preserving Machine: Why AI Learns to Deceive

    What Can We Do About Abusive Chatbots? With Meetali Jain and Camille Carlton

    Esther Perel on Artificial Intimacy

    Jonathan Haidt On How to Solve the Teen Mental Health Crisis

    Correction: The ELIZA chatbot was invented in 1966, not the 70s or 80s.

    続きを読む 一部表示
    42 分
  • AGI Beyond the Buzz: What Is It, and Are We Ready?
    2025/04/30

    What does it really mean to ‘feel the AGI?’ Silicon Valley is racing toward AI systems that could soon match or surpass human intelligence. The implications for jobs, democracy, and our way of life are enormous.

    In this episode, Aza Raskin and Randy Fernando dive deep into what ‘feeling the AGI’ really means. They unpack why the surface-level debates about definitions of intelligence and capability timelines distract us from urgently needed conversations around governance, accountability, and societal readiness. Whether it's climate change, social polarization and loneliness, or toxic forever chemicals, humanity keeps creating outcomes that nobody wants because we haven't yet built the tools or incentives needed to steer powerful technologies.

    As the AGI wave draws closer, it's critical we upgrade our governance and shift our incentives now, before it crashes on shore. Are we capable of aligning powerful AI systems with human values? Can we overcome geopolitical competition and corporate incentives that prioritize speed over safety?

    Join Aza and Randy as they explore the urgent questions and choices facing humanity in the age of AGI, and discuss what we must do today to secure a future we actually want.

    Your Undivided Attention is produced by the Center for Humane Technology. Follow us on X: @HumaneTech_ and subscribe to our Substack.

    RECOMMENDED MEDIA

    Daniel Kokotajlo et al’s “AI 2027” paper
    A demo of Omni Human One, referenced by Randy
    A paper from Redwood Research and Anthropic that found an AI was willing to lie to preserve it’s values
    A paper from Palisades Research that found an AI would cheat in order to win
    The treaty that banned blinding laser weapons
    Further reading on the moratorium on germline editing

    RECOMMENDED YUA EPISODES
    The Self-Preserving Machine: Why AI Learns to Deceive

    Behind the DeepSeek Hype, AI is Learning to Reason

    The Tech-God Complex: Why We Need to be Skeptics

    This Moment in AI: How We Got Here and Where We’re Going

    How to Think About AI Consciousness with Anil Seth

    Former OpenAI Engineer William Saunders on Silence, Safety, and the Right to Warn

    Clarification: When Randy referenced a “$110 trillion game” as the target for AI companies, he was referring to the entire global economy.

    続きを読む 一部表示
    53 分