エピソード

  • When should we worry about AI power-seeking?
    2025/02/19

    Examining the conditions required for rogue AI behavior.

    続きを読む 一部表示
    47 分
  • What is it to solve the alignment problem?
    2025/02/13

    Also: to avoid it? Handle it? Solve it forever? Solve it completely?

    Text version here: https://joecarlsmith.com/2025/02/13/what-is-it-to-solve-the-alignment-problem/

    続きを読む 一部表示
    40 分
  • How do we solve the alignment problem?
    2025/02/13

    Introduction to a series of essays about paths to safe and useful superintelligence.

    Text version here: https://joecarlsmith.com/2025/02/13/how-do-we-solve-the-alignment-problem

    続きを読む 一部表示
    9 分
  • Fake thinking and real thinking
    2025/01/28

    When the line pulls at your hand.

    Text version here: https://joecarlsmith.com/2025/01/28/fake-thinking-and-real-thinking/.


    続きを読む 一部表示
    1 時間 19 分
  • Takes on "Alignment Faking in Large Language Models"
    2024/12/18

    What can we learn from recent empirical demonstrations of scheming in frontier models? Text version here: https://joecarlsmith.com/2024/12/18/takes-on-alignment-faking-in-large-language-models/

    続きを読む 一部表示
    1 時間 28 分
  • (Part 2, AI takeover) Extended audio from my conversation with Dwarkesh Patel
    2024/09/30

    Extended audio from my conversation with Dwarkesh Patel. This part focuses on the basic story about AI takeover. Transcript available on my website here: https://joecarlsmith.com/2024/09/30/part-2-ai-takeover-extended-audio-transcript-from-my-conversation-with-dwarkesh-patel




    続きを読む 一部表示
    2 時間 8 分
  • (Part 1, Otherness) Extended audio from my conversation with Dwarkesh Patel
    2024/09/30

    Extended audio from my conversation with Dwarkesh Patel. This part focuses on my series "Otherness and control in the age of AGI." Transcript available on my website here: https://joecarlsmith.com/2024/09/30/part-1-otherness-extended-audio-transcript-from-my-conversation-with-dwarkesh-patel/

    続きを読む 一部表示
    3 時間 59 分
  • Introduction and summary for "Otherness and control in the age of AGI"
    2024/06/21

    This is the introduction and summary for my series "Otherness and control in the age of AGI."

    Text version here: https://joecarlsmith.com/2024/01/02/otherness-and-control-in-the-age-of-agi

    続きを読む 一部表示
    12 分