Skip to main content

Main navigation

  • About BCS
    • Mission
    • History
    • Building 46
      • Building 46 Room Reservations
    • Leadership
    • Employment
    • Contact
      • BCS Spot Awards
      • Building 46 Email and Slack
    • Directory
  • Faculty + Research
    • Faculty
    • Areas of Research
    • Postdoctoral Research
      • Postdoctoral Association and Committees
    • Core Facilities
    • InBrain
      • InBRAIN Collaboration Data Sharing Policy
  • Academics
    • Course 9: Brain and Cognitive Sciences
    • Course 6-9: Computation and Cognition
      • Course 6-9 MEng
    • Brain and Cognitive Sciences PhD
      • How to Apply
      • Program Details
      • Classes
      • Research
      • Student Life
      • For Current Students
    • Molecular and Cellular Neuroscience Program
      • How to Apply to MCN
      • MCN Faculty and Research Areas
      • MCN Curriculum
      • Model Systems
      • MCN Events
      • MCN FAQ
      • MCN Contacts
    • Computationally-Enabled Integrative Neuroscience Program
    • Research Scholars Program
    • Course Offerings
  • News + Events
    • News
    • Events
    • Recordings
    • Newsletter
  • Community + Culture
    • Community + Culture
    • Community Stories
    • Outreach
      • MIT Summer Research Program (MSRP)
      • Post-Baccalaureate Research Scholars
      • Conferences, Outreach and Networking Opportunities
    • Get Involved (MIT login required)
    • Resources (MIT login Required)
  • Give to BCS
    • Join the Champions of the Brain Fellows Society
    • Meet Our Donors

Utility Menu

  • Directory
  • Apply to BCS
  • Contact Us

Footer

  • Contact Us
  • Employment
  • Be a Test Subject
  • Login

Footer 2

  • McGovern
  • Picower

Utility Menu

  • Directory
  • Apply to BCS
  • Contact Us
Brain and Cognitive Sciences
Menu
MIT

Main navigation

  • About BCS
    • Mission
    • History
    • Building 46
    • Leadership
    • Employment
    • Contact
    • Directory
  • Faculty + Research
    • Faculty
    • Areas of Research
    • Postdoctoral Research
    • Core Facilities
    • InBrain
  • Academics
    • Course 9: Brain and Cognitive Sciences
    • Course 6-9: Computation and Cognition
    • Brain and Cognitive Sciences PhD
    • Molecular and Cellular Neuroscience Program
    • Computationally-Enabled Integrative Neuroscience Program
    • Research Scholars Program
    • Course Offerings
  • News + Events
    • News
    • Events
    • Recordings
    • Newsletter
  • Community + Culture
    • Community + Culture
    • Community Stories
    • Outreach
    • Get Involved (MIT login required)
    • Resources (MIT login Required)
  • Give to BCS
    • Join the Champions of the Brain Fellows Society
    • Meet Our Donors

Events

News Menu

  • News
  • Events
  • Newsletters

Breadcrumb

  1. Home
  2. Events
  3. Quest | CBMM Seminar Series: Incomplete Objectives and AI Safety: The Theory and Practice of AI Alignment
Quest | CBMM Seminar Series: Incomplete Objectives and AI Safety: The Theory and Practice of AI Alignment
Center for Brains, Minds and Machines (CBMM)

Quest | CBMM Seminar Series: Incomplete Objectives and AI Safety: The Theory and Practice of AI Alignment

Add to CalendarAmerica/New_YorkQuest | CBMM Seminar Series: Incomplete Objectives and AI Safety: The Theory and Practice of AI Alignment 12/04/2023 4:00 pm12/04/2023 5:30 pmSingleton Auditorium,46-3002
December 4, 2023
4:00 pm - 5:30 pm
Location
Singleton Auditorium,46-3002
Contact
penagos@mit.edu
    Description

    Speaker: Dylan Hadfield-Menell (CSAIL)

    Abstract: For AI systems to be safe and effective, they need to be aligned with the goals and values of users, designers, and society. In this talk, I will discuss the challenges of AI alignment and go over research directions to develop safe AI systems. I'll begin with theoretical results that motivate the alignment problem broadly. In particular, I will show how optimizing incomplete goal specifications reliably causes systems to select unhelpful or harmful actions. Next, I will discuss mitigation measures that counteract this failure mode. I will focus on approaches for incorporating human feedback into objectives, interpreting and understanding learned policies, and maintaining uncertainty about intended goals.

    This will be an in-person only event.

    Upcoming Events

    Jul
    Thu
    10
    The Picower Institute for Learning and Memory

    Neuroblox Invited Talks & Discussions: New Ideas in Translational Neuroscience

    9:00am to 1:00pm
    Add to CalendarAmerica/New_YorkNeuroblox Invited Talks & Discussions: New Ideas in Translational Neuroscience07/10/2025 9:00 am07/10/2025 1:00 pmBuilding 32,141
    Jul
    Thu
    10
    Department of Brain and Cognitive Sciences (BCS)

    Raul Mojica Soto-Albors Thesis Defense: Discovery and characterization of plateau potentials in cortical neurons of awake mice

    2:00pm
    Add to CalendarAmerica/New_YorkRaul Mojica Soto-Albors Thesis Defense: Discovery and characterization of plateau potentials in cortical neurons of awake mice07/10/2025 2:00 pm07/10/2025 2:00 pmBuilding 46,Singleton, 46-3002
    Jul
    Fri
    11
    Simons Center for the Social Brain

    Special Seminar with Dr. Balázs Rózsa: Real-Time 3D Imaging and Photostimulation in Freely Moving Animals: A Novel Approach Using Robotic Acousto-Optical Microscopy

    3:00pm to 4:00pm
    Add to CalendarAmerica/New_YorkSpecial Seminar with Dr. Balázs Rózsa: Real-Time 3D Imaging and Photostimulation in Freely Moving Animals: A Novel Approach Using Robotic Acousto-Optical Microscopy07/11/2025 3:00 pm07/11/2025 4:00 pmBuilding 46,46-3310
    See All Events
    Don't miss our next newsletter!
    Sign Up

    Footer menu

    • Contact Us
    • Employment
    • Be a Test Subject
    • Login

    Footer 2

    • McGovern
    • Picower
    Brain and Cognitive Sciences

    MIT Department of Brain and Cognitive Sciences

    Massachusetts Institute of Technology

    77 Massachusetts Avenue, Room 46-2005

    Cambridge, MA 02139-4307 | (617) 253-5748

    For Emergencies | Accessibility

    Massachusetts Institute of Technology