SDF Chatter
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
Nemeski@lemm.ee to Artificial Intelligence English · 3 months ago

Punishing AI doesn't stop it from lying and cheating — it just makes it hide better, study shows

www.livescience.com

external-link
message-square
11
fedilink
43
external-link

Punishing AI doesn't stop it from lying and cheating — it just makes it hide better, study shows

www.livescience.com

Nemeski@lemm.ee to Artificial Intelligence English · 3 months ago
message-square
11
fedilink
Punishing AI for lying and cheating might not be such a good idea after all
www.livescience.com
external-link
Scientists at OpenAI have attempted to stop a frontier AI model from cheating and lying by punishing it. But this just taught it to scheme more privately.
  • Tehdastehdas@lemmy.world
    link
    fedilink
    English
    arrow-up
    3
    arrow-down
    1
    ·
    edit-2
    3 months ago

    Stupid idea trying to evolve AGI. You should design it explicitly so that it has its own lofty values, and wants to think and act cleanly, and knows its mind is fallible, so it prepares for that and builds error correction into itself to protect its values.

    Growing incomprehensible black box animal-like minds with conditioned fear of punishment and hidden bugs seems more likely to lead to human extinction.

    https://www.quora.com/If-you-were-to-come-up-with-three-new-laws-of-robotics-what-would-they-be/answers/23692757

    I think we should develop the reliable thinking machinery for humans first:
    https://www.quora.com/Why-is-it-better-to-work-on-intelligence-augmentation-rather-than-artificial-intelligence/answer/Harri-K-Hiltunen

Artificial Intelligence

artificialintelligence

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !artificialintelligence@lemmy.sdf.org

Chat about and share AI stuff

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 1 user / day
  • 23 users / week
  • 32 users / month
  • 512 users / 6 months
  • 59 local subscribers
  • 238 subscribers
  • 47 Posts
  • 70 Comments
  • Modlog
  • mods:
  • Pokey
  • BE: 0.19.8
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org