Biscuits and tea with William: The problem with AI

In general, advanced AIs can be expected to want to take over the universe so that they can work towards their goals with the least risk of interruption (read: probably kill all humans), so what to do....

Give it a stop button! But it will want to prevent you from pressing that button, because that'll stop it from meeting its goals. It'll work hard to find a way to get to the button or to persuade you never to press it (read: probably kill you). You don't want this.

Program the AI to like having its stop button be pressed! But it will just act in an undesirable way (read: go berserk and probably kill lots of people) so that you press it.

So what to do? I wondered about programming the AI to simply like humans being in control of the universe (if you could figure out how to define "humans" unambiguously without messing up - a separate impossibly difficult problem). But then it might decide that, since AI is probably the biggest threat to human agency, AI development must be stopped.

Picture the scenario.... the AI programmers nervously turn on the robot and it seems to behave very nicely. Then a week later hitmen, paid by the robot who has hacked into all the banks, murder every AI developer on the planet and then the robot sticks its hard drive in a microwave and deletes itself.

Well it's a difficult problem as you can see.

Latest favourite books

A Clash Of Kings

by George R.R. Martin

Really awesome, I'm loving this series and my fears about it losing its narrative drive are being blown out of the water if only by the strength of its worldbuilding. I almost don't care if the story never goes anywhere 'meaningful' - it...

The Metropolitan Man

by Alexander Wales

Really enjoyed it. It did an excellent job of showing how creepy Superman's powers really are, and what an ethical minefield he represents.

A Game of Thrones

by George R.R. Martin

Really good read. Possibly a bit long, if you're stretched for time, but I really enjoyed the setting, the characters, the writing, the darkness, and the handling and scarcity of its fantasy elements. On to the next one!

Speaker for the Dead

by Orson Scott Card

According to TV tropes "by Card's admission, Ender's Game was expanded from its short story form just to set up Speaker for the Dead", and I can totally see it, and boy am I glad. This is a really good book. It had a clear, obvious, int...

A Game of Thrones

by George R.R. Martin

Definitely got in to it by the end, a compelling dark political/fantasy mix. I thought it was a poor man's Shogun (James Clavell) for a while, but it pulled through on the political side alone, while also giving its own light sprinkling ...

Blindsight

by Peter Watts

Interesting! Bit heavy on scientific theories, but explored interesting territory. The writing style was not really to my taste, the sort of sentences that you don't know what it's talking about until you're half way through them - suppo...

7 comments:

Matthew13 August 2021 at 08:25
I think it comes to inter-dependence. Humans are programmed to take of each other, and so can coexist.

Intelligent life (AI) will need a very complex reward system.

But can species evolving at very different rates co-exist?

Maybe about the same time we have artificial life, we’ll be well into augmenting ourselves. We’d have access to the same resources and intelligence, and so could evolve together.

🤞🤞🤞

Pages

The problem with AI

7 comments: