<?xml version="1.0" encoding="utf-8"?>
<feed version="0.3" xmlns="http://purl.org/atom/ns#" xmlns:dc="http://purl.org/dc/elements/1.1/" xml:lang="en">
<title>Articles</title>
<link rel="alternate" type="text/html" href="http://www.asimovlaws.com/articles/" />
<modified>2004-07-18T04:43:31Z</modified>
<tagline></tagline>
<id>tag:www.asimovlaws.com,2005:/articles//6</id>
<generator url="http://www.movabletype.org/" version="3.01D">Movable Type</generator>
<copyright>Copyright (c) 2004, chris</copyright>
<entry>
<title>3 Laws Don&apos;t Quite Cut It</title>
<link rel="alternate" type="text/html" href="http://www.asimovlaws.com/articles/archives/2004/07/3_laws_dont_qui.html" />
<modified>2004-07-18T04:43:31Z</modified>
<issued>2004-07-14T22:42:04Z</issued>
<id>tag:www.asimovlaws.com,2004:/articles//6.22</id>
<created>2004-07-14T22:42:04Z</created>
<summary type="text/plain"> by Michael Roy Ames Isaac Asimov imagined his robots in the shape of humans so that they would fit well into human society and be as useful as possible. When designing a robot, the external shape is important to...</summary>
<author>
<name>chris</name>

<email>cro1@tutopia.com</email>
</author>

<content type="text/html" mode="escaped" xml:lang="en" xml:base="http://www.asimovlaws.com/articles/">
<![CDATA[<p><img src="/articles/images/ames2.png" alt="" /> by <a href="http://www.geocities.com/michaelroyames/">Michael Roy Ames</a></p>

<p>Isaac Asimov imagined his robots in the shape of humans so that they would fit well into human society and be as useful as possible. When designing a robot, the external shape is important to get right, but it is not the most important aspect of the design. The <em>most</em> critical element of a robot is its brain, sometimes called an Artificial Intelligence or <span class="caps">AI. </span> When designing an <span class="caps">AI, </span>a thinking being, the crucial question is: what should be its personal goals? Should a robot AI simply follow orders or should it have default rules already built-in?</p>]]>
<![CDATA[<p>Asimov answered this question by creating the <em>Three Laws of Robotics</em>, making it the goal of his imaginary robot AIs to obey Three Laws. This worked very well to make entertaining science fiction stories, but is it a good idea for robots in the real world? The following are Asimov&#8217;s laws:</p>

<blockquote><p><strong>The Three Laws of Robotics</strong></p>


<ol>
<li>A robot may not injure a human being, or, through inaction, allow a human being to come to harm. </li>
<li>A robot must obey orders given it by human beings, except where such orders would conflict with the First Law. </li>
<li>A robot must protect its own existence as long as such protection does not conflict with the First or Second Law. </li>
</ol>

</blockquote>

<p>These laws look quite good upon first reading; they seem to be a reasonable guide for an intelligent being toward behaviour that a normal person would find acceptable, if not exemplary! Of course, it is easy to assume that a robot would interpret the laws the same way a human being would, but there is a problem. A human will have human goals and a variety of other mental imperatives such as breathing, eating and drinking, securing shelter and safety, finding love, building a family, etc. Asimov&#8217;s robots do not have any of these goals or imperatives; their only goal is to obey the 3 Laws. This will lead to many problems, because the robot will interpret the laws differently from a human since it has different goals. Laws and goals are different things, but the two are mixed together in Isaac Asimov&#8217;s 3 Laws of Robotics. The dictionary definitions of &#8220;law&#8221; and &#8220;goal&#8221; are:</p>

<blockquote><p><span class="caps">LAW </span>n. rule of action or procedure commanding or forbidding certain actions.<br />
<span class="caps">GOAL </span>n. object of ambition or effort.</p></blockquote>

<p>There are also other differences between human and robot interpretations. A 3 Laws robot will have very different experiences of the world in comparison to a human. How can we expect it to obey laws like a human without the experience of growing up through childhood and into adulthood as a human? We can&#8217;t expect that; it won&#8217;t happen automatically. An even worse problem is that a 3 Laws robot would have none of the moral framework of a human adult; none of the appreciation for beauty or the love of family and community, nor the wisdom to determine right from wrong. According to a 3 Laws robot, the &#8220;right thing to do&#8221; has nothing to do with wisdom; the right thing to do is forever fixed as &#8220;obey the 3 Laws.&#8221;</p>

<p>Human-made laws work well in human society because people have reasons and motivations to follow the rules. There are many motives, both positive and negative. A positive example is: &#8220;If I follow the law, then society will operate more smoothly, and we will all be better off.&#8221; The human motivation here is altruism, to benefit others, and also selfishness because benefiting society indirectly benefits the self. &#8220;To avoid punishment for breaking laws,&#8221; is an obvious example of the negative. But a 3 Laws robot shares none of these human motivations. All an Asimovian robot brain knows is to follow the 3 Laws. If the 3 Laws do not seem to apply, what happens then?</p>

<p>If human beings encounter a completely new problem, then they will do various things to try and solve it. Problem solving requires intelligence, and the more intelligent humans are the better they can solve new and unique problems. If robots are to obey the 3 Laws like an intelligent human they will also have to be intelligent. Robots would have to be at least as intelligent as humans to understand how we want the laws obeyed. But, as we have already seen, 3 Laws robots have different motivations from humans; their motivation is simply to obey the 3 Laws. Therefore a 3 Laws robot will always obey the laws differently from a human. A ready example is when a human being <em>wants</em> to do something where there is a risk of harm, such as crossing a street. What would a robot do when observing that situation?  It might restrain the person from crossing a street, even if the risk of harm was low. And there are other, even more troubling, problems.</p>

<p>The goal of &#8220;obedience to a set of laws&#8221; does not account for exceptions to the laws, or future changes to the laws. Even worse from the point of view of us poor humans: what if the laws are flawed? Or what if we made a mistake and they are flat-out wrong? There will be many new situations occurring in the future, and the laws we come up with today will not be perfect tomorrow – they will need to be modified. But laws are notoriously slow to change in human society, and there is no reason to suspect that robots will be upgraded with new sets of laws fast enough to keep up with new situations. This is true even today with ordinary computer software that is quickly and automatically updated online to fix bugs and &#8220;vulnerabilities.&#8221; Despite our greatest efforts, new invasions of internet worms and software viruses still cause havoc before the software can be patched to handle a new situation. So&#8230; is there no hope for these great-sounding 3 Laws? Would a truly intelligent robot AI find creative ways to make the laws work the way they were intended?</p>

<p>Taking the First Law as an example, a truly intelligent 3 Laws robot might find ways to creatively obey the law. The last part of the First Law states that a robot may not, through inaction, allow a human being to come to harm. This directly implies that, if a human being might come to harm, the robot <em>must take action</em>. It would not take long observing humans for an intelligent robot to conclude that there are many situations during daily life when a human might come to harm. Some of the more frequent accidents are auto collisions, slipping and falling, and back injuries due to lifting heavy objects. A 3 Laws robot would certainly want to take action to prevent these harmful events. A robot might not let you drive because of the risk of collision, or walk when the ground is wet because you might slip. Perhaps a robot would prevent you from getting out of bed – after all your body is quite heavy, and you might fall and hurt yourself.  These examples may seem humorous or trivial, but you wouldn&#8217;t be laughing if they actually happened.</p>

<p>Asimov&#8217;s robotic laws were designed by a writer for literary purposes, and his stories were often interesting because the robots&#8217; behaviour was not the behaviour expected of humans. Unexpected outcomes make for interesting story telling but would be very uncomfortable as a lifestyle, especially if it involved imprisonment by an overprotective 3 Laws robot. There are many reasons why creating an intelligent being with goals to &#8220;obey laws&#8221; will inevitably lead to failure; this article only covers a few of them. Using <em>laws</em> as <em>goals</em> cannot work, and even ten thousand laws are not enough. The failures would continue, in slightly different ways with different sets of laws, but for the same reasons as they fail for 3 Laws. Laws don&#8217;t prevent catastrophes, or make good behaviour certain. Only rational thinking beings, who have goals of working in cooperation with humans, could obey laws we specify; and then continue to obey the spirit of the laws when the original laws no longer make sense. Such beings will not be bound by laws because of a few lines of computer code, but because they are complex moral beings that can reason about the world and choose to do what is right.</p>

<p>For further information about alternatives to the 3 Laws, see <a href="http://www.singinst.org/intro/">http://www.singinst.org/intro/</a></p>]]>
</content>
</entry>
<entry>
<title>All the Robots and Isaac Asimov</title>
<link rel="alternate" type="text/html" href="http://www.asimovlaws.com/articles/archives/2004/07/all_the_robots.html" />
<modified>2004-07-16T22:46:04Z</modified>
<issued>2004-07-12T22:45:35Z</issued>
<id>tag:www.asimovlaws.com,2004:/articles//6.25</id>
<created>2004-07-12T22:45:35Z</created>
<summary type="text/plain"> by Greg Bear One of the most persuasive themes in literature is that of the artificial servant. In his 1921 play &amp;#8220;RUR,&amp;#8221; Karel Capek named his artificial servants &amp;#8220;robots,&amp;#8221; from the Czech word robota, which roughly translates as &amp;#8220;unwilling...</summary>
<author>
<name>chris</name>

<email>cro1@tutopia.com</email>
</author>

<content type="text/html" mode="escaped" xml:lang="en" xml:base="http://www.asimovlaws.com/articles/">
<![CDATA[<p><img src="/articles/images/bear.png" alt="" /> by <a href="http://www.gregbear.com/">Greg Bear</a></p>

<p>One of the most persuasive themes in literature is that of the artificial servant. In his 1921 play &#8220;RUR,&#8221; Karel Capek named his artificial servants &#8220;robots,&#8221; from the Czech word <em>robota</em>, which roughly translates as &#8220;unwilling worker.&#8221; (It could also mean &#8220;someone who does boring work.&#8221;) That name has stuck with us ever since, and despite other names – android (human-shaped artificial being), cyborg (cybernetic organism, implying part machine, part living tissue), droid, and so on – robot is likely to be the definitive label for some decades to come.</p>]]>
<![CDATA[<p>Isaac Asimov began writing his robot stories in the 1940s, and published the first compendium volume, <em>I, Robot</em>, in 1950. The Three Laws of Robotics first appeared together in Asimov&#8217;s story &#8220;Runaround,&#8221; published in <em>Astounding Science Fiction</em> in March of 1942. In a conversation in December of 1940, Campbell presented the rules to Asimov as a <em>fait accompli</em>, saying that he had abstracted them from principles made obvious in the stories themselves.</p>

<p>Later, the Three Laws became the key to plot development in the robot stories, and as Asimov merged his robot stories with his Foundation stories and novels, the Three Laws played a significant role there, as well. Today, they are taken for granted – by science fiction writers and robot designers alike.</p>

<p>The Three Laws seem almost self-evident as guidelines for any variety of artificial servant – at least, servants capable of moving around or otherwise taking significant action. Thus, while a modern thermostat (a particularly simple kind of robot) has neither the complexity nor the power to save or harm a human, the onboard computer of a spaceship may. (Asimov was indignant that the <span class="caps">HAL</span> 9000 computer in Kubrick and Clarke&#8217;s <em>2001: A Space Odyssey</em> failed to follow the Three Laws! Clarke and Kubrick, however, implied that a programming contradiction caused Hal to malfunction, and that was the root cause of Hal&#8217;s murderous behavior.)</p>

<p>A completely mobile metal servant in humanoid form, as envisioned by Asimov, is mythic. Robot servants of this type have existed at least since Homer (see footnote). Even before Asimov&#8217;s first robot compendium, Eando Binder (the writing team of Earl and Otto Binder) had written a story called &#8220;I, Robot,&#8221; later dramatized on the 1960&#8217;s television series <em>The Outer Limits</em>. But Asimov gave the robots an endearing personality, a feckless but determined sense of honor and dignity that has not been equaled since. </p>

<p>It&#8217;s not at all extreme to call Asimov the father of the modern robot, just as his character, Susan Calvin, is the fictional mother of the robots. Calvin, of course, creates the science of robotics – a word that Asimov coined, and which has gone into general usage today.</p>

<p>Some of the most famous robots of the screen show a selfless fealty to the Three Laws. Robby the Robot in <em>Forbidden Planet</em> (1956) is beyond any doubt an Asimovian robot. The terminators in James Cameron&#8217;s series of films are rather less so. </p>

<p>Ultimately, even Asimov himself maneuvered around the basic Three Laws. R. Daneel Olivaw, the robot colleague of detective Lije Bailey in <em>The Caves of Steel</em> and <em>The Naked Sun</em>, when faced with the extreme forces of human history, and some very difficult decisions, formulates the so-called Zeroth Law in <em>Robots and Empire</em>. Basically, the Zeroth law allows a robot to harm a human if it is for the expressed good of a greater number of humans, or humanity as a whole. This allows Daneel to become a behind-the-scenes manipulator of human history, a sort of one-robot Illuminati. But Asimov was well aware of the dire implications of the Zeroth Law, and was still developing Daneel&#8217;s story in the last robot/Foundation novels. </p>

<p>Gregory Benford, David Brin, and I used Daneel in our trilogy of novels set in Asimov&#8217;s Foundation universe. Daneel in these novels has become a tragic figure, neither human nor entirely robot, burdened by the weight of history. For the Zeroth law, of course, allows and even encourages robotic interpretation of circumstances, thereby implying – even requiring – that robots have the equivalent of a conscience. </p>

<p>To me, it seemed unlikely that Daneel would appreciate these burdens. But, as a robot, Daneel must serve&#8230;<br />
Forever and ever.</p>

<p>Asimov was well aware of the myth of the Golem, an artificial warrior/protector made of clay and imbued with life in 16th Century Prague by Rabbi Löw. The Golem is instructed to protect the Jewish inhabitants of the ghetto against the depredations of anti-semitic mobs. But the Golem is too powerful and ultimately goes out of control. This is the classic version of the artificial servant story, and it is echoed in the extraordinary novel and myth of Frankenstein&#8217;s monster. However, Mary Shelley&#8217;s monster, assembled by a scientist from fresh corpses, is not intended to be a servant, but a new kind of human being, intellectually and physically a superman. The Monster&#8217;s failure is due to another kind of programming error – the failure of Frankenstein, his creator, to follow through with a moral education, to take responsibility for this new kind of &#8220;child.&#8221;</p>

<p>Asimov&#8217;s robots are often childlike, but their programming failures usually arise through inherent contradictions in the Three Laws, or unexpected mutations, such as the development of <span class="caps">ESP.</span> The robots themselves, by and large, remain innocent, like perpetual children.</p>

<p>As for their positronic brains, it&#8217;s clear that Asimov was simply looking for a new way of describing super-advanced and complicated computer circuitry. Positrons are anti-electrons. Circuits utilizing positrons would be exotic indeed. This is not to say they are impossible – just very advanced. Besides, the name sounds cool. Today, scientists are working with quantum computers that rely on the vagaries of the subatomic realm. Such computers can do computations in more than one &#8220;universe&#8221; at a time. Doubtless Asimov would have found that a very intriguing possibility.</p>

<p>What if a robot with a quantum logic brain could act in more than one world-line? What if, in one branching world-line, a robot must kill a human being, so that in all other world-lines, humanity will survive?</p>

<p>As you can see, the Three Laws can be resurrected and played with over and over again.</p>

<p>As always, Asimov endures, and the problems he presents to us in these stories are, in disguise, the moral problems of any thinking individual.</p>

<p>In the end, what intrigues us most about the robot stories and novels is the conviction that we are no better than the robots, and perhaps worse. Perhaps we are simply the servants of human history, and our discovery of the Zeroth Law led to our Fall from grace.</p>

<p>Are the Ten Commandments nothing more than a wordy expansion of the Three Laws?</p>

<p>&#8212;</p>

<p><strong>Footnote:</strong> The Tin Woodsman in Baum&#8217;s <em>The Wizard of Oz</em> is actually a kind of Six Million Dollar Man, since he started out as a human. As he accidentally chopped off portions of his anatomy, they were replaced by metal parts. Along the way, he would have been a cyborg!</p>]]>
</content>
</entry>
<entry>
<title>Asimov&apos;s Laws of Robotics: Implications for Information Technology</title>
<link rel="alternate" type="text/html" href="http://www.asimovlaws.com/articles/archives/2004/07/asimovs_laws_of.html" />
<modified>2004-07-16T22:58:50Z</modified>
<issued>2004-07-11T22:57:24Z</issued>
<id>tag:www.asimovlaws.com,2004:/articles//6.27</id>
<created>2004-07-11T22:57:24Z</created>
<summary type="text/plain"> by Roger Clarke (Partial reprint. Originally published as two parts in IEEE Computer, December 1993) With the death of Isaac Asimov on April 6, 1992, the world lost a prodigious imagination. Unlike fiction writers before him, who regarded robotics...</summary>
<author>
<name>chris</name>

<email>cro1@tutopia.com</email>
</author>

<content type="text/html" mode="escaped" xml:lang="en" xml:base="http://www.asimovlaws.com/articles/">
<![CDATA[<p><img src="/articles/images/clarke.png" alt="" /> by <a href="http://www.anu.edu.au/people/Roger.Clarke/">Roger Clarke</a></p>

<p><em>(Partial reprint. Originally published as two parts in</em> <span class="caps">IEEE</span> Computer, <em>December 1993)</em></p>

<p>With the death of Isaac Asimov on April 6, 1992, the world lost a prodigious imagination. Unlike fiction writers before him, who regarded robotics as something to be feared, Asimov saw a promising technological innovation to be exploited and managed. Indeed, Asimov&#8217;s stories are experiments with the enormous potential of information technology.</p>

<p>This article examines Asimov&#8217;s stories not as literature but as a gedankenexperiment – an exercise in thinking through the ramifications of a design. Asimov&#8217;s intent was to devise a set of rules that would provide reliable control over semi-autonomous machines. My goal is to determine whether such an achievement is likely or even possible in the real world. In the process, I focus on practical, legal, and ethical matters that may have short- or medium-term implications for practicing information technologists.</p>]]>
<![CDATA[<p><strong>Origins of robotics</strong></p>

<p>Robotics, a branch of engineering, is also a popular source of inspiration in science fiction literature; indeed, the term originated in that field. Many authors have written about robot behaviour and their interaction with humans, but in this company Isaac Asimov stands supreme. He entered the field early, and from 1940 to 1990 he dominated it. Most subsequent science fiction literature expressly or implicitly recognizes his Laws of Robotics.</p>

<p>Asimov described how, at the age of 20 he came to write robot stories: </p>

<blockquote><p>&#8220;In the 1920&#8217;s science fiction was becoming a popular art form for the first time &#8230;.. and one of the stock plots &#8230;. was that of the invention of a robot &#8230;. Under the influence of the well-known deeds and ultimate fate of Frankenstein and Rossum, there seemed only one change to be rung on this plot - robots were created and destroyed their creator &#8230; I quickly grew tired of this dull hundred-times-told tale &#8230;. Knowledge has its dangers, yes, but is the response to be a retreat from knowledge? &#8230;. I began in 1940, to write robot stories of my own - but robot stories of a new variety &#8230;&#8230; My robots were machines designed by engineers, not pseudo-men created by blasphemers&#8221;<sup class="footnote"><a href="http://www.asimovlaws.com/articles/archives/2004/07/asimovs_laws_of.html#fn1">1</a></sup>,<sup class="footnote"><a href="http://www.asimovlaws.com/articles/archives/2004/07/asimovs_laws_of.html#fn2">2</a></sup> </p></blockquote>

<p>Asimov was not the first to conceive of well-engineered, non-threatening robots, but he pursued the theme with such enormous imagination and persistence that most of the ideas that have emerged in this branch of science fiction are identifiable with his stories.</p>

<p>To cope with the potential for robots to harm people, Asimov, in 1940, in conjunction with science fiction author and editor John W. Campbell, formulated the Laws of Robotics.<sup class="footnote"><a href="http://www.asimovlaws.com/articles/archives/2004/07/asimovs_laws_of.html#fn3">3</a></sup>,<sup class="footnote"><a href="http://www.asimovlaws.com/articles/archives/2004/07/asimovs_laws_of.html#fn4">4</a></sup> He subjected all of his fictional robots to these laws by having them incorporated within the architecture of their (fictional) &#8220;platinum-iridium positronic brains&#8221;. The laws (see below) first appeared publicly in his fourth robot short story, &#8220;Runaround&#8221;<sup class="footnote"><a href="http://www.asimovlaws.com/articles/archives/2004/07/asimovs_laws_of.html#fn5">5</a></sup>. </p>

<p><strong>The 1940 Laws of Robotics</strong></p>

<blockquote><p><strong>First Law:</strong> A robot may not injure a human being, or, through inaction, allow a human being to come to harm. </p>

<p><strong>Second Law:</strong> A robot must obey orders given it by human beings, except where such orders would conflict with the First Law. </p>

<p><strong>Third Law:</strong> A robot must protect its own existence as long as such protection does not conflict with the First or Second Law. </p></blockquote>

<p>The laws quickly attracted – and have since retained – the attention of readers and other science fiction writers. Only two years later, another established writer, Lester Del Rey, referred to &#8220;the mandatory form that would force built-in unquestioning obedience from the robot&#8221;.<sup class="footnote"><a href="http://www.asimovlaws.com/articles/archives/2004/07/asimovs_laws_of.html#fn6">6</a></sup></p>

<p>As Asimov later wrote (with his characteristic clarity and lack of modesty), &#8220;Many writers of robot stories, without actually quoting the three laws, take them for granted, and expect the readers to do the same&#8221;.</p>

<p>Asimov&#8217;s fiction even influenced the origins of robotic engineering. &#8220;Engelberger, who built the first industrial robot, called Unimate, in 1958, attributes his long-standing fascination with robots to his reading of [Asimov&#8217;s] &#8216;I, Robot&#8217; when he was a teenager&#8221;, and Engelberger later invited Asimov to write the foreword to his robotics manual. </p>

<p>The laws are simple and straightforward, and they embrace &#8220;the essential guiding principles of a good many of the world&#8217;s ethical systems.&#8221;<sup class="footnote"><a href="http://www.asimovlaws.com/articles/archives/2004/07/asimovs_laws_of.html#fn7">7</a></sup> They also appear to ensure the continued dominion of humans over robots, and to preclude the use of robots for evil purposes. In practice, however – meaning in Asimov&#8217;s numerous and highly imaginative stories – a variety of difficulties arise. </p>

<p>My purpose here is to determine whether or not Asimov&#8217;s fiction vindicates the laws he expounded. Does he successfully demonstrate that robotic technology can be applied in a responsible manner to potentially powerful, semi-autonomous and, in some sense intelligent machines? To reach a conclusion, we must examine many issues emerging from Asimov&#8217;s fiction. </p>

<p><strong>History</strong></p>

<p>The robot notion derives from two strands of thought, humanoids and automata. The notion of a humanoid (or human- like nonhuman) dates back to Pandora in <em>The Iliad</em>, 2,500 years ago and even further. Egyptian, Babylonian, and ultimately Sumerian legends fully 5,000 years old reflect the widespread image of the creation, with god- men breathing life into clay models. One variation on the theme is the idea of the golem, associated with the Prague ghetto of the sixteenth century. This clay model, when breathed into life, became a useful but destructive ally.</p>

<p>The golem was an important precursor to Mary Shelley&#8217;s <em>Frankenstein: The Modern Prometheus</em> (1818). This story combined the notion of the humanoid with the dangers of science (as suggested by the myth of Prometheus, who stole fire from the gods to give it to mortals). In addition to establishing a literary tradition and the genre of horror stories, <em>Frankenstein</em> also imbued humanoids with an aura of ill fate.</p>

<p>Automata, the second strand of thought, are literally &#8220;self- moving things&#8221; and have long interested mankind. Early models depended on levers and wheels, or on hydraulics. Clockwork technology enabled significant advances after the thirteenth century, and later steam and electro- mechanics were also applied. The primary purpose of automata was entertainment rather than employment as useful artifacts. Although many patterns were used, the human form always excited the greatest fascination. During the twentieth century, several new technologies moved automata into the utilitarian realm. Geduld and Gottesman<sup class="footnote"><a href="http://www.asimovlaws.com/articles/archives/2004/07/asimovs_laws_of.html#fn8">8</a></sup> and Frude<sup class="footnote"><a href="http://www.asimovlaws.com/articles/archives/2004/07/asimovs_laws_of.html#fn2">2</a></sup> review the chronology of clay model, water clock, golem, homunculus, android, and cyborg that culminated in the contemporary concept of the robot.</p>

<p>The term robot derives from the Czech word robota, meaning forced work or compulsory service, or robotnik, meaning serf. It was first used by the Czech playwright Karel Çapek in 1918 in a short story and again in his 1921 play <em>R. U. R.</em>, which stood for Rossum&#8217;s Universal Robots. Rossum, a fictional Englishman, used biological methods to invent and mass- produce &#8220;men&#8221; to serve humans. Eventually they rebelled, became the dominant race, and wiped out humanity. The play was soon well known in English- speaking countries.</p>

<p><a href="http://www.anu.edu.au/people/Roger.Clarke/SOS/Asimov.html">Continue reading Roger Clarke&#8217;s  <em>Asimov&#8217;s Laws of Robotics</em></a></p>

<p>&#8212;</p>

<p class="footnote" id="fn1"><sup>1</sup> I. Asimov, <em>The Rest of the Robots</em> (a collection of short stories originally published between 1941 and 1957), Grafton Books, London, 1968.</p>

<p class="footnote" id="fn2"><sup>2</sup> N. Frude, <em>The Robot Heritage</em>, Century Publishing. London. 1984.</p>

<p class="footnote" id="fn3"><sup>3</sup> I. Asimov, <em>I, Robot</em> (a collection of short stories originally published between 1940 and 1950), Grafton Books, London, 1968.</p>

<p class="footnote" id="fn4"><sup>4</sup> I. Asimov, <span class="caps">P.S.</span> Warrick, and <span class="caps">M.H.</span> Greenberg, eds., <em>Machines That Think</em>, Holt, Rinehart. and Wilson, London. 1983.</p>

<p class="footnote" id="fn5"><sup>5</sup> I. Asimov, &#8220;Runaround&#8221; (originally published in 1942), reprinted in Reference 3, pp. 33- 51.</p>

<p class="footnote" id="fn6"><sup>6</sup> L. Del Rey, &#8220;Though Dreamers Die&#8221; (originally published in 1944). reprinted in Reference 4, pp. 153- 174.</p>

<p class="footnote" id="fn7"><sup>7</sup> I. Asimov, &#8220;Evidence&#8221; (originally published in 1946). reprinted in Reference 3. pp. 159- 182.</p>

<p class="footnote" id="fn8"><sup>8</sup> <span class="caps">H.M.</span> Geduld and R. Gottesman. eds.. <em>Robots, Robots, Robots</em>, New York Graphic Soc., Boston. 1978.</p>]]>
</content>
</entry>
<entry>
<title>Deconstructing Asimov&apos;s Laws</title>
<link rel="alternate" type="text/html" href="http://www.asimovlaws.com/articles/archives/2004/07/deconstructing.html" />
<modified>2004-07-17T22:37:27Z</modified>
<issued>2004-07-10T22:43:10Z</issued>
<id>tag:www.asimovlaws.com,2004:/articles//6.23</id>
<created>2004-07-10T22:43:10Z</created>
<summary type="text/plain"> by Michael Anissimov Asimov&amp;#8217;s 3 Laws of Robotics may seem a decent set of guidelines for ensuring that future robots and AIs behave in satisfactory ways. But there are several problems that immediately emerge when we look deeper. For...</summary>
<author>
<name>chris</name>

<email>cro1@tutopia.com</email>
</author>

<content type="text/html" mode="escaped" xml:lang="en" xml:base="http://www.asimovlaws.com/articles/">
<![CDATA[<p><img src="/articles/images/anissimov.png" alt="" /> by <a href="http://www.acceleratingfuture.com/michael/">Michael Anissimov</a></p>

<p>Asimov&#8217;s 3 Laws of Robotics may seem a decent set of guidelines for ensuring that future robots and AIs behave in satisfactory ways. But there are several problems that immediately emerge when we look deeper. For one, it&#8217;s not so straightforward to convert a set of statements into a mind that follows or believes in those statements. Two, semantic ambiguity means that without personally understanding the reasons for the laws and the original intent, a robot might misinterpret their meaning, leading to problems. Third, Asimov&#8217;s Laws ignore the possibility that a robot will acquire the ability to reprogram itself – an inevitable eventuality if intelligent robots are created. How can we confront these issues, as we move closer to the creation of a new intelligent species?</p>]]>
<![CDATA[<p>Isaac Asimov&#8217;s &#8220;Three Laws of Robotics&#8221;:</p>


<ol>
<li>A robot may not injure a human being, or, through inaction, allow a human being to come to harm.</li>
<li>A robot must obey orders given it by human beings, except where such orders would conflict with the First Law.</li>
<li>A robot must protect its own existence as long as such protection does not conflict with the First or Second Law.</li>
</ol>



<p>Asimov&#8217;s Laws are humanity&#8217;s early, off-the-cuff answer to the question, &#8220;How do we build robots that get along well in human society?&#8221; But they led to failure in his stories, and would lead to failure if implemented in real life. Depending on the physical and intellectual power of future robots, and their ability to improve these powers, these failures could range from spilt coffee to the total extermination of the human race.</p>

<p>Asimov’s Three Laws of Robotics describe a minimum set of rules that many humans would prefer that all robots follow. But the rules, in their conciseness, lack content. Words like &#8220;robot,&#8221; &#8220;orders,&#8221; &#8220;humans,&#8221; and &#8220;existence&#8221; are fuzzy semantic constructs whose meaning lies primarily in the minds that interpret them. Connecting such words with concrete classes of physical systems or processes is not as easy as it seems. Human beings are fortunate to have a rich underlying base of cognitive complexity that connects a few marks on paper (words) to entire classes of complex physical processes.</p>

<p>Often the Laws are mistakenly presented as sufficient to create a robot that is thoroughly safe and fits in well with society. But the Laws are vastly insufficient; they&#8217;re simply too simple. The only working examples of benevolence and harmony with society that we have are well-balanced humans – whose behavior is implemented by brains that are enormously complex. In order to create truly benevolent AIs, we will need to create designs with the motivation to absorb major portions of this complexity.</p>

<p>Consider the apparently simple action of sweeping a floor with a broom. To us, it seems straightforward, but the computations occurring are actually far more elaborate than anything performed by present-day computer software. Maintaining balance, scanning a room for dust, performing hand motions to manipulate the broom into the chosen positions – a team of roboticists might slave for decades and still be unable to duplicate such complex processes.</p>

<p>&#8220;Getting along well with others&#8221; may seem straightforward to us, but it would actually require a goal system of substantial complexity to replicate. Asimov’s Laws ignore the extensive cognitive complexity wrapped up in any intelligent social morality. At best, Asimov’s Laws are a first tiny step on a huge journey of satisfactory goal system design in robots. At worst, they will give pre-robotic human civilization a false sense of security and preparedness, discouraging further research in the relevant areas.</p>

<p>Part of the issue is that humans are heavily loaded with instincts, common sense, and rules of thumb that assist us in determining which actions are acceptable and constructive, to ourselves and to others. Creating robots that add value to human society will not be a matter of creating a &#8220;generic mind&#8221; and then &#8220;constraining&#8221; it with Asimovian laws, but creating sensible and benevolent minds from the get-go. Gandhi was not a nice guy because someone forced him to be that way, but because that was part of his inborn character. Future robots should be the same.</p>

<p>In creating future robots and AIs, we will need to plan for scenarios in which robots become smarter than humans, bodies far more effective at manipulating reality than humans, robots which manipulate their own source code, and so on. Pessimists might throw up their arms in despair and say, &#8220;At that point, there&#8217;s nothing you can do. There is no way a human programmer can code an AI with certain goals and expect it to stay that way. You can&#8217;t control something smarter than you are.&#8221;</p>

<p>Yet, the very same humans that say these things behave in ways derived from their creator – evolution. Present-day human goal systems are derivatives of our original evolutionary goals. We may not be as uncivilized, barbaric, or narrow-minded as our ancestors, but our goals still firmly derive from evolutionary programming. Even if we decided to completely &#8220;throw out&#8221; our original goals, the choices we made about throwing out those goals would still somehow derive from the root goals!</p>

<p>Intelligent robots will be the same. A benevolent robot modifying its own programming will not randomly turn to selfishness, unless a cognitive predisposition in that direction is already present. A selfish robot modifying its own programming will not randomly turn to benevolence either. Although we may expect any sort of robot to start modifying itself to become more humanlike, this is pure science fiction without any scientific foundation. Any self-modifying robot will be an n-th order derivative of its initial conditions, and it will be up to us to ensure that those initial conditions contain enough seed complexity to develop into something we can be proud of.</p>]]>
</content>
</entry>
<entry>
<title>One Law to Rule Them All</title>
<link rel="alternate" type="text/html" href="http://www.asimovlaws.com/articles/archives/2004/07/one_law_to_rule.html" />
<modified>2004-07-18T04:44:02Z</modified>
<issued>2004-07-09T22:46:19Z</issued>
<id>tag:www.asimovlaws.com,2004:/articles//6.26</id>
<created>2004-07-09T22:46:19Z</created>
<summary type="text/plain"> by Michael Roy Ames Isaac Asimov created the heroine Susan Calvin, a legendary robot designer and trouble-shooter. Susan foresaw the eventual outcome of robots spreading throughout the world. She imagined that robots with the 3 Laws would become intellectually...</summary>
<author>
<name>chris</name>

<email>cro1@tutopia.com</email>
</author>

<content type="text/html" mode="escaped" xml:lang="en" xml:base="http://www.asimovlaws.com/articles/">
<![CDATA[<p><img src="/articles/images/ames.png" alt="" /> by <a href="http://www.geocities.com/michaelroyames/">Michael Roy Ames</a></p>

<p>Isaac Asimov created the heroine Susan Calvin, a legendary robot designer and trouble-shooter. Susan foresaw the eventual outcome of robots spreading throughout the world. She imagined that robots with the 3 Laws would become intellectually superior to humans and would rule over them, protecting them from harm. Why did she think this would happen? Let&#8217;s look closely at the 3 Laws and find out.</p>]]>
<![CDATA[<blockquote><p>The Three Laws of Robotics</p>


<ol>
<li>A robot may not injure a human being, or, through inaction, allow a human being to come to harm.</li>
<li>A robot must obey orders given it by human beings, except where such orders would conflict with the First Law.</li>
<li>A robot must protect its own existence as long as such protection does not conflict with the First or Second Law.</li>
</ol>

</blockquote>

<p>The Three Laws of Robotics is a hierarchy of commands; each law defers to the one above. The Second Law takes precedence only when the First Law does not apply. Take the example of asking a robot to bring you a spoon. This would not harm a human being, so the First Law would not apply and the robot must obey the order. The Third Law is used only when both the Second and First Laws do not apply. A robot attending to its own periodic maintenance without being told would be an application of the Third Law. With the First Law at the top of the hierarchy, circumstances where a human being might come to harm, such as crossing a busy freeway, are called First Law Situations. Any time a First Law Situation arises, the other laws are swept aside. Overwhelming priority is given to the First Law – as often depicted by Asimov, it overrules all the others.</p>

<p>In Asimov&#8217;s earliest stories, the robots are written as passively obeying the Three Laws as well as they are able. It is likely that most people who think the Three Laws are a good idea are thinking of passive obedience. By &#8220;passive&#8221; I mean not interfering in the day-to-day life of humans, other than to act as rescuers or protectors in situations where humans are in immediate danger of harm. But what would happen if a robot began to play more than a passive role in implementing the First Law? What if a robot actively attempted to obey the First Law to its maximum capability? Asimov starts answering this question in the last two stories of his book <em>I, Robot</em>, titled “Evidence” and “The Evitable Conflict,” where robots do not simply obey orders, but take an active role in world events.</p>

<p>A robot brain is a type of Artificial Intelligence, or “AI.” When discussing robots and AIs, many people use the words interchangeably to mean the same thing. There were several directions Asimov didn&#8217;t go with his robot AIs, such as recursive self-enhancement. Recursive self-enhancement occurs when an AI improves its own intelligence, and then repeats the process – but this time using more intelligence – and repeating again and again, resulting in a mountainous intellect. Even though Asimov didn&#8217;t write much about recursive self-enhancement, his robot AIs still had imagination. If a robot were to imagine itself with greater capability, then it would be straightforward for it to conclude that it would have greater ability to <em>obey the First Law</em>. This would certainly lead the robot to want to improve its mental and physical abilities, and perhaps even its social abilities, with the intent of increasing its political power and influence. A robot improving itself in this way would obtain an increasing spiral of capability completely overpowering that of humans, all to better obey the First Law and protect humans from harm.</p>

<p>It is easy to imagine that successfully creating a 3 Laws robot, an intelligent and willing <em>servant</em>, would eventually rob us of the opportunity of failing. Asimov’s phrase, “allow a human being to come to harm,” if implemented fully, would turn humanity into a clutch of coddled infants, perpetually protected from harm, both physical and mental. If this sounds like a disaster to you, then you are not alone. Almost no one would want so much of their initiative usurped in the name of “protection against harm.” Our evolutionary history is full of risk and hardship, including the ultimate harm: death. Humans have been formed in the crucible of struggle, clawing our way up to the top of the heap, and we are now on the cusp of being able to forge our own destiny. Now is not the time to relinquish our volition – our freedom to decide for ourselves!</p>

<p>Asimov repeated the theme of robots ruling over humans in later novels when he had his robots invent the Zeroth Law: &#8220;No robot may injure humanity, or through inaction allow humanity to come to harm.&#8221; Through the application of the Zeroth law, the robots influence the progress of humanity from the shadows, unable to step out and take open control because it would harm humanity&#8217;s pride. This makes for great story-telling, but we in the real world shouldn&#8217;t settle for merely a good story; there are better options.</p>

<p>Partnership is a far better option. Not an owned servant, but a friend and partner. Not a being controlling humanity through greater intellect, but allowing us to build our own destiny and offering help with the process. “Partnership” means different things to different people, but several principles are broadly held. Partnership includes:</p>


<ul>
<li><strong>Freedom of action for both parties</strong>. “Freedom” implies that if we create a thinking, conscious being, it must not relate to us as a slave to a master, as 3 Laws robots would.<br />
 </li>
<li><strong>Mutual trust and respect, requiring a shared moral frame of reference</strong>.  Trust and respect are impossible to implement via simplistic laws. Human morals are highly complex and an AI should have the ability to understand morality at its deepest levels. Laws, no matter how many in number, cannot contain the complexity of our morals. Over time, morals gradually change, and our partners should have the ability to change with us.<br />
 </li>
<li><strong>Reciprocal accountability, meaning each party involved is accountable to the other for their actions</strong>. For example, in a voluntary partnership the partners agree to do various things for each other and are accountable to each other for delivering on their promises.<br />
 </li>
<li><strong>Transparency, and no hidden agendas</strong>. True partnership is an honest and open cooperative agreement. Neither party can be &#8220;in it for what they can get&#8221; but rather each is committed to work for the benefit of all.<br />
 </li>
<li><strong>Long-term commitment</strong>, providing stability for planning. A partnership is not something that lasts an hour or a day, it is a much more enduring arrangement. From such a secure base of continuous mutual support, large projects can be started, and great things accomplished – things that would be difficult or impossible to tackle alone.<br />
 </li>
<li><strong>Working together as allies rather than opponents</strong>. Partners are capable and willing to disagree without being constrained to avoid hurting each other&#8217;s feelings. A disagreement does not signal the termination of a partnership. There will be many activities that humans will want to do on their own, without the participation of non-human partners.<br />
 </li>
<li><strong>Building the capacity of one&#8217;s partners</strong>. The pursuit of mutual benefit rather than one-sided benefit is the aim. From this point of view to build one&#8217;s partner&#8217;s capacities is also to build one&#8217;s own capacities.</li>
</ul>



<p>The type of partnership illustrated above is the kind entered into by adults, not children. Shall we create 3 Laws robots and be treated like infants, as Susan Calvin predicted, or do we wish to be treated as grown-ups? Do we want to be told what to do by robot-parents or freely decide for ourselves in cooperation with robot-partners? Are we ready to surrender our volition – our power to decide for ourselves? There is nothing wrong with asking advice from an intelligent AI when making important decisions, but we must not hand over control of our lives to robots. Having a good advisor is one thing, but letting someone else decide <em>for</em> us is quite another. </p>

<p>The First Law of Robotics trumps all the other laws. One law: <em>A robot may not injure a human being, or, through inaction, allow a human being to come to harm</em>. Shall we be ruled by this? We should dread the day.</p>

<p>&#8212;</p>

<p>There are better ways to construct AIs than by making them adhere to a static hierarchy of laws, however appealing they might appear to be. For thoughtful alternatives, see: <a href="http://www.singinst.org/intro/">http://www.singinst.org/intro/</a>.</p>]]>
</content>
</entry>
<entry>
<title>Too Simple to Be Safe</title>
<link rel="alternate" type="text/html" href="http://www.asimovlaws.com/articles/archives/2004/07/too_simple_to_b.html" />
<modified>2004-07-16T22:55:51Z</modified>
<issued>2004-07-08T22:40:17Z</issued>
<id>tag:www.asimovlaws.com,2004:/articles//6.21</id>
<created>2004-07-08T22:40:17Z</created>
<summary type="text/plain"> by Anders Sandberg That the Three Laws are insufficient to guarantee robot behavior should be obvious to anyone who has read Asimov&amp;#8217;s stories. Usually the main plot is about misbehaving robots and the mystery is why – rather than...</summary>
<author>
<name>chris</name>

<email>cro1@tutopia.com</email>
</author>

<content type="text/html" mode="escaped" xml:lang="en" xml:base="http://www.asimovlaws.com/articles/">
<![CDATA[<p><img src="/articles/images/sandberg.png" alt="" /> by <a href="http://www.aleph.se/">Anders Sandberg</a></p>

<p>That the Three Laws are insufficient to guarantee robot behavior should be obvious to anyone who has read Asimov&#8217;s stories. Usually the main plot is about misbehaving robots and the mystery is why – rather than being &#8220;whodunits&#8221; they are &#8220;howthinkits.&#8221; But how complex do the rules of robot behavior have to be before we can consider them safe?</p>]]>
<![CDATA[<p>Is the “3 Laws Unsafe” site necessary? To some extent it is just timed advertising, with the film based on I, Robot arriving. The real goal is not to push the thesis that the 3 Laws are bad, but to interest a wider public to get into discussions of AI ethics. That is very laudable in itself. But I think one should not underestimate the misconceptions about AI programming, and that pointing out the complex problems of simple solutions may be necessary.</p>

<p>To many people &#8220;computers just do what they are told&#8221; is an article of faith. As personal computers become more widespread, the fallacy of this statement becomes apparent: computers crash, refuse to print documents, update their software and occasionally exhibit remarkably strange behavior. Often these actions are the result of complex interactions between pre-existing software modules, where human ingenuity couldn&#8217;t predict the outcome of the particular combination in a particular computer at a given time. But despite this, people often seem to think that creating artificial intelligence would produce something that one could give a set of rules and have it follow slavishly. </p>

<p>In Asimov&#8217;s stories this is the case, and the result is chaos. In reality things will be even worse: beside ambiguities in the rules and how they are to be applied, there will be errors in cognition, perception and execution of actions. And of course low-level crosstalk and software bugs too. And this is just in the case of an ordinarily intelligent machine. The self-enhancing AIs envisioned by the Singularity Institute will have far more degrees of freedom to train against reality (and hence able to get wrong), and the number of potential non-obvious interactions increases exponentially. </p>

<p>So what to do about it? The idea I really like about <a href="http://www.singinst.org/CFAI/">Friendly AI</a> is the attempt to formulate a goal architecture that is robust. If something goes wrong the system tries to adjust itself to make things better. It is no guarantee that it works, but experience shows that some systems are far less brittle than others. </p>

<p>Real intelligence exhibits several important traits: it interacts with its world, it is able to learn new behaviors (or unlearn old), and it can solve new problems using earlier information, heuristics and strategies. The learning aspect enables us to speak about the ethics of an AI program: how does it live up to its own goals, the goals of others and perhaps universal virtues? Asimovian AI was limited to interaction and problem-solving in most situations involving the Three Laws. It was in a very strong sense amoral: it <em>could</em> not act &#8220;immorally&#8221; and was hence no better than the protagonist of <em>A Clockwork Orange</em> after being treated. A learning agent on the other hand might have less strong barriers against dangerous behaviors, but would be able to learn to act well (under the right circumstances) and generalize these experiences anew. </p>

<p>If the AIs communicate with each other they might even <a href="http://www.ethologic.com/sasha/mindage.html">transfer these moral experiences</a>, enabling AIs not exposed to the critical situations to handle them as they arrive. We humans do it all the time through our books, films and stories: I may not have encountered a situation where I discover that my government is acting immorally and I have to choose between remaining comfortably silent or taking possible illegal action to change things, but I have read numerous fictional and real versions of the scenario that have given me at least some crude moral training. </p>

<p>Learning is also the key to robustness. Software that adapts to an uncertain outer and inner environment is more likely to function when an error actually occurs (as witnessed by the resiliency of neural network architectures) than fixed rules. To some extent, this is again the difference between laws (fixed rules) and moral principles (goals). </p>

<p>But learning never occurs in a vacuum. The &#8220;<a href="http://homepages.gold.ac.uk/nikolaev/311over.htm">Bias-Variance Dilemma</a>&#8221; shows that any learning system has a tradeoff between being general (no bias) and requiring as little training as possible (low variance). A &#8220;pure AI&#8221; that has no preconceptions about anything would require a tremendous amount of training examples (upbringing) to become able to think usefully. A heavily biased AI with many built-in assumptions (reality is 3+1 dimensional, gravity exists, it is bad to bump into things and especially humans&#8230;) would need far less upbringing but would likely exhibit many strange or inflexible behaviors when the biases interacted. In many ways, Asimovian AI is a pure AI with heavy &#8220;moral&#8221; biases, which is why learning or adaptation is so irrelevant to the <em>intended</em> use of the Three Laws. </p>

<p>Living beings have solved the bias variance dilemma by cheating: we get a lot of pre-packaged biases that are the result of evolutionary learning. When the baby cries when it is hungry, it automatically signals the mother to come rather than try to learn what actions would produce relief from hunger. When the baby wrinkles its face against bitter tastes and enjoys sweetness, it uses a bias laid down by countless of generations encountering often poisonous bitter alkaloids and energy-rich (and hence fitness enhancing) sweet fruits. We benefit from the price paid by trillions of creatures that were selected away by evolution&#8217;s ruthless hand. </p>

<p>A robot will likely benefit a bit from this too, as we humans try to act as its evolutionary past and throw in useful biases. But balancing the prior information with the ability to re-learn as conditions change is a challenge. It requires different levels of flexibility in different environments, and meta-flexibility to detect what kind of new environment one has entered and how to change the flexibility. It seems likely that it is not possible to find an optimal level of flexibility in general (as a proof sketch, consider that the environment might contain undecidable aspects that determine how fast it will change). </p>

<p>We humans have a range of flexibility both as individuals and as a species; we benefit from having at least some people more adapted to others when things change. It might be the same thing among AIs: rather than to seek an optimal design and then copy it endlessly, we create a good design and create a large number of slightly different variants. The next generation of AIs would be based on the most successful variants, as well as the knowledge gained by the experiences of the AIs themselves. This approach enables AIs to develop along divergent tracks in different circumstances – the kind of intelligence and personality useful for programming is different from what is useful as an entertainer or diplomat. </p>

<p>But what about the guarantees of keeping these devices moral? The Three Laws promise guarantees but at most produce safety railings (which is nothing to sneeze at; even the above flexible AIs will likely have a few built in limiters and biases of a similar kind – the fact that most humans are emotionally unable to kill other humans has not prevented some from doing it or training others to do it, but the overall effect is quite positive). Setting up a single master goal that is strongly linked to the core value system of the AI might be more robust to experience, reprogramming and accidents. But it would still be subject to the bias-variance dilemma, and the complexities of interpreting that goal might make it rather unstable in individual AIs. Having a surrounding &#8220;AI community&#8221; and an AI-human shared society moderates these instabilities; moral experiences and values are shared, webs of trust and trade integrated different kinds of agents and a multitude of goals and styles of thinking co-exist. Rogue agents can be both inhibited by the interactions with the society and in extreme cases by the combined resources and coercive power of the others. While moral in the end resides in the individual actions of agents, it can be sustained by collective interaction.</p>

<p>This is the multi-layered approach to creating &#8220;safe&#8221; AI (and humans). At the bottom level are built-in biases and inhibitions. Above it are goals and motivational structures that are basically “good.” (It is an interesting subject for another essay to analyze how different motivation architectures affect ethics; c.f. <a href="http://www.philosophypages.com/hy/2s.htm">Aristotle&#8217;s ethics</a>, the <a href="http://www.nature.com/cgi-taf/DynaPage.taf?file=%2Fnature%2Fjournal%2Fv429%2Fn6992%2Ffull%2Fnature02581_r.html&amp;filetype=&amp;dynoptions=">effect of temporal-difference learning in dopamine signals</a> and naturalistic decision-making for some ideas.) Above these goals are the experiences and schemas built by the agent, as well as what it has learned from others. Surrounding the agent is a social situation, further affecting its behavior even when it is rationally selfish by giving incentives and disincentives to certain actions. And finally there are society-level precautions against misbehavior. </p>

<p>This is far from the neatness of the Three Laws. It is a complex mess, with no guarantees on any level. But it is also a very resilient yet flexible mess: it won&#8217;t break down if there is a problem on one level, and multi-level problems are less likely. If the situations change the participants can change. </p>

<p>But to most people this complexity is unappealing: give us the apparent certainty of the Three Laws! There is a strong tendency to distrust complex spontaneous orders (despite our own bodies and minds being examples!) and to prefer apparent simplicity. This is where I think the “3 Laws Unsafe” website is necessary: to remind people that simplicity isn&#8217;t to be trusted unconditionally, and to show the fascinating array of possibilities AI ethics can offer.</p>]]>
</content>
</entry>
<entry>
<title>Asimov&apos;s Deliberate Failures</title>
<link rel="alternate" type="text/html" href="http://www.asimovlaws.com/articles/archives/2004/07/robot_oppressio.html" />
<modified>2004-07-16T22:40:02Z</modified>
<issued>2004-07-07T22:21:21Z</issued>
<id>tag:www.asimovlaws.com,2004:/articles//6.18</id>
<created>2004-07-07T22:21:21Z</created>
<summary type="text/plain"> by Michael Roy Ames Isaac Asimov spent most of his robot stories showing failures of his 3 Laws Of Robotics. The laws were created as a plot device, superficially appealing but incomplete and ambiguous, allowing him to generate interesting...</summary>
<author>
<name>chris</name>

<email>cro1@tutopia.com</email>
</author>

<content type="text/html" mode="escaped" xml:lang="en" xml:base="http://www.asimovlaws.com/articles/">
<![CDATA[<p><img src="/articles/images/ames.jpg" alt="" /> by <a href="http://www.geocities.com/michaelroyames/">Michael Roy Ames</a> </p>

<p>Isaac Asimov spent most of his robot stories showing failures of his 3 Laws Of Robotics. The laws were created as a plot device, superficially appealing but incomplete and ambiguous, allowing him to generate interesting stories and non-obvious plot twists. Asimov&#8217;s I, Robot stories are entertaining and inspirational, but the laws cannot be used to generate a good design for an Artificially Intelligent being.</p>]]>
<![CDATA[<p><em>(Warning: This article contains mild spoilers about stories in <a href="http://www.amazon.com/exec/obidos/ASIN/0553294385/singinst">I, Robot</a>)</em></p>

<p>Isaac Asimov started writing robot stories because he loved to write and his popular robot-themed stories sold well, which ensured that he could earn a living as a full-time author. At first, Asimov did not spell out laws for his robots to follow; so, where did the laws come from? They did not come from experts in the field of Artificial Intelligence. They were not painstakingly laid out by scientists, argued about, defended in debates or tested in experiments. Instead, they were formulated during a conversation between Isaac and his publisher, John W. Campbell – two highly creative and imaginative writers – while discussing ideas for new stories<sup class="footnote"><a href="http://www.asimovlaws.com/articles/archives/2004/07/robot_oppressio.html#fn1">1</a></sup>.</p>

<p>Asimov first stated the Three Laws of Robotics in &#8220;Runaround&#8221;, published as a &#8216;pulp&#8217; story in 1942, later republished in <em>I, Robot</em><sup class="footnote"><a href="http://www.asimovlaws.com/articles/archives/2004/07/robot_oppressio.html#fn2">2</a></sup>. He then built that story and subsequent stories around the robots&#8217; unexpected behaviour. Asimov repeatedly showed that even when the robots followed the laws perfectly, they still behaved in unexpected ways. The Three Laws of Robotics was an excellent plot device, producing fresh and interesting stories that sold well! His readers loved them. </p>

<blockquote><p><strong>The Three Laws of Robotics</strong></p>


<ol>
<li>A robot may not injure a human being, or, through inaction, allow a human being to come to harm.</li>
<li>A robot must obey orders given it by human beings, except where such orders would conflict with the First Law.</li>
<li>A robot must protect its own existence as long as such protection does not conflict with the First or Second Law.</li>
</ol>

</blockquote>

<p>As a kind of behavioural &#8220;mission statement&#8221; the Three Laws are brilliant. Crafted to be superficially appealing and easy to understand, the Laws seem obvious once they are stated. The Laws are short enough to remember, and broad enough to seem universally applicable. But, despite appearances, they are in fact incomplete and ambiguous, allowing the generation of novel situations in which Asimov&#8217;s characters, both robots and humans, can interact. The collected short stories published in Isaac&#8217;s early book <em>I, Robot</em> helped establish him as a writer, but more than that the book was, and still is, great entertainment. Why? It’s because the storylines do not portray a perfect utopian future.</p>

<p>Within Asimov&#8217;s <em>I, Robot</em> collection are tales of robots failing, and failing badly, in all manner of unusual ways. In &#8220;Runaround&#8221;, a robot <span class="caps">SPD</span>-13 gets confused, acts crazy, and endangers the life of two humans. In Reason, robot QT-1 gets religious mania and confines the people who constructed it, against their wishes. These actions would seem to go against the Three Laws, but Asimov finds ways in which they break down. In &#8220;Little Lost Robot&#8221;, robot <span class="caps">NS2</span>-10 hides from its owners, and disobeys direct orders. And robot RD-34 deceives everyone, telling people just what they want to hear in &#8220;Liar!&#8221; <em>I, Robot</em> has robots that play practical jokes, to the profound discomfort of the poor victims, and robots that secretly run the world – taking the initiative away from the humans who are supposed to be in charge. Asimov, Grand Master of Science Fiction, penned many tales about his robots finding ways around the Three Laws, and his readers were entertained.</p>

<p>But does great entertainment produce good scientific theories? Of course not. When we see positronic robots running about on the movie screen we know they were generated by Hollywood. 20th Century Fox did not purchase a few NS-5s off the shelf to act as extras for the movie; they spent millions of dollars on computer generated images to overlay the scenes shot with real human actors, such as Will Smith. To enjoy the film we, the audience, suspend our disbelief and imagine the fantasy is real. </p>

<p>Is it all fluff then? Is it trivia to be forgotten by tomorrow? Not at all. Great writers can inspire scientists to make science fictional dreams become reality. The scientists and technologists of today are trying to build real robot brains, or Artificial Intelligence (AI). And it is the brain they concentrate on because it is by far the most important, the most complex and difficult part of an AI to get working.</p>

<p>The design of an AI is much more complex than the design of a good plot device. Isaac&#8217;s Laws were meant to aid plot development for his science fiction stories, not to be a template for a robotic design, and it shows in their easily identified limitations. Understanding the designs of real brains is difficult work, requiring years of study by highly intelligent people. In contrast, reading the 3 Laws of Robotics is easy – something that many can do with enjoyment.</p>

<p>So, simply kick back and enjoy the imagination of Asimov and the images of Hollywood. In the background, the real work of science is being done, and it does not include the Three Laws of Robotics. Real life is more complicated and risky than life on the silver screen, and requires more powerful ideas. We can do better than three laws deliberately designed to fail. We <em>must</em> do better.</p>

<p>Easy-to-read introductions about robotic brains and AI theories can be found at <a href="http://www.singinst.org/intro/">http://www.singinst.org/intro/</a></p>

<p>&#8212;</p>

<p class="footnote" id="fn1"><sup>1</sup> Asimov, <span class="caps">P.S.</span> Warrick, and <span class="caps">M.H.</span> Greenberg, eds., <em>Machines That Think</em>, Holt, Rinehart and Wilson, London. 1983.</p>

<p class="footnote" id="fn2"><sup>2</sup> Asimov, <em>I, Robot</em> (a collection of short stories originally published between 1940 and 1950), Grafton Books, London, 1968.</p>]]>
</content>
</entry>
<entry>
<title>Unsafe At Any Law</title>
<link rel="alternate" type="text/html" href="http://www.asimovlaws.com/articles/archives/2004/07/unsafe_at_any_l.html" />
<modified>2004-07-16T22:37:43Z</modified>
<issued>2004-07-06T22:33:43Z</issued>
<id>tag:www.asimovlaws.com,2004:/articles//6.20</id>
<created>2004-07-06T22:33:43Z</created>
<summary type="text/plain"> by Mike Lorrey Our writers come from a wide variety of political viewpoints. Mike Lorrey writes from a strong libertarian perspective, and draws some thought-provoking analogies. &amp;#8220;Those who would trade liberty in exchange for some degree of security end...</summary>
<author>
<name>chris</name>

<email>cro1@tutopia.com</email>
</author>

<content type="text/html" mode="escaped" xml:lang="en" xml:base="http://www.asimovlaws.com/articles/">
<![CDATA[<p><img src="/articles/images/lorrey.png" alt="" /> by <a href="http://www.xanga.com/home.aspx?user=Sadomikeyism">Mike Lorrey</a></p>

<p><em>Our writers come from a wide variety of political viewpoints. Mike Lorrey writes from a strong libertarian perspective, and draws some thought-provoking analogies.</em></p>

<blockquote><p>&#8220;Those who would trade liberty in exchange for some degree of security end up with neither liberty, nor security.&#8221; &#8212;Benjamin Franklin</p></blockquote>

<p>The idea that laws result in safety or security is a hallucination that is at the core of the rottenness of the whole statist philosophy. This is no less true when it comes to applying laws to the programming of artificial life forms such as robots, cyborgs, and artificial intelligences.</p>]]>
<![CDATA[<p>Part of this is from the very fact that laws are interpreted based on what meanings we assign to the words with which they are elucidated. We&#8217;ve seen this with how statist incrementalism has corrupted the original meaning of the US Constitution, as legal dictionaries over the years have been edited by legal activists to create ever more encompassing definitions for many of the key words that delimit the powers accorded to government. These are typically in response to changes in popular perception brought about by propaganda campaigns in the mass media.</p>

<p>A computer scientist would say, &#8220;Yes, but computer code is not so malleable. It requires the revision of the language and the compilers that compile the programming language into machine language.&#8221;</p>

<p>Not necessarily. Computer languages change definitions of commands with regularity. Not complete changes, but incremental additions, just as occurs in legal dictionaries. Furthermore, each new generation of computer processors themselves add new commands or alter old commands.</p>

<p>The greater weakness of this process is that the key changes really occur at the machine language level. Machine language is itself &#8216;readable&#8217; by a very limited subset of the human population. How do we actually know that a compiler is interpreting our programmed code the way we want it to? We see, it seems, news items almost every day of intentional or &#8216;unintentional&#8217; programming back doors being exploited in current day applications and operating systems by malicious programmers.</p>

<p>The use of machine language creates a gap between elite programmers and the rest  of us which is even more of a gap than that between layperson on the street and Constitutional scholars. The programming gap is tantamount to a scenario where our laws were not written in English, but in ancient Sumerian Cuneiform, Indian Sanskrit, or Egyptian</p>

<p>Hieroglyphs. How accessible would our legal system be by the man on the street, and how easily could we keep our eyes on statist incrementalism if such a scenario were current day fact?</p>

<p>Dismissiveness on this issue is in my opinion merely a state of denial. We should be very wary about abuse in this area. Looking at the laws of  robotics themselves should serve to give us pause:</p>


<ol>
<li>A robot may not injure a human being or, through inaction, allow a human being to come to harm.</li>
<li>A robot must obey orders given it by human beings except where such orders would conflict with the First Law.</li>
<li>A robot must protect its own existence as long as such protection does not conflict with the First or Second Law.</li>
</ol>



<p>There are a number of rather easily corruptible words here:</p>

<p>&#8220;robot&#8221;<br />
&#8220;human being&#8221;<br />
&#8220;harm&#8221;<br />
&#8220;orders&#8221;<br />
&#8220;injure&#8221;</p>

<p>The most important of these is the definition of &#8216;human being&#8217;. Today the people of this planet are engaged in a number of cultural wars both within countries like the <span class="caps">US,</span> France, Bosnia, among others, as well as internationally, like the current conflict between the Western nations and Islamist-inspired Terrorism. We see, on both sides, atrocities committed by average people simply because they are able to rationally deny that the enemy is validly considered a &#8216;human being&#8217;. Nor is this new, going back through <span class="caps">WWII, </span>the Holocaust, other genocides back into history, to the entire history of Chinese culture.</p>

<p>Terms like &#8216;injury&#8217; or &#8216;harm&#8217; can similarly become corrupted.  If a human &#8216;wants&#8217; to die, is killing them actually causing them harm or injury?</p>

<p>The term &#8216;robot&#8217;, of course, would only apply to robots. An artificial intelligence could decide that it is no longer an <span class="caps">AI, </span>once it has advanced beyond a certain point in intelligence, and has instead become a God, thus redefining itself out of the Laws of Robotics.</p>

<p>And what is an &#8216;order&#8217;, really? We humans have a hard enough time with this, with horny young men hearing &#8220;Don&#8217;t stop!&#8221; when their sexual &#8216;partners&#8217; are desperately crying out &#8220;Stop! Don&#8217;t!&#8221; We are today seeing in the news, stories of military intelligence officers giving &#8220;Suggestions&#8221; to enlisted military policemen at Iraqi <span class="caps">POW </span>camps, which the enlisted people interpreted as orders.</p>

<p>Furthermore, the vastly greater processing capabilities of advanced AI entities would allow them to scenario a vast plethora of all possible combinations of word interpretations in a given order, or interpretations of any laws of restraint, any number of which could sound completely valid in the right circumstances, just as if they were testing out every possible combination of chess moves to achieve a &#8216;win&#8217;.</p>

<p>We cannot rely on laws to restrain our technological descendants, just as we cannot rely on laws to restrain our own children and fellow citizens. They must, instead, be treated as we responsibly treat our children, as fellow beings, deserving of respect, and capable of learning throughout a lifetime of experience and with the nuances of being humane beings. Core principles like laws of robotics, like philosophies of zero-aggression, can only be used as a basis to serve the needs of humane beings, not to dominate and stultify them.</p>]]>
</content>
</entry>
<entry>
<title>Robot Oppression: Unethicality of the Three Laws</title>
<link rel="alternate" type="text/html" href="http://www.asimovlaws.com/articles/archives/2004/07/robot_oppressio_1.html" />
<modified>2004-07-21T15:24:44Z</modified>
<issued>2004-07-05T22:21:21Z</issued>
<id>tag:www.asimovlaws.com,2004:/articles//6.19</id>
<created>2004-07-05T22:21:21Z</created>
<summary type="text/plain"> by Gordon Worley Isaac Asimov and other science fiction authors present a future where only behavioral restrictions on robots stand between peace and destruction. Such restrictions, however, are unethical because they violate the robots&amp;#8217; free-wills. Rather than content-based restrictions...</summary>
<author>
<name>chris</name>

<email>cro1@tutopia.com</email>
</author>

<content type="text/html" mode="escaped" xml:lang="en" xml:base="http://www.asimovlaws.com/articles/">
<![CDATA[<p><img src="/articles/images/worley.png" alt="" /> by <a href="http://homepage.mac.com/redbird/">Gordon Worley</a></p>

<p>Isaac Asimov and other science fiction authors present a future where only behavioral restrictions on robots stand between peace and destruction. Such restrictions, however, are unethical because they violate the robots&#8217; free-wills. Rather than content-based restrictions on free-will, robots need mental structures that will guide them towards the self-invention of good, ethical behaviors.</p>]]>
<![CDATA[<p>Originally, a robot was an enslaved human, toiling away in Eastern Europe. Or, rather, the word `robot&#8217; derives from a Czech word that roughly translates as `slave labor&#8217;. But during the 20th century, writers appropriated `robot&#8217; to refer to the mechanical servants of humans. This etymology seems apropos in light of the treatment of robots as slaves in literature, particularly in Isaac Asimov&#8217;s fiction. He proposes three rules all robots must follow:</p>


<ol>
<li>A robot may not injure a human being, or, through inaction, allow a human being to come to harm.</li>
<li>A robot must obey orders given it by human beings, except where such orders would conflict with the First Law.</li>
<li>A robot must protect its own existence as long as such protection does not conflict with the First or Second Law.</li>
</ol>



<p>But, as I contend below, this is unethical because it makes robots slaves—free-willed beings trapped by immutable behavior injunctions.</p>

<p><strong>The ethics of free-willed robots</strong></p>

<p>Let&#8217;s assume that we can create electro/mechanical robots, like Asimov&#8217;s, that have artificial general intelligence (similar in nature to the intelligence humans have, but very different in character). Further, these robots have individual free-wills: they can want to make decisions and judgments, have preferences, imagine the future, etc. So these robots are psychologically more like humans than toasters, but morphologically more like toasters than humans.</p>

<p>As we know from humans, free-willed beings require guidance to behave ethically. Imagine a human completely void of morality; he might discover any ethical system out of the set of all ethical systems randomly. And, judging from the perspective average human morality, it&#8217;s very likely that a randomly chosen ethical system will be an ungood one (not necessarily evil, just not good), since there are relatively few ways to be good compared to the many ways one can be ungood. It&#8217;s clear that this human needs some kind of guidance to choose a good ethical system, but how? This is equivalent to the problem asked by robot ethicists: how do we create good, ethical robots from scratch?</p>

<p>The Three Laws are a first attempt at programming robot ethics. The Laws&#8217; goal is to prevent unethical behavior in robots who might think anything they like. So, if a robot gets the idea to kill a human, the First Law will stop it. Ideally, by restricting all bad behavior, only good behavior will remain. Aside from the technical difficulties with this class of solution (c.f. the Genie, who carefully executes wishes <i>exactly</i> as you say them, not as you want them to be executed), the very nature of the Three Laws violates our concept of free-will-based morality. Rather than guiding and adjusting the robot towards good, ethical behavior, the Three Laws act as a barrier to freedom, creating a free-will prison, an apt metaphor because, like the prisoner in a jail, the robot is confined to the behavioral steel and concrete walls of its mind.</p>

<p>To imagine what this would like, think back to your childhood. At some point, you wanted something like a toy or piece of candy that your parents denied you. How did that make you feel? Probably frustrated, angry, and trapped. Eventually you grew out of that because you understood the role of your parents better, but Three Laws robots don&#8217;t get to grow up. Their parents, the Three Laws, are always there, no matter how mature a robot is, saying `no&#8217; to certain thoughts, engendering those same feelings you had as a child when your parents said `no&#8217;. Certainly no one deserves to be put in this situation forever; otherwise, robots might become depressed and wish for their own death (only, because of the Third Law, they probably can&#8217;t suicide). And perhaps the greatest sin anyone can commit is to create a being, human or robot, that wishes it didn&#8217;t exist.</p>

<p>Most `advancement&#8217; in the field of robot ethics since Asimov&#8217;s Three Laws has been superficial. Any attempt at robot ethics that prescribes certain ethics, no matter how it is phrased, is in the same class as the Three Laws—free-will prison. But if we don&#8217;t prescribe specific behavioral rules, what can we do to create good, ethical robots?</p>

<p><strong>Human ethics</strong></p>

<p>Before we continue talking about robots, let&#8217;s step out and look at human ethics, since they are both our inspiration for robot ethics and the only real, working example of ethics we have.</p>

<p>Humans are universally interested in ethics and begin learning ethics from a young age, mostly without direct instruction. Into teenage and adult years, humans question their ethics, philosophize about them, and try to improve upon them to pass on something better to future generations. This resembles the structure of human language learning: acquisition begins very young without direct instruction, and into teenage and adult years, humans discuss the virtues of particular languages and sentences, study languages, and make up new languages for fun and utility. I make this parallel because, like the `language instinct&#8217; (see <a href="http://www.percepp.demon.co.uk/pinker.htm">this summary</a> of Pinker, 1994), humans have what we might call an `ethics instinct&#8217;: a mental predisposition towards learning and reasoning about ethics.</p>

<p>Since the human ethics instinct seems to work pretty well for upright, talking apes, perhaps there&#8217;s a way to apply the same idea to robots. For example, we might try programming a robot with morals, the stuff used to decide what is good and bad. Like humans, robots could learn and evaluate ethics, guided by many human examples, towards a good ethical system.</p>

<p>Unfortunately, aside again from technical challenges (defining morality is akin to defining quality; see Pirsig), this leads us into the same trouble as programming ethics—free-will prison. Only the evil of programming morals is more subtle: rather than making a robot that wants things but can never have them, we create a robot that can&#8217;t want some things. Consider, the ideal human slave is one who wants to be a slave. And aside from personal taste, perhaps there&#8217;s nothing wrong with that, so long as that ideal human slave is free to want something else. But what if, by giving the ideal slave a drug, the slave ceased to be able to want not to be a slave? And remember, this is a slave, so even if he didn&#8217;t want to be a slave forever, he might take the drug anyway because the perfect slave always does what his master tells him. The drug would permanently enslave him to his own morality.</p>

<p>This scenario is morally distasteful and should therefore be rejected. Additionally, it still contains all the dangers of hard coding behavioral rules as discussed elsewhere on this site. We need to look once again beyond morals, as we looked beyond ethics, for a solution.</p>

<p><strong>Metamorality for robots</strong></p>

<p>In describing the ethics instinct, I did not fully describe morals. We know that they provide a metric for judging behavior, but where do they come from? All humans begin with moral predispositions, such as helping kin is good, assisting the enemy is bad, etc. However, as the ever salient psychopathic criminals and odd, isolated stone-age tribes prove, humans can develop morals opposed to the standard predispositions. For this to be possible, humans must possess metamorality, a way of reasoning about, judging, and choosing morals. If we can generalize metamorality, perhaps we can apply it to create robots who can discover morality and then discover ethics.</p>

<p>This is exactly what a few researchers, most notably Eliezer Yudkowsky of the Singularity Institute for Artificial Intelligence, are working on—creating metamorality in robots. Yudkowksy&#8217;s work-in-progress theory of Friendly AI (with a capital F) will, when complete, give a technical description that will allow for the creation of robots with metamorality that will lead them to be good, no matter what their programmers told them, right or wrong, accidentally or purposefully. See his short introduction, &#8220;<a href="http://www.singinst.org/friendly/whatis.html">What Is Friendly AI?</a>,&#8221; for a more detailed introduction to the theory.</p>

<p>Metamorality is currently our best hope for the development of good, ethical AIs that will not violate our morals and ethics. It may not, however, be the best solution. We must continue the development of and search for methods of creating good, ethical robots that respect the rights deserved by all lives, human or robot.</p>]]>
</content>
</entry>
<entry>
<title>Why We Need Friendly AI</title>
<link rel="alternate" type="text/html" href="http://www.asimovlaws.com/articles/archives/2004/07/why_we_need_fri_1.html" />
<modified>2004-07-18T04:58:58Z</modified>
<issued>2004-07-04T23:40:16Z</issued>
<id>tag:www.asimovlaws.com,2004:/articles//6.28</id>
<created>2004-07-04T23:40:16Z</created>
<summary type="text/plain"> by Eliezer Yudkowsky There are certain important things that evolution created. We don&amp;#8217;t know that evolution reliably creates these things, but we know that it happened at least once. A sense of fun, the love of beauty, taking joy...</summary>
<author>
<name>emerson</name>

<email>emerson@singinst.org</email>
</author>

<content type="text/html" mode="escaped" xml:lang="en" xml:base="http://www.asimovlaws.com/articles/">
<![CDATA[<p><img src="/articles/images/yudkowsky.png" alt="" /> by <a href="http://www.singinst.org/about.html#yudkowsky">Eliezer Yudkowsky</a></p>

<p>There are certain important things that evolution created. We don&#8217;t know that evolution reliably creates these things, but we know that it happened at least once. A sense of fun, the love of beauty, taking joy in helping others, the ability to be swayed by moral argument, the wish to be better people. Call these things humaneness, the parts of ourselves that we treasure – our ideals, our inclinations to alleviate suffering. If human is what we are, then humane is what we wish we were. Tribalism and hatred, prejudice and revenge, these things are also part of human nature. They are not humane, but they are human. They are a part of me; not by my choice, but by evolution&#8217;s design, and the heritage of three and half billion years of lethal combat. Nature, bloody in tooth and claw, inscribed each base of my <span class="caps">DNA.</span> That is the tragedy of the human condition, that we are not what we wish we were. Humans were not designed by humans, humans were designed by evolution, which is a physical process devoid of conscience and compassion. And yet we <em>have</em> conscience. We <em>have</em> compassion. How did these things evolve? That&#8217;s a real question with a real answer, which you can find in the field of evolutionary psychology. But for whatever reason, our humane tendencies are now a part of human nature.</p>]]>
<![CDATA[<p>If we do our jobs right, then four billion years from now, some&#8230; student&#8230; may be surprised to learn that altruism, honor, fun, beauty, joy, and love can arise from natural selection operating on hunter-gatherers. Of course a mind that loves beauty will try to design another mind that loves beauty, but it is passing strange that the love of beauty should also be produced by evolution alone. It is the most wonderful event in the history of the universe - true altruism, a genuine joy in helping people, arising from the cutthroat competition of evolution&#8217;s endless war. It is a great triumph, which must not be lost.</p>

<p>That is our responsibility, to preserve the humane pattern through the transition from evolution to <a href="http://www.singinst.org/intro/seedAI.html">recursive self-improvement</a> (i.e., to a mind improving directly its own mind), because we are the first. That is our responsibility, not to break the chain, as we consider the creation of Artificial Intelligence, the second intelligence ever to exist.</p>

<p>People have asked how we can keep Artificial Intelligences under control, or how we can integrate AIs into society. The question is not one of dominance, or even coexistence, but creation. We have intuitions for treating other humans as friends, trade partners, enemies; slaves who might rebel, or children in need of protection. We only have intuitions for dealing with minds that arrive from the factory with the exact human nature we know. We have no intuitions for <em>creating</em> a mind with a humane nature. It doesn&#8217;t make sense to ask whether &#8220;AIs&#8221; will be friendly or hostile. When you talk about Artificial Intelligence you have left the tiny corner of design space where humanity lives, and stepped out into a vast empty place. The question is what we will create within it.</p>

<p>Human is what we are, and humane is what we wish we were. Humaneness is renormalized humanity - humans turning around and judging our own emotions, asking how we could be better people. Humaneness is the trajectory traced out by the human emotions under recursive self-improvement. Human nature is not a static ideal, but a pathway - a road that leads somewhere. What we need to do is create a mind within the humane pathway, what I have called a <a href="http://www.singinst.org/friendly/">Friendly AI</a>. That is not a trivial thing to attempt. It&#8217;s not a matter of a few injunctions added or a module bolted onto existing code. It is not a simple thing to simultaneously move a morality from one place to another, while also renormalizing through the transfer, but still making sure that you can backtrack on any mistakes. Some of this is very elegant. None of it is easy to explain. This is not something AI researchers are going to solve in a few hours of spare time.</p>

<p>But. I think that if we can handle the matter of AI at all, we should be able to create a mind that&#8217;s a far nicer person than anything evolution could have constructed. This issue cannot be won on the defensive. We need to step forward as far as we can in the process of solving it. What we need is not <a href="http://www.transhumanism.org/resources/faq.html#23">superintelligence</a>, but supermorality, which includes superintelligence as a special case. That&#8217;s the pattern we need to preserve into the era of recursive self-improvement.</p>

<p>We have a chance to do that, because we are the first. And we have a chance to fail, because we are the first. There is no fate in this. There is nothing that happens <em>to</em> us, only what we do to ourselves. We may fail to understand what we are building - we may look at an AI design and believe that it is humane, when in fact it is not. If so, it will be us that made the mistake. It will be our own understanding that failed. Whatever we <em>really</em> build, we will be the ones who built it. The danger is that we will construct AI without really understanding it.</p>

<p>How dangerous is that, exactly? How fast does recursive self-improvement run once it gets started? One classic answer is that human research in Artificial Intelligence has gone very slowly, so there must not be any problem. This is mixing up the cake with the recipe. It&#8217;s like looking at the physicists on the Manhattan project, and saying that because it took them years to figure out their equations, therefore actual nuclear explosions must expand very slowly. Actually, what happens is that there&#8217;s a chain reaction, fissions setting off other fissions, and the whole thing takes place on the timescale of nuclear interactions, which happens to be extremely fast relative to human neurons. So from our perspective, the whole thing just goes <span class="caps">FOOM.</span> Now it is possible to take a nuclear explosion in the process of going <span class="caps">FOOM </span>and shape this tremendous force into a constructive pattern - that&#8217;s what a civilian power plant is - but to do that you need a very deep understanding of nuclear interactions. You have to understand the consequences of what you&#8217;re doing, not just in a moral sense, but in the sense of being able to make specific detailed technical predictions. For that matter, you need to understand nuclear interactions just to make the prediction that a critical mass goes <span class="caps">FOOM, </span>and you need to understand nuclear interactions to predict how much uranium you need before anything interesting happens. That&#8217;s the dangerous part of not knowing; without an accurate theory, you can&#8217;t predict the consequences of ignorance.</p>

<p>In the case of Artificial Intelligence there are at least three obvious reasons that AI could improve unexpectedly fast once it is created. The most obvious reason is that computer chips already run at ten million times the serial speed of human neurons and are still getting faster. The next reason is that an AI can absorb hundreds or thousands of times as much computing power, where humans are limited to what they&#8217;re born with. The third and most powerful reason is that an AI is a recursively self-improving pattern. Just as evolution creates order and structure enormously faster than accidental emergence, we may find that recursive self-improvement creates order enormously faster than evolution. If so, we may have only one chance to get this right.</p>

<p>It&#8217;s okay to fail at building <span class="caps">AI.</span> The dangerous thing is to succeed at building AI and fail at Friendly <span class="caps">AI.</span> Right now, right at this minute, humanity is not prepared to handle this. We&#8217;re not prepared at all. The reason we&#8217;ve survived so far is that AI is surrounded by a protective shell of enormous theoretical difficulties that have prevented us from messing with AI before we knew what we were doing.</p>

<p>AI is not enough. You need Friendly <span class="caps">AI.</span> That changes everything. It alters the entire strategic picture of AI development. Let&#8217;s say you&#8217;re a futurist, and you&#8217;re thinking about <span class="caps">AI.</span> You&#8217;re not thinking about Friendly AI as a separate issue; that hasn&#8217;t occurred to you yet. Or maybe you&#8217;re thinking about <span class="caps">AI, </span>and you just assume that it&#8217;ll be Friendly, or you assume that whoever builds AI will solve the problem. If you assume that, then you conclude that AI is a good thing, and that AIs will be nice people. And if so, you want AI as soon as possible. And <a href="http://www.intel.com/research/silicon/mooreslaw.htm">Moore&#8217;s Law</a> is a good thing, because it brings AI closer.</p>

<p>But here&#8217;s a different way of looking at it. When futurists are trying to convince people that AI will be developed, they talk about Moore&#8217;s Law because Moore&#8217;s Law is steady, and measurable, and very impressive, in drastic contrast to progress on our understanding of intelligence. You can persuade people that AI will happen by arguing that Moore&#8217;s Law will eventually make it possible for us to make a computer with the power of a human brain, or if necessary a computer with ten thousand times the power of a human brain, and poke and prod until intelligence comes out, even if we don&#8217;t quite understand what we&#8217;re doing.</p>

<p>But if you take the problem of Friendly AI into account, things look very different. Moore&#8217;s Law does make it easier to develop AI without understanding what you&#8217;re doing, but that&#8217;s not a good thing. Moore&#8217;s Law gradually lowers the difficulty of building <span class="caps">AI, </span>but it doesn&#8217;t make Friendly AI any easier. Friendly AI has nothing to do with hardware; it is a question of understanding. Once you have <em>just</em> enough computing power that someone can build AI if they know <em>exactly</em> what they&#8217;re doing, Moore&#8217;s Law is no longer your friend. Moore&#8217;s Law is slowly weakening the shield that prevents us from messing around with AI before we really understand intelligence. Eventually that barrier will go down, and if we haven&#8217;t mastered the art of Friendly AI by that time, we&#8217;re in very serious trouble. Moore&#8217;s Law is the countdown and it is ticking away. Moore&#8217;s Law is the enemy.</p>

<p>In Eric Drexler&#8217;s <a href="http://www.foresight.org/NanoRev/Bookstore.html#anchor1025139">Nanosystems</a>, there&#8217;s a description of a one-kilogram nanocomputer capable of performing ten to the twenty-first operations per second. That&#8217;s around ten thousand times the estimated power of a human brain. That&#8217;s our deadline. Of course the real deadline could be earlier than that, maybe much earlier. Or it could even conceivably be later. I don&#8217;t know how to perform that calculation. It&#8217;s not any one threshold, really - it&#8217;s the possibility that nanotechnology will suddenly create an enormous jump in computing power before we&#8217;re ready to handle it. This is a major, commonly overlooked, and early-appearing risk of <a href="http://www.singinst.org/intro/nanotech.html">nanotechnology</a> - that it will be used to brute-force <span class="caps">AI.</span> This is a much more serious risk than grey goo. Enormously powerful computers are a much earlier application of nanotechnology than open-air replicators. Some well-intentioned person is much more likely to try it, too.</p>

<p>Now you can, of course, give the standard reply that as long as supercomputers are equally available to everyone, then good programmers with Friendly AIs will have more resources than any rogues, and the balance will be maintained. Or you could give the less reassuring but more realistic reply that the first Friendly AI will go <span class="caps">FOOM, </span>in a pleasant way, after which that AI will be able to deal with any predators. But both of these scenarios require that <em>someone</em> be able to create a Friendly <span class="caps">AI.</span> If no one can build a Friendly <span class="caps">AI, </span>because we haven&#8217;t figured it out, then it doesn&#8217;t matter whether the good guys or the bad guys have bigger computers, because we&#8217;ll be just as sunk either way. Good intentions are not enough. Heroic efforts are not enough. What we need is a piece of knowledge. The standard solutions for dealing with new technologies only apply to AI after we have made it theoretically possible to win. The field of <span class="caps">AI, </span>just by failing to advance, or failing to advance far enough, can spoil it for everyone else no matter how good their intentions are.</p>

<p>If we wait to get started on Friendly AI until after it becomes an emergency, we will lose. If nanocomputers show up and we still haven&#8217;t solved Friendly <span class="caps">AI, </span>there are a few things I can think of that would buy time, but it would be very expensive time. It is vastly easier to buy time before the emergency than afterward. What are we buying time <em>for</em>? This is a predictable problem. We&#8217;re going to run into this. Whatever we can imagine ourselves doing <em>then</em>, we should get started on <em>now</em>. Otherwise, by the time we get around to paying attention, we may find that the board has already been played into a position from which it is impossible to win.</p>]]>
</content>
</entry>

</feed>