-
Notifications
You must be signed in to change notification settings - Fork 1
/
aises_8_3
454 lines (436 loc) · 37.5 KB
/
aises_8_3
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
<h1 id="distribution">8.3 Distribution</h1>
<p>In this section we discuss three main dimensions of how aspects of AI
systems are distributed:</p>
<ol>
<li><p>Benefits and costs of AI: whether the benefits and costs of AI will be evenly or unevenly shared across society.</p></li>
<li><p>Access to AI: whether advanced AI systems will be kept in the hands of a small group of people, or whether they will be widely accessible to the general public.</p></li>
<li><p>Power of AI systems: whether in the long run, there will be a few highly sophisticated AIs with vastly more power than the rest, or many highly capable AIs.</p></li>
</ol>
<h2 id="distribution-of-costs-and-benefits-of-ai">8.3.1 Distribution of Benefits and Costs of AI</h2>
<p>The distribution of the costs and benefits from AI will ultimately depend on both market forces and governance decisions. It is possible that companies developing AI will receive most of the economic gains, while automation could dramatically reduce economic opportunities for others. Government may need to engage in new approaches to redistribution to ensure that even if only a small number of people directly gain wealth from AIs, wealth is eventually shared more broadly among the population.</p>
<p><strong>AI might substitute for labor on a large scale.</strong> Researchers have estimated that many jobs are susceptible to partial or full replacement by AI, depending on what tasks those jobs involve and which of these can be automated with AI. Unlike in the past, even knowledge-based jobs such as writing and programming are at risk due to advancements in AI such as large language models, robotics, and computer vision. In the U.S., for instance, the advent of self-driving cars could potentially displace 5 million people who drive for a living, and advancements in robotics might threaten the employment of the 12 million employed in manufacturing. Since software is cheap to duplicate, AI presents firms around the globe with a low-cost and scalable alternative to using human labor.</p>
<p><strong>Past revolutions have relocated employment, not destroyed it.</strong> A common counterargument brought up in discussions about the potential impact of AI on employment draws on historical evidence of technological revolutions. On the one hand, automation negatively impacts employment and wages through the <em>displacement effect</em>, as human labor is substituted with technology. On the other hand, automation has a positive economic impact through the <em>productivity effect</em>, as new tasks or industries require human labor <span class="citation" data-cites="acemoglu2020robots">[4]</span>. Historically, the productivity effect has dominated and the general standard of living has increased. Consider the Industrial Revolution: mechanized production methods displaced artisanal craftspeople, but eventually led to new types of employment opportunities in factories and industries that hadn't previously existed. Similarly, access to computers has automated away many manual and clerical jobs like data entry and typing, but has also spawned many more new professions. Technological shifts have historically led to increases in employment opportunities and wages. Therefore, while these changes were disruptive for certain professions, they ultimately led to a shift in employment rather than mass unemployment. This phenomenon, called creative destruction, describes how outdated industries and jobs are replaced by new, often more efficient ones. Similarly, transformative technologies can also augment workers (like capital according to the standard view) rather than replace them. If AIs serve as gross complements to human labor, this may drive up wage growth rather than increase inequality.</p>
<p><strong>In the near term, AI could boost employment.</strong> We may see an increase in job opportunities that involve managing or checking the outputs of AI systems. Given the likely penetration of AI into all sectors of the economy, this could lead to a significant creation of new positions, not dissimilar to how IT services are required across industries.</p>
<p>If automation increases purchasing power due to decreasing prices of automated goods, this could increase demand for human-provided goods and services by enough to partly or fully offset job losses from automation. As wealth increases, people may have more disposable income, potentially spurring job growth in sectors like hospitality, recreation, and mental health services. Automation can also lower the costs of goods and services, increasing demand. For example, automation may expedite tedious aspects of the legal profession, decreasing the cost of legal services. Legal services might then become more affordable, increasing demand and creating more work for legal professionals.</p>
<p><strong>However, human-level AI might destroy employment.</strong> Human-Level AI, by definition, is capable of doing every task that humans can do, at least as cheaply. The implication of achieving HLAI is that human labor would no longer be valuable or necessary. This would represent a fundamental difference from past technological advancements that both helped and displaced human workers. Conventional policies to address job losses from automation, like worker retraining programs, would be meaningless in a world where there are no jobs to retrain for. At that point, economic gains from automation would be likely to accrue to a small handful of people or organizations that control AIs. Human innovation has historically created new jobs that would have seemed inconceivable just decades earlier. However, HLAI would also possess the ability to invent new economically valuable tasks, possibly more quickly than humans can think and innovate. The idea that there are always jobs for humans to do, while historically true, is not a law of nature <span class="citation"
data-cites="harari2016homo">[6]</span>.</p>
<p>Historically, routine tasks have been the primary target for automation. As AIs approach human-level intelligence, cognitive nonroutine jobs <span class="citation" data-cites="dvorkin2017growing">[5]</span>, which require education and are typically high-paying, would also become automated. Programmers, researchers, and artists are already augmenting their productivity using large language models, which will likely continue to become more capable. One way the future of work could play out is that increasingly few high-skilled workers will excel at managing or using AIs or will provide otherwise exceptionally unique skills, while the vast majority of people become unemployable.</p>
<p><strong>An autonomous economy could operate without human labor.</strong> The transition to a fully automated economy may not stem from one defining moment, but from an accumulation of small economic decisions. Imagine the following story: as AIs become increasingly capable, humans delegate an increasing number of business tasks to them. Initially, they handle emails and scheduling, but soon manage organizational accounting and communications with clients and partners. The AIs can be trained and deployed quickly and make fewer errors than humans. Over time and due to competitive pressures, businesses feel the need to automate more of their operations, and AIs begin handling complex cognitive tasks like strategy, innovation, and leadership. Soon, the economy is mostly made up of AIs communicating with other AIs, operating at a pace and complexity that humans can't keep up with, further motivating the automation of all economic activity. Eventually, the economy is able to operate without need for human contribution or oversight. Humans on the other hand, have become reliant on AIs for basic needs, societal governance, and even social interactions. Humans have little reason to learn, work, or aspire to create, and their survival depends on the beneficence of AIs.</p>
<p><strong>Excessive automation could lead to social decay.</strong> If the majority of the population became unemployed, they would have lost their sources of income, causing unprecedented levels of poverty and inequality (see section 6.4 Economic Engine for further discussion). Automation may also reduce social wellbeing through the loss of meaning or enfeeblement: the loss of abilities and values that make human life worthwhile. Overcoming challenges through effort is an essential way that humans find purpose and meaning, and this often comes through work or learning. Absent the need to cultivate skills for work, humans may shy away from challenges to pursue instant gratification. Personal achievement and growth are deeper forms of wellbeing than momentary pleasure. In an autonomous economy, areas of life like personal expression, communication, and emotional resilience may get delegated to the domain of AI assistants, and people may lose the ability to, or never learn to, independently process their emotions or connect with other humans.</p>
<p><strong>Wages could collapse if full automation is reached.</strong> Some researchers have argued that under plausible assumptions, we might see wages continue to grow for a significant period of time, but that at high levels of automation wages would collapse. Under this model, as AI systems improve, an increasing share of the tasks performed by human beings would be automated. Eventually automation would surpass a certain critical threshold where labor is no longer scarce, even if some tasks are still performed by humans. Once this threshold is passed and capital can fully substitute for labor, wages would decline rapidly.</p>
<p>Governments may seek to redistribute wealth from the owners of AI systems to the rest of the population in order to address poverty and inequality. If this led to people being free to spend their time on what they most value, this could be a positive change, provided we can address challenges around purpose and enfeeblement in a world without work. Relevant policies are discussed further in the Governance chapter.</p>
<h2 id="distribution-of-access-to-ais">8.3.2 Distribution of Access to
AIs</h2>
<p>Issues of access to AI are closely related to the question of distribution of costs and benefits. Some have argued that if access to AI systems is broadly distributed across society rather than concentrated in the hands of a few companies, the benefits of AI would be more evenly shared across society. Here, we will discuss broader access to AI through open source models, and narrower access through restricted models, as well as striking a balance between the two through structured access. We will examine the safety implications of each level of access.</p>
<h3 id="levels-of-access">Levels of Access</h3>
<p><strong>Restricted AI models concentrate power.</strong> Restricted
AI models are those that can only be used by a small group of people.
They may, for example, be exclusively accessible to people working in
private companies, government agencies, or a small group of people with
national security clearances. Restricted models cannot be used by the
general public.<p>
While some AIs could be restricted, it is possible that a significant
number of highly capable AIs will be tightly restricted. If all AIs, or
at least the most powerful, are restricted models, then power would be
concentrated in the hands of a small number of people. This could
increase the risk of value lock-in, where the values of that small group
of people would be promoted and perpetuated, potentially irreversibly,
even if they did not adequately represent the values and interests of
the larger human population.</p>
<p><strong>Open-source AI models increase the risks of malicious
use.</strong> Open-source AI models are those that are freely available
for anyone to use. There are no restrictions on what people can use them
for, or how users can modify them. These AI systems would proliferate
irreversibly. Since open-source models are, by definition, accessible to
anyone who can run them without restrictions on how they can be used,
there is an increased probability of malicious use.</p>
<p><strong>If information security is poor, all AI systems are
effectively open-source.</strong> Robust cybersecurity will be required
to prevent unintended users from accessing powerful AIs. Inadequate
protections will mean that AIs are implicitly open-source even if they
are not intended to be, because they will likely be leaked or
stolen.</p>
<p><strong>Structured access.</strong> One possible option for striking
a balance between keeping AIs completely private and making them fully
open-source would be to adopt a <em>structured access approach</em>.
This is where the public can access an AI, but with restrictions on what
they can use it for and how they can modify it. There may also be
restrictions on who is given access, with “Know Your Customer” policies
for verifying users’ identities. In this scenario, the actor controlling
the AI has ultimate authority over who can access it, how they can
access it, what they can use it for, and if and how they can modify it.
They can also grant access selectively to other developers to integrate
the AI within their own applications, with consideration of these
developers’ safety standards.<p>
One practical way of implementing structured access is to have users
access an AI via an application programming interface (API). This
indirect usage facilitates controls on how the AI can be used and also
prevents users from modifying it. The rollout of GPT-3 in 2020 is an
example of this style of structured access: the large language model was
stored in the cloud and available for approved users to access
indirectly through a platform controlled by OpenAI.</p>
<h3 id="openness-norms">Openness Norms</h3>
<p>Traditionally, the norm in academia has been for research to be
openly shared. This allows for collaboration between researchers in a
community, enabling faster development. While openness may be a good
default position, there are certain areas where it may be appropriate to
restrict information sharing. We will now discuss the circumstances
under which these restrictions might be justified and their relevance to
AI development.</p>
<p><strong>There are historical precedents for restricting information
sharing in dual-use research.</strong> Dual-use technologies are those
that can be used for both beneficial and harmful purposes. It is not a
new idea that information about the development of such technologies
should not be widely shared. In the 1930s, publication of research on
the nuclear chain reaction, which could be used for both nuclear power
and nuclear weapons, prompted a Nazi program developing the latter. The
Manhattan Project was then conducted in secrecy to avoid enemy
intelligence learning of any breakthroughs. Biotechnology has seen
debates about the appropriate level of openness, with concerns around
the publication of papers detailing potential methods for creating
dangerous pathogens, which might in future be used as bioweapons.<p>
Powerful AI would be a dual-use technology and there is therefore a need
for serious consideration of who can be trusted with it. Absolute
openness means implicitly trusting anyone who has the necessary hardware
to use AIs responsibly. However, there could in future be many people
with sufficient means to deploy AIs, and it might only take one person
with malicious intent to cause a catastrophe.</p>
<p><strong>Technological progress may be too fast for regulations to
keep up.</strong> Another reason for restricting information sharing is
the pacing problem—–where technological progress happens too quickly for
policymakers to devise and implement robust controls on a technology’s
use. This means that we cannot rely on regulations and monitoring to
prevent misuse in an environment where information that could enable
misuse is being openly shared.</p>
<p><strong>It may be difficult to predict the type of research that is
potentially dangerous.</strong> Within AI research, there are different
kinds of information, such as the model weights themselves and the
methods of building the system. There have been cases where the former
has been restricted for safety reasons but the latter openly shared.
However, it seems feasible that information on how to build dangerous
AIs could also be used to cause harm.<p>
Moreover, it can be difficult to predict exactly how insights might be
used and whether they are potentially dangerous. For instance, the
nuclear reactor, which could help society create more sustainable
energy, was instrumental in developing a cheaper version of the atomic
bomb. It is possible that AIs designed for seemingly harmless tasks
could be used to propel the advancement of potentially dangerous AIs. We
may not be able to predict every way in which technologies that are
harmless in isolation might combine to become hazardous.</p>
<p><strong>Since there are costs to restrictions, it is worth
considering when they are warranted.</strong> Any interventions to
mitigate the risk of misuse of AIs are likely to come at a cost, which
may include users’ freedom and privacy, as well as the beneficial
research that could be accelerated by more open sharing. It is therefore
important to think carefully about which kind of restrictions are
justified, and in which scenarios.<p>
It might, for example, be worth comparing the number of potential
misuses and how severe they would be with the number of positive uses
and how beneficial they would be. Another factor that could be taken
into account is how narrowly targeted an intervention could be, namely
how accurately it could identify and mitigate misuses without
interfering with positive uses.<p>
Restrictions on the underlying capabilities of an AI (or the
infrastructure supporting these) tend to be more general and less
precisely targeted than interventions implemented downstream. The latter
may include restrictions on how a user accessing an AI indirectly can
use it, as well as laws governing its use. However, upstream
restrictions on capabilities or infrastructure may be warranted under
specific conditions. They may be needed if interventions at later stages
are insufficient, if the dangers of a capability are particularly
severe, or if a particular capability lends itself much more to misuse
than positive use.</p>
<h3 id="risks-from-open-vs-controlled-models">Risks From Open vs
Controlled Models</h3>
<p>Open models would enable dangerous members of the general public to
engage in harmful activities. Tightly controlled models exacerbate the
risk that their creators, or elites with special access, could misuse
them with impunity. We will examine each possibility.</p>
<p><strong>Powerful, open AIs lower the barrier to entry for many
harmful activities.</strong> There are multiple ways in which
sophisticated AIs could be harnessed to cause widespread harm. They
could, for example, lower the barrier to entry for creating biological
and chemical weapons, conducting cyberattacks like spear phishing on a
large scale, or carrying out severe physical attacks, using lethal
autonomous weapons. Individuals or non-state actors wishing to cause
harm might adapt powerful AIs to harmful objectives and unleash them, or
generate a deluge of convincing disinformation, to undermine trust and
create a more fractured society.</p>
<p><strong>More open AI models increase the risk of bottom-up
misuse.</strong> Although the majority of people do not seek to bring
about a catastrophe, there are some who do. It might only take one
person pursuing malicious intentions with sufficient means to cause a
catastrophe. The more people who have access to highly sophisticated
AIs, the more likely it is that one of them will try to use it to pursue
a negative outcome. This would be a case of <em>bottom-up misuse</em>,
where a member of the general public leverages technology to cause
harm.</p>
<p><strong>Some AI capabilities may be skewed in favor of offense over
defense.</strong> It could be argued that AIs can also be used to
improve defenses against these various threats. However, some misuse
capabilities may be skewed in favor of offense not defense. For example,
it may be much easier to create and release a deadly pathogen than to
control it or come up with cures or vaccines. Even if an AI were to
facilitate faster vaccine development, a bioweapon could still do a
great deal of harm even in a short timeframe, leading to many deaths
before the vaccine could be discovered and rolled out.</p>
<p><strong>Releasing highly capable AIs to the public may entail a risk
of black swans.</strong> Although numerous risks associated with AIs
have been identified, there may be more that we are unaware of. AIs
themselves might even discover more technologies or ways of causing harm
than humans have imagined. If this possibility were to result in a black
swan event (see section 4.7 for a deeper
discussion of black swans), it would likely favor offense over defense,
at least to begin with, as decision makers would not immediately
understand what was happening or how to counteract it.</p>
<p><strong>More tightly controlled models increase the risk of top-down
misuse.</strong> In contrast with bottom-up misuse by members of the
public, <em>top-down misuse</em> refers to actions taken by government
officials and elites to pursue negative outcomes. If kept in the hands
of a small group of people, powerful AIs could be used to lock in those
people’s values, without consideration of the interests of humanity more
broadly. Powerful AIs could also increase governments’ surveillance
capacity, potentially facilitating democratic backsliding or
totalitarianism. Furthermore, AIs that can quickly generate large
quantities of convincing content and facilitate large-scale censorship
could hand much greater control of media narratives to people in power.
In extreme cases, these kinds of misuse by governments and elites could
enable the establishment of a very long-lasting or permanent dystopian
civilization.</p>
<h2 id="distribution-of-power-among-ais">8.3.1 Distribution of Power Among
AIs</h2>
<p>The final dimension we will consider is how power might be distributed among advanced AI systems. Assuming that we reach a world with AI systems that generally surpass human capabilities, how many of such systems should we expect there to be? We will contrast two scenarios: one in which a single AI has enduring decisive power over all other AIs and humans, and one in which there are many different powerful AIs. We will look at the factors that could make each situation more likely to emerge, the risks we are most likely to face in each case, and the kinds of policies that might be appropriate to mitigate them.</p>
<h3 id="ai-singleton">AI Singleton</h3>
<p>One possible future scenario is the emergence of an AI singleton—–an
AI with vastly greater power than all others, to the extent that it can
permanently secure its power over the others <span class="citation"
data-cites="bostrom2006singleton">[1]</span>.</p>
<p><strong>A monopoly on AI could make a single powerful AI more
likely.</strong> One factor affecting the number of powerful AIs that
emerge is the number of actors that can independently develop AIs of
similar capabilities. If a single organization, whether a government or
a corporation, were to achieve a monopoly on the development of highly
sophisticated AI, this would increase the likelihood of a single AI
emerging with decisive and lasting power over all individuals.</p>
<p><strong>A fast takeoff could make a single powerful AI more
likely.</strong> If an AI were to undergo a fast takeoff, where its
capabilities suddenly grew to surpass other intelligences, then it could
prevent other existing AIs from going through the same process. Such an
AI might be motivated to destroy any potential threats to its power and
secure permanent control, ensuring it could pursue its goals unimpeded.
On the other hand, if intelligence were to progress more gradually, then
there would not be a window of time where any single AI was sufficiently
more powerful than the others to halt their further development. Note,
however, that a fast takeoff does not necessitate one AI becoming a
permanent singleton. That is because AIs may still be vulnerable even if
they are extremely powerful. Simple structures can take down more
complex structures; just as humans are vulnerable to pathogens and
chemical weapons, simpler AIs (or humans) might be able to counteract
more capable AIs.</p>
<p><strong>An AI singleton could reduce competitive pressures and solve
collective action problems.</strong> If an AI singleton were to emerge,
the actor in control of it would not face any meaningful competition
from other organizations. In the absence of competitive pressures, they
would have no need to try to gain an advantage over rivals by rushing
the development and deployment of the technology. This scenario could
also reduce the risk of collective action problems in general. Since one
organization would have complete control, there would be less potential
for dynamics where different entities chose not to cooperate with one
another (as discussed in the previous chapter <span>Collective Action Problems</span>),
leading to a negative overall outcome.</p>
<p><strong>An AI singleton increases the risk of single points of
failure.</strong> In a future scenario with only one superintelligent
AI, a failure in that AI could be enough to cause a catastrophe. If, for
instance, it were to start pursuing a dangerous goal, then it might be
more likely to achieve it than if there were other similarly powerful
AIs that could counteract it. Similarly, if a human controlling an AI
singleton would like to lock in their values, they might be able to do
so unopposed. Therefore, an AI singleton could represent a single point
of failure.</p>
<p><strong>An AI singleton could increase the risk of human
disempowerment.</strong> If there were just one superintelligent AI and
it sought to capture global power, it would not have to overpower other
superintelligent AIs in order to do so. If, instead, there were multiple
powerful AIs, humans might be able to cooperate with those that were
more willing to cooperate with humans. However, an AI singleton would
have little reason to cooperate with humans, as it would not face any
competition from other AIs. This scenario would therefore increase the
risk of disempowerment of humanity.</p>
<h3 id="diverse-ecosystem-of-ais">Diverse Ecosystem of AIs</h3>
<p>An alternative possibility is the emergence of a diverse ecosystem of
similarly capable AIs, in which no single agent is significantly and
sustainably more powerful than all the others combined. An AI singleton
might also not occur if there is turnover amongst the most powerful AIs
due to the presence of vulnerabilities. Just as human empires rise and
fall, AIs may gain and lose power to others.</p>
<p><strong>Declining development costs could make multiple AIs more
likely.</strong> If the costs associated with developing AIs diminish
considerably over time, then more actors will be able to develop AIs
independently of one another. Also, if there aren’t increasing returns
from intelligence in many economic niches, then many businesses will
settle for the minimum necessary capable AIs. That is, an AI intended to
cook fast food may not benefit from knowing advanced physics. This
increases the probability of a future where multiple AIs coexist.</p>
<p><strong>A diverse ecosystem of AIs might be more stable than a single
superintelligence.</strong> There are reasons to believe that a diverse
ecosystem of AIs would be more likely to establish itself over the long
term than a single superintelligence. The general principle that
variation improves resilience has been observed in many systems. In
agriculture, planting multiple varieties of crops reduces the risk that
all of them will be lost to a single disease or pest. Similarly, in
finance, having a wide range of investments reduces the risk of large
financial losses. Essentially, a system comprising many entities is less
vulnerable to collapsing if a single entity within it fails.<p>
There are multiple additional advantages that a diverse ecosystem of AIs
could have over a single superintelligence. Variation within a
population means that individuals can specialize in different skills,
making the group as a whole better able to achieve complex goals that
involve multiple different tasks. Such a group might also be generally
more adaptable to different circumstances, since variation across
components could offer more flexibility in how the system interacts with
its environment. The “wisdom of the crowds” theory posits that groups
tend to make better decisions collectively than any individual member of
a group would make alone. This phenomenon could be true of groups of
AIs. For all these reasons, a future involving a diverse ecosystem of
AIs may be more able to adapt and endure over time than one where a
single powerful AI gains decisive power.</p>
<p><strong>Diverse AIs could remove single points of failure.</strong>
Having multiple diverse AIs could dilute the negative effects of any
individual AI failing to function as intended. If each AI were in charge
of a different process, then they would have less power to cause harm
than a single AI that was in control of everything. Additionally, if a
malicious AI started behaving in dangerous ways, then the best chance of
preventing harm might involve using similarly powerful AIs to counteract
it, such as through the use of “watchdog AIs” tasked with detecting such
threats. In contrast with a situation where everything relies on a
single AI, a diverse ecosystem reduces the risk of single points of
failure.</p>
<p><strong>Multi-agent dynamics could lead to selfish traits.</strong>
Having a group of diverse AIs, as opposed to just one, could create the
necessary conditions for a process of evolution by natural selection to
take effect (for further detail, see section 7.5). This might cause AIs to evolve
in ways that we would not necessarily be able to predict or control. In
many cases, evolutionary pressures have been observed to favor selfish
traits in biological organisms. The same mechanism might promote AIs
with undesirable characteristics.</p>
<p><strong>Diverse AIs could increase the risk of unanticipated
failures.</strong> A group of AIs interacting with one another would
form a complex system, and could therefore exhibit collective emergent
properties that could not be predicted from understanding the behavior
of just one. A group of AIs might therefore increase the risk of black
swan events (see section 4.7 for details). Additionally,
interactions between AIs could form feedback loops, increasing the
potential for rapid downward spirals that are difficult to intervene and
stop. A group of powerful AIs in control of multiple military processes
could, for example, present a risk of a flash war (as discussed in
section 1.3), resulting from a feedback loop of adversarial
reactions.</p>
<p><strong>Diverse AI ecosystems could exhibit failure modes of AI
singletons.</strong> If multiple AI systems collude with one another, or
if inequality amongst AIs is significant such that one or a few are much
more powerful than others, risks will mirror those of an AI singleton.
We will examine why collusion and inequality may occur, and the
implications.</p>
<p><strong>Multiple AIs may or may not collude.</strong> It has been
proposed that if there were multiple highly capable AIs, they would
collude with one another, essentially acting as a single powerful AI
<span class="citation" data-cites="Drexler2019">[2]</span>. This is not
inevitable. The risk of collusion depends on the exact environmental
conditions.<p>
Some circumstances that make collusion more likely and more successful
include:</p>
<ol>
<li><p>A small number of actors being involved.</p></li>
<li><p>Collusion being possible even if some actors cease to
participate.</p></li>
<li><p>Colluding actors being similar, for example in terms of their
characteristics and goals.</p></li>
<li><p>Free communication between actors.</p></li>
<li><p>Iterated actions, where each actor can observe what another has
done and respond accordingly in their next decision.</p></li>
<li><p>All actors being aware of the above circumstances.</p></li>
</ol>
<p>Conversely, some circumstances that impede collusion include:</p>
<ol>
<li><p>A large number of actors being involved.</p></li>
<li><p>A requirement for every single actor to participate in order for
collusion to succeed.</p></li>
<li><p>Dissimilarity among colluders, perhaps having different histories
and conflicting goals.</p></li>
<li><p>Limited communication between actors.</p></li>
<li><p>Processes involving only one step where actors cannot observe
what other actors have done and respond in a future action.</p></li>
<li><p>Uncertainty about the above circumstances.</p></li>
</ol>
<p><strong>Power among AIs may be distributed unequally.</strong> The
power of AI systems may follow a long-tail distribution, analogous to
the distribution of wealth among humans in the US. It is therefore
important to note that even if we have many diverse AIs of similar
capabilities, power may still end up being concentrated in just a few
that have a slight edge, and the impact of AI may be largely determined
by only a few. There are situations short of an AI singleton where power
is mainly concentrated in one or a few AIs.</p>
<h3 id="conclusions"> Conclusions about Distribution</h3>
<p>In this section, we examined concerns around the distribution of AI's benefits and costs. First, we considered how costs and benefits of AIs might be distributed, including societal-scale risks. In the short term, automation can lead to economic growth, but if Human-Level AI is developed, this would result in the effective end of human labor. In such a world, people may struggle to find purpose and become dependent on AIs.</p>
<p>Second, we examined varying levels of access to AIs from open source to highly restricted private models. Since AI systems can be used in dangerous ways, traditional openness norms in research likely need to be reconsidered. Both centralization and decentralization of access to AIs carry risks.</p>
<p>Third, we discussed how we should expect to see power distributed across AI systems. Will we eventually see a singleton with decisive and lasting power, a diverse ecosystem of AIs with varying degrees of power, or something in between? These questions could have important implications both for the questions about access and benefits discussed above, as well as for how we go about managing risks from AI.</p>
<p>In the rest of this chapter, we turn to discussing various strategies for governing AI and mitigating some of the risks it may pose. We explore how various stakeholders can contribute to good governance and effective risk management within companies developing AI systems, at the level of national policy and regulation, and in international cooperation between different states and AI developers. We conclude by considering the role of controls over inputs to AI systems as a means of governing AI and managing some of its risks, focussing on the role of computing hardware as the most "governable" of the inputs used for modern AI systems.</p>
<br>
<br>
<h3>References</h3>
<div id="refs" class="references csl-bib-body" data-entry-spacing="0"
role="list">
<div id="ref-bostrom2006singleton" class="csl-entry" role="listitem">
<div class="csl-left-margin">[1] N.
Bostrom, <span>“What is a singleton? Linguistic and philosophical
investigations, 5 (2), 48-54.”</span> 2006.</div>
</div>
<div id="ref-Drexler2019" class="csl-entry" role="listitem">
<div class="csl-left-margin">[2] K.
E. Drexler, <span>“Reframing superintelligence: Comprehensive AI
services as general intelligence,”</span> Future of Humanity Institute,
2019.</div>
</div>
<div id="ref-brynjolfsson2022turing" class="csl-entry" role="listitem">
<div class="csl-left-margin">[3] E.
Brynjolfsson, <span>“The turing trap: The promise & peril of
human-like artificial intelligence,”</span> <em>Daedalus</em>, vol. 151,
no. 2, pp. 272–287, 2022, Available: <a
href="https://digitaleconomy.stanford.edu/news/the-turing-trap-the-promise-peril-of-human-like-artificial-intelligence/">https://digitaleconomy.stanford.edu/news/the-turing-trap-the-promise-peril-of-human-like-artificial-intelligence/</a></div>
</div>
<div id="ref-acemoglu2020robots" class="csl-entry" role="listitem">
<div class="csl-left-margin">[4] D.
Acemoglu and P. Restrepo, <span>“Robots and jobs: Evidence from US labor
markets,”</span> <em>Journal of political economy</em>, vol. 128, no. 6,
pp. 2188–2244, 2020.</div>
</div>
<div id="ref-dvorkin2017growing" class="csl-entry" role="listitem">
<div class="csl-left-margin">[5] M.
Dvorkin and H. Shell, <span>“The growing skill divide in the US labor
market,”</span> <em>Federal Reserve Bank of St. Louis: On the Economy
Blog</em>, vol. 18, 2017.</div>
</div>
<div id="ref-harari2016homo" class="csl-entry" role="listitem">
<div class="csl-left-margin">[6] Y.
N. Harari, <em>Homo deus: A brief history of tomorrow</em>. random
house, 2016.</div>
</div>
<div id="ref-hendrycks2023overview" class="csl-entry" role="listitem">
<div class="csl-left-margin">[7] D.
Hendrycks, M. Mazeika, and T. Woodside, <span>“An Overview of
Catastrophic AI Risks.”</span> 2023. Available: <a
href="https://arxiv.org/abs/2306.12001">https://arxiv.org/abs/2306.12001</a></div>
</div>
<div id="ref-acemoglu2020does" class="csl-entry" role="listitem">
<div class="csl-left-margin">[8] D.
Acemoglu, A. Manera, and P. Restrepo, <span>“Does the US tax code favor
automation?”</span> National Bureau of Economic Research, 2020.</div>
</div>
<div id="ref-acharya2016end" class="csl-entry" role="listitem">
<div class="csl-left-margin">[9] V.
V. Acharya, D. Anginer, and A. J. Warburton, <span>“The end of market
discipline? Investor expectations of implicit government
guarantees,”</span> <em>Investor Expectations of Implicit Government
Guarantees (May 1, 2016)</em>, 2016.</div>
</div>
</div>