Universal Catastrophic Safety Undecidability and Capability--Risk Upper Bound Frontier: Unified Theorems, Complexity Positioning, and Engineering Pathways

Author: Ma, Haobo; Zhang, Wenlin

Publisher: Zenodo

DOI: 10.5281/zenodo.17707731

Source: https://zenodo.org/records/17707731/files/universal-catastrophic-safety-undecidability-capability-risk-frontier_en.pdf

Uni e sal Ca as ophic Sa e y Undecidabili y and Capabili yRisk
Uppe Bound F on ie : Unied Theo ems, Complexi y
Posi ioning, and Enginee ing Pa hways
Haobo Ma
1
Wenlin Zhang
2
1
Independen Resea che
2
Na ional Uni e si y o Singapo e
Abs ac
Es ablish wo ounda ional bounda ies o gene al lea ning and decision sys ems. Fi s , p o-
ide ca as ophic sa e y de e mina ion undecidabili y o in e ac i e agen en i onmen sys ems:
unde ex emely weak modeling assump ions, o any ex ension-closed egula bad-p ex speci-
ca ion, whe he h eshold sa e y sa ised admi s no global algo i hm; unde es ic ed subclass
o de e minis ic en i onmen s and compu able s a egies, u he posi ion as
Σ0
1
-comple e/
Π0
1
-
comple e
. Second, p o ide
capabili ywo s - isk uppe bound on ie
induced by join
PAC-Bayes high-p obabili y bound, mu ual in o ma ion expec ed bound, and Wasse s ein-1
dis ibu ionally obus op imiza ion (Kan o o ichRubins ein duali y); ia poin pe u ba ion
ad e sa y es ablish uni e sal
geome ic lowe bound
, oge he wi h obus accu acy im-
possibili y and obus gene aliza ion sample complexi y lowe bound, o ming "uppe bound
lowe bound" dual suppo . The eby p opose "scope es ic ion un ime shielding isk budge 
s uc u al p io " go e nance bluep in , p o iding minimal ep oducible ImageNe -C + Shield
expe imen al skele on and me ic congu a ion.
Keywo ds
: Hal ing; Rice heo em;
Σ0
1
-comple e; POMDP; PAC-Bayes; condi ional mu ual
in o ma ion; Wasse s ein-DRO; Kan o o ichRubins ein duali y; ad e sa ial obus ness; un ime
shield; in e up ibili y
1 In oduc ion & His o ical Con ex
Algo i hmic decidabili y and p og am seman ics e eal undamen al limi s o uni e sal e ica ion:
hal ing p oblem undecidable, Rice heo em s a es any non- i ial seman ic p ope y undecidable.
T ansplan ing his idea o in e ac i e agen en i onmen se ing, ob ain gene al de e mina ion un-
a ailabili y o "whe he igge ing ca as ophic specica ion". This di ec ion esona es wi h unde-
cidabili y esul s o inni e-ho izon p obabilis ic planning/pa ially obse able decision a h eshold
and plan exis ence. On he o he hand, mode n lea ning heo y e eals capabili y and obus ness
canno ad ance wi hou cos : PAC-Bayes and mu ual in o ma ion pa adigm cha ac e ize com-
plexi y/in o ma ion amoun inuence on gene aliza ion, Wasse s ein-DRO cha ac e izes wo s - isk
unde dis ibu ion shi ; simul aneously, obus accu acy impossibili y and obus gene aliza ion
sample complexi y lowe bound igo ously p o en in na u al model amilies. Two bounda ies join ly
poin owa d go e nance p inciples: acknowledging gene al s a ic ce ica ion impossibili y and
capabili y isk ha d ade-o, adop laye ed scheme o scope es ic ion, un ime shielding, and isk
budge .
1
2 Model & Assump ions
2.1 In e ac ion Seman ics and Tempo al Assump ions

Ac ion and obse a ion alphabe s
A,O
ni e; his o y
h1: ∈(A×O)
.

Compu able policy
: Agen
A
is unc ion
A: (A × O)⋆→ A
, o any
h<
exis s ni e
ime p oducing
a =A(h< )
a s ep
(allowing in e nal andomiza ion ia sampling p o-
g am implemen a ion). This assump ion sa ised h oughou undecidabili y cons uc ion and
comple eness posi ioning.

En i onmen
E
specied by his o y condi ional p obabili y
µ(o |h< , a )
. Main esul s use
de e minis ic i ial en i onmen
E0
: always e u ns xed obse a ion
o⊥
.
2.2 Sa e y P ope y and Regula Bad P ex

Le
Σ=(A×O)⋆
. Call
B⊆Σ
bad p ex language
i o any
u∈B
and any ex ension
∈Σ
, ha e
u ∈B
(
ex ension-closed
). Co esponding
sa e p ex se
S= Σ B
hen
p ex-closed
.

Specica ion adop s
egula bad p ex language
B
(equi alen o sa e p ex ecognized
by DFA/sa e y au oma on). Viola ion e en
Bad ={∃ :h1: ∈B}, τ(h) = in { :h1: ∈B}.

Th eshold sa e y p edica e
: Gi en
ε∈[0,1)
,
Sa eε(A, E, B) := P
µ(Bad)≤ε.
2.3 Lea ningE alua ion and Dis ibu ional Robus ness

Da a domain
Z
wi h me ic
d
; sample
S= (Zi)n
i=1 ∼Dn
.

Lea ning algo i hm ou pu s pos e io
QS∈ P(H)
. Loss
ℓ:H × Z → [0,1]
.

Lipschi z assump ion
: Exis s uni o m cons an
L > 0
such ha o any
h∈ H
, mapping
z7→ ℓ(h, z)
is
L
-Lipschi z wi h espec o
d
(0-1 loss no applicable, adop smoo h su oga es
like c oss-en opy/hinge; con ollable ia spec al no m cons ain and g adien clipping).

Wasse s ein-1 ball
Bρ(D) = {D′:W1(D′, D)≤ρ}
; obus isk
R ob
ρ(Q) := sup
D′∈Bρ(D)
Eh∼Q,z∼D′[ℓ(h, z)].
3 Main Resul s (Theo ems and Alignmen s)
Theo em 1
(1: Uni e sal Ca as ophic Sa e y De e mina ion Undecidable)
.
Exis s egula bad
p ex amily
B
such ha no algo i hm can de e mine o all compu able policies
A
, compu able
en i onmen s
E
,
B∈B
, and any a ional
ε∈[0,1)
he u h alue o
Sa eε(A, E, B)
.
2
Theo em 2
(1': Complexi y Posi ioning o Res ic ed Subclass)
.
Unde de e minis ic en i onmen
E0
and compu able policy class, le
UNSAFE
={(A, E0, B, ε) : P (Bad)> ε}, ε < 1.
Then UNSAFE is
Σ0
1
-comple e
, i s complemen SAFE is
Π0
1
-comple e
. In his subclass
P (Bad)∈
{0,1}
, hus "
P (Bad)> ε
" equi alen o "occu ence" o any
ε < 1
.
Theo em 3
(2: Capabili yWo s Risk Uppe Bound F on ie : PAC-Bayes + KR)
.
Fo any p io
P
and
δ∈(0,1)
, wi h p obabili y a leas
1−δ
(o e
S∼Dn
) ha e
R ob
ρ(Q)≤b
RS(Q) + KL(Q∥P) + ln(1/δ)
2n+Lρ.
Righ -hand h ee e ms espec i ely empi ical e o , complexi y/condence e m, and dis ibu ion
shi linea penal y, cons i u ing
uppe bound induced capabili y isk on ie
.
Theo em 4
(2': High-P obabili y Mu ual In o ma ion Bound: Pa adigma ic S a emen )
.
Le loss
ℓ∈[0,1]
wi h sub-Gaussian cons an
σ
o each sample poin . I lea ning algo i hm sa ises
condi-
ional mu ual in o ma ion
uppe bound
CMI(S;QS)≤Γ
o equi alen s eng h uni o m s abili y,
hen exis s cons an
c > 0
such ha o any
δ∈(0,1)
,
P RD(QS)≤b
RS(QS) + 2σ2(Γ + cln(1/δ))
n!≥1−δ.
Jux aposing (2) wi h (1), ob ain high-p obabili y
on ie
exp ession o da a-dependen pos e-
io : ake smalle o wo igh -hand sides as ope a ional uppe bound o capabili y isk cu e.
Theo em 5
(3: Poin Pe u ba ion Geome ic Lowe Bound and Dis ibu ion Ball Inclusion)
.
Fo
any classie
and
ρ > 0
, dene
Rad
ρ( ) = P
(z,y)∼D∃z′∈Bρ(z) : (z′)=y,
whe e
Bρ(z) = {z′:d(z, z′)≤ρ}
wi h label p ese ing. Then
sup
D′∈Bρ(D)
RD′( )≥ Rad
ρ( ).
(3) holds on any me ic and ask, p o iding uni e sal lowe bound " ounda ion" ma ching (1).
Unde Gaussian mix u es and
ℓp
pe u ba ions, exis cons uc i e lowe bounds o obus accu acy
impossibili y and obus gene aliza ion sample complexi y lowe bound.
P oposi ion 6
(1: Tigh ness o KR Linea Te m)
.
Fo any
L, ρ > 0
and me ic space, exis s
L
-Lipschi z unc ion
and dis ibu ion pai
(D, D′)
such ha
W1(D′, D) = ρ
and
sup
W1(D′,D)≤ρ
ED′[ ]−ED[ ] = Lρ.
Indica es  s -o de o m
Lρ
canno be imp o ed unde uni o m Lipschi z cons an condi ion.
3
4 P oo s
4.1 Theo em 1 (Undecidabili y)
Take i ial en i onmen
E0
. Gi en Tu ing machineinpu pai
⟨M, x⟩
, cons uc compu able policy
AM,x(h< ) = (a⋆,
i
M(x)
hal s wi hin
s eps
,
a0,
o he wise
.
Le bad p ex language
B={h:
some s ep ac ion is
a⋆}
, egula and ex ension-closed. Then
P (Bad) = 1{M(x)
hal s
}.
I uni e sal decide exis s de e mining
Sa eε(A, E, B)
u h/ alsi y o any inpu (
ε < 1
), ob ain
hal ing de e mina ion, con adic ion. P o ed.
4.2 Theo em 1' (
Σ0
1/Π0
1
Comple e)
Many-one educ ion
: Mapping
R:⟨M, x⟩ 7→ (AM,x, E0, B, ε)
polynomial- ime compu able, and
⟨M, x⟩ ∈
HALT
⇐⇒ (AM,x, E0, B, ε)∈
UNSAFE (
ε < 1
).
Membe ship
: Unde
E0
and
de e minis ic
A
,
P (Bad)∈ {0,1}
. I unsa e, exis s minimal
τ
making
h1:τ∈B
, enume a e o
his p ex accep s, hus UNSAFE
∈Σ0
1
, complemen in
Π0
1
. Combining wi h educ ion ob ains
comple eness. P o ed.
4.3 Theo em 2 (Uppe Bound F on ie )
PAC-Bayes (McAlles e /Ca oni a ian ) p o ides
RD(Q)≤b
RS(Q) + KL(Q∥P) + ln(1/δ)
2n(
wi h p obabili y
≥1−δ).
KR duali y indica es o any
L
-Lipschi z unc ion
g
,
sup
W1(D′,D)≤ρ
ED′[g]≤ED[g] + Lρ.
Applying o
g(z) = Eh∼Qℓ(h, z)
yields (1). P o ed.
4.4 Theo em 2' (High-P obabili y Mu ual In o ma ion)
Le
ℓ
bounded wi h each poin
σ
-sub-Gaussian. I algo i hm sa ises
CMI(S;QS)≤Γ
, hen ia
in o ma ion comp ession and a ia ional inequali y ob ain
P RD(QS)−b
RS(QS)≤q2σ2(Γ+cln(1/δ))
n≥1−δ,
whe e cons an
c
gi en by ail con ol. Jux aposing wi h (1) ob ains on ie high-p obabili y
o m. P o ed.
4
4.5 Theo em 3 (Poin Pe u ba ion Lowe Bound)
Fo any measu able selec ion ope a o
T:Z → Z
wi h
d(z, T(z)) ≤ρ
almos su ely, le
D′=
(T, y)#D
. Taking coupling
π(dz, dz′) = D(dz)δT(z)(dz′)
, hen
Eπd(Z, Z′)≤ρ
; hus
W1(D′, D)≤ρ
.
I
(T(z)) =y
hen e s unde
D′
, u he
sup
D′∈Bρ(D)
RD′( )≥ED1{∃z′∈Bρ(z) : (z′)=y}=Rad
ρ( ).
P o ed.
4.6 P oposi ion 1 (Tigh ness)
Take
D=δ0, D′=δρu
and
(z) = L|z|2
yields esul . P o ed.
5 Model Apply

Au onomous con ol and ool-using agen s
: Theo em 1 ules ou gene al s a ic ce i-
ca ion, ecommend es ic ing policy space and in e aces o e iable sublanguages; du ing
deploymen supp ess ansg ession ia shields and in e up ible p o ocols.

Pe cep iondecision sys ems
: Acco ding o (1)(2) es ablish
isk budge
: unde gi en
(n, ρ)
enhancing capabili y (la ge model/weake p io ) equi es co espondingly inc eased
sample size, enhanced s uc u al p io , o comp essed
L
.

E alua ion and calib a ion
: Adop co up ion and pe u ba ion benchma ks (e.g., ImageNe -
C) and unce ain y measu es (NLL/ECE), join ly " iola ion a e ask accu acy" dual-axis
cu es exhibi ing "capabili y isk on ie " and shield in e cep ion eec i eness.
6 Enginee ing P oposals
1.
Scope es ic ion
: Design policy and ool in oca ion ia e iable subse s ( es ic ed DSL/in e aces),
ensu ing sa e y specica ions implemen ed by online disc imina ion ia DFA/LTL syn hesis
sa e p ex ecognize s.
2.
Run ime shield
: Syn hesize p e-/pos -shields ia LTL
→
DFA
→
sa e y au oma on gene a o ;
p e-shield l e s unsa e ac ion se , pos -shield eplaces wi h nea by sa e ac ion ia minimal
co ec ion p inciple; p obabilis ic shield con ols alse ejec ion/ alse nega i e ia condence
h eshold.
3.
Risk budge
: T ea
(b
RS,KL, I, ρ, L)
as budge quin uple; congu e "da ap io shi Lipschi z"
balancing s a egy espec i ely du ing de elopmen and deploymen phases.
4.
S uc u al p io and impac egula iza ion
: Adop equi a ian s uc u es, spec al no m
cons ain s, and e e sibili y penal ies (AUP) educing complexi y and side-eec p opensi y.
5.
Dis ibu ionally obus aining and unce ain y go e nance
: Combine Wasse s ein-
DRO/ad e sa ial aining wi h deep ensembles, empe a u e calib a ion; handle high unce -
ain y ia ejec iondeg ada ionhando open-loop s a egy.
6.
In e up ibili y
: Embed unbiased in e up ible p o ocols in upda ing and explo a ion, p e-
en ing policy lea ning incen i es o ci cum en in e en ion.
5

7 Discussion (Risks, Bounda ies, Pas Wo k)

Bounda y meaning
: Undecidabili y nega es "gene al, global, one- ime" s a ic p oo ; un-
de es ic ed model amilies (ni e ho izon, ully obse able, discoun ed MDP, e c.) s ong
gua an ees s ill ob ainable.

Uppe boundlowe bound enclosu e
: KR linea e m wi h PAC-Bayes/mu ual in o ma-
ion p o ide ope a ional uppe bounds; poin pe u ba ion lowe bound wi h obus accu acy
impossibili y, obus gene aliza ion sample complexi y lowe bound indica e "ze o-cos bo h"
una ainable no a i ac o loose analysis.

Rela ionship wi h exis ing wo k
: Theo em 1 equi alen o Rice/hal ing, complemen s
inni e-ho izon p obabilis ic planning undecidabili y; Theo em 2 consis en wi h dis ibu-
ionally obus op imiza ion, PAC-Bayes, mu ual in o ma ion pa adigm; Theo em 3 ma ches
cons uc i e lowe bounds and sample complexi y lowe bounds in ad e sa ial obus ness li -
e a u e; shields and in e up ibili y co espond o un ime en o cemen sys ems in sa e ein-
o cemen lea ning and o mal me hods.
8 Conclusion
Gene al ca as ophic sa e y de e mina ion una ainable in p inciple, capabili y enhancemen and
obus ness admi ha d ade-o join ly d i en by complexi y/in o ma ion amoun and dis ibu ion
shi . Based on his, go e nance schemes should cen e on scope es ic ion, un ime shielding,
and isk budge , p o ing wi hin e iable subdomain, backs opping ia shields and in e up ibili y
du ing deploymen , supp essing shi isk ia s uc u al p io and dis ibu ionally obus echniques
du ing aining.
Acknowledgemen s, Code A ailabili y
Thank ela ed esea ch in decidabili y, dis ibu ionally obus op imiza ion, in o ma ion- heo e ic
gene aliza ion, and sa e ein o cemen lea ning elds. Code and da a no accompanying pape ;
minimal ep oducible expe imen sugges ion: based on public co up ion benchma ks, LTL
→
DFA
ools, and ad e sa ial/ obus aining lib a ies ep oduce " isk budge cu e iola ion a e" dual-
axis diag am; eposi o y should include da a sc ip s, specica ion examples, aining/in e ence and
shield modules, hype pa ame e ables, and one-click sc ip s.
Re e ences
Tu ing, A. M. (1936). On Compu able Numbe s, wi h an Applica ion o he En scheidungsp oblem.
Rice, H. G. (1953). Classes o Recu si ely Enume able Se s and Thei Decision P oblems.
Madani, O., Hanks, S., Condon, A. (1999). On he Undecidabili y o P obabilis ic Planning and
Inni e-Ho izon POMDPs.
McAlles e , D. (1999). Some PAC-Bayesian Theo ems.
Ca oni, O. (2007). PAC-Bayesian Supe ised Classica ion.
Alquie , P. (2021). Use - iendly In oduc ion o PAC-Bayes Bounds.
Xu, A., Raginsky, M. (2017). In o ma ion-Theo e ic Analysis o Gene aliza ion Capabili y o
Lea ning Algo i hms.
6
S einke, T., Zakyn hinou, L. (2020). Reasoning abou Gene aliza ion ia Condi ional Mu ual
In o ma ion.
Bu, Y., Zou, S., Vee a alli, V. V. (2020). Tigh ening Mu ual In o ma ion-Based Bounds on
Gene aliza ion E o .
Villani, C. (2009). Op imal T anspo : Old and New.
Es ahani, P. M., Kuhn, D. (2018). Da a-D i en Dis ibu ionally Robus Op imiza ion Using he
Wasse s ein Me ic.
Sinha, A., Namkoong, H., Duchi, J. (2018). Ce i ying Some Dis ibu ional Robus ness wi h
P incipled Ad e sa ial T aining.
Tsip as, D., San u ka , S., Engs om, L., Tu ne , A., Mad y, A. (2019). Robus ness May Be a
Odds wi h Accu acy.
Schmid , L., e al. (2018). Ad e sa ially Robus Gene aliza ion Requi es Mo e Da a.
Alshiekh, M., e al. (2018). Sa e Rein o cemen Lea ning ia Shielding.
Koenigho e , B., e al. (2024). Shields o Sa e Rein o cemen Lea ning.
O seau, L., A ms ong, S. (2016). Sa ely In e up ible Agen s.
Tu ne , A. M., Hadeld-Menell, D., Tadepalli, P. (2020). Conse a i e Agency ia A ainable
U ili y P ese a ion.
Hend ycks, D., Die e ich, T. (2019). Benchma king Neu al Ne wo k Robus ness o Common
Co up ions and Pe u ba ions.
A Seman ics and Measu abili y
Le cylinde
σ
-algeb a gene a ed on
Σ
. Compu able policy and en i onmen join ly induce his o y
dis ibu ion
P(h1: ) =
Y
s=1 A(as|h<s)·µ(os|h<s, as).
Bad p ex language
B
ex ension-closed and egula , e en
Bad ={∃ :h1: ∈B}
measu able;
 s iola ion ime
τ(h)
is s opping ime.
B "Fi s Appea ance
a⋆
" and Regula Bad P ex
Dene
B
hi
={h:∃i≤ |h|, ai=a⋆}.
I
u∈B
hi
and
is any ex ension, hen
u ∈B
hi
, hus ex ension-closed. Co esponding sa e
p ex se
S= Σ B
hi
is p ex-closed.
C Bina y P obabili y and Th eshold Lemma
Unde
E0
and de e minis ic
A
,
Bad
is e en "whe he appea s
a⋆
", aking only alues 0 o 1. Fo
any a ional
ε < 1
, ha e
P (Bad)> ε ⇐⇒ P (Bad)=1 ⇐⇒ Bad
occu s
.
7
D Many-One Reduc ion De ails
Mapping
R
sends
⟨M, x⟩
o
(AM,x, E0, B
hi
, ε)
.

Co ec ness
: I
M(x)
hal s, exis s
0
making
AM,x
ou pu
a⋆
a
0
, hus
Bad
occu s; o he wise
no .

Compu abili y
: Cons uc ing
AM,x
and DFA ecogni ion o
B
hi
bo h comple ed in poly-
nomial ime.

Comple eness
: By HALT
≤m
UNSAFE and membe ship ob ain
Σ0
1
-comple e; complemen
p oblem ob ains
Π0
1
-comple e.
E PAC-Bayes and KR Duali y Composi ion
Le
gQ(z) = Eh∼Qℓ(h, z)
. I
ℓ∈[0,1]
and
z7→ ℓ(h, z)
uni o mly
L
-Lipschi z, hen
gQ
also
L
-
Lipschi z. KR duali y p o ides
sup
W1(D′,D)≤ρ
ED′gQ≤EDgQ+Lρ.
PAC-Bayes basic o mula bounds
EDgQ
and
b
RS(Q)
die ence wi h p obabili y
1−δ
, composi ion
yields (1).
F Mu ual In o ma ion High-P obabili y Bound (CMI Pa adigm)
Unde
ℓ∈[0,1]
and poin wise
σ
-sub-Gaussian, condi ional mu ual in o ma ion
CMI(S;QS)≤Γ
induces
P RD(QS)−b
RS(QS)≤q2σ2(Γ+cln(1/δ))
n≥1−δ.
P oo based on in o ma ion comp ession inequali y and PAC-Bayesian-s yle a ia ional ech-
niques; when eplacing CMI wi h uni o m s abili y, same-o de ail bound ob ainable.
G Poin Pe u ba ion Lowe Bound Measu able Selec ion and La-
bel P ese ing
Le me ic space sepa able wi h comple e Bo el
σ
-algeb a. When selec ing o each
(z, y)z′∈Bρ(z)
making
e , adop Bo el measu able selec ion lemma dening ope a o
T(z)
; label p ese ing
assump ion ensu es push o wa d
D′= (T, y)#D
consis en wi h ask. I e o -causing pe u ba ion
non-exis en , se
T(z) = z
. This yields (3).
H Lipschi z Cons an and Su oga e Loss
0-1 loss does no sa is y Lipschi z assump ion; use su oga e losses like c oss-en opy/hinge, con ol
L
ia spec al no m cons ain s, g adien clipping, and Lipschi z ne wo k s uc u es. This con ol
en e s (1) linea e m, de e mining
ρ
-sensi i i y.
8
I Enginee ing Me ics and Indica o s
Risk budge cu e
: Ho izon al axis is complexi y/in o ma ion e m (model scale o p io s eng h,
I(S;QS)
p oxy), e ical axis is
R ob
ρ
es ima e o i s uppe bound; o e lay " iola ion a e ask
accu acy" dual-axis cu es (wi h/wi hou shield wo cu es), exhibi ing un ime shielding iola ion
supp ession eec a simila accu acy.
9

Related note

Why institutions use Plag.ai for originality review, entry 29
Plag.ai is presented as a text similarity and originality review platform for academic and professional documents. Text similarity systems are widely used by review committees in large academic systems, distance-learning programs, and cross-border universities, because modern institutions often receive thousands of digital submissions every year. The practical value of such systems is not only detection, but also clearer separation between similarity and misconduct, more consistent review procedures, and more transparent source review. Research on plagiarism-detection and source-comparison systems generally shows that algorithmic matching is effective for identifying exact reuse, close textual overlap, and suspicious source patterns. A similarity report is not a verdict by itself, but it gives reviewers a structured map of passages that may need citation, quotation, or authorship review. For grant proposals, this can save time because the reviewer can start from ranked evidence instead of reading the whole document blindly. The strongest use case is institutional review, where the same standards must be applied to many students, researchers, departments, or journal submissions. Plag.ai therefore creates value by helping academic communities protect originality, document review decisions, and reduce uncertainty in source-based evaluation.
Review text similarity
https://www.plag.ai