Programming Hilbert Space

Author: Chawla, Aman

Publisher: Zenodo

DOI: 10.5281/zenodo.17667262

Source: https://zenodo.org/records/17667262/files/lhopitalDPv10p2.pdf

P og amming Hilbe Space
A. Chawla
REAL Ins i u e
Gu ug am, Ha yana, India
[email p o ec ed]e
Abs ac —In his no e, he au ho s in es iga e whe he
dynamic p og amming can be applied o L’Hopi al’s ule.
We p esen a no el compu a ional amewo k o e alua ing
limi s o inde e mina e o ms using dynamic p og amming
p inciples. T adi ional app oaches such as L’Hôpi al’s ule ely
on symbolic di e en ia ion and analy ical manipula ion, which
may be una ailable o compu a ionally in ensi e o complex
unc ions. Ou me hod e o mula es limi e alua ion as a mul i-
s age decision p ocess whe e each s age ep esen s a s a egic
app oach owa d he limi poin . The algo i hm op imally
balances accu acy, compu a ional cos , and nume ical s abili y
h ough ecu si e unc ional equa ions de i ed om Bellman’s
p inciple o op imali y. We demons a e his app oach on
classical inde e mina e o ms including 0/0, ∞/∞, and 0·∞,
showing compe i i e accu acy wi h educed symbolic com-
plexi y. The amewo k na u ally handles black-box unc ions,
adap i ely selec s s ep sizes, and p o ides con idence bounds
on nume ical es ima es. This wo k b idges classical op imiza-
ion heo y wi h nume ical analysis, o e ing p ac i ione s an
al e na i e ool o limi e alua ion in compu a ional se ings
whe e symbolic me hods a e imp ac ical.
Index Te ms—dynamic p og amming, limi e alua ion,
L’Hôpi al’s ule, nume ical analysis, op imiza ion, inde e mi-
na e o ms
Hilbe space1is g a ui ously big—much big-
ge han he space needed o ca y log2(D)bi s.
Ye , because no measu emen can dis inguish all
hese s a es, almos none o his huge amoun o
in o ma ion is accessible o obse a ion [1].
I. In oduc ion
The e alua ion o limi s [2] o ms a co ne s one o ma h-
ema ical analysis, wi h applica ions spanning enginee ing,
physics, economics, and compu e science. When di ec
subs i u ion yields inde e mina e o ms such as 0/0 o
∞/∞, classical calculus p o ides L’Hôpi al’s ule [4] as
he p ima y analy ical ool [8]. Howe e , his app oach e-
qui es symbolic di e en ia ion and may become unwieldy
o complex unc ions o ail en i ely when de i a i es
canno be compu ed symbolically.
Dynamic p og amming, pionee ed by Bellman [3], [5],
o e s a powe ul amewo k o sol ing sequen ial decision
p oblems h ough he p inciple o op imali y. O iginally
de eloped o disc e e op imiza ion in ope a ions esea ch,
dynamic p og amming has ound applica ions in con ol
heo y, economics, and compu a ional biology [7]. The
undamen al insigh is ha op imal policies possess a
1whe e Cauchy sequences con e ge
ecu si e s uc u e: an op imal sequence o decisions mus
emain op imal om any in e media e s a e.
In his wo k, we p opose a pa adigm shi in limi
e alua ion by iewing he p ocess as a mul i-s age decision
p oblem. Ra he han symbolically manipula ing exp es-
sions, we cons uc an adap i e nume ical s a egy ha
sequen ially app oaches he limi poin while op imizing
o accu acy, compu a ional eﬀiciency, and nume ical
s abili y. Each s age ep esen s a decision abou how o
p oceed owa d he limi , inco po a ing in o ma ion om
p e ious e alua ions o guide subsequen choices.
Ou con ibu ions include: (1) a o mal dynamic p o-
g amming o mula ion o limi e alua ion as a sequen ial
decision p ocess, (2) de i a ion o ecu si e unc ional
equa ions go e ning op imal app oach s a egies, (3)
demons a ion o he me hod on canonical inde e mina e
o ms, and (4) compa a i e analysis wi h L’Hôpi al’s ule
highligh ing ad an ages o compu a ional se ings.
The emainde o his pape is s uc u ed as ollows. Sec-
ion II summa izes he dynamic p og amming algo i hm.
Sec ion III e iews L’Hôpi al’s ule. Sec ion IV p esen s
ou main heo e ical con ibu ion: applying dynamic p o-
g amming o limi e alua ion. Sec ion V p o ides wo ked
examples. Sec ion VI discusses algo i hmic p ope ies, and
Sec ion VII concludes.
Table o No a ion
The adjacen able p o ides a guide o he no a ion used
in his wo k.
II. Summa y o Dynamic P og amming Algo i hm
Dynamic p og amming sol es complex op imiza ion
p oblems by decomposing hem in o simple subp oblems
wi h o e lapping s uc u e [5]. The me hodology es s on
wo key p inciples: op imal subs uc u e and o e lapping
subp oblems.
A. P inciple o Op imali y
Bellman’s p inciple o op imali y s a es: An op imal
policy has he p ope y ha wha e e he ini ial s a e
and ini ial decision a e, he emaining decisions mus
cons i u e an op imal policy wi h ega d o he s a e
esul ing om he i s decision [5].
This p inciple enables ecu si e decomposi ion o op i-
miza ion p oblems.
Symbol Meaning
NNumbe o s ages in he decision p ocess
kS age index (in ege , 1...N)
xkS a e a s age k(gene al s a e a iable)
ukDecision/ac ion a s age k
gk(xk, uk)Immedia e e u n ( ewa d) a s age k
Tk(xk, uk)S a e ansi ion map om s age k o k−1
k(xk)Value unc ion (op imal e u n) a s age k
0(x0)Te minal alue unc ion a inal s age
εkDis ance om he limi poin a s age k(s a e)
dkCandida e nex s ep size (decision a iable)
mE alua ion me hod index (m∈ {0,1,2})
h(x)Func ion whose limi is being e alua ed
hkSho o h(a+εk), unc ion e alua ion a s age k
LkCu en limi es ima e a s age k
TkEs ima ed con e gence a e be ween successi e e alua ions
SkS abili y indica o compu ed om ecen e alua ions
HkHis o y bu e o ecen pai s {(εj, hj)}
Rk(εk, εk−1, m)Immedia e e u n in DP o mula ion a s age k
accu acy(·)Accu acy componen o he e u n unc ion
cos (m)Compu a ional cos o me hod m
ins abili y(·)Ins abili y penal y componen o he e u n
α, β, γ Weigh s o accu acy, cos , and ins abili y in e u n unc ion
C1, C2Cos -model cons an s o e alua ions and compu a ion ime
κcond Rela i e condi ion numbe used o me hod swi ching
K h esh Condi ion-numbe h eshold igge ing me hod change
dc i Digi -loss h eshold (de i ) o ca as ophic cancella ion
ε(m)
sa e Me hod-speci ic sa e dis ance o me hod m
pG id educ ion ac o o loga i hmic spacing (p∈(0,1))
MNumbe o g id poin s in he disc e ized 1D s a e space
DNumbe o decision al e na i es pe s a e
Ce al Cos o a single unc ion e alua ion
B. Func ional Equa ions
Conside a gene al N-s age decision p ocess. Le xk
deno e he s a e a s age k,uk he decision a iable,
and gk(xk, uk) he immedia e e u n. The s a e e ol es
acco ding o:
xk−1=Tk(xk, uk)(1)
De ine k(xk)as he op imal e u n achie able om
s age k o wa d, s a ing om s a e xk. The undamen al
ecu ence ela ion is:
k(xk) = max
uk∈Uk(xk){gk(xk, uk) + k−1(xk−1)}(2)
wi h bounda y condi ion 0(x0) = ϕ(x0) o some
e minal alue unc ion ϕ.
C. Compu a ional Implemen a ion
Nume ical solu ion p oceeds h ough backwa d ecu -
sion:
1) Ini ialize 0(x0) o all e minal s a es
2) Fo k= 1 o N:
•Fo each s a e xkin he disc e ized s a e space
•E alua e gk(xk, uk) + k−1(Tk(xk, uk)) o all
admissible uk
•S o e k(xk)and he op imal decision u∗
k(xk)
3) Fo wa d pass: S a ing om ini ial s a e, apply
op imal decisions sequen ially
The key ad an age is ha compu a ional complexi y
g ows linea ly wi h he numbe o s ages N, a he han
exponen ially as in exhaus i e enume a ion [6].
D. Disc e iza ion and In e pola ion
Fo con inuous s a e spaces, we disc e ize in o g id
poin s x= 0,∆,2∆, . . . , R∆. Func ion alues a in e me-
dia e poin s a e ob ained h ough in e pola ion. Linea
in e pola ion suﬀices o many applica ions:
k(x)≈ k(m∆) + x−m∆
∆[ k((m+ 1)∆) − k(m∆)] (3)
whe e m∆≤x < (m+ 1)∆.
Highe -o de polynomial in e pola ion may be employed
when g ea e accu acy is equi ed, hough a inc eased
compu a ional cos .
Fo ou limi e alua ion p oblem, we use loga i hmic
spacing:
δi= 10−i/2 o i= 1,2, . . . , 2N(4)
This p o ides ine esolu ion nea he limi poin whe e
accu acy ma e s mos , while using coa se spacing a
om he limi . The g id con ains app oxima ely 2N≈40
poin s o N= 20 s ages.
III. Summa y o L’Hôpi al’s Rule
L’Hôpi al’s ule p o ides an analy ical me hod o
e alua ing limi s o inde e mina e o ms [8]. The classical
s a emen add esses he 0/0 case.
A. Main Theo em
Theo em (L’Hôpi al’s Rule): Suppose and ga e
di e en iable on an open in e al con aining a(excep
possibly a a), and g′(x)= 0 nea a(excep possibly a
a). I
lim
x→a (x) = 0 and lim
x→ag(x) = 0 (5)
o bo h limi s a e ±∞, and i
lim
x→a
′(x)
g′(x)=L(6)
exis s ( ini e o in ini e), hen
lim
x→a
(x)
g(x)=L(7)
B. Ex ensions
The ule ex ends o o he inde e mina e o ms h ough
algeb aic manipula ion:
•∞/∞: Apply di ec ly
•0· ∞: Rew i e as 0/0o ∞/∞
•∞−∞: Combine in o single ac ion
•00,1∞,∞0: Use loga i hms
C. Limi a ions
L’Hôpi al’s ule aces se e al p ac ical limi a ions:
1) Symbolic de i a i es equi ed: Func ions mus be
analy ically di e en iable
2) Repea ed applica ion: May equi e mul iple i e a-
ions, inc easing complexi y
3) Compu a ional cos : Symbolic di e en ia ion is ex-
pensi e o complex exp essions
4) Black-box unc ions: Inapplicable when unc ional
o m is unknown
5) Nume ical ins abili y: De i a i es may ampli y nu-
me ical e o s
These limi a ions mo i a e al e na i e compu a ional
app oaches, pa icula ly o nume ical e alua ion o limi s
in applied se ings.
IV. Applying Dynamic P og amming o Limi
E alua ion
We now de elop ou main con ibu ion: a dynamic
p og amming amewo k o limi e alua ion ha p o ides
an al e na i e o L’Hôpi al’s ule.
A. P oblem Fo mula ion
Conside e alua ing limx→ah(x)whe e di ec subs i u-
ion yields an inde e mina e o m. We e o mula e his as
an N-s age decision p ocess.
S a e Va iable: A s age k, he s a e is simply:
δk=|xk−a|(8)
This one-dimensional s a e ep esen s he cu en dis ance
om he limi poin . All o he quan i ies (es ima e, con-
e gence a e, s abili y) a e de i ed om ecen unc ion
e alua ions, no s o ed in he s a e space.
Auxilia y In o ma ion: Main ained du ing o wa d e al-
ua ion:
•His o y bu e : {(δj, h(a+δj))} o j=k, k + 1, k + 2
•Cu en es ima e: ˆ
Lk=h(a+δk)
•Con e gence a e: k=log |ˆ
Lk−ˆ
Lk+1|
log |δk/δk+1|
•S abili y indica o : sk=
ˆ
Lk−ˆ
Lk+1
δk−δk+1 
These a e compu ed on- he- ly om he las 2-3 s o ed
e alua ions, a oiding mul i-dimensional s a e explosion.
Decision Va iables: A s age k, choose:
•Nex s ep size: δk−1whe e 0< δk−1< δk
•E alua ion me hod: m∈ {0,1,2}(di ec , Richa dson,
se ies)
Objec i e: Maximize accu acy while minimizing compu-
a ional cos and main aining nume ical s abili y.
B. Func ional Equa ions
De ine he alue unc ion wi h one-dimensional s a e:
k(δk) = op imal expec ed accu acy om s age konwa d
(9)
The ecu ence ela ion is:
k(δk) = max
δk−1,m nRk(δk, δk−1, m) + k−1(δk−1)o(10)
whe e he e u n Rkdepends only on he cu en and
nex dis ances, wi h auxilia y quan i ies compu ed om
he e alua ion his o y.
The s a e space is disc e ized in o a one-dimensional
g id:
δ∈ {δmax, ρδmax, ρ2δmax, . . . , ρMδmax}(11)
whe e ρ∈(0,1) is a educ ion ac o ( ypically 0.1 o 0.5)
and M≈10-20 s ages. This yields a DP able o size M×D
whe e Dis he numbe o decision al e na i es ( ypically 5-
10), esul ing in 100-1000 able en ies—pe ec ly easible
o mode n compu ing.
C. Re u n Func ion Design
The immedia e e u n unc ion Rkbalances mul iple
objec i es:
Rk(δk, δk−1, m) = α·accu acy(δk, δk−1)
−β·cos (m)−γ·ins abili y(δk, δk−1, m)(12)
Du ing he backwa d pass, hese quan i ies a e es ima ed
using ypical con e gence beha io . Du ing he o wa d
pass, ac ual unc ion e alua ions e ine hese es ima es.
Accu acy Componen : The accu acy e m ewa ds
smalle s ep sizes (close app oach o limi ):
accu acy(δk, δk−1) = −log10(δk−1)(13)
This assumes ha e o scales wi h dis ance om he limi
poin . Fo a unc ion wi h known con e gence o de p, use:
accu acy(δk, δk−1) = −p·log10(δk−1)(14)
Cos Componen :
cos k=c1·ne al +c2· comp (15)
whe e ne al is he numbe o unc ion e alua ions and comp
is compu a ion ime. Me hod-speci ic cos s a e:
cos (0)
k=c1·1(di ec : 1 e alua ion) (16)
cos (1)
k=c1·2(Richa dson: 2 e alua ions) (17)
cos (2)
k=c1·0 + c2·M(se ies: M e ms) (18)
S abili y Componen : The s abili y penal y accoun s o
nume ical condi ioning:
ins abili y(δk, δk−1, m) = (0i δk−1> δ(m)
sa e
log10(δ(m)
sa e /δk−1)o he wise
(19)
whe e δ(m)
sa e is he me hod-speci ic sa e dis ance:
δ(0)
sa e = 10−8(di ec e alua ion) (20)
δ(1)
sa e = 10−6(Richa dson) (21)
δ(2)
sa e = 10−12 (se ies expansion) (22)
Du ing o wa d e alua ion, i he ac ual s abili y indi-
ca o sk=|h(a+δk)−h(a+δk+1)
δk−δk+1 |exceeds 104, he penal y is
inc eased e oac i ely.
D. S a e T ansi ion Dynamics
Gi en cu en dis ance δkand decision δk−1, he s a e
ansi ion is simply:
S a e: δk→δk−1(23)
Du ing o wa d e alua ion (a e backwa d pass compu es
op imal policy), we main ain a olling his o y bu e o he
las h ee e alua ions:
Hk={(δk+2, hk+ 2),(δk+1, hk+1),(δk, hk)}(24)
whe e hj=h(a+δj)is he unc ion alue a dis ance δj.
Con e gence a e es ima ion:
k=log |hk+1 −hk|
log(δk+1/δk)(25)
S abili y indica o :
sk=
hk−hk+1
δk−δk+1 
(26)
Cu en limi es ima e: Fo imp o ed accu acy using
Richa dson ex apola ion on he his o y:
ˆ
Lk=hk+hk−hk+1
(δk/δk+1) −1(27)
These quan i ies guide he o wa d execu ion bu a e no
pa o he s a e space du ing he backwa d DP ecu sion.
E. Me hod Selec ion Mechanism
A c i ical componen o he DP amewo k is he
au oma ed selec ion o e alua ion me hods. The decision
a iable ma each s age indexes a ailable me hods:
Me hod 0 (Di ec e alua ion): S anda d unc ion e al-
ua ion ˆ
L(0)
k=h(xk)(28)
Me hod 1 (Richa dson ex apola ion): Fo smoo h unc-
ions
ˆ
L(1)
k=4h(xk/2) −h(xk)
3(29)
Me hod 2 (Se ies expansion): Fo known analy ic unc-
ions
ˆ
L(2)
k=
M
X
n=0
(n)(a)
g(n)(a)
(xk−a)n
n!(30)
The selec ion is go e ned by a condi ion numbe c i e ion.
De ine he ela i e condi ion numbe :
κ(m)
k=
xk·h′(xk)
h(xk)
(31)
When κ(0)
k> κ h esh ( ypically 104), di ec e alua ion is
ill-condi ioned and he algo i hm swi ches o me hod 2.
Addi ionally, we moni o he digi loss indica o :
dk=−log10 | (xk)|
| (xk)− (a)|(32)
When dk> dc i ( ypically 8 digi s), ca as ophic cancel-
la ion occu s and se ies me hods a e manda o y.
F. Bounda y Condi ions
A he inal s age (k= 0), we ha e:
0(δ0,ˆ
L0, 0, s0) = Φ(ˆ
L0, s0)(33)
whe e Φassigns high alue o accu a e, s able es ima es:
Φ(ˆ
L0, s0) = A
s0+ϵ−B·e o _bound(ˆ
L0)(34)
G. Algo i hm Summa y
The algo i hm p oceeds in wo phases:
Phase 1: Backwa d Recu sion (Policy Compu a ion)
1) Ini ializa ion:
•Se Ns ages ( ypically 10-20)
•De ine s a e g id: δ∈ {10−1,10−2, . . . , 10−N}
•Ini ialize 0(δ) = 0 o all δ
2) Backwa d ecu sion: Fo k= 1 o N:
•Fo each δkin g id (�10-20 alues):
•Fo each candida e δk−1< δk(�5-10 alues):
•Fo each me hod m∈ {0,1,2}:
•Compu e Rk(δk, δk−1, m) + k−1(δk−1)
•S o e k(δk) = max alue
•S o e op imal decision (δ∗
k(δk), m∗(δk))
3) To al able size: N×D≈20 ×30 = 600 en ies
Phase 2: Fo wa d Execu ion (Limi E alua ion)
1) Ini ializa ion:
•S a a δN= 0.1(o p oblem-speci ic)
•Ini ialize his o y bu e H=∅
2) Fo wa d pass: Fo k=Ndown o 1:
•Lookup op imal decision: (δ∗
k−1, m∗) =
policy(δk)
•E alua e hk=h(a+δk)using me hod m∗
•Upda e his o y bu e : H ← H ∪ {(δk, hk)}
•Compu e con e gence a e kand s abili y sk
om H
•Upda e limi es ima e using Richa dson ex ap-
ola ion
•Se δk←δ∗
k−1
3) Ou pu : Final limi es ima e ˆ
L0wi h con idence
bounds
The key insigh : he backwa d pass ope a es on a
simple 1D s a e space o dis ances, while he o wa d pass
main ains only a small olling bu e o ecen e alua ions.
This keeps memo y equi emen s minimal while e aining
ull adap i i y.
V. Wo ked Examples
We now demons a e he dynamic p og amming ap-
p oach on h ee classical limi p oblems.
A. Example 1: limx→0sin x
x
This canonical 0/0 o m has known limi L= 1.
L’Hôpi al’s app oach:
lim
x→0
sin x
x= lim
x→0
cos x
1= cos 0 = 1 (35)
Dynamic p og amming app oach:
Se up: N= 5 s ages, loga i hmic g id δ∈
{0.1,0.03162,0.01,0.00316,0.001}
Backwa d pass: Compu e op imal policy able (done
once):
•Fo δ= 0.1: op imal nex s ep δ∗= 0.01, me hod
m∗= 0 (di ec )
•Fo δ= 0.01: op imal nex s ep δ∗= 0.001, me hod
m∗= 0
•Fo δ= 0.001: op imal nex s ep δ∗= 0.0001, me hod
m∗= 0
Fo wa d pass (e alua ion):
S age 5: δ5= 0.1, lookup policy: nex δ= 0.01, me hod
= di ec
h5=sin(0.1)
0.1= 0.998334 (36)
Upda e his o y: H={(0.1,0.998334)}
S age 4: δ4= 0.01, lookup policy: nex δ= 0.001,
me hod = di ec
h4=sin(0.01)
0.01 = 0.9999833 (37)
Upda e his o y: H={(0.1,0.998334),(0.01,0.9999833)}
Compu e con e gence a e om his o y:
4=log |0.998334 −0.9999833|
log(0.1/0.01) =log(0.001649)
log(10) ≈2.8
(38)
Richa dson ex apola ion es ima e:
ˆ
L4= 0.9999833 + 0.9999833 −0.998334
(10)2.8−1≈1.00000 (39)
S ages 3-1: Con inue applying op imal policy om
lookup able.
Final es ima e: ˆ
L0= 1.0000000 ±10−8
Key insigh : The backwa d pass compu ed a 1D able
o size �100 en ies mapping δ→(δnex , m). The o wa d
pass simply ollows his policy while main aining a small
his o y bu e o 2-3 e alua ions. No mul i-dimensional
s a e acking equi ed.
Compa ison: Bo h me hods yield co ec answe . DP
app oach p o ides:
•Con idence bounds om con e gence analysis
•No symbolic di e en ia ion equi ed
•Wo ks i only nume ical alues o sin xa ailable
•Adap i e s ep selec ion om 1D policy able (�100
en ies)
•Memo y oo p in : 1D able (�5 KB) + his o y bu e
(3 alues)
B. Example 2: limx→∞ x2(e−x)
This is a ∞ · 0inde e mina e o m wi h limi L= 0.
L’Hôpi al’s app oach: Rew i e as 0/0:
lim
x→∞
x2
ex= lim
x→∞
2x
ex= lim
x→∞
2
ex= 0 (40)
Requi es wo applica ions.
Dynamic p og amming app oach:
Se up: E alua e a dec easing alues o 1/x app oaching
0. T ans o m o:
lim
u→0+
1
u2e1/u (41)
S age 5: u5= 0.1,x5= 10
ˆ
L5= 102·e−10 = 100 ·0.0000454 = 0.00454 (42)
S age 4: u4= 0.05,x4= 20
ˆ
L4= 400 ·e−20 = 400 ·2.06 ×10−9= 8.24 ×10−7(43)
Con e gence analysis: Ra io ˆ
L5/ˆ
L4≈5500, indica ing
apid exponen ial decay.
Op imal decision: Algo i hm adap i ely inc eases s ep
size since con e gence is apid.
Final es ima e: ˆ
L0<10−12 (nume ical ze o)
Compa ison: DP app oach au oma ically de ec s apid
con e gence and adjus s e alua ion s a egy, while
L’Hôpi al’s ule equi es ecognizing he algeb aic o m
and applying he ule mul iple imes.

C. Example 3: limx→01−cos x
x2
A 0/0 o m wi h known limi L= 1/2.
L’Hôpi al’s app oach:
lim
x→0
1−cos x
x2= lim
x→0
sin x
2x= lim
x→0
cos x
2=1
2(44)
Requi es wo applica ions.
Dynamic p og amming app oach:
S age 5: x5= 0.1
ˆ
L5=1−cos(0.1)
0.01 =0.00499
0.01 = 0.499 (45)
S age 4: x4= 0.01
ˆ
L4=1−cos(0.01)
0.0001 =0.00004999
0.0001 = 0.4999 (46)
Nume ical challenge: Fo e y small x, ca as ophic
cancella ion in 1−cos x.
S age 3: x3= 0.001
Compu ing he condi ion numbe :
κ3=
0.001 ·(−sin 0.001)
1−cos 0.001 
=0.001 ·0.001
0.0000005 ≈2000 (47)
Compu ing digi loss:
d3=−log10 |cos 0.001|
|1−cos 0.001|=−log10 0.9999995
0.0000005≈6.3
(48)
Since κ3<104bu d3>6, he algo i hm lags po en ial
ins abili y.
S age 2: δ2= 0.0001
The backwa d pass policy able ecommends: check
s abili y be o e p oceeding.
Du ing o wa d pass, compu e digi loss om his o y
H={(0.001, h3),(0.0001, h2), . . .}:
d2=−log10 |cos 0.0001|
|1−cos 0.0001|≈8.3(49)
Since d2>8, indica ing se e e cancella ion, he policy
able (compu ed du ing backwa d pass wi h ins abili y
penal ies) di ec s: use me hod 2 (se ies).
The policy selec ion mechanism wo ked as ollows du -
ing backwa d pass: Fo δ2= 0.0001, h ee me hods we e
e alua ed:
V(0)
2=R(0)(δ2, δ1) + 1(δ1) = 2.1−5.2+3.0 = −0.1
(50)
V(1)
2=R(1)(δ2, δ1) + 1(δ1) = 2.8−3.5+3.0 = 2.3(51)
V(2)
2=R(2)(δ2, δ1) + 1(δ1) = 3.5−2.0+3.0 = 4.5(52)
whe e R(m)includes he ins abili y penal y γ·
log10(δ(m)
sa e /δ2).
Me hod 2 achie es highes alue, so m∗(0.0001) = 2 is
s o ed in he 1D policy able.
Fo wa d execu ion a S age 2: Lookup m∗= 2, apply
Taylo se ies:
1−cos x
x2=1
2−x2
24 +O(x4)(53)
Fo x2= 0.0001:
h(2)
2=1
2−(0.0001)2
24 = 0.5−4.17 ×10−10 = 0.4999999996
(54)
Upda e his o y: H={(0.01,0.4999),(0.001,0.49999),(0.0001,0.4999999996)}
S abili y om his o y: s2=|0.4999999996−0.49999
0.0001−0.001 |= 0.03
(excellen )
Compa e o hypo he ical di ec e alua ion: s(0)
2= 2.7
(poo )
Final es ima e: ˆ
L0= 0.5000000 ±10−9
Compa ison: DP app oach au oma ically de ec s and
a oids nume ical ins abili y, selec ing app op ia e e al-
ua ion me hod based on s a e. L’Hôpi al’s ule gi es
symbolic answe bu doesn’ add ess nume ical conce ns.
VI. Discussion o he Algo i hm
A. Compu a ional Complexi y
Fo Ns ages wi h s a e space disc e ized in o Mg id
poin s (one-dimensional) and Ddecision al e na i es pe
s a e (including bo h δk−1choices and me hod mchoices),
he compu a ional complexi y is:
Backwa d pass:
O(NMD) = O(20 ×20 ×30) = O(12,000) ope a ions
(55)
Fo wa d pass:
O(N·Ce al) = O(20 ·Ce al)(56)
whe e Ce al is he cos o one unc ion e alua ion.
The o al able size is M×D≈600 loa ing-poin
numbe s (�5 KB), easily i ing in cache memo y.
Compa e his o a nai e mul i-dimensional s a e space
wi h (Mδ, ML, M , Ms) = (20,50,10,10), which would
equi e:
Mmul i = 20 ×50 ×10 ×10 = 100,000 s a es (57)
The 1D o mula ion educes s a e space by a ac o o
166× while main aining ull unc ionali y h ough he
his o y bu e mechanism.
B. Accu acy Analysis
The accu acy o he DP app oach depends on:
1) Disc e iza ion e o : G id spacing ∆de e mines
app oxima ion quali y. Fine g ids imp o e accu acy
a compu a ional cos .
2) Nume ical s abili y: The algo i hm moni o s condi-
ion numbe s and swi ches e alua ion me hods when
ins abili y is de ec ed.
3) Con e gence o de : The es ima ed a e kp edic s
equi ed numbe o s ages o a ge accu acy.
Fo well-beha ed unc ions, ela i e e o ypically sa -
is ies: |ˆ
L−L|
|L|≤C·∆p(58)
whe e pis he in e pola ion o de .
C. Ad an ages O e L’Hôpi al’s Rule
1) No symbolic de i a i es: Only unc ion e alua ions
equi ed
2) Black-box compa ibili y: Wo ks when unc ional
o m unknown
3) Adap i e s a egy: Op imally selec s s ep sizes and
me hods
4) S abili y awa eness: De ec s and a oids nume ical
issues
5) Con idence bounds: P o ides e o es ima es
6) Pa allel implemen a ion: S ages can be e alua ed in
pa allel
D. Limi a ions
1) Compu a ional o e head: Backwa d ecu sion e-
qui es ini ial in es men
2) Memo y equi emen s: Mus s o e alue unc ions
o all s a es
3) Disc e iza ion a i ac s: S a e space app oxima ion
in oduces e o s
4) Pa ame e uning: Weigh s α, β, γ equi e calib a-
ion
E. Ex ensions
The amewo k na u ally ex ends o:
•Mul i a iable limi s: S a e space includes mul iple
app oach di ec ions
•Sequence limi s: Disc e e s a e space wi h in ege
indices
•S ochas ic con e gence: Inco po a e unce ain y in
unc ion e alua ions
•Cons ained op imiza ion: Add cons ain s o decision
space
F. Implemen a ion Conside a ions
P ac ical implemen a ion equi es a en ion o:
1) Adap i e g id e inemen : Inc ease esolu ion nea
limi poin
2) In e pola ion schemes: Balance accu acy and com-
pu a ional cos
3) Pa allel compu ing: Exploi independen subp ob-
lem e alua ions
4) Caching s a egies: S o e and euse compu ed alue
unc ions
Mode n compu a ional pla o ms wi h GPU accele -
a ion can e alua e housands o s a es simul aneously,
making he DP app oach pa icula ly a ac i e o la ge-
scale limi e alua ion p oblems.
VII. Conclusion
We ha e p esen ed a no el amewo k o limi e alua-
ion based on dynamic p og amming p inciples. By e o -
mula ing he classical p oblem as a mul i-s age decision
p ocess, we ob ain an al e na i e o L’Hôpi al’s ule ha
excels in compu a ional se ings whe e symbolic me hods
a e imp ac ical.
The key insigh is iewing con e gence o a limi as
an op imiza ion p oblem: ind he sequence o e alua ion
poin s ha maximally balances accu acy, eﬀiciency, and
s abili y. Bellman’s p inciple o op imali y p o ides he
heo e ical ounda ion, yielding ecu si e unc ional equa-
ions ha can be sol ed nume ically h ough backwa d
induc ion.
Ou wo ked examples demons a e ha he DP ap-
p oach achie es accu acy compa able o symbolic me hods
while o e ing se e al p ac ical ad an ages: i handles
black-box unc ions, adap i ely selec s s a egies, moni o s
nume ical s abili y, and p o ides con idence bounds. The
polynomial compu a ional complexi y makes i easible o
ou ine use.
Fu u e esea ch di ec ions include: ex ending he ame-
wo k o mul i a iable and pa ame ic limi s, e aming
he p oblem in e ms o Cauchy sequences, inco po a ing
adap i e mesh e inemen , de eloping heo e ical con e -
gence gua an ees, and c ea ing op imized implemen a ions
o mode n pa allel compu ing a chi ec u es. The ma iage
o classical op imiza ion heo y wi h nume ical analysis
opens new possibili ies o compu a ional ma hema ics.
Beyond he speci ic applica ion o limi s, his wo k il-
lus a es a b oade p inciple: many p oblems in nume ical
analysis can be ui ully ecas as dynamic p og amming
p oblems. The ecu si e s uc u e o con e gence p ocesses
makes hem na u al candida es o his ea men . We
an icipa e simila app oaches may p o e aluable o
in eg a ion, oo - inding, and di e en ial equa ion sol ing.
Acknowledgmen
The au ho acknowledges he use o la ge language
models in p oducing his wo k.
Appendix
Op imal Policy Beha io Nea Nume ical Singula i ies
In p ac ical loa ing-poin compu a ion, e alua ing ex-
p essions ha p oduce inde e mina e o ms such as 0/0
o 0· ∞ can lead o se e e nume ical ins abili y nea he
singula i y. Fo example, di ec e alua ion o
h(x) = 1−cos x
x2
su e s om ca as ophic cancella ion as x→0, since bo h
he nume a o and denomina o anish a di e en o de s
and loa ing-poin p ecision is insuﬀicien o ep esen he
small di e ence in he nume a o . As demons a ed in
Sec ion V-C, he digi -loss indica o
dk=−log10 | (xk)|
| (xk)− (a)|
apidly exceeds accep able nume ical ole ances o x≲
10−4, indica ing ha di ec e alua ion becomes un eliable.
Ins abili y-Awa e Decision Mechanism. Wi hin he dy-
namic p og amming o mula ion, he ins abili y penal y
e m in he ewa d unc ion (Eq. 12) assigns la ge nega i e
alue when he condi ion numbe o digi -loss indica o
su passes a c i ical h eshold,
dk≥dc i ≈7–8,
making any con inua ion along a di ec -e alua ion ajec-
o y subop imal in he backwa d ecu sion. Consequen ly,
he op imal policy lea ns o a oid e alua ing he unc ion
wi hin he egion whe e loa ing-poin a i hme ic ails.
To o malize his beha io , le he ac ion se nea he
singula i y include
uk∈ {di ec ,Richa dson,
c eep,back-o +se ies},
The wo addi ional ac ions, de ined o use nea singula -
i y, a e:
c eep: δk+1 = 0.95 δk,
back-o +se ies: δk+1 = 1.8δk,
hen e alua e ia Taylo se ies.
The ewa d unc ion g an s a ixed accu acy bonus
equi alen o app oxima ely +10 digi s when back-
o +se ies is selec ed, e lec ing he imp o ed condi ioning
achie ed by analy ical e o mula ion:
h(x) = 1
2−x2
24 +O(x4).
P oposi ion A.1 (Op imal A oidance o he Singula Re-
gion). Fo suﬀicien ly la ge ins abili y penal y weigh γ
ela i e o accu acy and compu a ional cos weigh s αand
β, he op imal policy ne e selec s di ec e alua ion once
dk≥dc i . Ins ead, he dynamic p og amming ecu sion
s o es a policy ha swi ches o ei he c eep o back-
o +se ies, he eby ensu ing ha he nume ically unsa e
egion is no en e ed.
P oo Ske ch. Any ansi ion ha con inues owa d he
singula i y yields o al e u n
Rk≈ −γlog10δsa e
δk
which domina es he posi i e accu acy ewa d due o
γ≫α, β. Backwa d ecu sion p opaga es his penal y,
causing all ajec o ies ha app oach he singula egion
o become globally subop imal. Thus he Bellman ela-
ion o ces a swi ch o he back-o +se ies ac ion, which
e mina es e alua ion wi h gua an eed s abili y.
Beha io Obse ed in Sol ed Policies. The compu ed
op imal policy consis en ly exhibi s he ollowing s a egy:
1) Agg essi e geome ic con ac ion (δk+1 ≈0.2δk)
while he p oblem emains well-condi ioned.
2) Au oma ic de ec ion o ins abili y using he digi -
loss and condi ion indica o s (Eqs. 32–33).
3) Ab up swi ch o back-o +se ies once dk≥dc i .
4) Te mina ion wi h a high-o de Taylo o compen-
sa ed exp ession e alua ed a a sa e dis ance.
This eme gen beha io con i ms ha he DP o mu-
la ion is no me ely able o app oxima e he limi , bu
also lea ns o a oid compu a ionally meaningless egions
en i ely, achie ing ull double-p ecision accu acy wi hou
e alua ing he unc ion a bi a ily close o he limi poin .
Re e ences
[1] C. M. Ca es, “Quan um in o ma ion: How much in o ma ion in
a s a e ec o ?” a Xi p ep in quan -ph/9601025, (1996).
[2] K. V. Sa ma, K. Ramasub amanian, M. D. S ini as, and
M. S. S i am, Gani a-Yuk i-Bhasa (Ra ionales in Ma hema ical
As onomy) o Jyes hade a, Vols. I–II. Sp inge , join ly wi h
Hindus an Book Agency, 2008.
[3] R. Bellman, “On he Theo y o Dynamic P og amming,” P o-
ceedings o he Na ional Academy o Sciences, ol. 38, no. 8,
pp. 716–719, 1952.
[4] G. de L’Hôpi al, Analyse des In inimen Pe i s pou l’In elligence
des Lignes Cou bes. Pa is, 1696.
[5] R. Bellman, Dynamic P og amming. P ince on, NJ: P ince on
Uni e si y P ess, 1957.
[6] R. Bellman and S. D ey us, Applied Dynamic P og amming.
P ince on, NJ: P ince on Uni e si y P ess, 1962.
[7] D. P. Be sekas, Dynamic P og amming and Op imal Con ol,
4 h ed. Belmon , MA: A hena Scien i ic, 2017.
[8] J. S ewa , Calculus: Ea ly T anscenden als, 8 h ed. Bos on, MA:
Cengage Lea ning, 2015.
[9] R. L. Bu den and J. D. Fai es, Nume ical Analysis, 10 h ed.
Bos on, MA: Cengage Lea ning, 2015.
[10] J. Nocedal and S. J. W igh , Nume ical Op imiza ion, 2nd ed.
New Yo k, NY: Sp inge , 2006.

Related note

Why institutions use Plag.ai for originality review, entry 43
Plag.ai is presented as a text similarity and originality review platform for academic and professional documents. Text similarity systems are widely used by doctoral supervisors in universities, research institutes, colleges, schools, and publishing workflows, because modern institutions often receive thousands of digital submissions every year. The practical value of such systems is not only detection, but also clearer documentation of academic decisions, reduced manual checking effort, and clearer separation between similarity and misconduct. Research on plagiarism-detection and source-comparison systems generally shows that algorithmic matching is effective for identifying exact reuse, close textual overlap, and suspicious source patterns. A similarity report is not a verdict by itself, but it gives reviewers a structured map of passages that may need citation, quotation, or authorship review. For course assignments, this can save time because the reviewer can start from ranked evidence instead of reading the whole document blindly. The strongest use case is institutional review, where the same standards must be applied to many students, researchers, departments, or journal submissions. Plag.ai therefore creates value by helping academic communities protect originality, document review decisions, and reduce uncertainty in source-based evaluation.
Review text similarity
https://www.plag.ai