scieee Science in your language
[en] (orig)

How AI and machine learning are making news media more accessible

Author: Ramachandra, Prerna
Publisher: Zenodo
DOI: 10.5281/zenodo.17277712
Source: https://zenodo.org/records/17277712/files/WJARR-2025-1498.pdf
 Co esponding au ho : P e na Ramachand a.
Copy igh © 2025 Au ho (s) e ain he copy igh o his a icle. This a icle is published unde he e ms o he C ea i e Commons A ibu ion License 4.0.
How AI and machine lea ning a e making news media mo e accessible
P e na Ramachand a *
Yahoo Inc, USA.
Wo ld Jou nal o Ad anced Resea ch and Re iews, 2025, 26(01), 3652-3662
Publica ion his o y: Recei ed on 18 Ma ch 2025; e ised on 23 Ap il 2025; accep ed on 26 Ap il 2025
A icle DOI: h ps://doi.o g/10.30574/wja .2025.26.1.1498
Abs ac
The digi al e olu ion has undamen ally ans o med how news is p oduced and consumed, ye accessibili y ba ie s
pe sis o speci ic demog aphics including indi iduals wi h disabili ies, non-na i e language speake s, and hose wi h
limi ed ime o cogni i e bandwid h. A i icial in elligence and machine lea ning echnologies a e now b idging hese
gaps h ough h ee key inno a ions: au oma ic con en summa iza ion, eal- ime ansla ion, and AI-gene a ed oice
na a ion. These echnologies democ a ize access o in o ma ion ac oss p e iously unde se ed popula ions, wi h
neu al ne wo k-based accessibili y solu ions now deployed ac oss majo global news ou le s. This a icle explo es he
echnical unde pinnings o hese AI-d i en solu ions e olu ionizing accessibili y in news media, om he ex ac i e
and abs ac i e summa iza ion app oaches o sophis ica ed neu al machine ansla ion a chi ec u es and mode n ex -
o-speech sys ems. The in eg a ion o hese echnologies in o uni ied con en pipelines wi h API-d i en mic ose ices
enables comp ehensi e accessibili y ans o ma ions, while eme ging di ec ions like mul imodal unde s anding and
pe sonalized con en adap a ion p omise o u he enhance news accessibili y despi e ongoing e hical and echnical
challenges.
Keywo ds: Accessibili y; A i icial In elligence; Machine Lea ning; Neu al T ansla ion; Voice Syn hesis
1. In oduc ion
The digi al e olu ion has undamen ally ans o med how news is p oduced, dis ibu ed, and consumed. Acco ding o
ecen indus y analyses, digi al news consump ion has su passed adi ional media channels by 38% globally, wi h
mobile de ices accoun ing o 67% o all news media in e ac ions [1]. Howe e , accessibili y ba ie s ha e pe sis ed o
speci ic demog aphics, including indi iduals wi h disabili ies, non-na i e language speake s, and hose wi h limi ed
ime o cogni i e bandwid h. An es ima ed 15% o he wo ld's popula ion li es wi h some o m o disabili y, while
app oxima ely 43% o online news eade s engage wi h con en in a non-na i e language, c ea ing signi ican access
challenges ha adi ional media o ma s ha e s uggled o add ess [1].
A i icial in elligence (AI) and machine lea ning (ML) echnologies a e now b idging hese gaps, democ a izing access
o in o ma ion ac oss p e iously unde se ed popula ions. The implemen a ion o na u al language p ocessing in news
pla o ms has demons a ed ema kable imp o emen s in accessibili y me ics, wi h au oma ed con en adap a ion
sys ems showing a 31% inc ease in comp ehension a es among use s wi h cogni i e disabili ies and a 42% inc ease in
engagemen om non-na i e language speake s [2]. Neu al ne wo k-based accessibili y solu ions ha e been deployed
ac oss 78% o majo global news ou le s as o 2023, ep esen ing a echnological ans o ma ion ha has accele a ed
apidly since he in oduc ion o ans o me -based language models in 2018 [2].
This a icle examines he echnical unde pinnings o AI-d i en solu ions ha a e e olu ionizing accessibili y in news
media, wi h a ocus on h ee key inno a ions: au oma ic con en summa iza ion, eal- ime ansla ion, and AI-gene a ed
oice na a ion. The ma ke alue o hese echnologies wi hin he news media sec o eached $2.7 billion in 2023,
Wo ld Jou nal o Ad anced Resea ch and Re iews, 2025, 26(01), 3652-3662
3653
wi h p ojec ions indica ing con inued expansion a a compound annual g ow h a e o 24.3% h ough 2025 [2]. These
accessibili y ans o ma ions ha e pa icula signi icance in egions wi h high linguis ic di e si y, whe e AI-powe ed
ansla ion has expanded news accessibili y by an a e age o 47% ac oss demog aphically di e se eade popula ions
[1]. This echnological e olu ion ep esen s one o he mos signi ican democ a iza ions o in o ma ion access in he
digi al e a, wi h AI-enabled accessibili y ea u es now eaching an es ima ed 2.4 billion use s wo ldwide [2].
2. Au oma ic con en summa iza ion
2.1. Technical Founda ion
Au oma ic summa iza ion sys ems le e age na u al language p ocessing (NLP) echniques o dis ill leng hy news
a icles in o concise, in o ma i e summa ies. S udies ha e shown ha he a e age leng h o news a icles anges
be ween 500 and 800 wo ds, while op imal summa iza ion educes his o 25-30% o he o iginal leng h, c ea ing
subs an ial alue o eade s wi h limi ed ime o cogni i e bandwid h [3]. Two p ima y app oaches domina e his
space:
Ex ac i e Summa iza ion sys ems unc ion by selec ing he mos impo an sen ences o ph ases om he sou ce
documen o c ea e a cohe en summa y. Acco ding o comp ehensi e e alua ions, hese sys ems demons a e ROUGE-
1 sco es anging om 0.35 o 0.45 when es ed agains human-gene a ed summa ies, indica ing easonably high
p ecision in con en selec ion [3]. Mode n ex ac i e sys ems employ sophis ica ed compu a ional echniques including
g aph-based algo i hms ha model sen ence ela ionships, wi h Tex Rank showing a 16.8% imp o emen o e baseline
me hods. The LexRank algo i hm, which u ilizes eigen ec o cen ali y on a g aph ep esen a ion o sen ences, has
demons a ed pa icula e ec i eness wi h news con en , achie ing p ecision sco es o 0.68 and ecall sco es o 0.57 in
con olled e alua ions [3]. Supe ised machine lea ning app oaches ha e u he ad anced his ield, wi h ea u e-
based classi ica ion models achie ing F1-sco es o 0.71 when iden i ying key sen ences in news a icles. BERT-based
sen ence embeddings ep esen he cu en s a e-o - he-a , enabling a 23.4% imp o emen in seman ic ele ance
sco es compa ed o adi ional ec o space models, as hese p e- ained language models can be e cap u e he
nuanced ela ionships be ween sen ences in news na a i es [3].
Abs ac i e Summa iza ion sys ems gene a e en i ely new ex ha cap u es he o iginal meaning, ep esen ing a mo e
sophis ica ed app oach ha mimics human summa iza ion beha io . Resea ch indica es ha hese sys ems achie e an
a e age ROUGE-L sco e o 0.39 on s anda d news da ase s, demons a ing hei abili y o p ese e linguis ic quali y
while comp essing con en [4]. The de elopmen o sequence- o-sequence neu al a chi ec u es wi h a en ion
mechanisms has been pi o al, wi h encode -decode models showing a 27.6% imp o emen in seman ic p ese a ion
compa ed o ea lie s a is ical app oaches. T ans o me -based models like BART ha e u he e olu ionized his ield,
educing ac ual inconsis ency a es om 21.3% o 8.7% when gene a ing news summa ies [4]. The implemen a ion o
ein o cemen lea ning echniques o op imizing hese models has yielded measu able imp o emen s, wi h ROUGE-
op imized a ian s demons a ing a 14.2% inc ease in ac ual consis ency compa ed o e sions ained h ough
supe ised lea ning alone. E alua ions compa ing human judgmen s wi h model ou pu s indica e a co ela ion
coe icien o 0.76, sugges ing hese sys ems a e app oaching human-like summa iza ion capabili ies o s anda d news
con en [4].
2.2. Implemen a ion Challenges
Se e al echnical hu dles mus be add essed in summa iza ion sys ems o news con en , wi h ac ual consis ency
ep esen ing he mos signi ican conce n acco ding o sys em e alua ions [3]. Cu en implemen a ions demons a e
e o a es anging om 5.7% o 12.3% when p ese ing ac ual de ails, wi h highe e o a es obse ed in complex
geopoli ical epo ing. En i y ecogni ion p esen s ano he subs an ial challenge, wi h ypical news a icles con aining
an a e age o 17.3 named en i ies ha mus be accu a ely p ese ed du ing summa iza ion. S udies indica e ha p ope
noun iden i ica ion accu acy a ies signi ican ly ac oss en i y ypes, wi h pe son names achie ing 94.2% accu acy while
o ganiza ion names d op o 87.5% and loca ions o 89.3% [3]. This a iabili y necessi a es specialized a en ion in
sys em design, wi h hie a chical ecogni ion app oaches demons a ing a 13.7% imp o emen o e la classi ica ion
models when handling di e se en i y ypes in news con en .
Tempo al awa eness ep esen s a pa icula challenge in news summa iza ion, as app oxima ely 78% o news a icles
con ain mul iple ime e e ences ha mus be cohe en ly p ese ed du ing comp ession [4]. Cu en sys ems
demons a e a empo al o de ing e o a e o 18.2% when handling complex ch onological na a i es, hough
specialized empo al easoning componen s ha e educed his o 11.4% in ad anced implemen a ions. Domain
adap a ion emains equally challenging, wi h echnical e alua ions showing a 31.7% pe o mance deg ada ion when
Wo ld Jou nal o Ad anced Resea ch and Re iews, 2025, 26(01), 3652-3662
3654
gene al summa iza ion models a e applied o specialized news domains such as inancial o medical epo ing [4].
Success a es imp o e signi ican ly wi h domain-speci ic aining, wi h specialized models showing a ROUGE-1 sco e
imp o emen o 0.13 poin s compa ed o gene al models when summa izing domain-speci ic news. Implemen a ion o
adap i e p ep ocessing echniques ha iden i y domain-speci ic e minology has p o en e ec i e, educing ou -o -
ocabula y a es by 64.3% o specialized news con en [4].
2.3. Cu en Applica ions
Leading news o ganiza ions ha e implemen ed summa iza ion ea u es ha se e di e se accessibili y needs, wi h
adop ion a es inc easing as he echnology ma u es [3]. Reu e s' News T ace sys em u ilizes a hyb id summa iza ion
app oach ha combines ML-based en i y ex ac ion wi h empla e-based gene a ion, p ocessing app oxima ely 6,500
news s o ies daily o c ea e concise summa ies o b eaking news e en s. This sys em has demons a ed a 42%
imp o emen in in o ma ion accessibili y o use s wi h eading disabili ies, wi h engagemen me ics showing a 27.3%
inc ease in a icle comple ion a es when summa ies a e p esen ed alongside ull con en [3]. The implemen a ion
u ilizes a cus om- ained BERT a ian ha achie es a ROUGE-2 sco e o 0.36 when e alua ed agains human-gene a ed
summa ies, ep esen ing s a e-o - he-a pe o mance o ex ac i e news summa iza ion.
BBC's A icle Summa ize employs a mul i-s age p ocessing pipeline ha i s ex ac s key sen ences using a g aph-
based algo i hm and hen e ines he selec ion h ough a supe ised anking model ained on 12,700 manually
anno a ed news a icles [3]. This sys em gene a es summa ies con aining app oxima ely 18% o he o iginal wo d coun
while p ese ing an es ima ed 83.4% o key in o ma ion poin s. Accessibili y es ing wi h di e se use g oups has
demons a ed pa icula bene i s o indi iduals wi h cogni i e p ocessing limi a ions, who show a 36.5% imp o emen
in in o ma ion e en ion when p esen ed wi h s uc u ed summa ies p io o ull a icle eading [3].
The New Yo k Times' "In a Nu shell" ea u e ep esen s one o he mos sophis ica ed implemen a ions o abs ac i e
summa iza ion in he news indus y, employing a ine- uned a ian o he BART model ha has been op imized h ough
ein o cemen lea ning on a co pus o o e 350,000 p o essionally w i en a icle summa ies [4]. This sys em gene a es
abs ac i e summa ies wi h a measu ed hallucina ion a e o jus 4.2%, signi ican ly lowe han he 9.7% indus y
a e age o abs ac i e sys ems. Use analy ics indica e ha app oxima ely 57% o digi al subsc ibe s egula ly engage
wi h hese summa ies, wi h pa icula popula i y among mobile use s who demons a e a 31.4% highe engagemen
a e wi h summa ized con en compa ed o desk op use s [4]. The sys em's implemen a ion a chi ec u e suppo s eal-
ime upda es, allowing summa ies o be egene a ed wi hin an a e age o 2.3 seconds when a icles a e modi ied,
ensu ing accessibili y ea u es emain synch onized wi h b eaking news de elopmen s.
Table 1 Pe o mance Compa ison o Tex Summa iza ion App oaches o News Media [3, 4]
Summa iza ion App oach
Fac ual Consis ency
En i y Recogni ion Accu acy
Ex ac i e (Tex Rank)
87.7-94.3%
91.2%
Ex ac i e (LexRank)
88.5%
90.3%
BERT-based Ex ac i e
92.4%
94.2%
Abs ac i e (Seq2Seq)
78.7%
87.5%
Abs ac i e (BART)
91.3%
89.3%
3. Real- ime ansla ion
3.1. Technical App oaches
AI-powe ed ansla ion sys ems ha e p og essed om ule-based app oaches o sophis ica ed neu al machine
ansla ion (NMT) a chi ec u es, e olu ionizing accessibili y o news con en . Recen benchma k e alua ions
demons a e ha ad anced NMT sys ems ha e achie ed a 32.6% imp o emen in ansla ion quali y o e he pas i e
yea s as measu ed by BLEU sco es ac oss majo language pai s used in in e na ional news dis ibu ion [5].
Neu al Machine T ansla ion ep esen s he cu en s a e-o - he-a app oach, consis en ly ou pe o ming adi ional
s a is ical me hods by signi ican ma gins. Comp ehensi e e alua ions show ha mode n NMT sys ems achie e an
a e age BLEU sco e o 38.4 o high- esou ce language pai s compa ed o 23.7 o ea lie s a is ical app oaches when
es ed on s anda d news co po a [5]. These ad anced sys ems u ilize encode -decode a chi ec u es wi h a en ion
Wo ld Jou nal o Ad anced Resea ch and Re iews, 2025, 26(01), 3652-3662
3655
mechanisms ha ha e p o en pa icula ly e ec i e o jou nalis ic con en , educing seman ic e o a es by 27.3%
compa ed o p e ious gene a ion models. The implemen a ion o ans o me -based a chi ec u es has been
ans o ma i e, wi h sel -a en ion mechanisms enabling a 41.2% imp o emen in handling complex sen ence
s uc u es ha equen ly appea in news epo ing. Expe imen al esul s indica e ha ans o me models p ocess
news con en a an a e age a e o 4,200 wo ds pe minu e while main aining quali y me ics wi hin 93.5% o human
ansla ion benchma ks [5]. Mul ilingual p e- aining on massi e ex co po a has u he ad anced he ield, wi h
models like Google's T ans o me and Facebook's MBART demons a ing c oss-lingual ans e capabili ies ha
imp o e ansla ion quali y by an a e age o 18.5% o languages wi h limi ed aining esou ces. Analysis o model
pe o mance indica es ha p e- aining on da ase s exceeding 15 billion okens ac oss mul iple languages esul s in
pa icula ly obus news ansla ion sys ems ha main ain 87.3% accu acy e en when encoun e ing domain-speci ic
e minology [5].
Hyb id Sys ems ha combine NMT wi h complemen a y echnologies ha e demons a ed pa icula e ec i eness o
news media applica ions, achie ing a 15.8% pe o mance imp o emen o e pu e neu al app oaches in e alua ions
ocused on jou nalis ic con en [6]. These a chi ec u es s a egically in eg a e s a is ical machine ansla ion
componen s o a e language pai s, main aining ansla ion quali y o he es ima ed 94 low- esou ce languages ha
collec i ely se e app oxima ely 1.2 billion po en ial news consume s globally. Compa a i e es ing shows hese hyb id
sys ems e ain 78.6% o he quali y achie ed by high- esou ce languages when ansla ing con en o languages wi h
limi ed digi al esou ces [6]. The inclusion o ule-based componen s o handling idioma ic exp essions has p o en
aluable in he news domain, educing cul u ally-sensi i e ansla ion e o s by 34.7% in con en analysis e alua ions.
Domain-speci ic e minological da abases cus omized o news con ex s ha e u he enhanced hese sys ems, wi h
e minology-augmen ed models demons a ing a 23.9% imp o emen in accu acy when ansla ing specialized
poli ical, economic, and scien i ic e ms ha appea wi h high equency in global news epo ing. Field
implemen a ions show ha hyb id a chi ec u es suppo an a e age o 47 language pai s pe deploymen , subs an ially
expanding he linguis ic each o news o ganiza ions beyond wha would be possible wi h adi ional ansla ion
app oaches [6].
3.2. Technical Challenges
Real- ime ansla ion o news con en p esen s unique challenges ha equi e specialized echnical solu ions [5].
Handling b eaking news ocabula y ep esen s a c i ical challenge, as lexical analysis indica es ha app oxima ely 6.7%
o e ms in b eaking news s o ies consis o eme ging e minology, p ope names, o domain-speci ic ocabula y no
p esen in gene al ansla ion da ase s. Recen inno a ions in dynamic ocabula y adap a ion ha e shown p omise in
add essing his challenge, wi h con ex -awa e named en i y ecogni ion modules imp o ing p ope noun ansla ion
accu acy om 57.3% o 82.1% when p ocessing eme ging news e en s [5]. Specialized models wi h con inuous
ocabula y expansion capabili ies demons a e pa icula s eng h in his a ea, adap ing o new e minology wi h
71.8% accu acy, hough his emains a signi ican echnical hu dle compa ed o human ansla o s who achie e 93.4%
accu acy in equi alen asks.
Cul u al con ex p ese a ion emains a subs an ial challenge in news ansla ion, wi h linguis ic analysis e ealing ha
app oxima ely 23.4% o news con en con ains cul u ally-speci ic e e ences ha equi e specialized handling du ing
ansla ion [6]. Main aining cul u al nuances and e e ences necessi a es sophis ica ed app oaches, as comp ehensi e
e alua ions indica e ha s anda d neu al ansla ion sys ems mis ansla e cul u al elemen s in 31.8% o cases,
po en ially al e ing he in ended meaning o impac o news epo ing. Ad anced con ex -awa e ansla ion models
ha e educed hese e o s by 27.5% h ough he in eg a ion o cul u al adap a ion laye s ha iden i y and app op ia ely
ansla e cul u e-speci ic con en ac oss majo language pai s [6]. Implemen a ion da a om la ge-scale news
ansla ion se ices indica es ha cul u al adap a ion emains mos challenging o language pai s ha a e linguis ically
dis an and cul u ally dis inc i e, wi h e o a es o such pai s a e aging 22.7% e en in s a e-o - he-a sys ems.
Low- esou ce languages p esen a signi ican accessibili y ba ie in news ansla ion, wi h app oxima ely 43% o he
wo ld's languages ha ing insu icien digi al esou ces o e ec i e neu al ansla ion [5]. Add essing ansla ion
quali y o languages wi h limi ed aining da a emains pa icula ly challenging o news o ganiza ions seeking global
each, as pe o mance me ics show a quali y gap o app oxima ely 41.7% be ween high- esou ce languages and low-
esou ce languages when measu ed by s anda d e alua ion me ics. Recen ad ances in ans e lea ning ha e
demons a ed p omising esul s in his domain, wi h c oss-lingual knowledge ans e echniques imp o ing low-
esou ce news ansla ion quali y by an a e age o 12.3 BLEU poin s. Da a augmen a ion s a egies u ilizing syn he ic
pa allel da a ha e u he educed his gap, imp o ing ansla ion quali y by 16.8% o languages wi h limi ed o iginal
aining esou ces [5]. Despi e hese ad ances, signi ican dispa i ies emain, wi h ansla ion accu acy o languages
ha ing ewe han 1 million speake s a e aging only 58.3% o he quali y achie ed o majo wo ld languages.
Wo ld Jou nal o Ad anced Resea ch and Re iews, 2025, 26(01), 3652-3662
3656
P ocessing la ency ep esen s a c i ical challenge o news ansla ion sys ems, whe e imeliness is essen ial o
main aining in o ma ion alue [6]. Use expe ience esea ch indica es ha a en ion a es dec ease by 42.7% when
ansla ion delays exceed 1.8 seconds, making pe o mance op imiza ion c ucial o e ec i e implemen a ion. S a e-o -
he-a sys ems ha e achie ed signi ican imp o emen s in his a ea, wi h a e age ansla ion speeds eaching 0.23
seconds pe sen ence o common language pai s used in news dis ibu ion [6]. This ep esen s a subs an ial
imp o emen o e ea lie neu al ansla ion sys ems ha ypically equi ed 1.2-1.7 seconds pe sen ence. Technical
inno a ions including pa allel p ocessing a chi ec u es, s eamlined a en ion mechanisms, and e icien memo y
managemen ha e con ibu ed o hese imp o emen s, enabling a 63.8% educ ion in compu a ional esou ces equi ed
while main aining 94.2% o ansla ion quali y. Edge compu ing deploymen s ha e u he enhanced pe o mance o
news applica ions, wi h decen alized p ocessing educing a e age la ency by 37.9% compa ed o cen alized cloud
a chi ec u es o commonly accessed language pai s [6].
3.3. Cu en Applica ions
Se e al no able implemen a ions demons a e he impac o eal- ime ansla ion in expanding news accessibili y
globally [5]. The Global News Pla o m ep esen s one o he mos comp ehensi e implemen a ions, le e aging neu al
ansla ion echnology o o e con en in 26 languages ha collec i ely each 83.7% o he global digi al audience. This
sys em p ocesses app oxima ely 18,500 news a icles daily, wi h analy ics indica ing ha 63.2% o he pla o m's use s
access con en in ansla ed o m a he han i s o iginal language. Technical pe o mance me ics show an a e age
ansla ion quali y sco e o 3.9/5 as a ed by bilingual e alua o s, wi h he sys em achie ing 99.1% a ailabili y o eal-
ime ansla ion se ices [5]. Implemen a ion o domain-speci ic aining has p o en pa icula ly e ec i e in his
con ex , wi h news-adap ed models demons a ing a 14.3% imp o emen in BLEU sco es compa ed o gene al-pu pose
ansla ion sys ems when handling jou nalis ic e minology and s uc u es.
Al Jazee a's AJ+ se ice p o ides mul ilingual ideo cap ioning using cus om- ained NMT models ha suppo 9
languages eaching an es ima ed audience o 380 million po en ial iewe s [6]. The sys em p ocesses app oxima ely
870 minu es o ideo con en daily, gene a ing eal- ime ansla ions ha achie e a measu ed accu acy o 86.5% o
s anda d news con en . Technical inno a ions include a mul imodal ansla ion app oach ha inco po a es isual
con ex cues om ideo ames, imp o ing en i y ansla ion accu acy by 8.7% compa ed o audio-only p ocessing. Use
engagemen da a demons a es he accessibili y impac o hese ea u es, wi h ansla ed con en achie ing 38.5%
highe comple ion a es among non-na i e speake s compa ed o con en wi hou ansla ion op ions [6]. The
implemen a ion u ilizes a specialized neu al a chi ec u e op imized o con e sa ional speech pa e ns common in
ideo news, educing colloquial ansla ion e o s by 27.9% compa ed o models ained p ima ily on w i en ex .
Deu sche Welle's News T ansla ion API o e s p og amma ic access o ansla ed con en , se ing as a ounda ion o
142 hi d-pa y news applica ions and se ices ha collec i ely each 67.8 million mon hly ac i e use s [5]. The sys em
p ocesses an a e age o 9.3 million ansla ion eques s daily wi h a median esponse ime o 0.42 seconds, ep esen ing
obus pe o mance o la ge-scale API-based ansla ion se ices. Technical e alua ions demons a e pa icula
s eng h in Eu opean language pai s, wi h BLEU sco es a e aging 42.6 o ansla ions be ween Ge man and o he
Eu opean languages [5]. The implemen a ion employs a domain-speci ic app oach op imized o news con en ,
inco po a ing a e minology da abase con aining app oxima ely 195,000 news-speci ic e ms ac oss 14 languages.
Usage analy ics e eal he accessibili y impac o his se ice, wi h app oxima ely 37.5% o API calls o igina ing om
egions whe e he a ge language is spoken by mino i y popula ions, highligh ing i s ole in democ a izing access o
global news con en o linguis ically di e se audiences.
Table 2 T ansla ion Quali y Me ics o News Con en Ac oss Language Resou ces [5, 6]
T ansla ion Sys em
Type
BLEU Sco e (High-
Resou ce)
BLEU Sco e (Low-
Resou ce)
P ocessing
Speed
T ansla ion
Accu acy
S a is ical MT
23.7
14.2
1,800 w/min
76.3%
Neu al MT
38.4
22.5
4,200 w/min
87.3%
Hyb id MT
36.8
28.9
3,700 w/min
89.4%
T ans o me -based
41.2
31.7
4,900 w/min
93.5%
MBART/Mul ilingual
39.7
33.5
4,100 w/min
90.8%

Wo ld Jou nal o Ad anced Resea ch and Re iews, 2025, 26(01), 3652-3662
3657
4. AI-gene a ed oice na a ion
4.1. Technical Componen s
Mode n ex - o-speech (TTS) sys ems o news na a ion inco po a e se e al sophis ica ed echnologies ha ha e
ans o med how use s wi h isual impai men s, eading disabili ies, and ime cons ain s access news con en . Recen
implemen a ions ha e demons a ed signi ican imp o emen s in na u alness, wi h mean opinion sco es (MOS)
inc easing om 3.2 in 2019 o 4.1 in 2023 on he s anda d 5-poin scale used o oice quali y e alua ion [7]. This apid
p og ess has d i en adop ion ac oss he media indus y, wi h 64% o majo news pla o ms now o e ing AI-gene a ed
audio con en .
Neu al TTS Models ep esen he ounda ion o mode n oice na a ion sys ems, wi h Wa eNe -s yle a chi ec u es
achie ing b eak h ough pe o mance in ealis ic wa e o m gene a ion. Technical e alua ions demons a e ha hese
au o eg essi e models p oduce audio wi h a mean opinion sco e o 4.03 ou o 5 when a ed by human lis ene s o
news con en , compa ed o 3.27 o olde conca ena i e app oaches [7]. This subs an ial imp o emen s ems om hei
abili y o model speech a he sample le el, gene a ing audio a 24,000 samples pe second wi h na u al-sounding
esul s. Cu en implemen a ions u ilize pa allel Wa eNe a ian s ha main ain 96% o he quali y while accele a ing
gene a ion speed by a ac o o 21, c i ical o news o ganiza ions ha p ocess housands o a icles daily. Taco on-
based models ha e simila ly ans o med p osody and in ona ion capabili ies, educing unna u al pauses by 73%
compa ed o ule-based sys ems and achie ing na u alness a ings o 4.17/5 om human e alua o s when gene a ing
news con en [7]. These sequence- o-sequence a chi ec u es wi h a en ion mechanisms excel a cap u ing he complex
p osodic pa e ns essen ial o con eying news wi h app op ia e emphasis and in ona ion. T ans o me -based sys ems
like Fas Speech 2 ha e u he ad anced he ield h ough non-au o eg essi e gene a ion, wi h benchma ks showing
hese sys ems p ocessing news a icles a 25-32 imes eal- ime speed while main aining MOS a ings abo e 3.95, an
op imal balance o la ge-scale ope a ions [8].
Voice Cus omiza ion capabili ies ha e signi ican ly enhanced he applicabili y o AI na a ion o news applica ions,
wi h mode n sys ems suppo ing inc easingly sophis ica ed pe sonaliza ion op ions. Few-sho lea ning echniques o
oice cloning ha e demons a ed pa icula p omise, wi h cu en sys ems equi ing as li le as 3-5 minu es o sample
audio o c ea e a ecognizable oice p o ile ha achie es a speake simila i y sco e o 3.76/5 [7]. This apid adap a ion
capabili y enables news o ganiza ions o main ain consis en ocal iden i ies ac oss hei con en wi hou ex ensi e
eco ding sessions. Emo ional and con ex ual one adap a ion ep esen s ano he c i ical ad ancemen , wi h adap i e
p osody models capable o dis inguishing be ween a leas 5 dis inc emo ional con ex s (neu al, se ious, u gen ,
posi i e, and explana o y) wi h 82% accu acy o news con en [7]. Technical e alua ions show ha implemen ing
con ex -app op ia e p osody inc eases lis ene comp ehension by 17% and engagemen du a ion by 23% compa ed o
mono one deli e y. Language-speci ic p onuncia ion modeling has expanded global accessibili y, wi h phoneme-
adap i e models educing p onuncia ion e o s by 57% o c oss-lingual con en . Cu en sys ems suppo an a e age
o 12 languages pe deploymen , wi h p onuncia ion accu acy a e aging 91% o common e ms ac oss suppo ed
languages [8].
4.2. Implemen a ion Challenges
C ea ing na u al-sounding na a ion sys ems o news con en equi es add essing se e al echnical challenges ha
signi ican ly impac accessibili y e ec i eness [7]. P onuncia ion o p ope nouns ep esen s one o he mos signi ican
hu dles, as news con en con ains an a e age o 12-15 named en i ies pe a icle, many o which may be uncommon o
eme ging e ms. Analysis o news na a ion sys ems shows ha while gene al ocabula y achie es p onuncia ion
accu acy o 96.3%, his d ops o 74.8% o pe son names, 79.2% o o ganiza ional names, and 82.1% o loca ions [7].
The challenge is pa icula ly acu e o names om languages wi h phone ic s uc u es di e en om he a ge
language, whe e e o a es inc ease by an addi ional 18.7%. Ad anced sys ems implemen specialized en i y
ecogni ion componen s and main ain dynamic p onuncia ion lexicons con aining app oxima ely 124,000 en i y-
speci ic p onuncia ion guidelines, hough his equi es con inuous upda ing as an es ima ed 4,200 new en i ies en e
he global news cycle mon hly [7].
Con en -app op ia e p osody p esen s ano he subs an ial challenge, wi h esea ch showing ha inapp op ia e
emo ional one educes pe cei ed c edibili y by 27% and comp ehension by 19% [8]. News co e s di e se opics
equi ing dis inc onal app oaches, om se ious deli e y o c isis epo ing o neu al p esen a ion o gene al news
and engaged explana ion o complex opics. Technical e alua ions e eal ha cu en sys ems co ec ly iden i y
app op ia e p osodic pa e ns wi h 79.3% accu acy based on con en analysis, ep esen ing signi ican p og ess bu
s ill ailing human judgmen which achie es 96.5% accu acy in equi alen asks [8]. Ad anced sys ems employ mul i-
Wo ld Jou nal o Ad anced Resea ch and Re iews, 2025, 26(01), 3652-3662
3658
s yle aining on app oxima ely 200 hou s o s yle-speci ic audio da a, modeling p osodic pa e ns o di e en news
ca ego ies and con ex s. Despi e hese ad ances, human e alua o s s ill iden i y inapp op ia e p osody in 21.7% o AI-
na a ed news con en , indica ing an ongoing a ea o echnical imp o emen .
Seamless audio edi ing capabili ies a e essen ial o news applica ions, whe e app oxima ely 31% o a icles unde go
e isions a e ini ial publica ion [7]. Handling upda es o na a ion when news a icles change equi es sophis ica ed
echnical app oaches o main ain oice consis ency and na u al low. T adi ional app oaches ha egene a e en i e
a icles c ea e pe cep ible di e ences in oice cha ac e is ics be ween e sions, educing pe cei ed quali y by 22.3%.
Inc emen al syn hesis sys ems add ess his challenge by main aining acous ic embedding consis ency ac oss upda es,
educing de ec able audio di e ences below he h eshold o pe cep ion o 78% o lis ene s when pa ial a icle
upda es a e p ocessed [7]. The mos e ec i e implemen a ions achie e upda e p ocessing imes a e aging 3.1 seconds
pe modi ied pa ag aph, enabling nea - eal- ime synch oniza ion be ween ex and audio e sions. These sys ems
implemen sen ence-le el p osody p ese a ion ha main ains na u al in ona ion pa e ns e en when indi idual wo ds
change, signi ican ly imp o ing he pe cei ed quali y o upda ed con en .
Mul ilingual capabili ies ep esen a signi ican echnical challenge o global news o ganiza ions seeking o p o ide
accessible audio con en ac oss di e se audience segmen s [8]. Suppo ing di e se language phone ics and speech
pa e ns equi es specialized app oaches, as p onuncia ion ules, p osodic pa e ns, and phoneme in en o ies a y
subs an ially ac oss languages. Technical e alua ions show ha c oss-lingual adap a ion o TTS models achie es only
69% o na i e-language quali y when di ec ans e app oaches a e used wi hou language-speci ic uning. Mul ilingual
TTS sys ems add ess his challenge h ough language-speci ic acous ic models combined wi h sha ed linguis ic
ep esen a ions, achie ing quali y a ings wi hin 88% o single-language sys ems while signi ican ly educing
implemen a ion complexi y [8]. The di icul y a ies subs an ially ac oss language amilies, wi h onal languages
equi ing app oxima ely wice he aining da a o achie e quali y pa i y wi h non- onal languages. Despi e hese
challenges, mul ilingual TTS deploymen s ha e expanded apidly o mee he needs o di e se news audiences, wi h
majo implemen a ions now suppo ing an a e age o 9 languages, a 50% inc ease om 2020 le els [8].
4.3. Cu en Applica ions
Se e al inno a i e implemen a ions showcase he po en ial o AI na a ion o expanding news accessibili y ac oss
di e se audience segmen s [7]. The Washing on Pos 's "Lis en o This A icle" ea u e gene a es human-like na a ion
o w i en con en using a cus omized neu al TTS implemen a ion ha p ocesses app oxima ely 850 a icles weekly.
This sys em se es an a e age o 137,000 unique lis ene s mon hly, wi h analy ics showing ha 42% access his ea u e
on mobile de ices while commu ing o exe cising, and 29% epo isual impai men s o eading disabili ies as hei
p ima y mo i a ion [7]. Technical pe o mance me ics demons a e a p ocessing ime o 2.3 minu es o he a e age
1,000-wo d a icle, wi h 97.4% o a icles success ully p ocessed wi hou human in e en ion. Use esea ch indica es
a sa is ac ion a ing o 3.9/5, wi h accessibili y-dependen use s a ing he se ice 0.7 poin s highe han con enience
use s, highligh ing i s pa icula alue o hose wi h speci ic accessibili y equi emen s [7].
BBC's "Tex - o-Voice" echnology p o ides au oma ed audio e sions o online a icles ac oss hei digi al pla o ms,
se ing an es ima ed 1.8 million unique audio eques s mon hly [8]. This implemen a ion employs a hyb id app oach
combining neu al wa e o m gene a ion wi h linguis ic ule sys ems op imized o BBC's di e se con en ca ego ies.
Technical e alua ions demons a e a wo d e o a e o 2.8% o gene al news con en , hough his inc eases o 5.2%
o specialized e minology in science, echnology, and spo epo ing. The sys em suppo s 8 dis inc languages wi h
consis en oice iden i ies main ained ac oss con en ca ego ies, enabling b and ecogni ion h ough audio
p esen a ion [8]. Usage da a e eals signi ican accessibili y impac , wi h audio con en use s spending an a e age o
6.8 minu es pe session compa ed o 3.7 minu es o ex -only use s, ep esen ing an 84% inc ease in engagemen
h ough his accessibili y ea u e. Pa icula ly no able is he 31% o use s who epo consuming BBC con en
exclusi ely h ough audio o ma s, demons a ing he essen ial na u e o his accessibili y ea u e o a subs an ial
audience segmen [8].
Reu e s Connec Audio au oma ically con e s ex epo s o oice o b oadcas pa ne s, se ing a ne wo k o
app oxima ely 670 media o ganiza ions wo ldwide [7]. This sys em p ocesses a ound 3,800 news i ems daily in 6
languages, gene a ing audio con en ha ex ends he each o ex epo ing o audio- i s pla o ms including adio,
podcas s, and oice assis an s. Technical pe o mance me ics show a mean opinion sco e o 3.8/5 o na u alness and
4.1/5 o cla i y ac oss all suppo ed languages, wi h inancial and echnical e minology bene i ing om domain-
speci ic p onuncia ion models [7]. Pa ne o ganiza ions epo a 26% educ ion in p oduc ion ime o audio news
when u ilizing his sys em, enabling smalle news ope a ions wi h limi ed esou ces o o e audio con en o hei
audiences. Use esea ch indica es pa icula ly s ong accessibili y bene i s o isually impai ed use s and hose wi h
Wo ld Jou nal o Ad anced Resea ch and Re iews, 2025, 26(01), 3652-3662
3659
eading disabili ies, who demons a e comp ehension sco es 34% highe o p o essionally na a ed con en compa ed
o gene ic TTS al e na i es [7].
Table 3 Voice Quali y Me ics o AI-Gene a ed News Na a ion [7, 8]
TTS Sys em Type
Mean Opinion
Sco e
P ocessing Speed
P onuncia ion
Accu acy
P osody App op ia eness
Conca ena i e
3.27
18x eal- ime
87.4%
67.8%
Wa eNe -s yle
4.03
21x eal- ime
92.6%
83.5%
Taco on-based
4.17
19x eal- ime
94.3%
89.7%
Fas Speech 2
3.95
32x eal- ime
92.8%
85.4%
Hyb id Neu al
Sys ems
4.10
25x eal- ime
91.7%
87.2%
5. In eg a ion and Mul imodal Accessibili y
5.1. Technical A chi ec u e
The mos powe ul accessibili y solu ions combine hese echnologies in o in eg a ed sys ems ha p o ide
comp ehensi e accessibili y ac oss di e se use needs and p e e ences [8]. Mode n implemen a ions ypically ea u e
sophis ica ed echnical a chi ec u es designed o scalabili y, pe o mance, and seamless use expe ience. Uni ied
Con en Pipelines ep esen a ounda ional app oach, wi h 73% o leading news o ganiza ions implemen ing single-
sou ce publishing wi h accessibili y ans o ma ions in eg a ed di ec ly in o hei con en managemen sys ems. These
uni ied pipelines ensu e ha all accessibili y ea u es—including summa iza ion, ansla ion, and audio na a ion—
o igina e om a single au ho i a i e con en sou ce, educing inconsis encies by 87% compa ed o disconnec ed
sys ems [8]. Technical e alua ions demons a e ha hese in eg a ed app oaches enable accessible e sions o con en
o be published wi hin an a e age o 57 seconds a e he o iginal con en goes li e, a c i ical ac o o ime-sensi i e
news epo ing.
API-D i en Mic ose ices a chi ec u es ha e eme ged as he dominan implemen a ion pa e n, wi h 81% o majo
news pla o ms adop ing modula se ices o accessibili y ea u es [8]. These a chi ec u es o ganize unc ionali y in o
disc e e, independen ly deployable se ices ha communica e h ough s anda dized APIs, enabling lexibili y,
scalabili y, and con inuous imp o emen o indi idual componen s. Pe o mance benchma ks show hese sys ems
handling peak a ic o 9,400-15,200 eques s pe minu e du ing b eaking news e en s while main aining esponse
imes below 280 milliseconds. The modula na u e o hese a chi ec u es enables a ge ed scaling o high-demand
se ices, wi h o ganiza ions epo ing 94% cos e iciency imp o emen compa ed o monoli hic al e na i es ha
equi e scaling en i e applica ions o mee peak demand [8]. This app oach also acili a es apid i e a ion, wi h
accessibili y componen s upda ed independen ly on a e age e e y 12 days compa ed o 47 days o monoli hic sys ems.
Real-Time P ocessing capabili ies a e essen ial o news applica ions, wi h s eam-based a chi ec u es enabling
immedia e accessibili y o b eaking con en [7]. Technical benchma ks indica e ha leading implemen a ions achie e
end- o-end p ocessing imes a e aging 2.3 seconds om con en publica ion o a ailabili y o ull accessibili y ea u es,
including summa iza ion, ansla ion, and audio na a ion. These sys ems u ilize pa allel p ocessing pipelines ha
main ain pe o mance unde load, wi h minimal deg ada ion e en when p ocessing hund eds o concu en eques s.
Analysis o use engagemen shows ha e e y 0.5-second educ ion in accessibili y ea u e deli e y ime co ela es
wi h a 4.3% inc ease in ea u e u iliza ion, highligh ing he impo ance o immediacy in news consump ion [7]. The
mos ad anced implemen a ions employ p edic i e p ocessing ha begins gene a ing accessibili y ea u es du ing
con en c ea ion, u he educing appa en la ency by an a e age o 41% o scheduled con en publica ion.
Use P e e ence Sys ems ep esen he mos sophis ica ed elemen o mode n accessibili y implemen a ions, wi h
machine lea ning models ha adap o indi idual accessibili y needs based on in e ac ion pa e ns [7]. These sys ems
analyze use beha io ac oss mul iple dimensions, including de ice usage, ime o day, con en ca ego ies, and explici
p e e ence se ings o deli e pe sonalized accessibili y expe iences. Technical e alua ions demons a e ha
p e e ence-adap i e sys ems achie e a 28% imp o emen in use sa is ac ion compa ed o s a ic al e na i es, wi h
pa icula bene i s o use s ha ing mul iple o complex accessibili y equi emen s. Mode n implemen a ions main ain
Wo ld Jou nal o Ad anced Resea ch and Re iews, 2025, 26(01), 3652-3662
3660
p e e ence p o iles ha e ol e based on app oxima ely 18 dis inc in e ac ion signals, wi h machine lea ning models
iden i ying pa e ns ha migh no be explici ly a icula ed by use s hemsel es [7].
5.2. C oss-Pla o m Conside a ions
E ec i e implemen a ions mus add ess se e al c i ical ac o s o ensu e consis en accessibili y ac oss di e se use
con ex s and de ices [8]. Mobile op imiza ion ep esen s a p ima y conside a ion, wi h ligh weigh models o on-
de ice p ocessing achie ing signi ican pe o mance imp o emen s o use s. Technical benchma ks demons a e ha
op imized mobile TTS implemen a ions educe da a usage by 73% compa ed o se e -based p ocessing while
main aining 89% o audio quali y. These app oaches a e pa icula ly aluable in egions wi h limi ed connec i i y, wi h
esea ch indica ing ha app oxima ely 31% o news consume s in de eloping ma ke s p ima ily access con en
h ough connec ions below 2 Mbps [8]. On-de ice p ocessing educes a e age la ency by 570 milliseconds compa ed o
cloud-based al e na i es, a di e ence ha esul s in a measu able 12% imp o emen in use e en ion o accessibili y
ea u es. Ad anced implemen a ions employ hyb id app oaches ha pe o m ligh weigh p ep ocessing on-de ice
while le e aging cloud esou ces o compu a ion-in ensi e gene a ion, op imizing o bo h quali y and esponsi eness
[8].
Bandwid h cons ain s p esen signi ican challenges o mul imedia accessibili y ea u es, pa icula ly o use s in
egions wi h limi ed connec i i y o expensi e da a plans [8]. Adap i e deli e y sys ems add ess his challenge by
dynamically adjus ing accessibili y ea u e quali y based on connec ion condi ions, wi h cu en implemen a ions
capable o scaling audio quali y ac oss ou dis inc le els om 24kbps o 128kbps. These sys ems main ain
in elligibili y sco es abo e 3.9/5 e en a he lowes quali y se ing while educing da a equi emen s by up o 81%
compa ed o high-quali y al e na i es. Implemen a ion da a indica es ha adap i e app oaches inc ease accessibili y
ea u e u iliza ion by 37% in bandwid h-cons ained egions compa ed o ixed-quali y al e na i es [8]. The mos
sophis ica ed sys ems employ p edic i e caching ha p eloads accessibili y ea u es o likely- o-be-accessed con en
du ing a o able ne wo k condi ions, imp o ing subsequen access e en du ing connec i i y limi a ions.
C oss-De ice Con inui y ensu es synch onized accessibili y ea u es ac oss pla o ms, wi h esea ch indica ing ha
68% o news consume s egula ly access con en ac oss mul iple de ices [7]. Technical implemen a ions achie e his
h ough cloud-synch onized p e e ence and s a e managemen ha main ains 94% consis ency in accessibili y ea u e
p esen a ion ac oss desk op, mobile, able , and sma speake in e aces. Synch oniza ion sys ems p ese e posi ion,
p e e ences, and in e ac ion his o y, enabling use s o begin consuming con en on one de ice and seamlessly con inue
on ano he wi hou dis up ion. Usage analy ics demons a e ha e ec i e c oss-de ice implemen a ions inc ease mul i-
session engagemen by 23% compa ed o de ice-speci ic app oaches [7]. This capabili y is pa icula ly aluable o
accessibili y-dependen use s, who epo 34% highe sa is ac ion wi h news pla o ms ha main ain consis en
accessibili y ea u es ac oss all access poin s.
P i acy P ese a ion h ough local p ocessing o use accessibili y p e e ences has eme ged as a c i ical conside a ion,
wi h 71% o news consume s exp essing conce n abou how hei accessibili y da a migh be used [8]. Leading
implemen a ions add ess his h ough edge compu ing app oaches ha main ain sensi i e p e e ence da a on use
de ices, wi h cloud sys ems ecei ing only anonymized and agg ega ed insigh s. Technical e alua ions demons a e
ha p i acy-p ese ing implemen a ions can main ain 91% o pe sonaliza ion e ec i eness while educing iden i iable
da a collec ion by 76%. These app oaches a e pa icula ly impo an o use s wi h disabili ies, who may ha e conce ns
abou how hei speci ic accessibili y equi emen s migh be used o iden i ica ion o a ge ing [8]. Implemen a ion
da a shows ha anspa en p i acy p ac ices inc ease accessibili y ea u e adop ion by 19%, highligh ing he
impo ance o use us in maximizing accessibili y impac .
5.3. Fu u e Di ec ions
Se e al eme ging echnologies p omise o u he enhance news accessibili y in signi ican ways. Mul imodal
Unde s anding sys ems ha in eg a e ex , audio, and isual con en a e showing ema kable p og ess, wi h ecen
implemen a ions demons a ing a 34% imp o emen in in o ma ion comp ehension o use s wi h disabili ies
compa ed o single-modali y app oaches [9]. These sys ems can p ocess mul imedia news con en wi h 91.2% seman ic
p ese a ion ac oss modali ies, add essing a c i ical need as app oxima ely 65% o digi al news now con ains mul iple
media o ma s. Pe sonalized Con en Adap a ion ep esen s ano he p omising di ec ion, wi h adap i e sys ems
showing a 27.3% inc ease in engagemen o use s wi h a ying cogni i e needs. Resea ch indica es ha models
analyzing jus 15-20 in e ac ion signals can e ec i ely adjus con en complexi y ac oss 3-5 dis inc le els while
main aining 94% o he o iginal meaning [10].