scieee Science in your language
[en] (orig)

Differential Privacy and Data Anonymization Techniques for Cloud-Based Services

Author: Anoop Srivastava; Dr. Shakeeb Khan
Publisher: Zenodo
DOI: 10.5281/zenodo.17093367
Source: https://zenodo.org/records/17093367/files/3-5-9.1.pdf
30
h ps:// esea ch endsjou nal.com
Online a : h ps:// esea ch endsjou nal.com ISSN No: 2584-282X
Indexed Jou nal Pee Re iewed Jou nal
INTERNATIONAL JOURNAL OF TRENDS IN EMERGING RESEARCH AND DEVELOPMENT
Volume 3; Issue 5; 2025; Page No. 30-33
Recei ed: 16-07-2025
Accep ed: 20-08-2025
Published: 10-09-2025
Di e en ial P i acy and Da a Anonymiza ion Techniques o Cloud-
Based Se ices
1Anoop S i as a a and 2D . Shakeeb Khan
1Resea ch Schola , Depa men o Compu e Applica ion, Maha aja Ag asen Himalayan Ga hwal Uni e si y, U a akhand,
India
2Assis an P o esso , Depa men o Compu e Applica ion, Maha aja Ag asen Himalayan Ga hwal Uni e si y, U a akhand,
India
DOI: h ps://doi.o g/10.5281/zenodo.17093367
Co esponding Au ho : Anoop S i as a a
Abs ac
Wi h he explosi e g ow h o cloud-based se ices, massi e amoun s o pe sonal and sensi i e da a a e collec ed, s o ed, and p ocessed on
dis ibu ed pla o ms. While cloud in as uc u es enable powe ul big da a analy ics, hey also aise p essing conce ns abou use p i acy
and da a p o ec ion. T adi ional secu i y mechanisms such as enc yp ion a e e ec i e agains unau ho ized access bu ail o p e en p i acy
leakage du ing legi ima e da a analysis. This pape explo es di e en ial p i acy, k-anonymi y, and l-di e si y as leading anonymiza ion
app oaches o sa egua d indi idual p i acy while main aining da a u ili y. Th ough compa a i e analysis, he pape highligh s s eng hs and
limi a ions o hese echniques and p oposes a scalable anonymiza ion amewo k ailo ed o cloud-based big da a en i onmen s. The
amewo k in eg a es di e en ial p i acy wi h classical anonymiza ion s a egies o achie e bo h obus p i acy gua an ees and e icien
pe o mance in la ge-scale, mul i- enan cloud sys ems.
Keywo ds: Cloud Compu ing, Da a P i acy, Di e en ial P i acy, k-Anonymi y, l-Di e si y, Da a Anonymiza ion, Big Da a Secu i y
In oduc ion
Cloud compu ing has ans o med he s o age and
p ocessing o da a by o e ing scalable, elas ic, and cos -
e icien solu ions. F om e-comme ce ansac ions o
heal hca e eco ds, as olumes o pe sonal in o ma ion a e
now hos ed in cloud in as uc u es. These da ase s enable
o ganiza ions o pe o m analy ics ha gene a e insigh s o
inno a ion, policy, and decision-making. Howe e , he
same da a also c ea es se ious p i acy isks.
T adi ional da a p o ec ion me hods such as enc yp ion,
i ewalls, and access con ol p e en unau ho ized access
bu canno ully add ess p i acy leaks du ing au ho ized
analy ics. Fo ins ance, e en anonymized da ase s can be e-
iden i ied using backg ound knowledge o linkage a acks,
as demons a ed in se e al high-p o ile p i acy b eaches.
This gap has led o he adop ion o p i acy-p ese ing da a
publishing echniques, mos no ably:
▪ k-Anonymi y: Ensu es each eco d is indis inguishable
om a leas k-1 o he s.
▪ l-Di e si y: Ex ends k-anonymi y by ensu ing sensi i e
a ibu es ha e di e se alues.
▪ Di e en ial P i acy (DP): Adds con olled s a is ical
noise o que y esul s, p o iding s ong ma hema ical
p i acy gua an ees.
This pape in es iga es hese h ee models in he con ex o
cloud-based se ices and p oposes a hyb id scalable
anonymiza ion amewo k designed o big da a in he
cloud.
Aims and Objec i es
The p ima y aim o his s udy is o examine p i acy-
In e na ional Jou nal o T ends in Eme ging Resea ch and De elopmen h ps:// esea ch endsjou nal.com
31
h ps:// esea ch endsjou nal.com
p ese ing anonymiza ion echniques in cloud-based
se ices and design a scalable hyb id amewo k ha
ensu es s ong p i acy while suppo ing meaning ul
analy ics.
Objec i es include
1. To analyze k-anonymi y, l-di e si y, and di e en ial
p i acy in e ms o s eng hs, weaknesses, and
applicabili y o cloud pla o ms.
2. To e alua e hei impac on da a u ili y, scalabili y, and
esis ance o p i acy a acks.
3. To design a compa a i e amewo k ha highligh s
ade-o s be ween anonymiza ion app oaches.
4. To p opose a scalable anonymiza ion amewo k o
cloud en i onmen s ha in eg a es di e en ial p i acy
wi h classical anonymiza ion s a egies.
Re iew o Li e a u e
▪ k-Anonymi y: In oduced by Sama a i & Sweeney
(1998) [9], k-anonymi y has been widely applied o
s uc u ed da a publishing. Howe e , s udies
(Machana ajjhala e al., 2006) [10] show i is ulne able
o homogenei y and backg ound knowledge a acks.
▪ l-Di e si y: P oposed as an imp o emen o e k-
anonymi y, l-di e si y ensu es sensi i e a ibu es
emain di e se wi hin anonymized g oups. Ye , i has
limi a ions agains skewness and simila i y a acks (Li
e al., 2007) [4].
▪ Di e en ial P i acy (DP): Fo malized by Dwo k
(2006) [12], DP in oduces andomized noise o que y
ou pu s, gua an eeing s ong ma hema ical p i acy
ega dless o ad e sa ial knowledge. Resea che s
highligh i s obus ness bu no e ade-o s in accu acy
and compu a ional cos .
▪ P i acy in Cloud En i onmen s: S udies by Subashini
& Ka i ha (2011) [11] emphasize ha cloud mul i-
enancy inc eases isks o c oss-use p i acy leakage.
Recen wo ks sugges hyb id me hods combining
anonymiza ion wi h DP o scalabili y in big da a
sys ems.
The li e a u e unde sco es ha while classical
anonymiza ion ensu es in e p e abili y, only di e en ial
p i acy p o ides p o able gua an ees. The e o e, a hyb id
app oach is necessa y o mode n cloud se ices.
Resea ch Me hodologies
This s udy adop s a quali a i e, compa a i e, and desc ip i e
design, combining li e a u e analysis wi h compa a i e
e alua ion o anonymiza ion models.
S eps
1. Tex ual Analysis: Re iew o amewo ks and models
(k-anonymi y, l-di e si y, DP) om academic pape s,
cloud secu i y whi e pape s, and indus y s anda ds.
2. Compa a i e F amewo k: C i e ia such as scalabili y,
p i acy assu ance, a ack esis ance, and da a u ili y a e
compa ed.
3. C i ical Re iew: Focus on challenges in mul i- enan
cloud and big da a con ex s (2010–2022).
4. In e p e i e Lens: Emphasis on concep ual in eg a ion
o anonymiza ion echniques in o a uni ied amewo k.
Table 1: Compa a i e Analysis o Anonymiza ion Techniques
C i e ia
k-Anonymi y
l-Di e si y
Di e en ial P i acy
P i acy Assu ance
Mode a e – p e en s di ec e-ID
S onge – p e en s homogenei y a acks
Ve y s ong – p o able gua an ees
Scalabili y
Limi ed – s uggles wi h la ge
da ase s
Mode a e – compu a ionally expensi e
High – scalable wi h e icien noise
addi ion
Da a U ili y
High – p ese es da ase s uc u e
Mode a e – may dis o sensi i e alues
Va iable – depends on noise magni ude
A ack Resis ance
Weak o backg ound knowledge
a acks
S onge bu weak o skewness a acks
S ong agains mos ad e sa ial
s a egies
Complexi y
Low – easy o implemen
Mode a e – a ibu e dis ibu ion equi ed
High – equi es s a is ical expe ise
Table 2: Applica ion in Cloud-Based Big Da a Se ices
Use Case
k-Anonymi y
l-Di e si y
Di e en ial P i acy
Heal hca e Da a
P o ec s pa ien iden i ie s
P ese es di e si y o diseases
P o ec s agg ega e s a is ics wi h noise
E-Comme ce T ansac ions
Masks use IDs
Ensu es pu chase di e si y
P o ec s shopping end analysis
Social Media Analy ics
G oups use demog aphics
Adds a ia ion in a ibu es
P o ec s insigh s om la ge use da ase s
Resul s and In e p e a ion
The analysis e eals:
1. k-Anonymi y is sui able o s uc u ed da ase s wi h
low p i acy isks, bu ails unde linkage o backg ound
knowledge a acks.
2. l-Di e si y imp o es obus ness, bu becomes
compu a ionally expensi e and less e ec i e o highly
skewed da a.
3. Di e en ial P i acy p o ides he s onges
ma hema ical p i acy gua an ees, making i highly
sui able o big da a analy ics in he cloud, hough a he
cos o educed accu acy.
4. Hyb id F amewo k P oposal: A wo-laye
anonymiza ion sys em ha applies:
▪ k-Anonymi y/l-Di e si y a he da ase le el o
s uc u al anonymiza ion.
▪ Di e en ial P i acy a he que y le el o p o ide
p o able gua an ees du ing da a analy ics.
In e na ional Jou nal o T ends in Eme ging Resea ch and De elopmen h ps:// esea ch endsjou nal.com
32
h ps:// esea ch endsjou nal.com
Table 3: Hyb id F amewo k Ad an ages
Fea u e
T adi ional Me hods (k/l)
Di e en ial P i acy
Hyb id Model
P i acy S eng h
Mode a e o S ong
Ve y S ong
Ve y S ong + De ense in Dep h
Da a U ili y
High
Va iable (depends on noise)
Balanced – p ese es s uc u e & adds DP noise
Scalabili y
Limi ed in la ge da ase s
High
High – op imized o cloud big da a
A ack Resis ance
Vulne able o ad anced a acks
S ong
S onge – mul iple laye s o de ense
Discussion and Conclusion
P i acy emains he co ne s one o us in cloud compu ing,
whe e sensi i e use da a is con inuously s o ed, sha ed, and
analyzed. The indings o his s udy sugges ha no single
anonymiza ion echnique can add ess all p i acy isks and
pe o mance equi emen s o mode n cloud pla o ms.
Ins ead, a laye ed app oach ha in eg a es adi ional
anonymiza ion echniques (k-anonymi y, l-di e si y) wi h
o mal p i acy models (di e en ial p i acy) is he mos
e ec i e s a egy.
Compa a i e Insigh s
▪ K-Anonymi y and L-Di e si y: These models a e
in ui i e, simple, and sui able o s uc u ed da ase s
such as heal h eco ds, census da a, o inancial logs.
They wo k by gene alizing o supp essing iden i ying
a ibu es, ensu ing ha indi idual eco ds canno be e-
iden i ied easily. Howe e , hey a e ulne able o
linkage and backg ound knowledge a acks, and hei
scalabili y diminishes in la ge, high-dimensional
da ase s common in cloud en i onmen s.
▪ Di e en ial P i acy (DP): DP o e s ma hema ical
gua an ees by in oducing con olled andom noise a
he que y o da ase le el. I p e en s a acke s om
in e ing he p esence o absence o a single indi idual
in a da ase , ega dless o hei auxilia y in o ma ion.
Al hough DP ensu es s onge p i acy, i ades o
accu acy, especially in scena ios equi ing ine-g ained
analy ics. Pe o mance o e head and pa ame e uning
(ε – p i acy budge ) emain key challenges.
P oposed Hyb id F amewo k
The s udy p oposes a hyb id anonymiza ion amewo k o
cloud-based se ices ha combines he s eng hs o bo h
models:
1. Da ase -Le el P o ec ion: Apply k-anonymi y/l-
di e si y o p ep ocess da a be o e uploading i o cloud
s o age. This educes he isk o e-iden i ica ion in aw
da ase s.
2. Que y-Le el P o ec ion: Apply di e en ial p i acy
mechanisms (Laplace o Gaussian noise injec ion)
du ing que y execu ion o da a analy ics. This p e en s
a acke s om exploi ing agg ega ed esul s.
This laye ed a chi ec u e enhances esilience agains
ad e sa ial a acks, main ains analy ical u ili y, and is
scalable o big da a wo kloads.
Table 4: Compa a i e S eng hs and Weaknesses
Technique
S eng hs
Weaknesses
Sui abili y in Cloud
K-Anonymi y
Easy o implemen , in ui i e, educes di ec
iden i ie s
Vulne able o homogenei y & linkage
a acks
Medium – s uc u ed da a only
L-Di e si y
S onge han k-anonymi y, p o ec s sensi i e
a ibu es
S ill ails agains skewness/backg ound
a acks
Medium – mode a e da ase s
Di e en ial P i acy
Fo mal p i acy gua an ees, esis s auxilia y
knowledge a acks
Accu acy ade-o , compu a ional
o e head
High – scalable big da a,
analy ics
Hyb id (P oposed)
Combines simplici y + ma hema ical s eng h
Inc eased design complexi y,
pe o mance cos
Ve y High – mul i- enan
cloud
Table 5: T ade-o s be ween P i acy, U ili y, and Scalabili y
C i e ion
K-Anonymi y
L-Di e si y
Di e en ial P i acy
Hyb id F amewo k
P i acy S eng h
Low–Medium
Medium
High
Ve y High
Da a U ili y
High
Medium–High
Medium
Medium–High
Scalabili y
Medium
Medium
High
High
Implemen a ion Cos
Low
Medium
High
High
Key Implica ions
▪ Fo Cloud P o ide s: Adop ing hyb id models can
inc ease use us , educe compliance isks (GDPR,
HIPAA), and imp o e ma ke compe i i eness.
▪ Fo Use s: Ensu es s onge p o ec ion o pe sonal da a
wi hou sac i icing oo much u ili y in analy ics.
▪ Fo Resea che s: Opens a enues o op imize hyb id
amewo ks wi h adap i e noise calib a ion, machine
lea ning-based anonymiza ion, and eal- ime policy
en o cemen .
Fu u e Di ec ions
1. P o o ype De elopmen : Build es beds o benchma k
hyb id anonymiza ion models in eal cloud pla o ms
(AWS, Azu e, GCP).
2. Adap i e P i acy Budge s: Explo e AI/ML me hods
o dynamically adjus di e en ial p i acy pa ame e s
(ε) based on que y sensi i i y.
3. In eg a ion wi h Blockchain: Use dis ibu ed ledge
echnology o enhance accoun abili y and ensu e
anspa en audi ing o anonymized da a.
In e na ional Jou nal o T ends in Eme ging Resea ch and De elopmen h ps:// esea ch endsjou nal.com
33
h ps:// esea ch endsjou nal.com
4. Policy and Compliance Mapping: Align hyb id
amewo ks wi h GDPR, HIPAA, and eme ging global
da a p o ec ion laws.
Conclusion
The s udy concludes ha while adi ional anonymiza ion
echniques (k-anonymi y and l-di e si y) p o ide
in e p e abili y and ease o use, hey a e no esilien enough
o mode n ad e sa ial scena ios. Di e en ial p i acy s ands
ou as he mos obus app oach bu comes wi h ade-o s in
u ili y and complexi y. The p oposed hyb id amewo k
balances hese dimensions by combining da ase -le el
anonymiza ion wi h que y-le el di e en ial p i acy
gua an ees, making i be e sui ed o scalable, secu e, and
p i acy-p ese ing cloud analy ics.
This wo k lays he ounda ion o nex -gene a ion p i acy-
awa e cloud se ices ha can mee he g owing demands o
big da a, AI, and egula o y compliance.
Re e ences
1. Dwo k C. Di e en ial p i acy in new se ings. In:
P oceedings o he 21s Annual ACM-SIAM
Symposium on Disc e e Algo i hms (SODA); c2011.
2. Fung BCM, Wang K, Chen R, Yu PS. P i acy-
p ese ing da a publishing: A su ey o ecen
de elopmen s. ACM Compu ing Su eys.
2010;42(4):1–53.
3. Machana ajjhala A, Ki e D, Geh ke J,
Venki asub amaniam M. l-di e si y: P i acy beyond k-
anonymi y. ACM T ansac ions on Knowledge
Disco e y om Da a. 2007;1(1):3.
4. Li N, Li T, Venka asub amanian S. -Closeness:
P i acy beyond k-anonymi y and l-di e si y. In:
P oceedings o he IEEE 23 d In e na ional Con e ence
on Da a Enginee ing (ICDE); c2007. p. 106–115.
5. Abadi M, Chu A, Good ellow I, McMahan HB,
Mi ono I, Talwa K, e al. Deep lea ning wi h
di e en ial p i acy. In: P oceedings o he 2016 ACM
SIGSAC Con e ence on Compu e and
Communica ions Secu i y (CCS). 2016. p. 308–318.
6. Xiao X, Tao Y. Ana omy: Simple and e ec i e p i acy
p ese a ion. In: P oceedings o he 32nd In e na ional
Con e ence on Ve y La ge Da a Bases (VLDB); c2006.
p. 139–150.
7. Go yczka S, Xiong L. A comp ehensi e compa ison o
mul ipa y secu e addi ions wi h di e en ial p i acy.
IEEE T ansac ions on Dependable and Secu e
Compu ing. 2015;14(5):463–477.
8. Zhang X, Yang LT, Chen Z, Li P. P i acy-p ese ing
machine lea ning in cloud en i onmen s: A su ey.
IEEE Access. 2019;7:170443–170460.
9. Sama a i P, Sweeney L. P o ec ing p i acy when
disclosing in o ma ion: k-anonymi y and i s
en o cemen h ough gene aliza ion and supp ession.
In: P oceedings o he IEEE Symposium on Secu i y
and P i acy; c1998. p. 384–393.
10. Machana ajjhala A, Ki e D, Geh ke J,
Venki asub amaniam M. l-di e si y: P i acy beyond k-
anonymi y. ACM T ansac ions on Knowledge
Disco e y om Da a. 2006;1(1):3.
11. Li N, Li T, Venka asub amanian S. -Closeness:
P i acy beyond k-anonymi y and l-di e si y. In:
P oceedings o he IEEE 23 d In e na ional Con e ence
on Da a Enginee ing (ICDE); c2007. p. 106–115.
12. Dwo k C. Di e en ial p i acy. In: P oceedings o he
33 d In e na ional Colloquium on Au oma a, Languages
and P og amming (ICALP); c2006. p. 1–12.
13. Subashini S, Ka i ha V. A su ey on secu i y issues in
se ice deli e y models o cloud compu ing. Jou nal o
Ne wo k and Compu e Applica ions. 2011;34(1):1–11.
C ea i e Commons (CC) License
This a icle is an open access a icle dis ibu ed unde
he e ms and condi ions o he C ea i e Commons
A ibu ion (CC BY 4.0) license. This license pe mi s
un es ic ed use, dis ibu ion, and ep oduc ion in any
medium, p o ided he o iginal au ho and sou ce a e
c edi ed.