(TL;DR) We propose RESFL, a federated learning framework that jointly optimizes privacy, group fairness, and detection utility via adversarial representation disentanglement and evidential uncertainty-guided aggregation — reducing membership-inference attack success by 37% and the equality-of-opportunity gap by 17% vs. FedAvg, while maintaining competitive mAP on FACET and CARLA.
Federated Learning (FL) has gained prominence in machine learning across critical domains, enabling collaborative model training without centralized data aggregation. However, FL frameworks that protect privacy often sacrifice fairness: differential privacy reduces data leakage but hides sensitive attributes needed for bias correction, worsening performance gaps across demographic groups. This work explores the trade-off between privacy and fairness in FL-based object detection and introduces RESFL, an integrated solution jointly optimizing both.
RESFL incorporates adversarial privacy disentanglement and uncertainty-guided fairness-aware aggregation. The adversarial component uses a gradient reversal layer to remove sensitive attributes from learned representations. The uncertainty-aware aggregation employs an evidential neural network to weight client updates adaptively, prioritizing contributions with lower fairness disparities and higher confidence. We validate on high-stakes autonomous vehicle scenarios (FACET, CARLA) and non-visual benchmarks (Adult, TweetEval), confirming RESFL as a domain-agnostic foundation for responsible federated optimization.
RESFL addresses two core challenges simultaneously: (i) preventing sensitive-attribute leakage during training, and (ii) mitigating bias in client updates. The framework integrates an adversarial privacy module and an evidential uncertainty head, both operating locally, with a fairness-aware server aggregation rule.
YOLOv8 backbone computes \(h_i = f(x;\,\theta_i)\) per client.
GRL drives \(I(H;S) \!\to\! 0\), pushing attribute inference toward chance level.
Dirichlet \(\alpha_0\) per group yields scale-invariant disparity score \(\mathrm{UFM}_i\).
Server: \(\omega_i \propto \exp(-\beta\,\mathrm{UFM}_i)\) weights low-disparity clients higher.
Each client replaces its softmax detection head with an evidential output layer that predicts a non-negative concentration vector \(\boldsymbol{\alpha} = (\alpha_1,\ldots,\alpha_C)\) parameterizing a Dirichlet over class probabilities. The total evidence \(\alpha_0 = \sum_c \alpha_c\) gives a closed-form epistemic variance:
Higher \(\alpha_0\) means lower uncertainty. Raw logits are passed through \(\alpha_c = 1 + \mathrm{softplus}(z_c)\) to ensure strict positivity. For group \(g\), the group-wise mean evidence aggregated over post-NMS detections \(\mathcal{P}_\tau(x)\) at threshold \(\tau\) is:
The Uncertainty Fairness Metric is the normalized inter-group epistemic gap:
UFM equals zero under perfect group parity and grows monotonically with disparity. It is scale-invariant and provably bounds the downstream fairness gaps \(|1-\mathrm{DI}|\) and \(\Delta\mathrm{EOP}\) (Appendix B, Thm. B.1 → Cor. B.5).
The feature extractor \(f(x;\theta)\) is augmented with an adversarial classifier \(A(h;\phi)\) predicting sensitive attribute \(s \in \{1,\ldots,K\}\) from representation \(h\). A Gradient Reversal Layer \(R_{\lambda}\) negates gradients by \(-\lambda_\mathrm{priv}\) during backpropagation, turning the local minimax into:
By Fano's inequality, as \(I(H;S) \to 0\) the minimum achievable attribute-inference error satisfies \(P_e \to 1 - 1/K\) — chance level. The coefficient \(\lambda_\mathrm{priv}\) directly tunes the privacy–utility trade-off along a piecewise-convex frontier (Appendix C).
Each client minimises the composite local objective balancing detection, privacy, and fairness:
After local updates, client \(i\) transmits \((\Delta\theta_i,\,\mathrm{UFM}_i)\) to the server. The server performs fairness-weighted global aggregation:
As \(\beta \to 0\) this recovers uniform FedAvg; larger \(\beta\) concentrates weight on clients with minimal inter-group uncertainty disparity. A deterministic gate (per-client validation mAP floor 0.30) blocks low-confidence clients from receiving high weight.
We evaluate in an autonomous vehicle context using FACET (32,000 real-world images, 50,000+ person instances annotated on the 10-level Monk Skin Tone scale, split into \(K=4\) IID clients) and CARLA (6,000 fine-tuning frames + 7,800 evaluation frames across 3 urban maps under 13 weather conditions). RESFL is implemented on a modified YOLOv8 backbone with an evidential Dirichlet head, trained for \(T=100\) communication rounds with \(\lambda_\mathrm{priv}=0.1\), \(\lambda_\mathrm{fair}=0.01\), \(\beta=2.0\), averaged over 3 seeds.
We evaluate under cloud, rain, and fog at intensities 0–100% across 3 urban CARLA maps. Fog is the hardest condition as it removes scene structure unevenly — objects with darker appearance disappear first, amplifying group disparities. RESFL's evidence flooring, temperature control, and vacuity masking slow the growth of disparity and attack success under all conditions.
Under non-IID client data (Dirichlet split, \(\alpha=0.5\), \(H_\mathrm{TV} \approx 0.33\)), RESFL maintains the best joint fairness–privacy profile with top mAP 0.538. Scaling to \(N=8\) clients under IID, RESFL achieves mAP 0.654 / fairness score 0.220 / privacy score 0.206, confirming linear computational scalability (communication cost \(\mathcal{O}(N|\theta|)\), matching FedAvg).
To confirm domain-agnosticism we evaluate on structured tabular data (Adult income, sensitive attribute: race) and text classification (TweetEval sentiment, sensitive attribute: gender) with \(K=4\) IID clients.
| Algorithm | Acc ↑ | Fair ↓ | Priv ↓ |
|---|---|---|---|
| FedAvg | 0.853 | 0.319 | 0.372 |
| FedAvg-DP | 0.706 | 0.302 | 0.222 |
| FairFed | 0.845 | 0.256 | 0.397 |
| PUFFLE | 0.829 | 0.295 | 0.285 |
| RESFL (ours) | 0.848 | 0.232 | 0.239 |
| Algorithm | Acc ↑ | Fair ↓ | Priv ↓ |
|---|---|---|---|
| FedAvg | 0.526 | 0.044 | 0.343 |
| FedAvg-DP | 0.372 | 0.042 | 0.195 |
| FairFed | 0.531 | 0.036 | 0.408 |
| PUFFLE | 0.496 | 0.044 | 0.285 |
| RESFL (ours) | 0.507 | 0.033 | 0.235 |
When sensitive labels are unavailable (regulated deployments), three label-free variants derive latent cohorts from model internals:
All three variants recover over 90% of the fairness and privacy improvements of labeled RESFL with less than 2% accuracy drop, confirming that uncertainty-guided aggregation captures latent heterogeneity without explicit demographic supervision.
If you find RESFL useful in your research, please cite:
@inproceedings{
wasif2026resfl,
title={{RESFL}: An Uncertainty-Aware Framework for Responsible Federated Learning by Balancing Privacy, Fairness and Utility},
author={Dawood Wasif and Terrence J Moore and Jin-Hee Cho},
booktitle={The Fourteenth International Conference on Learning Representations},
year={2026},
url={https://openreview.net/forum?id=Wfz7gpoDSl}
}