PEARL: Prototype-Enhanced Alignment for Label-Efficient Representation Learning with Deployment-Driven Insights from Digital Governance Communication Systems

Published in Preprint, 2026

We study deployed systems where new text inputs are routed by retrieving similar past cases, such as citizen messages in digital governance platforms. When this fails, the issue is often poorly aligned embedding neighborhoods rather than the language model itself. We propose PEARL (Prototype-Enhanced Aligned Representation Learning), a label-efficient method that softly aligns embeddings toward class prototypes to improve local neighborhood structure without changing dimensionality or requiring heavy retraining. Our approach bridges the gap between unsupervised post-processing and fully supervised projection, and yields strong gains precisely in the label-scarce settings where similarity-based systems are most brittle.

You can apply PEARL directly from PyPI:

pip install pearl-h

For full documentation and tutorials, visit the PEARL project page. The SSRN version is available here.

PEARL Overview
Figure. Neighborhood Geometry Illustration.

Citation as: Ruiyu ZHANG, Lin Nie, Wai-Fung Lam, Qihao Wang, and Xin Zhao. (2026). "PEARL: Prototype-Enhanced Alignment for Label-Efficient Representation Learning with Deployment-Driven Insights from Digital Governance Communication Systems." Preprint. arXiv:2601.17495. https://arxiv.org/abs/2601.17495