
Diffusion-Pretrained Dense and Contextual Embeddings
The new pplx-embed family of multilingual embedding models utilizes multi-stage contrastive learning on a diffusion-pretrained backbone for enhanced web-scale retrieval. Two variants are released: pplx-embed-v1 for standard tasks and pplx-embed-context-v1 for contextual embeddings. The latter excels on the ConTEB benchmark, while both models perform well across several other retrieval benchmarks and internal evaluations, indicating their reliability for large-scale search applications.










