Open-source Contributions | Youngjoon Jang

Extended the cross-encoder training stack with classic learning-to-rank losses (Position-Aware ListMLELoss, RankNetLoss). [PR#6] [PR#7]
Introduced hardness-weighted contrastive learning to up-weight informative hard negatives. [PR#3667]
Implemented CachedSpladeLoss for gradient-cache compatible, memory-efficient SPLADE training (4-16x larger batch sizes, 36-41% lower GPU memory). [PR#3670]

Added Korean retrieval benchmark task (AutoRAGRetrieval). [PR#1388]
Improved stability of OpenAI embedding wrapper with sentence trimming. [PR#1526]
Fixed NaN embeddings for Jasper models (float16 → bfloat16). [PR#2481]