Bridging the Gap: New Datasets Push Recommender Research Toward Real-World Scale
KDnuggets
JUNE 11, 2025
One notable recent release is Yambda-5B , a 5-billion-event dataset contributed by Yandex, based on data from its music streaming service, now available via Hugging Face. In recent years, several new datasets have been made public that aim to better reflect real-world usage patterns, spanning music, e-commerce, advertising, and beyond.
Let's personalize your content