WebFAQ: Large-Scale Multilingual FAQ Datasets for Dense Retrieval and Cross-Lingual QA
I'd like to share a new contribution to multilingual ML research: WebFAQ introduces a collection of 2.7 million natural question-answer pairs from real website FAQs across 8 languages (English, German, French, Spanish, Italian, Portuguese, Dutch, a…