There is a newer version of the record available.

Published April 12, 2024 | Version v1
Dataset Open

LongEval 2024 Test Collection

Description

The collection consists of queries and documents provided by the Qwant search Engine (https://www.qwant.com). The queries, which were issued by the users of Qwant, are based on the selected trending topics. The documents in the collection were selected with respect to these queries using the Qwant click model. Apart from the documents selected using this model, the collection also contains randomly selected documents from the Qwant index. All the data was collected over June 2023 and August 2023. In total, the collection contains 1,925 test queries. The set of documents consist of 4,321,642 downloaded, cleaned and filtered Web Pages. Translations of the webpages and queries into English will be added when available. The collection serves as the official test collection for the 2024 LongEval Information Retrieval Lab (https://clef-longeval.github.io/) organised at CLEF.

Files

LongEval 2024 Test Collection Readme.pdf

Files (11.2 GiB)

Name Size
md5:50c852b9682127ecbd2ffefbcfa902df
40.9 KiB Preview Download
md5:3fd23c5db7dfc686b7158f370dcda7d9
11.2 GiB Preview Download