--- license: cc-by-sa-3.0 datasets: - natural_questions language: - en tags: - colbert - natural questions - checkpoint - text retrieval metrics: - type: NQ 10 Recall - value: 71.1 - type: NQ 20 Recall - value: 76.3 - type: NQ 50 Recall - value: 80.4 - type: NQ 100 Recall - value: 82.7 - type: NQ 10 MRR - value: 52.1 - type: NQ 20 MRR - value: 52.3 - type: NQ 50 MRR - value: 52.5 - type: NQ 100 MRR - value: 52.5 --- # ColBERT NQ Checkpoint The ColBERT NQ Checkpoint is a trained model based on the ColBERT architecture, which itself leverages a BERT encoder for its operations. This model has been specifically trained on the Natural Questions (NQ) dataset, focusing on text retrieval tasks. | Model Detail | Description | | ----------- | ----------- | | Model Authors | ? | | Date | Feb 7, 2023 | | Version | Checkpoint | | Type | Text retrieval | | Paper or Other Resources | Base Mode: [ColBERT](https://github.com/stanford-futuredata/ColBERT) Dataset: [Natural Questions](https://huggingface.co/datasets/natural_questions) | | License | Other | | Questions or Comments | [Community Tab](https://huggingface.co/Intel/ColBERT-NQ/discussions) and [Intel DevHub Discord](https://discord.gg/rv2Gp55UJQ)| | Intended Use | Description | | ----------- | ----------- | | Primary intended uses | This model is designed for text retrieval tasks, allowing users to submit queries and receive relevant passages from a corpus, in this case, Wikipedia. It can be integrated into applications requiring efficient and accurate retrieval of information based on user queries. | | Primary intended users | Researchers, developers, and organizations looking for a powerful text retrieval solution that can be integrated into their systems or workflows, especially those requiring retrieval from large, diverse corpora like Wikipedia. | | Out-of-scope uses | The model is not intended for tasks beyond text retrieval, such as text generation, sentiment analysis, or other forms of natural language processing not related to retrieving relevant text passages. | # Evaluation The ColBERT NQ Checkpoint model has been evaluated on the NQ dev dataset with the following results, showcasing its effectiveness in retrieving relevant passages across varying numbers of retrieved documents:
NQ | Recall | MRR |
---|---|---|
10 | 71.1 | 52.0 |
20 | 76.3 | 52.3 |
50 | 80.4 | 52.5 |
100 | 82.7 | 52.5 |