Workshop on Multimodal Augmented Generation via MultimodAl
https://nlp.jhu.edu/magmar/
Retrieval centering on topics including (but not limited to):
- Document retrieval
- Multimodal retrieval
- Retrieval-augmented generation (RAG)
- Multimodal RAG
The leaderboard will be up by the end of the week. Please feel free to reach out with any questions!
The leaderboard will be up by the end of the week. Please feel free to reach out with any questions!
* SigLIP features
* Whisper ASR transcripts
* PaddleOCR output
* ICDAR OCR output (Etter et al., 2023)
* Florence video captions (test only)
The train queries/judgments are also available, along with the test queries.
* SigLIP features
* Whisper ASR transcripts
* PaddleOCR output
* ICDAR OCR output (Etter et al., 2023)
* Florence video captions (test only)
The train queries/judgments are also available, along with the test queries.