Maik Fröbe
banner
maik-froebe.bsky.social
Maik Fröbe
@maik-froebe.bsky.social
PhD-Student in the webis.de group. Interested in IR and NLP.
Reposted by Maik Fröbe
We just released "German Commons", the largest openly-licensed German text dataset for LLM training: 154B tokens with clear usage rights for research and commercial use.

huggingface.co/datasets/coral-nlp/german-commons
coral-nlp/german-commons · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
October 27, 2025 at 12:45 PM
Reposted by Maik Fröbe
Dutch-Belgian Information Retrieval workshop #dir3025 today in Nijmegen! Opening by the great Harrie Oosterhuis

https://informagus.nl/dir2025/schedule
October 27, 2025 at 10:07 AM
Reposted by Maik Fröbe
Check out the slides from our SCAI'2025 #convsearch workshop collocated with @ijcai.org #IJCAI2025 on LLMs, retrieval & QA, recommendations, negotiations, evaluation and transparency

scai.info/scai-2025

@patuchen.bsky.social @maik-froebe.bsky.social @tuetschek.bsky.social @mila-quebec.bsky.social
SCAI 2025
Online Event on Search-Oriented Conversational AI.
scai.info
September 9, 2025 at 10:34 AM
Reposted by Maik Fröbe
🌟Really excited to share the fourth Strategic Workshop on Information Retrieval (SWIRL) report published in SIGIR Forum!

Paper 👉🏻 www.johannetrippas.com/papers/tripp...

More info 👉🏻 sites.google.com/view/swirl20...

#SWIRL2025 #SIGIR2026 #IR #GenAI #Research #CHIIR2026
September 2, 2025 at 12:38 PM
Reposted by Maik Fröbe
Some exciting news! 🤗 After 3 amazing years at TREC, the Tip-of-the-Tongue (ToT) shared task will be a core task at NTCIR-19 in 2026. The new track will focus on tip-of-the-tongue information needs in English and East Asian languages.

More details coming soon. See you all in Tokyo next year!
an aerial view of tokyo at night with lots of lights
ALT: an aerial view of tokyo at night with lots of lights
media.tenor.com
September 1, 2025 at 4:12 PM
Reposted by Maik Fröbe
Hello TREC-ToTers! 👋🏽

📆 Good news! We are extending the run submission deadline by 2 weeks. Please submit your runs by **September 10 (Wednesday)** and spread the word.

More info: trec-tot.github.io/guidelines
#TREC2025 #TRECToT #TREC2025ToT
August 25, 2025 at 11:31 AM
Reposted by Maik Fröbe
Gentle reminder 📢
All run submissions for the Tip-of-the-Tongue (ToT) Track are due next week Wednesday (Aug 27).

More info: trec-tot.github.io/guidelines
#TREC2025 #TRECToT #TREC2025ToT
August 19, 2025 at 4:45 PM
Here are some impressions from our ReNeuIR workshop on "Reaching Efficiency in Neural IR" that we had yesterday at #SIGIR2025.
July 18, 2025 at 8:41 AM
Reposted by Maik Fröbe
Happy to share that our paper "The Viability of Crowdsourcing for RAG Evaluation" received the Best Paper Honourable Mention at #SIGIR2025! Very grateful to the community for recognizing our work on improving RAG evaluation.

 📄 webis.de/publications...
July 16, 2025 at 9:04 PM
Now @fschlatt.bsky.social presents "TITE: Token-Independent Text Encoder for Information Retrieval" at #SIGIR2025

Paper: webis.de/publications...
July 16, 2025 at 9:08 AM
Reposted by Maik Fröbe
Want to know how to make bi-encoders more than 3x faster with a new backbone encoder model? Check out our talk on the Token-Independent Text Encoder (TITE) #SIGIR2025 in the efficiency track. It pools vectors within the model to improve efficiency dl.acm.org/doi/10.1145/...
July 16, 2025 at 7:28 AM
To Eun Kim just presented the work on "Tip of the Tongue Query Elicitation for Simulated Evaluation" at #SIGIR2025. The approach will be used in the #TREC2025 Tip-of-the-Tongue track, and we had some sweets at the poster :)

The paper is available online: dl.acm.org/doi/10.1145/...
July 15, 2025 at 2:30 PM
Lukas Gienapp presents "The Viability of Crowdsourcing for RAG Evaluation" at #SIGIR2025

The paper is available at: webis.de/publications...
July 15, 2025 at 1:53 PM
Reposted by Maik Fröbe
@mrparryparry.bsky.social presenting our work on reproducing TREC DL 2019 judgements and the implications for evaluating modern ranking models on modern collections. Paper: arxiv.org/abs/2502.20937
Variations in Relevance Judgments and the Shelf Life of Test Collections
The fundamental property of Cranfield-style evaluations, that system rankings are stable even when assessors disagree on individual relevance decisions, was validated on traditional test collections. ...
arxiv.org
July 14, 2025 at 2:49 PM
Here are some of the statistics that I found very interesting from the #SIGIR2025 opening session. (Over 1000 attendees!)
July 14, 2025 at 9:38 AM
Reposted by Maik Fröbe
Hello TREC-ToTers!

We have released the test queries for the TREC 2025 Tip-of-the-Tongue (TREC-ToT) Track. Please see the guidelines for more information: trec-tot.github.io/guidelines. Run submission deadline will tentatively be in August. #TREC2025 #TRECToT #TREC2025ToT

Please spread the word!
July 13, 2025 at 4:47 PM
Reposted by Maik Fröbe
Thank you Carlos for the shout-out of Lightning IR in the LSR tutorial at #SIGIR2025

If you want to fine your own LSR models, check out our framework at github.com/webis-de/lig...
July 13, 2025 at 2:42 PM
Reposted by Maik Fröbe
Never seen our editor in chief, Djoerd Hiemstra, more happy than today, holding a copy of the first issue of #irrj
July 1, 2025 at 3:26 PM
Do not forget to participate in the #TREC2025 Tip-of-the-Tongue (ToT) Track :)

The corpus and baselines (with run files) are now available and easily accessible via the ir_datasets API and the HuggingFace Datasets API.

More details are available at: trec-tot.github.io/guidelines
June 27, 2025 at 2:46 PM
The deadline for submissions to the ReNeuIR workshop at #SIGIR2025 is extended to June 10 😸

Details: reneuir.org

#ReNeuIr2025 #SIGIR25
ReNeuIR’25
Workshop on Reaching Efficiency in Neural Information Retrieval
reneuir.org
May 21, 2025 at 5:31 PM
Reposted by Maik Fröbe
Hello TREC-ToTers! 👋🏽

Excited to announce the release of TREC 2025 Tip-of-the-Tongue (TREC-ToT) Track guidelines: trec-tot.github.io/guidelines. We will release test queries in July and run submission deadline will be in August. #TREC2025 #TRECToT #TREC2025ToT

Please register to participate:
TREC 2025 Tip-of-the-Tongue (ToT) Track
Tip of the tongue: The phenomenon of failing to retrieve something from memory, combined with partial recall and the feeling that retrieval is imminent.
trec-tot.github.io
May 9, 2025 at 9:02 PM
Today I had the pleasure to talk about child-safe search at #ECIR2025. We created an cranfield-style evaluation dataset to contrast relevance with harm in web search scenarios.

Details: webis.de/publications...
April 10, 2025 at 3:14 PM
The Workshop on Open Web Search just finished #WOWS2025 #ECIR2025.

It was a very cool experience with many interesting talks. Lets hope we can do it again next year at #ECIR2026 in Delft :)
April 10, 2025 at 3:05 PM
The Workshop on Open Web Search at #ECIR2025 just starts with a keynote by @claclarke.bsky.social on Annotative Indexing. #WOWS25 #WOWS2025 #ECIR25
April 10, 2025 at 7:16 AM
Reposted by Maik Fröbe
Honored to receive the best short paper award and best paper honourable mention award at #ECIR2025. Thank you to all co-authors @maik-froebe.bsky.social, @hscells.bsky.social, Shengyao Zhuang, @bevankoopman.bsky.social, Guido Zuccon, Benno Stein, @martin-potthast.com, @matthias-hagen.bsky.social 🥳
April 9, 2025 at 12:37 PM