Lj Miranda
banner
ljvmiranda.bsky.social
Lj Miranda
@ljvmiranda.bsky.social
PhD student at the University of Cambridge

https://ljvmiranda921.github.io
Finally, I want to thank the folks from HuggingFace for helping draft the official blog post (special shoutout to @clefourrier , @vanstriendaniel, @nathanhabib1011) and @Cohere_Labs for the research credits. :)
August 20, 2025 at 8:40 PM
Evals are often the first step, we hope FilBench paves the way for language-specific adaptation especially for Philippine languages! I've written some of my thoughts here:

ljvmiranda921.github.io/projects/20...
August 20, 2025 at 8:40 PM
Here's the link to the paper and leaderboard:

📜 Paper: arxiv.org/abs/2508.03523
📊 Leaderboard: ud-filipino-filbench-leaderboard.hf.space/
August 20, 2025 at 8:40 PM
This collaboration is exciting, it felt like assembling the Avengers of Filipino NLP. @acocodes and Conner are great collaborators, and I was happy to team-up with @jcblaisecruz and @josephimperial_, who are working on Filipino NLP for longer than I did!
August 20, 2025 at 8:40 PM
I was also part of a large-scale @seacrowd.bsky.social collaboration on building a vision-language dataset tailored for Southeast Asian Languages :) Also at ACL Main - aclanthology.org/2025.acl-lo...

July 29 Hall 4/5 10:30-12:00

#ACL2025 #ACL2025NLP
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia
Samuel Cahyawijaya, Holy Lovenia, Joel Ruben Antony Moniz, Tack Hwa Wong, Mohammad Rifqi Farhansyah, Thant Thiri Maung, Frederikus Hudi, David Anugraha, Muhammad Ravi Shulthan Habibi, Muhammad Reza Qorib, Amit Agarwal, Joseph Marvin Imperial, Hitesh Laxmichand Patel, Vicky Feliren, Bahrul Ilmi Nasution, Manuel Antonio Rufino, Genta Indra Winata, Rian Adam Rajagede, Carlos Rafael Catalan, Mohamed Fazli Mohamed Imam, Priyaranjan Pattnayak, Salsabila Zahirah Pranida, Kevin Pratama, Yeshil Bangera, Adisai Na-Thalang, Patricia Nicole Monderin, Yueqi Song, Christian Simon, Lynnette Hui Xian Ng, Richardy Lobo Sapan, Taki Hasan Rafi, Bin Wang, Supryadi, Kanyakorn Veerakanjana, Piyalitt Ittichaiwong, Matthew Theodore Roque, Karissa Vincentio, Takdanai Kreangphet, Phakphum Artkaew, Kadek Hendrawan Palgunadi, Yanzhi Yu, Rochana Prih Hastuti, William Nixon, Mithil Bangera, Adrian Xuan Wei Lim, Aye Hninn Khine, Hanif Muhammad Zhafran, Teddy Ferdinan, Audra Aurora Izzani, Ayushman Singh, Evan Evan,
aclanthology.org
July 24, 2025 at 12:56 PM
3️⃣ The UD-NewsCrawl Treebank: Reflections and Challenges from a Large-scale Tagalog Syntactic Annotation Project (Main) -
aclanthology.org/2025.acl-lo...

July 29 Hall 4/5 10:30-12:00

Collab with folks from UP Diliman

#ACL2025 #ACL2025NLP
July 24, 2025 at 12:56 PM
2️⃣ M-RewardBench: Evaluating Reward Models in Multilingual Settings (Main) - aclanthology.org/2025.acl-lo...

July 28 Hall 4/5 11:00-12:30

Collab with folks from @cohereforai.bsky.social

#ACL2025 #ACL2025NLP
July 24, 2025 at 12:56 PM
1️⃣ Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback (Main) - aclanthology.org/2025.acl-lo...

7/29 Hall 4/5 10:30-12:00

My project here at @ai2.bsky.social!

#ACL2025NLP
July 24, 2025 at 12:56 PM