Lightnews — Scholar-powered news

Lj Miranda

@ljvmiranda.bsky.social

490 followers 110 following 20 posts

PhD student at the University of Cambridge

https://ljvmiranda921.github.io

Posts Replies Media Videos

Lj Miranda

@ljvmiranda.bsky.social

Finally, I want to thank the folks from HuggingFace for helping draft the official blog post (special shoutout to @clefourrier , @vanstriendaniel, @nathanhabib1011) and @Cohere_Labs for the research credits. :)

August 20, 2025 at 8:40 PM

Lj Miranda

@ljvmiranda.bsky.social

Evals are often the first step, we hope FilBench paves the way for language-specific adaptation especially for Philippine languages! I've written some of my thoughts here:

ljvmiranda921.github.io/projects/20...

August 20, 2025 at 8:40 PM

Lj Miranda

@ljvmiranda.bsky.social

Here's the link to the paper and leaderboard:

📜 Paper: arxiv.org/abs/2508.03523
📊 Leaderboard: ud-filipino-filbench-leaderboard.hf.space/

August 20, 2025 at 8:40 PM

Lj Miranda

@ljvmiranda.bsky.social

This collaboration is exciting, it felt like assembling the Avengers of Filipino NLP. @acocodes and Conner are great collaborators, and I was happy to team-up with @jcblaisecruz and @josephimperial_, who are working on Filipino NLP for longer than I did!

August 20, 2025 at 8:40 PM

Lj Miranda

@ljvmiranda.bsky.social

I was also part of a large-scale @seacrowd.bsky.social collaboration on building a vision-language dataset tailored for Southeast Asian Languages :) Also at ACL Main - aclanthology.org/2025.acl-lo...

July 29 Hall 4/5 10:30-12:00

#ACL2025 #ACL2025NLP

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Samuel Cahyawijaya, Holy Lovenia, Joel Ruben Antony Moniz, Tack Hwa Wong, Mohammad Rifqi Farhansyah, Thant Thiri Maung, Frederikus Hudi, David Anugraha, Muhammad Ravi Shulthan Habibi, Muhammad Reza Qorib, Amit Agarwal, Joseph Marvin Imperial, Hitesh Laxmichand Patel, Vicky Feliren, Bahrul Ilmi Nasution, Manuel Antonio Rufino, Genta Indra Winata, Rian Adam Rajagede, Carlos Rafael Catalan, Mohamed Fazli Mohamed Imam, Priyaranjan Pattnayak, Salsabila Zahirah Pranida, Kevin Pratama, Yeshil Bangera, Adisai Na-Thalang, Patricia Nicole Monderin, Yueqi Song, Christian Simon, Lynnette Hui Xian Ng, Richardy Lobo Sapan, Taki Hasan Rafi, Bin Wang, Supryadi, Kanyakorn Veerakanjana, Piyalitt Ittichaiwong, Matthew Theodore Roque, Karissa Vincentio, Takdanai Kreangphet, Phakphum Artkaew, Kadek Hendrawan Palgunadi, Yanzhi Yu, Rochana Prih Hastuti, William Nixon, Mithil Bangera, Adrian Xuan Wei Lim, Aye Hninn Khine, Hanif Muhammad Zhafran, Teddy Ferdinan, Audra Aurora Izzani, Ayushman Singh, Evan Evan,

aclanthology.org

July 24, 2025 at 12:56 PM

Lj Miranda

@ljvmiranda.bsky.social

3️⃣ The UD-NewsCrawl Treebank: Reflections and Challenges from a Large-scale Tagalog Syntactic Annotation Project (Main) -
aclanthology.org/2025.acl-lo...

July 29 Hall 4/5 10:30-12:00

Collab with folks from UP Diliman

#ACL2025 #ACL2025NLP

July 24, 2025 at 12:56 PM

Lj Miranda

@ljvmiranda.bsky.social

2️⃣ M-RewardBench: Evaluating Reward Models in Multilingual Settings (Main) - aclanthology.org/2025.acl-lo...

July 28 Hall 4/5 11:00-12:30

Collab with folks from @cohereforai.bsky.social

#ACL2025 #ACL2025NLP

July 24, 2025 at 12:56 PM

Lj Miranda

@ljvmiranda.bsky.social

1️⃣ Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback (Main) - aclanthology.org/2025.acl-lo...

7/29 Hall 4/5 10:30-12:00

My project here at @ai2.bsky.social!

#ACL2025NLP

July 24, 2025 at 12:56 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news