Siyan Sylvia Li ✨
banner
siyan-li-sparkle.bsky.social
Siyan Sylvia Li ✨
@siyan-li-sparkle.bsky.social
2nd year PhD @columbianlp • Prev @stanfordnlp @GeorgiaTech • Weird Little Guy Academic • NLP, Dialogue Systems, Education • Caffeine Gremlin 🩷💜💙
Should I point out in my meta review that 2/3 reviewers on the paper used AI and therefore I don't take those reviews seriously
October 1, 2025 at 1:48 PM
🎉 Excited to announce that the 4th HCI+NLP workshop will be co-located with @emnlpmeeting.bsky.social in Suzhou, China! 🌍📍 Join us to explore the intersection of human-computer interaction and NLP. 🧵

1/
July 7, 2025 at 3:41 PM
Just learned that my paper will be the first in the session at NAACL (9:00 am to 9:15 am); will people even show up? 😭😭😭
April 5, 2025 at 4:57 PM
📢 We are releasing an Accented Speech ASR dataset!! This dataset is collected from native Mandarin speakers practicing English from my advisor Zhou Yu's platform. There are 3081 audio clips here, plus high-quality, human-verified transcripts.
huggingface.co/datasets/syl...
sylviali/EDEN_ASR_Data · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
December 2, 2024 at 4:04 PM
Reposted by Siyan Sylvia Li ✨
I am also looking for new positions, so if you are looking for someone with deep expertise in reinforcement learning, robotics, and deep learning in general, hit me me up!
November 25, 2024 at 8:31 PM
Reposted by Siyan Sylvia Li ✨
Rare personal tweet:
Subletting our furnished apartment in Brooklyn for the spring at a significant discount. It's quite nice and in a fun location. under price. Email me know if you are interested, I will send pictures.
November 25, 2024 at 8:39 PM
Reposted by Siyan Sylvia Li ✨
Okay genius idea to improve quality of #nlp #arr reviews. Literally give gold stars to the best reviewers, visible on open review next to your anonymously ID during review process.

Here’s why it would work, and why would you should RT this fab idea:
November 24, 2024 at 9:01 PM
Reposted by Siyan Sylvia Li ✨
I created a collection with good models for dataset curation

- NSFW classifiers
- PII classifiers
- blazing fast embeddings by model2vec
- quality classifier
- educational value classifier
- domain classifier

Collection: huggingface.co/collections/...
Models for dataset curation - a Dataset-Tools Collection
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
November 22, 2024 at 12:57 PM
Reposted by Siyan Sylvia Li ✨
For those who missed this post on the-network-that-is-not-to-be-named, I made public my "secrets" for writing a good CVPR paper (or any scientific paper). I've compiled these tips of many years. It's long but hopefully it helps people write better papers. perceiving-systems.blog/en/post/writ...
Writing a good scientific paper
perceiving-systems.blog
November 20, 2024 at 10:18 AM
Reposted by Siyan Sylvia Li ✨
I propose that instead of "posts" or "skeets" (ew) we refer to bluesky posts as BS

"I BS'd that"
"Oh I saw some BS earlier about that"
"You should make that a BS"
November 20, 2024 at 7:47 AM