Startup admin: stevetownsend0.bsky.social
Appeals: admin@pef-moderation.org
Follow for PSAs
Committed to no use of !hide !warn etc outside illegality
1. Subjective content moderation by platform is doomed to fail and should be confined to legality (e.g. remove CSAM) and authenticity - expand verification, remove bots and fakes.
Focus on takedown, not subjective post-level decisions.
Trying to do more is a costly distraction
For the most part, they label things sensibly and understandably, but basically anyone can create their own labeler and label everyone however they want."
h/t @nightlythievery.bsky.social
Subjective - ymmv, esp. note our pro-Israel labeling
Via @FordJohnathan5.x
For the most part, they label things sensibly and understandably, but basically anyone can create their own labeler and label everyone however they want."
h/t @nightlythievery.bsky.social
Subjective - ymmv, esp. note our pro-Israel labeling
whtwnd.com/bnewbold.net...
whtwnd.com/bnewbold.net...
prizemna srackoidna ruZka propaganda.
vy viete co mate robit. report, block etc...
Revisited two frequently-used lazy slurs, both of which are auto-labeled as abusive/disinfo by this service.
Bluesky live post/account counts for:
ZioN*zi : 3912/927
PaliN*zi : 57/34
Revisited two frequently-used lazy slurs, both of which are auto-labeled as abusive/disinfo by this service.
Bluesky live post/account counts for:
ZioN*zi : 3912/927
PaliN*zi : 57/34
bsky.app/search?q=aip...
I did a manual review of all results but might have missed a couple all the same. Please let me know if so.
bsky.app/search?q=aip...
I did a manual review of all results but might have missed a couple all the same. Please let me know if so.
bsky.app/profile/capi...
Which imho is a correct labeling, after the Heitmann scandal.
Thanks, moderation team!
bsky.app/profile/capi...
Will relink as needed as this grows irregularly over time
github.com/SteveTownsen...
Will relink as needed as this grows irregularly over time
github.com/SteveTownsen...
Screenshot of a labeled account, taken from the platform's Ozone tool that we use to manually review the backlog.
This is the moderation event stream on the account, filtered for only reports.
Screenshot of a labeled account, taken from the platform's Ozone tool that we use to manually review the backlog.
This is the moderation event stream on the account, filtered for only reports.
Screenshot of a labeled account, taken from the platform's Ozone tool that we use to manually review the backlog.
This is the moderation event stream on the account, filtered for only reports.
Automated labeling has been cleaned up and new labels introduced to destigmatize the presence of occasional labels on an account. Some users may be misunderstanding the semantics.
Automated labeling has been cleaned up and new labels introduced to destigmatize the presence of occasional labels on an account. Some users may be misunderstanding the semantics.
@maryshine.bsky.social rules with websites are the tracked sites. "track = true" is needed as well - standalone field, not substring of field "actions", which is legacy format.
@maryshine.bsky.social rules with websites are the tracked sites. "track = true" is needed as well - standalone field, not substring of field "actions", which is legacy format.
Completed a pass through accounts with over 100 pending reports.
Almost all either were or are now labeled.
Almost all were for anti-Israel and petrostate disinfo spam.
Surprisingly little traditional russian nonsense. Needs a different DB query imo.
Completed a pass through accounts with over 100 pending reports.
Almost all either were or are now labeled.
Almost all were for anti-Israel and petrostate disinfo spam.
Surprisingly little traditional russian nonsense. Needs a different DB query imo.
1. antisemitism
2. auto-labeling of slurs
which are the major current points of contention among label recipients.
bsky.app/profile/pef-...
You maybe don't understand this community labeler is distinct from platform moderation. We cannot take content down, just label it for "the community".
"Your moderators" = me.
1. antisemitism
2. auto-labeling of slurs
which are the major current points of contention among label recipients.
bsky.app/profile/pef-...
Process is to google the person, then review and include their amplifying websites for auto-labeling: no good outlet uses them.
bsky.app/profile/stev...
The oil-producing Arab states spend vast amounts of money to whitewash their regimes, deflecting towards the west, especially Israel.
Here is Al Jazeera's 'west-friendly' brand AJ+ cheerily sampling Native American cuisine, juxtaposed with anti-US 'eugenics' conspiracism.
Process is to google the person, then review and include their amplifying websites for auto-labeling: no good outlet uses them.
bsky.app/profile/stev...
Two news ones today, in order to support existing useful rules and future ones.
All use of "fag*ot" and "tr*on" slurs will be labeled realtime soon.
Two news ones today, in order to support existing useful rules and future ones.
All use of "fag*ot" and "tr*on" slurs will be labeled realtime soon.
Reiterating commitment to clean up labeling semantics by end of April 2025, including revisiting all auto-labeled content to correct or remove existing labels.
The goal is to alert about content considered harmful without overly infuriating casual amplifiers.
Reiterating commitment to clean up labeling semantics by end of April 2025, including revisiting all auto-labeled content to correct or remove existing labels.
The goal is to alert about content considered harmful without overly infuriating casual amplifiers.
Try out @skylight.social — a video app built upon the same open network as Bluesky. You can login with your Bluesky account, and all of your followers seamlessly come with you.
Huge thanks to everyone who's supported us as we build this unbannable TikTok!
There's still so much more we want to build— your support makes it possible.
You all are the best 🥰
Join: skylight.social
Follow Skylight Squad:
@reedharmeyer.bsky.social
@buildwithtori.com
Try out @skylight.social — a video app built upon the same open network as Bluesky. You can login with your Bluesky account, and all of your followers seamlessly come with you.
We know there's russian nonsense circulating about Zelensky buying a platinum mine.
The new rule
(Zelensky AND (Northam OR platinum))
got flagged on this account, already auto-flagged for MAGA disinfo posts.
Account now labeled
We know there's russian nonsense circulating about Zelensky buying a platinum mine.
The new rule
(Zelensky AND (Northam OR platinum))
got flagged on this account, already auto-flagged for MAGA disinfo posts.
Account now labeled
This is driving urgent work to revise the labels and reprocess all labeled content so far.
This makes it look like @noupside.bsky.social is spreading disinfo. That is *totally* wrong.
Annoying for the labeled account, confusing for the reader.
This is driving urgent work to revise the labels and reprocess all labeled content so far.
This makes it look like @noupside.bsky.social is spreading disinfo. That is *totally* wrong.
Annoying for the labeled account, confusing for the reader.
First step towards clarifying labeling reasons. Inference should not be needed.
Labeling more abuse is a baby step to encourage civility in atproto-based social media.
Got a sync-compliant firehose? This will work for you.
First step towards clarifying labeling reasons. Inference should not be needed.
Labeling more abuse is a baby step to encourage civility in atproto-based social media.
Got a sync-compliant firehose? This will work for you.
I don't care about subscriber attrition over this.
I have no way to know who subscribes, and if only my feed is freed up from in-scope slurs, that's a win for me.
There are multiple appeal channels.
Redacted moderation history can be requested.
I don't care about subscriber attrition over this.
I have no way to know who subscribes, and if only my feed is freed up from in-scope slurs, that's a win for me.
There are multiple appeal channels.
Redacted moderation history can be requested.
'disinfo' was originally applied under the now-defunct policy of account labeling links to in-scope offsite content. On that basis, perhaps the relevant posts should get 'misinformation" instead. A full review is merited.
'disinfo' was originally applied under the now-defunct policy of account labeling links to in-scope offsite content. On that basis, perhaps the relevant posts should get 'misinformation" instead. A full review is merited.
Rolled out new labels to automatically identify objectively high usage of "facets" below.
Facets covered:
- offsite links (URLs)
- user-mentions (@blablabal.myfriends.xyz)
- hashtags
- the sum of the above in any post/profile
Rolled out new labels to automatically identify objectively high usage of "facets" below.
Facets covered:
- offsite links (URLs)
- user-mentions (@blablabal.myfriends.xyz)
- hashtags
- the sum of the above in any post/profile