How to Collect Content Moderation and Other User Data for Social Science Research (AT Protocol)
/2026-04/session/2-b/
Convener: Christine Galvagna
Participants who chose to record their names here:
- Jaz-Michael King (@iftas@mastodon.iftas.org)
- Mark Corbett Wilson (@mcorbettwilson)
- @juliakamin.bsky.social
Notes
-
An implementation of calls designed to collect and organize Mastodon data via its Application Program Interfaces (API), which can be found at https://docs.joinmastodon.org/
-
Please use the canonical form https://CRAN.R-project.org/package=rtoot to link to this page.
-
Some ATProto data services: https://www.microcosm.blue/ …he’s also working on a public archive of all data, called lightrail
-
A network data backfill tool: https://atproto.com/blog/introducing-tap
-
Old out of date clickhouse dataset: https://sql.clickhouse.com/?query_id=8YAFPZQXXCGD75842UKE2W
-
Labeler websocker and query endpoints: https://atproto.com/specs/label#label-distribution-endpoints
-
https://github.com/bluesky-social/atproto/blob/main/lexicons/com/atproto/label/defs.json
-
Composable Moderation https://bsky.social/about/blog/4-13-2023-moderation
-
websocat wss://mod.bsky.app/xrpc/com.atproto.label.subscribeLabels
-
https://docs.bsky.app/docs/advanced-guides/moderation#labeler-subscriptions
-
https://github.com/mrd0ll4r/bluesky_downloader#bluesky-labeler-logger
-
web socket - just for labels
-
post data - microcosm, tap, etc.
-
install web sokat - ask for it in R