Communalytic’s Bluesky Thread Collector retrieves replies to a given post, including replies to replies. This data collector is ideal for studying posts that have attracted high engagement on Bluesky.
The collector collects publicly available replies. It does not collect direct messages, private posts or posts visible only to signed-in users.
Before you start a new data collection, below are some critical details about this data collector and the Bluesky API: #
- To use this collector, you do not need to create a Bluesky account or apply for a separate API key.
- This collector only collects publicly available Bluesky posts, starting with the most recent and continuing until your Comunalytic’s account limit is reached. This number might be lower than your account limit because there might not be enough posts that match your search criteria.
- Visit the Bluesky Data Structure page to learn more about the types of data collected by Communalytic via this API.
Step 1 #
Click Collect Data in the left side panel (located under the Create section), then select the Thread button within the Bluesky group.


Step 2 #
Name your dataset and enter a direct URL to the first posts in the thread.

Step 3 #
Click the “Start Data Collection” button.
Step 4 #
To confirm that data collection is underway, you should be able to see your new dataset listed on the “My Datasets” page.
- Note: The progress bar will auto-update. It usually takes about 10-20 seconds to see the progress of the number of records collected. If you also elect to collect replies, progress will be slower because there is more data.
- Helpful Tip: At this stage, if you choose, you can safely close the browsers and come back later to check the progress.

Once the collector has retrieved the relevant posts, you will see the total number of records in the dataset and can start analyzing it using one of the built-in data analyzers or export it for archival purposes or for use with third-party software.
