Home

Awesome

Analysis of Country-"Withheld" Twitter Accounts

This repository contains data and analysis supporting the BuzzFeed News article, "An Inside Look At The Accounts Twitter Has Censored In Countries Around The World," published January 24, 2018. That article contains important contextual and methodological details. Please read it before continuing below.

Data

This repository contains the following three datasets:

Fields in the withheld-accounts.csv file

Field nameDescription
user_id_numfixThis is a copy of the user_id field, but prefixed with an ' so that it is not converted into scientific notation by spreadsheet programs.
user_idThis is the account's Twitter-assigned identification number.
screen_nameThis is the account's screen name, e.g., @screenname.
bioThis is the most recent profile bio that BuzzFeed News observed for this account.
withheld_categoryThis the account's most recent status, with regards to withholding, as categorized by BuzzFeed News in its most recent data collection. See below for possible values.
withheld_in_countriesThis is a +-separated list of two-letter abbreviations corresponding to the countries in which BuzzFeed News observed this account being withheld in its most recent data collection.
withheld_everThis is a list similar to withheld_in_countries, except for every country in which BuzzFeed News has observed this account being withheld at any point.
followers_countThis is the number of followers that this account had at the time of the most recent data collection.
following_countThis is the number of accounts this account followed at the time of the most recent data collection.
signup_dateThis is the date on which the account was created.
first_observedThis is the date on which BuzzFeed News first observed this account being withheld. See above for important caveats about this date.

withheld_category values in withheld-accounts.csv

The withheld_category field in the withheld-accounts.csv file can be one of the following four values:

Analysis

This repository contains two analysis notebooks, both written in Python:

Reproducibility

To reproduce the calculations, you'll need to do the following:

Feedback / Questions?

Contact Jeremy Singer-Vine at jeremy.singer-vine@buzzfeed.com.

Looking for more from BuzzFeed News? Click here for a list of our open-sourced projects, data, and code.