Awesome
<!-- badges: start --> <!-- badges: end -->Swedish polls
This README is a short description of the file Polls.csv and the RPackage to handle these polls. The file Polls.csv contains all polls conducted in Sweden regarding political sympathies. The file originated from Novus in 2013 but has been updated, and variables has been added since then.
Data description
The file contains the following variables.
Variable | Description |
---|---|
PublYearMonth | Month and year of publication |
Company | Company name at publication |
M - Fi | Poll results for the different parties |
Uncertain | Uncertain voters |
n | The number of observations (see below) |
PublDate | Date of publication |
collectPeriodFrom | Start date of data collection |
collectPeriodTo | End date of data collection |
approxPeriod | Indicator if the period is known or if it is an approximation of the period |
house | The latest company name (if the name has been changed) |
The value NA
follows the R standard and means missing value.
Definitions of the number of observations
Some polling institutes report the total sample size and the number of respondents. Here we use the number of actual respondents in the poll as the sample size.
Quality of the data
In the earlier data, the quality of the data is of less good quality (more NA
:s in sample size and collection period). Before 2000 only Sifo is currently added.
Data from 2008 are of better quality (i.e. less NA
).
Issue with Ipsos
The house Ipsos round their numbers from the mid-2010s. Sometimes these numbers sum up to 101. In these situations, the numbers have been normalized by multiplying the values by 100/101. To get the original values, round up the Ipsos values to the closest integer.
Issue with Sentio
Sentio usually reports two numbers, total respondents and party preferences. Here we use the number of party preferences, but other definitions might have been used in older data.
If you spot any faults or want to contribute...
Just fork the repository and feel free to send me a merge request with your suggested corrections and/or additions. You can also drop an issue in the repository.
Thanks to
Simon Sigurdhsson for collecting polls.
Hampus Joakim Nilsson for collecting polls.
Henrik Ekegren Oscarsson and the Swedish National Election Studies for contributing historical Sifo polls.
Leonora Uddhammar at Statistics Sweden for contributing historical information on Statistics Swedens poll (Partisympatiundersökningen).
Pelle Ahlin at Demoskop for contributing historical information on historical poll from Demoskop.
License
The data in the repository is released under CC0 (public domain) and code is released under MIT.
Sources
We are now starting to build up sources to the polls data since we sometimes need to go back to individual polls to double check them. URL and sources can be found in the Sources folder.
To use the data
If you want to download the raw CSV file, you can find the file here. To download it, click "Raw".
Below there are codes to read the data directly into different statistical software.
Using R
To download this file directly to R use the RPackage SwedishPolls
with the function get_polls()
:
remotes::install_github("MansMeg/SwedishPolls", subdir = "RPackage")
polls <- SwedishPolls::get_polls()
Using R, it is also simple to get data for the last elections in the same format as the polls.
data("elections", package = "SwedishPolls")
We can handle tracking polls (polls with overlapping collection periods, usually close to elections) with the following R function. This function returns the latest, non-overlapping tracking polls.
new_polls <- handle_tracking_polls(polls)
In 2019, Inizio and Demoskop merged, and from this point, the polls of Inizio were used by Demoskop. The following function creates a new house variable for Inizio and Demoskop based on the actual method used.
new_polls <- handle_demoskop_inizio_polls(polls)