Manifesto Corpus

The Manifesto Corpus is a free, digital, multilingual, and annotated collection of electoral programmes. It is based on the collection of the Manifesto Project, comprising the currently largest collection of annotated electoral programmes.

Since the project Manifesto Research on Political Representation (MARPOR) took over the duty to maintain and update the Manifesto Project Dataset from the Comparative Manifestos Project the collection and the coding process were fully digitalised. The big advantage of the digitalisation of the project's infrastructure is the possibility to distribute the text data - machine-readable electoral programmes and the codings of every single quasi-sentence.

The Manifesto Corpus contains three types of information:

  • the machine-readable electoral programmes,
  • the unitising into-quasi sentences and the codes according to the Manifesto Coding scheme,
  • document meta-data such as the party and the election date.

The party and election dates can be used to link the corpus information to the Manifesto Project Main Dataset.

Coverage

The corpus currently covers electoral programmes from more than 40 different countries in more than 30 languages. It contains more than 2000 machine-readable programmes. For more than 900 of these, unitising and codings are available as well. These are around 800,000 coded quasi-sentences.

Access

The Corpus is stored in an online database. It can be accessed by four different ways:

  • Explore online: Browse the corpus online in your browser by document or by keyword.
  • Download csv documents: Download individual electoral programmes in .csv format. These are encoded in UTF-8. Make sure to import them correctly. You need to login (or register) to be able to download documents.
  • Access using manifestoR: We offer an R package that facilitates downloading and processing the Manifesto Corpus. It allows bulk downloading several documents at once and transforms the downloaded data into a corpus format. You need an API-key to be able to download documents with manifestoR. Login and create the key on your profile page.
  • Access using manifestata: We offer a stata add-on that facilitates downloading and processing the Manifesto Corpus. It allows bulk downloading several documents at once. You need an API-key to be able to download documents with manifestoR. Login and create the key on your profile page.
  • Access via API: You are a programmer and would like to have direct access to our database? Our API returns all data in our database in a standardised JSON format. You need an API-key to be able to use the API. Login and create the key on your profile page.

Versions and Replication

We regularly update, correct and extend the Manifesto Corpus. To ensure that analyses with the corpus can be reproduced later, we save and distribute older versions of the Manifesto Corpus. When using manifestoR you can choose to download specific corpus versions. If you want to make sure that your work can be replicated later, note the version number you are working on.

Citation

When publishing work using the Manifesto Corpus, please reference (and replace the 6 X accordingly):

Lehmann, Pola / MatthieƟ, Theres / Merz, Nicolas / Regel, Sven / Werner, Annika (201X): Manifesto Corpus. Version: XXXX-X. Berlin: WZB Berlin Social Science Center.

Make sure to provide the exact version you used for your analyses to ensure the replicability of your work.