A Critical Guide to the PDB


Teresa K. Attwood, The GOBLET Foundation

2018

DOI: 10.7490/f1000research.1116056.1

Published: F1000Research Bioinformatics Education and Training Collection


Abstract

This Critical Guide in the Introduction to Bioinformatics series provides a brief outline of the Protein Data Bank – the PDB – the world’s primary repository of biological macromolecular structures. The rationale for creating the resource and the kinds of information it provides are discussed, and issues relating to its evolution and growth are explored.

Specifically, this Guide introduces the principal features of the PDB, the nature (and quality) of its contents and how these may be interrogated. On reading this Guide, users will be able to:

  • explain some of the ways in which knowledge of protein structures is useful;
  • identify the constituent databases of the wwPDB;
  • explain key features of the RCSB PDB in terms of its data distribution, growth and redundancy statistics;
  • search the PDB using simple and advanced keywords and full sequences, and analyse differences between them; and
  • explain various structural quality criteria, and infer the quality of individual PDB entries.