3D modelling service
Sequence Unique database used by WHAT IF
The PDB structures stored in the WHAT IF relational database are a
representative set of sequence-unique structures generated from the
X-ray protein PDB files available at a certain moment.
The procedure used to generate this database is similar to the PDB select algorithm, but rather than focusing on
maximum size of the subset, our algorithm focuses on getting
representative structures of the highest available quality. For the
selection an empirical quality value is defined: a composite score
depending on the Resolution and the R-factor (published funny enough in:
Verification of protein structures: Side-chain planarity.
R.W.W. Hooft, C.Sander and G.Vriend, J. Appl. Cryst. (1996) 29, 714-716. ).
However, we use a sequence identy percentage cutoff of 30%, respectively 90%, and the
resolution and R-factor criteria are as indicated below.
Each structure is identified by the 4-letter PDB identifier, plus
(if applicable) a one-letter chain name. Structures are ordered by
decreasing 'quality value' as described in the article cited above.
The data
R-factor<0.25 and Resolution<2.5
R-factor<0.21 and Resolution<2.1
R-factor<0.21 and Resolution<2.0
- List of 253 PDB chains, created from the PDB on November 12 1996
- List of 259 PDB chains, created from the PDB on December 11 1996
- List of 274 PDB chains, created from the PDB on February 10 1997
- List of 279 PDB chains, created from the PDB on February 18 1997
- List of 296 PDB chains, created from the PDB on April 10 1997
- List of 303 PDB chains, created from the PDB on April 25 1997
- List of 309 PDB chains, created from the PDB on May 27 1997
- List of 312 PDB chains, created from the PDB on July 23 1997
- List of 368 PDB chains, created from the PDB on April 17 1998
- List of 387 PDB chains, created from the PDB on August 02 1998
- List of 432 PDB chains, created from the PDB on January 05 1999
- List of 462 PDB chains, created from the PDB on March 06 1999
- List of 511 PDB chains, created from the PDB on June 22 1999
R-factor<0.20 and Resolution<1.9
R-factor<0.18 and Resolution<1.8
R-factor<0.19 and Resolution<1.7
R-factor<0.19 and Resolution<1.6
R-factor<0.19 and Resolution<1.5
Tables for 90% identity cutoff
R-factor<0.25 and Resolution<2.5 at <90% sequence identity
R-factor<0.21 and Resolution<2.1 at <90% sequence identity
R-factor<0.21 and Resolution<2.0 at <90% sequence identity
R-factor<0.20 and Resolution<1.9 at <90% sequence identity
R-factor<0.18 and Resolution<1.8 at <90% sequence identity
R-factor<0.19 and Resolution<1.7 at <90% sequence identity
R-factor<0.19 and Resolution<1.5 at <90% sequence identity
Last modified June 22 1999
(C) G.V. 11_April-1998