Logo     3D modelling service     EMBL

Sequence Unique database used by WHAT IF

The PDB structures stored in the WHAT IF relational database are a representative set of sequence-unique structures generated from the X-ray protein PDB files available at a certain moment.

The procedure used to generate this database is similar to the PDB select algorithm, but rather than focusing on maximum size of the subset, our algorithm focuses on getting representative structures of the highest available quality. For the selection an empirical quality value is defined: a composite score depending on the Resolution and the R-factor (published funny enough in: Verification of protein structures: Side-chain planarity. R.W.W. Hooft, C.Sander and G.Vriend, J. Appl. Cryst. (1996) 29, 714-716. ). However, we use a sequence identy percentage cutoff of 30%, respectively 90%, and the resolution and R-factor criteria are as indicated below.

Each structure is identified by the 4-letter PDB identifier, plus (if applicable) a one-letter chain name. Structures are ordered by decreasing 'quality value' as described in the article cited above.

The data

R-factor<0.25 and Resolution<2.5

R-factor<0.21 and Resolution<2.1

R-factor<0.21 and Resolution<2.0

R-factor<0.20 and Resolution<1.9

R-factor<0.18 and Resolution<1.8

R-factor<0.19 and Resolution<1.7

R-factor<0.19 and Resolution<1.6

R-factor<0.19 and Resolution<1.5

Tables for 90% identity cutoff

R-factor<0.25 and Resolution<2.5 at <90% sequence identity

R-factor<0.21 and Resolution<2.1 at <90% sequence identity

R-factor<0.21 and Resolution<2.0 at <90% sequence identity

R-factor<0.20 and Resolution<1.9 at <90% sequence identity

R-factor<0.18 and Resolution<1.8 at <90% sequence identity

R-factor<0.19 and Resolution<1.7 at <90% sequence identity

R-factor<0.19 and Resolution<1.5 at <90% sequence identity

Last modified June 22 1999


EMBL Home Sequence Analysis Service Bork icon Gibson icon Vriend icon EBI Homepage BIOTECH validation server GPCRDB database Science information Databases Compute facilities
(C) G.V. 11_April-1998