The MGH-PGA Proteomic Database is based on NCBI's protein database, with the additional information about coding DNA sequences. For more detailed information about your protein, please refer to NCBI's protein search.

Our database was constructed by parsing the human and mouse GenPept release from NCBI. Very short peptide sequences (< 23 a.a.) were excluded from our database. Most immunoglobulin and T cell receptor variable region genes were also excluded to produce a better non-redundant database. The corresponding DNA coding regions were then retrieved from GenBank via the CDS feature and were included in our database.

With our database, you can do: