Protein

The Protein database is a collection of sequences from several sources, including translations from annotated coding regions in GenBank, RefSeq and TPA, as well as records from SwissProt, PIR, PRF, and PDB. Protein sequences are the fundamental determinants of biological structure and function.

Data and Resources

Field Value
accessLevel public
bureauCode {009:25}
catalog_@context https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld
catalog_@id https://healthdata.gov/data.json
catalog_conformsTo https://project-open-data.cio.gov/v1.1/schema
catalog_describedBy https://project-open-data.cio.gov/v1.1/schema/catalog.json
identifier https://datadiscovery.nlm.nih.gov/api/views/2xvy-u8hi
issued 2021-08-26
landingPage https://www.ncbi.nlm.nih.gov/protein
license http://opendefinition.org/licenses/odc-odbl/
modified 2025-06-18
programCode {009:041}
publisher National Library of Medicine
resource-type Dataset
source_datajson_identifier true
source_hash acee8563d02ec5832a73cbf14501aa4eaba45de59d351980eb7fe4672112ce34
source_schema_version 1.1
theme {Biology}
Groups
  • AmeriGEOSS
  • National Provider
  • North America
Tags
  • AmeriGEO
  • AmeriGEOSS
  • CKAN
  • GEO
  • GEOSS
  • National
  • North America
  • United States
  • api
  • dataset
  • protein
isopen False
license_id other-license-specified
license_title other-license-specified
maintainer National Library of Medicine
maintainer_email custserv@nlm.nih.gov
metadata_created 2025-09-24T04:16:05.082411
metadata_modified 2025-09-24T04:16:05.082420
notes The Protein database is a collection of sequences from several sources, including translations from annotated coding regions in GenBank, RefSeq and TPA, as well as records from SwissProt, PIR, PRF, and PDB. Protein sequences are the fundamental determinants of biological structure and function.
num_resources 1
num_tags 11
title Protein