Functional annotation for 15 diverse arthropod genomes

We present the annotation results of 15 arthropod proteomes using an open source, open access and containerized pipeline for genome-scale functional annotation of insect proteomes and apply it to a diverse range of arthropod species. You can find more information about the pipeline at our readthedocs site. The files for each genome include GOanna, InterproScan and KOBAS predictions. Arthropod genomes selected for this study and their assembly and annotation statistics.

Apis Mellifera (honey bee) Drosophila melanogaster (fruit fly) Tribolium castaneum (red flour beetle) Latrodectus hesperus (Western black widow spider) Limnephilus lunatus (caddisfly) Oncopeltus fasciatus (Large milkweed bug) Homalodisca vitripennis (Glassy-winged sharpshooter) Eurytemora affinis (calanoid copepod) Agrilus planipennis (emerald ash borer) Copidosoma floridanum (parasitoid wasp) Athalia rosae (turnip sawfly) Ceratitis capitata (Mediterranean fruit fly) Cimex lectularius (Cimicidae bed bug) Varroa destructor(parasitic mite) Diaphorina citri (Asian citrus psyllid)

Data and Resources

Field Value
accessLevel public
bureauCode {005:18}
catalog_@context https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld
catalog_conformsTo https://project-open-data.cio.gov/v1.1/schema
catalog_describedBy https://project-open-data.cio.gov/v1.1/schema/catalog.json
identifier 7a6a05b8-8652-4a93-b05d-38f13e239952
license https://creativecommons.org/publicdomain/zero/1.0/
modified 2021-07-07
old-spatial {"type":"Polygon","coordinates":[[[-125.33203125,30.654452824401],[-125.33203125,48.848450835898],[-74.35546875,48.848450835898],[-74.35546875,30.654452824401]]]}
programCode {005:040}
publisher Agricultural Research Service
resource-type Dataset
source_datajson_identifier true
source_hash 52ec1a1d4af601fd488bd09221c97b02bfa424c6
source_schema_version 1.1
spatial {"type":"Polygon","coordinates":[[[-125.33203125,30.654452824401],[-125.33203125,48.848450835898],[-74.35546875,48.848450835898],[-74.35546875,30.654452824401]]]}
Groups
  • AmeriGEOSS
  • National Provider
  • North America
Tags
  • amerigeo
  • amerigeoss
  • ckan
  • functional-annotation
  • gene-annotation
  • gene-ontology
  • geo
  • geoss
  • national
  • north-america
  • pathways
  • united-states
isopen True
license_id cc-zero
license_title Creative Commons CCZero
license_url http://www.opendefinition.org/licenses/cc-zero
maintainer Saha, Surya
maintainer_email ss2489@cornell.edu
metadata_created 2025-11-20T01:26:49.970238
metadata_modified 2025-11-20T01:26:49.970243
notes <p>We present the annotation results of 15 arthropod proteomes using an open source, open access and containerized pipeline for genome-scale functional annotation of insect proteomes and apply it to a diverse range of arthropod species. You can find more information about the pipeline at our <a href="https://agbase-docs.readthedocs.io/en/latest/agbase/workflow.html">readthedocs </a> site. The files for each genome include GOanna, InterproScan and KOBAS predictions.</p> <p><i><strong> Arthropod genomes selected for this study and their assembly and annotation statistics.</strong></i></p> <ol> <li><i>Apis Mellifera</i> (honey bee)</li> <li><i>Drosophila melanogaster</i> (fruit fly)</li> <li><i>Tribolium castaneum</i> (red flour beetle)</li> <li><i>Latrodectus hesperus</i> (Western black widow spider)</li> <li><i>Limnephilus lunatus</i> (caddisfly)</li> <li><i>Oncopeltus fasciatus</i> (Large milkweed bug)</li> <li><i>Homalodisca vitripennis</i> (Glassy-winged sharpshooter)</li> <li><i>Eurytemora affinis</i> (calanoid copepod)</li> <li><i>Agrilus planipennis</i> (emerald ash borer)</li> <li><i>Copidosoma floridanum</i> (parasitoid wasp)</li> <li><i>Athalia rosae</i> (turnip sawfly)</li> <li><i>Ceratitis capitata</i> (Mediterranean fruit fly)</li> <li><i>Cimex lectularius</i> (Cimicidae bed bug)</li> <li><i>Varroa destructor</i>(parasitic mite)</li> <li><i>Diaphorina citri</i> (Asian citrus psyllid)</li> </ol>
num_resources 14
num_tags 12
title Functional annotation for 15 diverse arthropod genomes