Accessing and distributing EMBL data using CORBA (common object request broker architecture)

Background: The EMBL Nucleotide Sequence Database is a comprehensive database of DNA and RNA sequences and related information traditionally made available in flat-file format. Queries through tools such as SRS (Sequence Retrieval System) also return data in flat-file format. Flat files have a number of shortcomings, however, and the resources therefore currently lack a flexible environment to meet individual researchers' needs. The Object Management Group's common object request broker architecture (CORBA) is an industry standard that provides platform-independent programming interfaces and models for portable distributed object-oriented computing applications. Its independence from programming languages, computing platforms and network protocols makes it attractive for developing new applications for querying and distributing biological data.

      Results:
      A CORBA infrastructure developed by EMBL-EBI provides an efficient means of accessing and distributing EMBL data. The EMBL object model is defined such that it provides a basis for specifying interfaces in interface definition language (IDL) and thus for developing the CORBA servers. The mapping from the object model to the relational schema in the underlying Oracle database uses the facilities provided by PersistenceTM, an object/relational tool. The techniques of developing loaders and 'live object caching' with persistent objects achieve a smart live object cache where objects are created on demand. The objects are managed by an evictor pattern mechanism.


      Conclusions:
      The CORBA interfaces to the EMBL database address some of the problems of traditional flat-file formats and provide an efficient means for accessing and distributing EMBL data. CORBA also provides a flexible environment for users to develop their applications by building clients to our CORBA servers, which can be integrated into existing systems.

Data and Resources

Official Government Data SourceHTML
Visit the original government dataset for complete information,...
Explore
- Preview
- Download

Field	Value
accessLevel	public
bureauCode	{009:25}
catalog_@context	https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld
catalog_@id	https://healthdata.gov/data.json
catalog_conformsTo	https://project-open-data.cio.gov/v1.1/schema
catalog_describedBy	https://project-open-data.cio.gov/v1.1/schema/catalog.json
identifier	https://healthdata.gov/api/views/6exx-7mb2
issued	2025-07-13
landingPage	https://healthdata.gov/d/6exx-7mb2
modified	2025-09-06
programCode	{009:033}
publisher	National Institutes of Health
resource-type	Dataset
source_datajson_identifier	true
source_hash	ec64970ac923f53f1d5840fac67410aeb85a36bda2ab63e8d60a5c25bf5e4b8e
source_schema_version	1.1
theme	{NIH}
Groups	AmeriGEOSS National Provider North America
Tags	AmeriGEO AmeriGEOSS CKAN GEO GEOSS National North America United States bioinformatics dna-rna nih nucleotide-sequences sequence-database
isopen	False
license_id	notspecified
license_title	License not specified
maintainer	NIH
maintainer_email	info@nih.gov
metadata_created	2025-09-23T17:44:51.908196
metadata_modified	2025-09-23T17:44:51.908202
notes	Background: The EMBL Nucleotide Sequence Database is a comprehensive database of DNA and RNA sequences and related information traditionally made available in flat-file format. Queries through tools such as SRS (Sequence Retrieval System) also return data in flat-file format. Flat files have a number of shortcomings, however, and the resources therefore currently lack a flexible environment to meet individual researchers' needs. The Object Management Group's common object request broker architecture (CORBA) is an industry standard that provides platform-independent programming interfaces and models for portable distributed object-oriented computing applications. Its independence from programming languages, computing platforms and network protocols makes it attractive for developing new applications for querying and distributing biological data. Results: A CORBA infrastructure developed by EMBL-EBI provides an efficient means of accessing and distributing EMBL data. The EMBL object model is defined such that it provides a basis for specifying interfaces in interface definition language (IDL) and thus for developing the CORBA servers. The mapping from the object model to the relational schema in the underlying Oracle database uses the facilities provided by PersistenceTM, an object/relational tool. The techniques of developing loaders and 'live object caching' with persistent objects achieve a smart live object cache where objects are created on demand. The objects are managed by an evictor pattern mechanism. Conclusions: The CORBA interfaces to the EMBL database address some of the problems of traditional flat-file formats and provide an efficient means for accessing and distributing EMBL data. CORBA also provides a flexible environment for users to develop their applications by building clients to our CORBA servers, which can be integrated into existing systems.
num_resources	1
num_tags	13
title	Accessing and distributing EMBL data using CORBA (common object request broker architecture)