Patent Application Publication Full Text (2001 - Present)

Contains the full text of each patent application publication (non-provisional utility and plant) published weekly (Thursdays) from March 15, 2001 to Present (includes tables, genetic sequence data and "in-line" mathematical expressions; excludes images/drawings). The file formats are eXtensible Markup Language (XML) in accordance with the U.S. Patent Application Version 1.5; 1.6; 4.0 International Common Element (ICE); 4.1 ICE; 4.2 ICE; 4.3 ICE and 4.4 ICE Document Type Definitions (DTDs). Because of the concatenation of the individual XML documents, these files will not parse successfully or open/display by default in Internet Explorer. They also will not import into MS Excel. Each XML document within the file should have one start tag and one end tag. Concatenation creates a file that contains 5,000 plus start/end tag combinations. If you take one document out of the Patent Application Publication Full Text file and place it in a directory with the correct DTD and then double click that individual document, Internet Explorer will parse/open the document successfully. NOTE: You may receive a warning about Active X controls. NOTE: All Patent Application Publication Full Text files will open successfully in MS Word; NotePad; WordPad; and TextPad. http://patents.reedtech.com/parbft.php Documentation: http://www.uspto.gov/products/xml-resources.jsp Approx. 5,000 patent application publications per week. Approx. 89 MB per weekly zipfile. References to the following external files are present, but the external files themselves are not present: - Mega Sequence Listing data files - Mathematica Notebook (NB) files - CambridgeSoft Corp. ChemDraw (CDX) and MDL Information Systems (MOL) files - Drawings, mathematical expressions, and chemical structures image (TIFF) files

Data and Resources

Field Value
accessLevel public
accrualPeriodicity R/P1W
bureauCode {006:51}
catalog_@context https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld
catalog_conformsTo https://project-open-data.cio.gov/v1.1/schema
catalog_describedBy https://project-open-data.cio.gov/v1.1/schema/catalog.json
dataQuality true
describedBy http://www.uspto.gov/products/cis/updates/patents_xml.jsp
describedByType application/xml
identifier EIP-5450P-OL
issued 2001-03-15
landingPage http://patents.reedtech.com/parbft.php
license http://creativecommons.org/publicdomain/mark/1.0
modified 2015-03-19
programCode {006:070}
publisher US Patent and Trademark Office, Department of Commerce
references {http://www.uspto.gov/products/cis/updates/patents_xml.jsp}
resource-type Dataset
source_datajson_identifier true
source_hash 801113018e3ab6ba0de1d2813d7e92fb15e9de98
source_schema_version 1.1
temporal 2001-03-15/2015
theme {"Patent Application Publication"}
Groups
  • AmeriGEOSS
  • National Provider
  • North America
Tags
  • amerigeo
  • amerigeoss
  • application
  • chemical-structures
  • ckan
  • complex-work-units
  • full
  • genetic-sequence-data
  • geo
  • geoss
  • mathematical-expressions
  • national
  • north-america
  • patent
  • publication
  • tables
  • text
  • thursday
  • united-states
  • uspto
  • xml
isopen False
license_id other-license-specified
license_title other-license-specified
maintainer Christopher Leithiser
maintainer_email Chris.Leithiser@uspto.gov
metadata_created 2025-11-21T08:45:11.067448
metadata_modified 2025-11-21T08:45:11.067452
notes Contains the full text of each patent application publication (non-provisional utility and plant) published weekly (Thursdays) from March 15, 2001 to Present (includes tables, genetic sequence data and "in-line" mathematical expressions; excludes images/drawings). The file formats are eXtensible Markup Language (XML) in accordance with the U.S. Patent Application Version 1.5; 1.6; 4.0 International Common Element (ICE); 4.1 ICE; 4.2 ICE; 4.3 ICE and 4.4 ICE Document Type Definitions (DTDs). Because of the concatenation of the individual XML documents, these files will not parse successfully or open/display by default in Internet Explorer. They also will not import into MS Excel. Each XML document within the file should have one start tag and one end tag. Concatenation creates a file that contains 5,000 plus start/end tag combinations. If you take one document out of the Patent Application Publication Full Text file and place it in a directory with the correct DTD and then double click that individual document, Internet Explorer will parse/open the document successfully. NOTE: You may receive a warning about Active X controls. NOTE: All Patent Application Publication Full Text files will open successfully in MS Word; NotePad; WordPad; and TextPad. http://patents.reedtech.com/parbft.php Documentation: http://www.uspto.gov/products/xml-resources.jsp Approx. 5,000 patent application publications per week. Approx. 89 MB per weekly zipfile. References to the following external files are present, but the external files themselves are not present: - Mega Sequence Listing data files - Mathematica Notebook (NB) files - CambridgeSoft Corp. ChemDraw (CDX) and MDL Information Systems (MOL) files - Drawings, mathematical expressions, and chemical structures image (TIFF) files
num_resources 1
num_tags 21
title Patent Application Publication Full Text (2001 - Present)