ESAT: Environmental Source Apportionment Toolkit Python package

The Environmental Source Apportionment Toolkit (ESAT) is an open-source software package that provides API and CLI functionality to create source apportionment workflows specifically targeting environmental datasets. Source apportionment in environment science is the process of mathematically estimating the profiles and contributions of multiple sources in some dataset, and in the case of ESAT, while considering data uncertainty. There are many potential use cases for source apportionment in environmental science research, such as in the fields of air quality, water quality and potentially many others.

The ESAT toolkit is written in Python and Rust, and uses common packages such as numpy, scipy and pandas for data processing. The source apportionment algorithms provided in ESAT include two variants of non-negative matrix factorization (NMF), both of which have been written in Rust and contained within the python package. A collection of data processing and visualization features are included for data and model analytics. The ESAT package includes a synthetic data generator and comparison tools to evaluate ESAT model outputs.

Data and Resources

Field Value
accessLevel public
bureauCode {020:00}
catalog_conformsTo https://project-open-data.cio.gov/v1.1/schema
describedBy https://pasteur.epa.gov/uploads/10.23719/1531870/documents/LICENSE.txt
describedByType text/plain
identifier https://doi.org/10.23719/1531870
license https://pasteur.epa.gov/license/sciencehub-license.html
modified 2024-11-19
programCode {020:000}
publisher U.S. EPA Office of Research and Development (ORD)
publisher_hierarchy U.S. Government > U.S. Environmental Protection Agency > U.S. EPA Office of Research and Development (ORD)
resource-type Dataset
source_datajson_identifier true
source_hash 41986ff11e6ed73c4430bafb3255a12e95d57c4084b6d0ffad294936ecd508af
source_schema_version 1.1
Groups
  • AmeriGEOSS
  • National Provider
  • North America
Tags
  • AmeriGEO
  • AmeriGEOSS
  • CKAN
  • GEO
  • GEOSS
  • National
  • North America
  • United States
  • matrix-factorization
  • open-source
  • pmf
  • positive-matrix-factorization-pmf
  • python
  • source-apportionment
isopen False
license_id other-license-specified
license_title other-license-specified
maintainer Deron Smith
maintainer_email smith.deron@epa.gov
metadata_created 2025-09-24T04:53:27.620949
metadata_modified 2025-09-24T04:53:27.620958
notes The Environmental Source Apportionment Toolkit (ESAT) is an open-source software package that provides API and CLI functionality to create source apportionment workflows specifically targeting environmental datasets. Source apportionment in environment science is the process of mathematically estimating the profiles and contributions of multiple sources in some dataset, and in the case of ESAT, while considering data uncertainty. There are many potential use cases for source apportionment in environmental science research, such as in the fields of air quality, water quality and potentially many others. The ESAT toolkit is written in Python and Rust, and uses common packages such as numpy, scipy and pandas for data processing. The source apportionment algorithms provided in ESAT include two variants of non-negative matrix factorization (NMF), both of which have been written in Rust and contained within the python package. A collection of data processing and visualization features are included for data and model analytics. The ESAT package includes a synthetic data generator and comparison tools to evaluate ESAT model outputs.
num_resources 2
num_tags 14
title ESAT: Environmental Source Apportionment Toolkit Python package