A Scalable Local Algorithm for Distributed Multivariate Regression

This paper offers a local distributed algorithm for multivariate regression in large peer-to-peer environments. The algorithm can be used for distributed inferencing, data compaction, data modeling and classification tasks in many emerging peer-to-peer applications for bioinformatics, astronomy, social networking, sensor networks and web mining. Computing a global regression model from data available at the different peer-nodes using a traditional centralized algorithm for regression can be very costly and impractical because of the large number of data sources, the asynchronous nature of the peer-to-peer networks, and dynamic nature of the data/network. This paper proposes a two-step approach to deal with this problem. First, it offers an efficient local distributed algorithm that monitors the quality of the current regression model. If the model is outdated, it uses this algorithm as a feedback mechanism for rebuilding the model. The local nature of the monitoring algorithm guarantees low monitoring cost. Experimental results presented in this paper strongly support the theoretical claims.

Data e Risorse

Campo Valore
accessLevel public
accrualPeriodicity irregular
bureauCode {026:00}
catalog_@context https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld
catalog_@id https://data.nasa.gov/data.json
catalog_conformsTo https://project-open-data.cio.gov/v1.1/schema
catalog_describedBy https://project-open-data.cio.gov/v1.1/schema/catalog.json
identifier DASHLINK_179
issued 2010-09-22
landingPage https://c3.nasa.gov/dashlink/resources/179/
modified 2020-01-29
programCode {026:029}
publisher Dashlink
resource-type Dataset
source_datajson_identifier true
source_hash 746fe6ca5649cecc6048ee5fcaaeca2593264c59
source_schema_version 1.1
Gruppi
  • AmeriGEOSS
  • National Provider
  • North America
Tag
  • amerigeo
  • amerigeoss
  • ames
  • ckan
  • dashlink
  • geo
  • geoss
  • nasa
  • national
  • north-america
  • united-states
isopen False
license_id notspecified
license_title License not specified
maintainer Kanishka Bhaduri
maintainer_email kanishka.bhaduri-1@nasa.gov
metadata_created 2025-11-22T22:06:20.787908
metadata_modified 2025-11-22T22:06:20.787912
notes This paper offers a local distributed algorithm for multivariate regression in large peer-to-peer environments. The algorithm can be used for distributed inferencing, data compaction, data modeling and classification tasks in many emerging peer-to-peer applications for bioinformatics, astronomy, social networking, sensor networks and web mining. Computing a global regression model from data available at the different peer-nodes using a traditional centralized algorithm for regression can be very costly and impractical because of the large number of data sources, the asynchronous nature of the peer-to-peer networks, and dynamic nature of the data/network. This paper proposes a two-step approach to deal with this problem. First, it offers an efficient local distributed algorithm that monitors the quality of the current regression model. If the model is outdated, it uses this algorithm as a feedback mechanism for rebuilding the model. The local nature of the monitoring algorithm guarantees low monitoring cost. Experimental results presented in this paper strongly support the theoretical claims.
num_resources 1
num_tags 11
title A Scalable Local Algorithm for Distributed Multivariate Regression