2024-11-102015978-1-4673-7278-72379-770310.1109/BigDataCongress.2015.272-s2.0-84959540956http://dx.doi.org/10.1109/BigDataCongress.2015.27https://hdl.handle.net/20.500.14288/15962We propose optimal and efficient distributed online learning strategies for Big Data applications. Here, we consider the optimal state estimation over distributed network of autonomous data sources. The autonomous data sources can generate and process data locally irrespective of any centralized control unit. We seek to enhance the learning rate through the distributed control of those autonomous data sources. We emphasize that although this problem attracted significant attention and extensively studied in different fields including services computing and machine learning disciplines, all the well-known strategies achieve suboptimal online learning performance in the mean square error sense. To this end, we introduce the oracle algorithm as the optimal distributed online learning strategy. We also propose the optimal and efficient distributed online learning algorithm that reduces the communication load tremendously, i.e., requires the undirected disclosure of only a single scalar. Finally, we demonstrate the significant performance gains due to the proposed strategies with respect to the state-of-the-art approaches.Computer scienceTheory methodsEngineeringElectrical electronic engineeringOptimal and efficient distributed online learning for big dataConference proceeding380443700017N/A1929