*********************************
There is now a CONTENT FREEZE for Mercury while we switch to a new platform. It began on Friday, March 10 at 6pm and will end on Wednesday, March 15 at noon. No new content can be created during this time, but all material in the system as of the beginning of the freeze will be migrated to the new platform, including users and groups. Functionally the new site is identical to the old one. webteam@gatech.edu
*********************************
Title: Towards Deep Learning System and Algorithm Co-design
Date: Monday, September 20th, 2021
Time: 3:30-5:30pm (ET)
Location: https://bluejeans.com/349424919/2655
Yanzhao Wu
Ph.D. Student
School of Computer Science
Georgia Institute of Technology
Committee
———————
Dr. Ling Liu (Advisor, School of Computer Science, Georgia Institute of Technology)
Dr. Calton Pu (Co-Advisor, School of Computer Science, Georgia Institute of Technology)
Dr. Greg Eisenhauer (School of Computer Science, Georgia Institute of Technology)
Dr. Shamkant Navathe (School of Computer Science, Georgia Institute of Technology)
Dr. Lakshmish Ramaswamy (Department of Computer Science, University of Georgia)
Abstract
———————
Big data powered deep learning (DL) systems and applications have blossomed in recent years. In addition to the demands for more accurate deep neural network (DNN) models, we also witness the growing interests in deploying model inference and model learning to the edge of the Internet, where data are generated, demanding deep learning system and algorithm co-design for performance optimization of deep learning systems and deep learning as a service. This dissertation research takes a holistic approach to promote the deep learning system and algorithm co-design with three original contributions.
First, we developed a methodical approach to configuration management of deep learning frameworks by exploring the intrinsic correlations between system-level parameters and algorithm specific hyperparameters and how different combinations may impact the performance of deep learning models. The core system parameters include configurations of CPU, memory, GPU, parallel processing and multi-thread management, and the DNN algorithm specific hyperparameters, including learning rate policies, loss functions and optimizers, batch size, and so forth. For example, we characterized the CPU/GPU resource usage patterns under different configurations and different DL frameworks to obtain an in-depth understanding of how varying batch sizes and learning rate policies may impact the model performance. We also provide a set of metrics for evaluating and selecting learning rate policies, including the classification confidence, variance, cost, and robustness. Two benchmarking tools, GTDLBench and LRBench, are made publicly available.
Second, we develop a systematic framework for creating ensembles of failure independent models by leveraging system and algorithm co-design for prediction fusion through diversity based hierarchical ensemble optimizations. We introduce a hierarchical diversity concept to capture diversity through high failure independence and low negative correlation. We develop focal-model based ensemble diversity metrics to compose high quality ensembles with complimentary member models, which effectively boosts the overall accuracy of ensemble learning. We develop ensemble selection algorithms based on a suite of ensemble pruning strategies, which select ensemble teams of high diversity, and remove low diversity ensembles. Our formal analysis and empirical results demonstrate the effectiveness of our system and algorithm co-design for high diversity ensemble learning. Our EmsembleBench tool has been used in adversarial learning for improving robustness under single task learners such as image classifiers and multi-task learners such as real time video object detection.
Last but not least, we have leveraged the system and algorithm co-design through a suite of optimization techniques to enable DNN inference and DNN learning at the edge. For example, edge video analytics is a core component for many real time deep learning systems, such as autonomous driving, video surveillance, Internet of smart cameras. Edge server load surge and Wi-Fi network bandwidth saturation can further aggravate the mismatch between incoming video streaming rate in #frames per second (FPS) and the detection processing performance, which often results in random frame dropping. We explore both detection model parallel execution approach and utility-aware data reduction techniques. The former explores multi-model multi-device detection parallelism for fast object detection at the edge to meet the runtime performance requirements. The latter leverages adaptive importance sampling techniques to improve both the throughput and accuracy of edge video analytics.
In this proposal exam, I will present the design and development of diversity based ensemble algorithms, the implementation of EnsembleBench, and the extensive evaluation of our ensemble methods.