- 28
- 19 782
Douglas Thain
Приєднався 15 сер 2016
Research highlights and public lectures on distributed systems and scientific computing by Prof. Douglas Thain at the University of Notre Dame. www.nd.edu/~dthain
Autoscaling High-Throughput Workloads on Container Orchestrators
Charles Zheng (University of Notre Dame) presents an introduction to his research paper presented at IEEE Cluster 2020:
Citation:
Chao Zheng, Nathaniel Kremer-Herman, Tim Shaffer, and Douglas Thain, "Autoscaling High-Throughput Workloads on Container Orchestrators", IEEE Conference on Cluster Computing (CLUSTER 2020), September 2020.
Full Paper:
Abstract:
High-throughput computing~(HTC) workloads seek to complete as many jobs as possible over a long period of time. Such workloads require efficient execution of many parallel jobs and can occupy a large number of resources for a long time. As a result, full utilization is the normal state of an HTC facility. The widespread use of container orchestrators eases the deployment of HTC frameworks across different platforms, which also provides an opportunity to scale up HTC workloads with almost infinite resources on the public cloud. However, the autoscaling mechanisms of container orchestrators are primarily designed to support latency-sensitive microservices, and result in unexpected behavior when presented with HTC workloads. In this paper, we design a feedback autoscaler, High Throughput Autoscaler (HTA), that leverages the unique characteristics of the HTC workload to autoscales the resource pools used by HTC workloads on container orchestrators. HTA takes into account a reference input, the real-time status of the jobs' queue, as well as two feedback inputs, resource consumption of jobs, and the resource initialization time of the container orchestrator. We implement HTA using the Makeflow workload manager, Work Queue job scheduler, and the Kubernetes cluster manager. We evaluate its performance on both CPU-bound and IO-bound workloads. The evaluation results show that, by using HTA, we improve resource utilization by 5.6X with a slight increase in execution time (about 15%) for a CPU-bound workload, and shorten the workload execution time by up to 3.65X for an IO-bound workload.
Citation:
Chao Zheng, Nathaniel Kremer-Herman, Tim Shaffer, and Douglas Thain, "Autoscaling High-Throughput Workloads on Container Orchestrators", IEEE Conference on Cluster Computing (CLUSTER 2020), September 2020.
Full Paper:
Abstract:
High-throughput computing~(HTC) workloads seek to complete as many jobs as possible over a long period of time. Such workloads require efficient execution of many parallel jobs and can occupy a large number of resources for a long time. As a result, full utilization is the normal state of an HTC facility. The widespread use of container orchestrators eases the deployment of HTC frameworks across different platforms, which also provides an opportunity to scale up HTC workloads with almost infinite resources on the public cloud. However, the autoscaling mechanisms of container orchestrators are primarily designed to support latency-sensitive microservices, and result in unexpected behavior when presented with HTC workloads. In this paper, we design a feedback autoscaler, High Throughput Autoscaler (HTA), that leverages the unique characteristics of the HTC workload to autoscales the resource pools used by HTC workloads on container orchestrators. HTA takes into account a reference input, the real-time status of the jobs' queue, as well as two feedback inputs, resource consumption of jobs, and the resource initialization time of the container orchestrator. We implement HTA using the Makeflow workload manager, Work Queue job scheduler, and the Kubernetes cluster manager. We evaluate its performance on both CPU-bound and IO-bound workloads. The evaluation results show that, by using HTA, we improve resource utilization by 5.6X with a slight increase in execution time (about 15%) for a CPU-bound workload, and shorten the workload execution time by up to 3.65X for an IO-bound workload.
Переглядів: 107
Відео
Log Discovery for Troubleshooting Open Distributed Systems with TLQ
Переглядів 944 роки тому
Nate Kremer-Herman (University of Notre Dame) presents an introduction to his research paper presented at PEARC 2020: Citation: Nathaniel Kremer-Herman and Douglas Thain, "Log Discovery for Troubleshooting Open Distributed Systems with TLQ", Practice and Experience of Advanced Research Computing (PEARC), July, 2020. Full Paper: ccl.cse.nd.edu/research/papers/tlq-pearc-2020.pdf Abstract: Trouble...
Solving the Container Explosion Problem for Distributed High Throughput Computing
Переглядів 894 роки тому
Tim Shaffer (University of Notre Dame) presents an introduction to his research paper presented at IPDPS 2020: Tim Shaffer, Nicholas Hazekamp, Jakob Blomer, and Douglas Thain, "Solving the Container Explosion Problem for Distributed High Throughput Computing" International Parallel and Distributed Processing Symposium, May, 2020. ccl.cse.nd.edu/research/papers/explosion-ipdps20.pdf Abstract: Co...
Studying Aging in Dynamic Protein-Protein Networks
Переглядів 716 років тому
Summer project by Jon Genty in the 2018 Data Intensive Scientific Computing Summer REU Program at the University of Notre Dame. disc.crc.nd.edu
Software to Process Malaria Genetic Data for QTL Analysis in a Shared Parent Genetic Cross
Переглядів 436 років тому
Summer project by Anne Freeman in the 2018 Data Intensive Scientific Computing Summer REU Program at the University of Notre Dame. disc.crc.nd.edu
Classifying Marshland Plant Species by Processing Light Reflectance in Satellite Images
Переглядів 756 років тому
Summer project by Aidan Draper in the 2018 Data Intensive Scientific Computing Summer REU Program at the University of Notre Dame. disc.crc.nd.edu
Predictive Analytics of Student Performance from Assignment Clickstream Data
Переглядів 1666 років тому
Summer project by Eric Gronda in the 2018 Data Intensive Scientific Computing Summer REU Program at the University of Notre Dame. disc.crc.nd.edu
Bipartite Matching as a Graph Benchmark
Переглядів 586 років тому
Summer project by Jeremy Speth in the 2018 Data Intensive Scientific Computing Summer REU Program at the University of Notre Dame. disc.crc.nd.edu
FLoRIn - Flexible Learning-Free Reconstruction of Neuronal Circuits
Переглядів 446 років тому
Summer project by Kevin Choy in the 2018 Data Intensive Scientific Computing Summer REU Program at the University of Notre Dame. disc.crc.nd.edu
Exhaustive Heterogeneous Graphlet Counting for Network Alignment
Переглядів 1056 років тому
Summer project by Kendrea Beers in the 2018 Data Intensive Scientific Computing Summer REU Program at the University of Notre Dame. disc.crc.nd.edu
Designing Diagnostic Genetic Markers for R Pomonella
Переглядів 1406 років тому
Summer project by Jacob Gersfeld in the 2018 Data Intensive Scientific Computing Summer REU Program at the University of Notre Dame. disc.crc.nd.edu
Lobster: Harnessing Opportunistic Clusters with a Workflow Management Tool for CMS Data Analysis
Переглядів 616 років тому
Summer project by Diego Fernandez in the 2018 Data Intensive Scientific Computing Summer REU Program at the University of Notre Dame. disc.crc.nd.edu
Deep Learning for Particle Physics: Optimizing Neural Network Structure
Переглядів 3276 років тому
Summer project by Khaya Klanot in the 2018 Data Intensive Scientific Computing Summer REU Program at the University of Notre Dame. disc.crc.nd.edu
Archiving Workflows in Cloud Based Storage
Переглядів 526 років тому
Summer project by Nicholas Pottiger in the 2018 Data Intensive Scientific Computing Summer REU Program at the University of Notre Dame. disc.crc.nd.edu
Compilers: Code Generation
Переглядів 13 тис.7 років тому
An overview of Chapter 11 in "Introduction to Compilers and Language Design" by Prof. Douglas Thain in the spring 2017 semester of CSE 40243 at the University of Notre Dame.
Scaling up Data Intensive Bioinformatics to Attack Malaria Parasites
Переглядів 3507 років тому
Scaling up Data Intensive Bioinformatics to Attack Malaria Parasites
Using Neural Networks to Identify Particle Collisions
Переглядів 4297 років тому
Using Neural Networks to Identify Particle Collisions
Computational Analysis of Climate Change on Ground Water Models
Переглядів 947 років тому
Computational Analysis of Climate Change on Ground Water Models
Temporal Analysis of Complex Networks
Переглядів 2,8 тис.7 років тому
Temporal Analysis of Complex Networks
Understanding Speciation through Genomic Data Analysis
Переглядів 1797 років тому
Understanding Speciation through Genomic Data Analysis
Analyzing Large Scale Astronomic Data
Переглядів 1007 років тому
Analyzing Large Scale Astronomic Data
Long Tail Analysis in CMS Data Analysis Workloads
Переглядів 1047 років тому
Long Tail Analysis in CMS Data Analysis Workloads
Oh wow!I get to see the man behind the compiler book..Such an amazing book and easy to follow!! Love it 👌🏽👌🏽
Great content, Professor! Have you uploaded any additional videos for this class? If so, where can we find them?
Great content, Professor! Have you uploaded any additional videos for this class? If so, where can we find them?
This is gold.
Best lecture on YT on code generation!
Hi. Can you please provide the image for Basekernel? I'd like to try it out myself.
Thanks for the video!
Thank you
Good stuff bro, thanks!
very good video i would watch a few more...i will take look at the git hub. very useful info.
Professor, Could you please upload other chapter videos?
Is there any actual advantage in making the VM itself use registers? Lua changed their stack for a register bank but I remember that being that advantageous
Sounds exciting!
Great stuff .. it's surprisingly hard to find good lectures on compiler design, so this is a real treasure. Thanks! (keep them coming)
The funny thing about this is this video has very few visitors yet its very enriching. Just 1 347 views (at the time this comment was written). While a video showing girls twerking has over 70,839,329 views. This behavior just goes to show you the kind of people that live in this world.
Yeah to be fair compiler design is a pretty niche thing even if you’re into programming, but everyone likes butts
Haha why do u know the exact number of views on that video. Sounds like you have been studying it closely 🧐🧐
we live in a society😔
I am in CS and I would never look at this voluntarily
Awesome, very good lecture. Thank you.
Hey Pete, really great video you made! Really informative and fun! Glad you had a productive and mind-opening summer. Call home soon!
This is awesome!