About the Trainer
Zubair Nabi has degrees in Engineering and Computer Science from NU-FAST, LUMS, and Cambridge. He is currently a part of the faculty at ITU, Punjab. Zubair's research interests broadly include datacenter networks, data intensive computing, and cloud computing. On and off, he also likes to dabble in pervasive computing, p2p systems, VANETs, and the semantic web. In addition, he is also an open-source buff and has contributed to a number of open source projects. Most notably, over the course of multiple Google Summers of Code, he has been involved in the development of a global Internet connectivity monitor that leverages crowd-sourcing of desktop and mobile devices to act as a watch-dog against Internet censorship. He is currently working on MR+, a radically new and improved implementation of Google's MapReduce paradigm, and Polyversal TCP, an evolutionary version of TCP that embraces network heterogeneity to allow transport customization.
He strongly believes that the future of computing lies in Cloud Computing and Big Data; the power of both can be leveraged to enable unprecedented data analytics to improve the human condition. Using this workshop as a conduit, he aims to equip practitioners from the local software industry with the right tools to ride the Cloud Computing and Big Data wave.
Workshop Plan |
|||
Day 1 | The Cloud | ||
Topic 1: | Introduction to Big Data and Cloud Computing | ||
Topic 2: | Cloud Computing paradigms and models | ||
Topic 3: | Introduction to Large-scale Distributed Systems | ||
Lab 1: | Amazon EC2 primer | ||
Day 2 | MapReduce | ||
Topic 1: | Introduction to Parallelization | ||
Topic 2: | MapReduce: Theory and Implementation | ||
Topic 3: | Designing MapReduce applications | ||
Lab 2: | Writing a MapReduce application | ||
Day 3 | Beyond MapReduce | ||
Topic 1: | Shortcomings in the MapReduce paradigm | ||
Topic 2: | Alternative architectures: Dryad and CIEL | ||
Topic 3: | Introduction to MR+ | ||
Lab 3: | Writing an application using CIEL | ||
Day 4 | Distributed Storage | ||
Topic 1: | Storage paradigms and taxonomy of datasets | ||
Topic 2: | Distributed filesystems | ||
Topic 3: | RDBMS, NoSQL, and NewSQL storage | ||
Lab 4: | Writing an application on top of distributed storage | ||
Day 5 | Data Center Stack | ||
Topic 1: | Programming environments for the Cloud | ||
Topic 2: | Operating System stacks and virtualization | ||
Topic 3: | Introduction to Data Center design and applications | ||
Lab 5: | Interconnecting a Data Center using mininet | ||