Cloud Computing Workshop

Neighborhood for Emerging-World Technologies-LUMS, the Information Technology University (ITU-Punjab) and the Punjab Information Technology Board (PITB) organized the first ever hands-on cloud computing workshop in Pakistan on 17th – 21st April, 2013 at Arfa Software Technology Park. This workshop had an aim to train professionals from the local software industry on the design and implementation of large-scale distributed systems and cloud computing technologies.

About the Trainer

Zubair Nabi has degrees in Engineering and Computer Science from NU-FAST, LUMS, and Cambridge. He is currently a part of the faculty at ITU, Punjab. Zubair's research interests broadly include datacenter networks, data intensive computing, and cloud computing. On and off, he also likes to dabble in pervasive computing, p2p systems, VANETs, and the semantic web. In addition, he is also an open-source buff and has contributed to a number of open source projects. Most notably, over the course of multiple Google Summers of Code, he has been involved in the development of a global Internet connectivity monitor that leverages crowd-sourcing of desktop and mobile devices to act as a watch-dog against Internet censorship. He is currently working on MR+, a radically new and improved implementation of Google's MapReduce paradigm, and Polyversal TCP, an evolutionary version of TCP that embraces network heterogeneity to allow transport customization.

He strongly believes that the future of computing lies in Cloud Computing and Big Data; the power of both can be leveraged to enable unprecedented data analytics to improve the human condition. Using this workshop as a conduit, he aims to equip practitioners from the local software industry with the right tools to ride the Cloud Computing and Big Data wave.

Workshop Plan

Day 1   The Cloud
Topic 1:   Introduction to Big Data and Cloud Computing

Topic 2:   Cloud Computing paradigms and models
Topic 3:   Introduction to Large-scale Distributed Systems

Lab 1:   Amazon EC2 primer
 
Day 2   MapReduce
Topic 1:   Introduction to Parallelization
Topic 2:   MapReduce: Theory and Implementation
Topic 3:   Designing MapReduce applications
Lab 2:   Writing a MapReduce application
     
Day 3   Beyond MapReduce
Topic 1:   Shortcomings in the MapReduce paradigm
Topic 2:   Alternative architectures: Dryad and CIEL
Topic 3:   Introduction to MR+
Lab 3:   Writing an application using CIEL
     
Day 4   Distributed Storage
Topic 1:   Storage paradigms and taxonomy of datasets  
Topic 2:   Distributed filesystems
Topic 3:   RDBMS, NoSQL, and NewSQL storage
Lab 4:   Writing an application on top of distributed storage
     
Day 5   Data Center Stack
Topic 1:   Programming environments for the Cloud
Topic 2:   Operating System stacks and virtualization
Topic 3:   Introduction to Data Center design and applications
Lab 5:   Interconnecting a Data Center using mininet