Skip to content

Details

6:00 PM- 6:30 PM: drinks, mingling

6:30 PM - 8:30PM: Hands-on: Data Science at Scale with HAWQ and MADlib and Hadoop

In this Meetup we’ll learn about Apache HAWQ (http://hawq.incubator.apache.org/), the elastic, parallel processing query engine that operates on all your data directly within Hadoop. We’ll also learn about Apache MADlib (http://madlib.incubator.apache.org/), the big data machine-learning library that provides commonly used data science algorithms capable of leveraging the parallel processing capabilities of HAWQ.

The main part of this event will be a guided hands-on where we use Apache Zeppelin as the notebook to perform a data science investigation of our data in Hadoop by invoking MADlib functions in Python, R, and directly with SQL.

Feel free to come watch the extended demonstration. If you want to play-along with your own sandbox, please bring a system that meets these minimum requirements. The software will be distributed by a USB drive:

· VirtualBox 4.2 or later, or VMWare 5.0 or later installed Pre-downloaded Sandbox VM with HAWQ

· 15 GBs free disk space

This meetup will be at new location @ WEWORK MARKET ST.

1601 Market Street Philadelphia PA 19103 (19th floor)

About our sponsor:

WeWork is a community for creators. We transform buildings into

beautiful, collaborative workspaces and provide the infrastructure, services,

events and technology so our members can focus on doing what they love.

WeWork currently has 111 locations in 29 cities across the world with over

70,000 members. Book a tour at wework.com now!

Sponsors

Sponsor logo
Cloudera
We deliver an enterprise data cloud for any data from the Edge to AI.
Sponsor logo
WeWork
Location .
Sponsor logo
MeetMe.com
conference area and food
Sponsor logo
AmerisourceBergen Corporation
AmerisourceBergen is hosting the Apache Spark and Zeppelin workshop
Sponsor logo
EPAM
food, hosting space
Sponsor logo
pivotal
speaker

Members are also interested in