This event has ended. View the official site or create your own event → Check it out
This event has ended. Create your own
View analytic
Wednesday, October 28 • 12:05pm - 12:45pm
OpenStack and Hadoop 101 - Getting Your Big Data Cloud Done Right

Sign up or log in to save this to your schedule and see who's attending!

OpenStack and Hadoop ecosystems have been enjoying parallel amounts of rapid growth and adoption over the last few years. Project Sahara was established in the OpenStack community to drive the overall “Data processing on OpenStack” theme, while Hadoop-focused companies such as Cloudera and Hortonworks have been offering their vision of management frameworks and deployment models around Hadoop.

At Mirantis, we noticed that many people got quickly confused about different options for adopting Hadoop as a technology. In this talk, we’ll make an effort to address part of this confusion and set the stage for further deep-dive conversations into how OpenStack can actually help in adopting Hadoop in a particular organization. As always, we’ll keep things as vendor-neutral as possible. 

Specifically, we’ll talk about the following:

  • An overview and roadmap of the Hadoop ecosystem -- components like YARN, Hive, HBase, and others that constitute a working Big Data solution

  • Management frameworks and deployment models offered by different vendors such as Cloudera and Hortonworks

  • Typical Hadoop logical and physical architecture deployment

  • Architecting OpenStack for Hadoop workloads, including:

    • Picking the right hardware and sizing it properly

    • Doing Storage right (HDFS, Ceph, Swift, direct block device mapping?)

    • Doing Compute right (KVM or Baremetal? Making scheduling work)

    • Doing Networking right (Just Neutron, or do we need full-featured SDN?)

    • Leveraging extras that OpenStack has to offer (multi-tenancy, NUMA, CPU pinning)

By attending this presentation, you will gain a solid real-world understanding of the benefits and ways to build out a working Hadoop/Big Data solution on OpenStack.

avatar for Sergey Lukjanov

Sergey Lukjanov

Principal Software Engineer, Mirantis, Mirantis
Sergey is the Project Technical Lead of OpenStack Data Processing program ("Sahara", ex. "Savanna"). He has been involved in the project from the first days. One of his main responsibilities is architecture design and community-related work in Sahara. Also he is a top contributor and reviewer of Sahara and he oversees all Launchpad and Gerrit activity. Sergey is experienced in Big Data projects and technologies (Hadoop, HDFS, Cassandra, Twitter... Read More →

Trevor McKay

Senior Software Engineer, Red Hat, Red Hat, Inc.
Trevor McKay is a Sahara core team member and has been working on Sahara | since mid 2013. He is one of the designers of Sahara's Elastic Data | Processing (EDP) facilities and continues to be a primary contributor to | EDP. Trevor has broad experience in distributed computing, user | interface development, client server applications and control systems.
avatar for Dmitriy Novakovskiy

Dmitriy Novakovskiy

Solutions Architect, Mirantis
Dmitiry Novakovskiy, Solutions Architect at Mirantis, is responsible for shaping technical engagements with new customers and partner prospects.  Dmitriy has been with Mirantis for 2.5 years, enjoying life on the bleeding edge of OpenStack cloud technology.

Wednesday October 28, 2015 12:05pm - 12:45pm

Attendees (77)