Back To Schedule
Wednesday, October 28 • 2:00pm - 2:40pm
Data Lake on OpenStack - Petabyte Scale!

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Got lots of data that you want to make use of? Not so easy to set up an environment to do so and maintain it, eh? Symantec’s data lake is a large scale example of marrying OpenStack platform technologies with big data enabling technologies such as Hadoop, Hive, Storm, Kafka, Spark, etc. This talk will cover what Symantec has done to allow our various teams to easily leverage our many petabytes of security data to increase the protection of our customers against threats such as APTs, identity thieves, and malicious web sites.

Symantec leverages our OpenStack cloud to create multiple analytics clusters, ranging in size from multi-PB to just a few VMs. We use various OpenStack services through a CloudBreak plug-in. Some other technologies we use in setting up and operating these clusters include Ambari, Puppet, a home-grown synthetic transaction system, Zabbix, and Dasher.

avatar for David T. Lin

David T. Lin

Senior Director, Cloud Platform Engineering, Symantec
Cloud Security

Wednesday October 28, 2015 2:00pm - 2:40pm JST

Attendees (0)