Loading…
This event has ended. View the official site or create your own event → Check it out
This event has ended. Create your own
ABOUT THE SUMMIT
Welcome to Apache Cassandra Summit Europe 2013. Join us and hundreds of your database peers, and take a deep dive into Apache Cassandra, the massively scalable NoSQL database that powers global businesses like Netflix, Adobe, Thomson Reuters and eBay.

REGISTER NOW > 
View analytic

Sign up or log in to bookmark your favorites and sync them to your phone or calendar.

Wednesday, October 16
 

8:00am

Registration and Breakfast
Registration opens for Workshop attendees. Breakfast is served.

*IMPORTANT: Bringing a laptop is completely optional. You are equally welcome to sit back and experience the activities being demonstrated by the instructor.

If you'd like to follow along with the hands-on activities during the workshop on a computer, you'll want to bring a 64-bit laptop that has 8GB of RAM and supports VMware.

More specifically, to use the Cassandra virtual machine image that we will be distributing on USB drives at the start of the session, you'll need the VMware player for your operating system:


Wednesday October 16, 2013 8:00am - 9:00am
Barbican Centre Silk Street, London EC2Y 8DS, UK

9:00am

Creating an Application in Cassandra
Attendees will learn how to create an application in Cassandra.

*IMPORTANT: Bringing a laptop is completely optional. You are equally welcome to sit back and experience the activities being demonstrated by the instructor.

If you'd like to follow along with the hands-on activities during the workshop on a computer, you'll want to bring a 64-bit laptop that has 8GB of RAM and supports VMware.

More specifically, to use the Cassandra virtual machine image that we will be distributing on USB drives at the start of the session, you'll need the VMware player for your operating system:


Wednesday October 16, 2013 9:00am - 12:00pm
Barbican Centre Silk Street, London EC2Y 8DS, UK

12:00pm

Lunch
Wednesday October 16, 2013 12:00pm - 1:00pm
Barbican Centre Silk Street, London EC2Y 8DS, UK

1:00pm

Apache Cassandra Developer Starter Kit
Students will learn about the Apache Cassandra Developer Starter Kit.

*IMPORTANT: Bringing a laptop is completely optional. You are equally welcome to sit back and experience the activities being demonstrated by the instructor.

If you'd like to follow along with the hands-on activities during the workshop on a computer, you'll want to bring a 64-bit laptop that has 8GB of RAM and supports VMware.

More specifically, to use the Cassandra virtual machine image that we will be distributing on USB drives at the start of the session, you'll need the VMware player for your operating system:


Wednesday October 16, 2013 1:00pm - 6:00pm
Barbican Centre Silk Street, London EC2Y 8DS, UK

6:30pm

Cassandra Summit EU Meetup — In Association with Big Data London

This month we have a very special meetup to coincide with theCassandra Summit and in association with the Big Data Londongroup. We have a stellar line up of Aaron Morton, Tim Moreton, Patrick McFadin, Matt Kennedy and Al Tobey who will take part in a panel discussion around the topic of "Cassandra in the Real World".

Refreshments will be provided, sponsored by Acunu, Datastax and Fusion.io.

RSVP: http://www.meetup.com/Cassandra-London/events/142497992/

Wednesday October 16, 2013 6:30pm - 9:00pm
Barbican Centre Silk Street, London EC2Y 8DS, UK
 
Thursday, October 17
 

7:45am

Registration & Breakfast
Thursday October 17, 2013 7:45am - 8:40am
Conservatory Room

8:40am

Keynote
Keynote Presentation by Jonathan Ellis

Speakers
avatar for Jonathan Ellis

Jonathan Ellis

Apache Cassandra Chair & CTO + Co-Founder, DataStax
Jonathan is CTO and co-founder at DataStax. Prior to DataStax, Jonathan worked extensively with Apache Cassandra while employed at Racksace. Prior to Rackspace, Jonathan built a multi-petabyte, scalable storage system based on Reed-Solomon encoding for backup provider Mozy.


Thursday October 17, 2013 8:40am - 9:40am
Auditorium Barbican Centre, Silk Street, London EC2Y 8DS, UK

9:50am

Real-Time BI with Cassandra

Cassandra is a superb platform for building scalable real-time analytics applications. And the highest value applications will be those that put simple visualizations of critical KPIs in the hands of business users.

 But there's a tension between your development team crafting complex, denormalized data models in CQL and your business users constantly evolving their understanding of the data and their analytic needs.

 Acunu Analytics helps to resolve that gap by using Cassandra to continuously maintain OLAP cubes that support instant, SQL-like queries and offering those up through a BI dashboarding tool. 

 In this talk, I'm going to explore these concepts, and dig into how we've automated the process of going from a stream of JSON events to a live dashboard of the indicators business users need.


Speakers
avatar for Tim Moreton

Tim Moreton

Founder & CTO, Acunu
As a co-founder and CTO of Acunu Tim was instrumental in delivering Acunu Analytics, the leading Cassandra-based operational intelligence platform. Tim was previously a senior member of the technical team at Tideway (now BMC), where he led the creation of solutions for managing data centers at Fortune 500 clients. Previously he was CEO of a consultancy delivering data solutions for the aviation sector. He holds a PhD in Computer Science from... Read More →


Thursday October 17, 2013 9:50am - 10:30am
Auditorium Barbican Centre, Silk Street, London EC2Y 8DS, UK

9:50am

Delivering Christmas Gifts in France Since 2012
Every year more and more people buy their Christmas gifts online and that gifts are delivered by the postal service of France "La Poste". At the end of the infrastructure a (not so) little MySQL was struggling for survival against that overwhelming load. Then, in 2011, MySQL hit its limit... Come hear the true story of La Poste switching its parcel management, PHP-based application, from MySQL to Cassandra in 3 weeks. You'll be taught about the details of the project constraints, how to use Cassandra from PHP, the migration plan, how to manage resilience testing, deploy your Cassandra with Puppet, and all the wonderful knowledge we accumulate through this project.

Speakers
avatar for Matthieu Nantern

Matthieu Nantern

Software Engineer, Xebia
Matthieu works at Xebia with focus on Big Data and automated deployment. During the last couple years he has dealt with high-performance applications for the postal service of France. He has replaced MySQL by Cassandra for a critical service with 30% grow by year.


Thursday October 17, 2013 9:50am - 10:30am
Frobisher 5 & 6 Barbican Centre, Silk Street, London EC2Y 8DS, UK

9:50am

Blueflood: Simple Metrics Processing
Rackspace needed a metrics system that could ingest 30 million signals generated from the Cloud Monitoring system. It had to offer custom data retention levels and still be able to offer graphs to customers in real-time. Gary and his team created a distributed system of shared-nothing nodes on top of Cassandra that split the responsibilities of: ingesting data, processing rollups, servicing data points for reads. Depending on the need, nodes can be easily reconfigured to support all or some of those functions. In this session you you will learn about techniques for scheduling rollups and still maintaining numerical accuracy, how to handled non-numerical data points, how to utilize open-source technology (Apache Cassandra, Scribe, Thrift, and Node.js) to deliver results relatively quickly and much more.

Speakers
avatar for Gary Dusbabek

Gary Dusbabek

Apache Cassandra Committer and Systems Architect, Rackspace Hosting
An Apache Cassandra committer and PMC member, Gary Dusbabek is a life-long programmer specializing in distributed systems. His past experience includes working with large-scale text and image indexes in the newspaper industry and high-volume advertisement booking software. Recent work at Rackspace includes working on Cassandra full-time and being a founding member of the Cloud Monitoring team. Gary currently works on the Rackspace Service... Read More →


Thursday October 17, 2013 9:50am - 10:30am
Frobisher 2 & 3 Barbican Centre, Silk Street, London EC2Y 8DS, UK

9:50am

Throwdown: CQL3 Binary Protocol vs CQL Thrift vs Thrift API

There are a number of different ways to interact with Cassandra: CQL3 and its binary protocol being the latest one, and even Netflix's Astynax with its token awareness. In this session we will give a run down on the performance of the different client options based on real world benchmarks. This will allow you to choose the best option for your use case and highlight a compelling customer example.


Speakers
avatar for Michael Masterson

Michael Masterson

Director, Strategic Business Development, Compuware APM
Results driven technology executive with 12 years of experience in Social and Enterprise Software. I currently lead business development and strategy for Compuware APM with a focus on emerging markets in Big Data and Cloud. Previously I held product management, strategy, and engineering roles at IBM. I have a passion for product, mindset for the forest and the tree, perseverance to build channel ecosystems, and excel at the... Read More →


Thursday October 17, 2013 9:50am - 10:40am
Frobisher 4 Barbican Centre, Silk Street, London EC2Y 8DS, UK

10:30am

Morning Break
Morning Break

Thursday October 17, 2013 10:30am - 11:00am
Barbican Centre Silk Street, London EC2Y 8DS, UK

11:00am

Leveraging the Power of Cassandra: Operational Reporting and Interactive Analysis
Speakers
avatar for Ernesto Ongaro

Ernesto Ongaro

Senior Sales Engineer, Jaspersoft


Thursday October 17, 2013 11:00am - 11:40am
Frobisher 4 Barbican Centre, Silk Street, London EC2Y 8DS, UK

11:00am

Leveraging the Power of Cassandra: Operational Reporting and Interactive Analysis
TBA

Speakers
avatar for Neil Barry

Neil Barry

Manager of Northern Europe, Jaspersoft
As Manager Northern Europe at Jaspersoft, Neil Barry is responsible for driving both direct and channel sales and business development throughout this region. Jaspersoft is the market leader in Open Source Business Intelligence, the most world’s most widely used BI software, with more than 14.5 million total downloads worldwide and more than 14,000 commercial customers in over 100 countries. Prior to Jaspersoft, Neil had... Read More →


Thursday October 17, 2013 11:00am - 11:40am
Frobisher 4 Barbican Centre, Silk Street, London EC2Y 8DS, UK

11:00am

The Cassandra Experience at Orange
At Orange, Jean Armel has helped develop an open source tool for the migration of data to Cassandra; Jean and his team were in need of the NoSQL solution Apache Cassandra in order to sustain the growth of requests and volume of data required by their application PnS. In this session, Jean Armel will start out with an overview of the Orange application PnS and dive into why they chose Apache Cassandra how they did their data migration without any interruption of service. Jean Armel will also show how his application behaves after the migration

Speakers
avatar for Jean Armel Luce

Jean Armel Luce

Senior Software Engineer/Cassandra Admin, Orange
Jean Armel is a Senior Software Engineer at Orange, with more than 20 years of software development in various environments. During the last couple years, he has had to deal with some applications using large databases that require scalability and high availability. A few years ago, Jean had learned about NoSQL and did a large study about a few NoSQL databases (Cassandra, MongoDB, HBase, Hypertable, Riak, …). This study focused on... Read More →


Thursday October 17, 2013 11:00am - 11:40am
Frobisher 5 & 6 Barbican Centre, Silk Street, London EC2Y 8DS, UK

11:00am

No Whistling Required: Cabs, Cassandra, and Hailo
Hailo has leveraged Cassandra to build one of the most successful startups in European history. This presentations looks at how Hailo grew from a simple MySQL-backed infrastructure to a resilient Cassandra-backed system running in three data centres globally. Topics covered include: the process of migration, experience running multi-DC on AWS, common data modeling patterns and security implications for achieving PCI compliance.

Speakers
avatar for Dave Gardner

Dave Gardner

Architect, Hailo
Dave Gardner is part of the services team at Hailo - the taxi app company. He has helped build the Hailo platform that brings together taxi drivers and passengers in 6 cities across the globe and counting, focussing on availability and resilience. Dave has worked with Apache Cassandra since 2010 and is the founder of the Cassandra London user group.


Thursday October 17, 2013 11:00am - 11:40am
Frobisher 2 & 3 Barbican Centre, Silk Street, London EC2Y 8DS, UK

11:00am

The State of CQL
Since its inception, the Cassandra Query Language (CQL) has grown and matured, resulting in the 3rd version of the language (CQL3) being finalized in Cassandra 1.2 and further improved in Cassandra 2.0. Compared to the legacy Thrift API, CQL3 aims at providing an API that is higher level, more user friendly, but still fully assumes the distributed nature of Cassandra and it's storage engine. This talk will present CQL3, describing the reasoning and goals behind the language as well as the language itself. We will also touch on CQL's relationship with Thrift and will present the CQL binary protocol that has been introduced in Cassandra 1.2. We will wrap up by discussing the future of CQL.

Speakers
avatar for Sylvain Lebresne

Sylvain Lebresne

Software Engineer, DataStax
Sylvain works at DataStax on the Apache Cassandra project. Project for which he is a core contributor, committer and PMC member. He has, amongst other things, played a prominent role in the development of CQL3 and its binary protocol. In a former life, he obtained a Ph.D. in Computer Science from Paris 7's University.


Thursday October 17, 2013 11:00am - 11:40am
Auditorium Barbican Centre, Silk Street, London EC2Y 8DS, UK

11:45am

Apache Cassandra 2.0 — Data Model on Fire

Functional data models are great, but how can you squeeze out more  performance and make them awesome! Let's talk through some example Cassandra 2.0 models, go through the tuning steps and understand the tradeoffs. Many time's just a simple understanding of the underlying Cassandra 2.0 internals can make all the difference. I've helped some of the biggest companies in the world do this  and I can help you. Do you feel the need for Cassandra 2.0 speed?


Speakers
avatar for Patrick McFadin

Patrick McFadin

Chief Evangelist, DataStax
Prior to working for DataStax, Patrick was the Chief Architect at Hobsons, an education services company. His responsibilities included ensuring product availability and scaling for all higher education products. Prior to this position, he was the Director of Engineering at Hobsons which he came to after they acquired his company, Link-11 Systems, a software services company. While at Link-11 Systems, he built the first widely popular CRM... Read More →


Thursday October 17, 2013 11:45am - 12:25pm
Auditorium Barbican Centre, Silk Street, London EC2Y 8DS, UK

11:45am

One Million Books: Adventures in Discoverability with Cassandra and Solr
For any venture, storing your data is just the first step in making sense of it. How do you make your system discoverable? How do you tune your relevancy to accommodate real-time updates? In this session, we explore pairing Cassandra with Solr using Datastax Enterprise Search, and look at different search paradigms to help your users find patterns in your data.

Speakers
avatar for Patricia Gorla

Patricia Gorla

Systems Engineer, Opensource Connections
Patricia is a systems engineer with OpenSource Connections. Starting with Python application development, Patricia moved to data analysis after becoming fascinated with machine learning. From there, she has worked on many full-stack data projects: gathering and scrubbing the data, running analyses, and developing custom visualizations to lay out the information. She is passionate about information retrieval, and loves tackling the challenges... Read More →


Thursday October 17, 2013 11:45am - 12:25pm
Frobisher 5 & 6 Barbican Centre, Silk Street, London EC2Y 8DS, UK

11:45am

Being Closer to Cassandra
Odnoklassniki uses cassandra for its business data, which doesn't fit into RAM. This data is typically fast growing, frequently accessed by our users and must be always available, because it constitute our primary business as a social network. The way we use cassandra is somewhat unusual - we don't use thrift or netty based native protocol to communicate with cassandra nodes remotely. Instead, we co-locate cassandra nodes in the same JVM with business service logic, exposing not generic data manipulation, but business level interface remotely. This way, we avoid extra network roundtrips within a single business transaction and use internal calls to Cassandra classes to get information faster. Also, this helps us to create many small hacks on Cassandra's internals, making huge gains on efficiency and ease of distributed server development.

Speakers
avatar for Oleg Anastasyev

Oleg Anastasyev

Lead Platform Developer, Odnoklassniki.ru
Oleg Anastasyev started his career in computer programming in 1995. He developed banking, telecom, public transportation software as well as software for the government of Latvia.  Oleg is a leading developer at Odnoklassniki.ru since 2007. His primary responsibilities as a Platform Team member are development of architectures and solutions for highly loaded as well as big data services, solving performance and availability problems... Read More →


Thursday October 17, 2013 11:45am - 12:25pm
Frobisher 4 Barbican Centre, Silk Street, London EC2Y 8DS, UK

11:45am

Top-K Queries in Realtime with Cassandra and Intravert
Performing ranking queries to find the most relevant documents, most popular urls, etc on huge datasets is trivial —if you're willing to wait a while for the answers. For those with less time to waste, this session describes techniques for performing such queries efficiently. We'll describe the ranking queries problem, outline the Cassandra CQL3 data structures and code that can be used to solve it and describe the trade-offs available. We describe intravert, an innovative server-side programming solution for Cassandra, and show how it can be used to reduce network usage and improve performance by filtering data closer to source.

Speakers
JH

Jonathan Halliday

Core Developer, JBoss
Jonathan Halliday is a core developer at JBoss, where he builds open source solutions for big-data analytics.
RV

Rui Vieira

Postgrad Student, Newcastle University
Rui Vieira is a postgraduate student at Newcastle University, researching the adaptation of statistical algorithms to modern nosql execution environments.


Thursday October 17, 2013 11:45am - 12:25pm
Frobisher 2 & 3 Barbican Centre, Silk Street, London EC2Y 8DS, UK

12:30pm

Lunch
Lunch

Thursday October 17, 2013 12:30pm - 1:45pm
Barbican Centre Silk Street, London EC2Y 8DS, UK

1:45pm

Blending Cassandra Data Into The Mix

Traditionally, data is delivered to business analytics tools through a relational database. However, there are cases where that can be inconvenient, for example when the volume of data is just too high or when you can’t wait until the database tables are updated.

This presentation by Pentaho Kettle founder Matt Casters will demonstrate a solution of data 'Blending', which allows a data integration user to create a transformation capable of delivering data directly to Pentaho - and other - business analytics tools. Matt will demonstrate taking data from Cassandra, and blending it with other data from both SQL and NoSQL sources, and then visualizing that data. Matt will explain how it becomes possible to create a virtual “database” with “tables” where the data actually comes from a transformation step.


Speakers
avatar for Matt Casters

Matt Casters

Chief Architect, PDI/Kettle Project Founder, Pentaho
I have been an independent BI consultant for many years and implemented numerous data warehouses and BI solutions for large companies. For the last 7 years I have been very busy writing an ETL tool called Kettle. This tool was open sourced in December 2005 and acquired by Pentaho Open Source BI early in 2006. As such I’m now Chief Data Integration for Pentaho mainly doing lead development for Kettle a.k.a. Pentaho Data Integration.


Thursday October 17, 2013 1:45pm - 2:25pm
Frobisher 4 Barbican Centre, Silk Street, London EC2Y 8DS, UK

1:45pm

Cassandra Adoption at Sky and Live Data Centre Migration
How to bring up a new data center and take down the old one with zero downtime, using Apache Cassandra.

Speakers
avatar for Paul Makkar

Paul Makkar

DevOps, Sky (UK)
Devops/DBA/SysAdmin


Thursday October 17, 2013 1:45pm - 2:25pm
Frobisher 5 & 6 Barbican Centre, Silk Street, London EC2Y 8DS, UK

1:45pm

Cassandra on Flash: Performance & Efficiency Lessons Learned
Flash Memory technology, deployed as server-side PCIe or solid state disks (SSDs), is emerging as a critical tool for performance and efficiency in data centers of all scales. This presentation will discuss how the use of Flash impacts Cassandra deployments in terms of configuration, DRAM requirements and performance expectations. Ideas on leveraging C*'s cutting-edge data-center awareness to blend flash and disk storage nodes for cost and workload efficiency will also be shared. Flash media itself will be examined from a physical perspective to understand endurance issues. Data on write amplification under bulk-load and operational workload conditions will be presented to explain the impact to Flash of C*'s Log Structured Merge Tree architecture and the associated compactions. Finally, we will examine strategies to make Cassandra more Flash-aware using both conventional techniques as well as emerging Non-volatile memory (NVM) programming capabilities. Lessons learned from real-world customer deployments will be shared to complete this presentation.

Speakers
avatar for Matt Kennedy

Matt Kennedy

Solution Architect: Big Data, Fusion.io
Matt Kennedy is a Solutions Architect at Fusion-IO. He works with customers to apply the performance benefits of low-latency server-side Flash technology to NoSQL databases, Hadoop clusters and other Big Data technologies. Matt has been a Cassandra user and occasional contributor since version 0.7 and is a co-organizer of the Cassandra meetup group in the Washington DC area.


Thursday October 17, 2013 1:45pm - 2:25pm
Auditorium Barbican Centre, Silk Street, London EC2Y 8DS, UK

1:45pm

Cassandra Made Simple with CQL Drivers and DevCenter

Is Cassandra too complex for newcomers? Besides the many improvements that have made their way into C* to dramatically simplify it and the finalized Cassandra Query Language, the SQL-like query language that should look familiar, new drivers have been created with a modern and efficient API. Last, but not least, DataStax is introducing *today* DevCenter 1.0, an IDE whose goal is to simplify developer's workflow even further and to improve their productivity. This presentation will show you how putting to work the new drivers and DevCenter can make your daily life with Cassandra nicer!


Speakers
avatar for Michaël Figuiere

Michaël Figuiere

Software Engineer, DataStax
Michael is an engineer and a developer advocate at DataStax where he actively works to improve Cassandra. At ease with both Enterprise Java and lower level technologies, he specializes in distributed architectures and topics such as NoSQL, search engines, and data processing. He often speaks about NoSQL in conferences and User Groups and loves to write about his favorite topics.
avatar for Alex Popescu

Alex Popescu

Senior Product Manager, DataStax
Building on my experience as a software developer that loves first hand feedback from his users, in the last few years my role involved the product management of a large rich media content platform for senior technical people and also a new NoSQL database. According to my superiors, I've been very successful in aligning the business strategy with the requirements of the end users, prioritizing and delivering timely results together with my... Read More →


Thursday October 17, 2013 1:45pm - 2:25pm
Frobisher 2 & 3 Barbican Centre, Silk Street, London EC2Y 8DS, UK

2:30pm

Capitalizing on Data in Telecommunications: The Cassandra Way
Telecommunications mediation is the process of handling Call Data Records (CDR’s) coming through specialized hardware switches that are connected to the infrastructure of the telecommunications company. The standard industry requirements of these processes are the archiving of original raw data for auditing purposes, decoding and persisting of the extracted Call Data Records for applying rating, billing, reporting and feeding processed data to other sub systems specific to each telecoms platform. The number of CDR's and their size has exploded since mobility got an enormous boost in recent years. A new and non-traditional approach is needed to achieve these requirements. By combining QAFE-based architecture platform with the hardware power of the cutting edge Oracle Exalogic and the scaleable, flexible and highly performing DataStax enterprise, we achieved astonishing results. The solution not only allowed us to achieve industry standard requirements but also opened up the door for new requirements, realizing crucial functionality for the industry. Specifically the ability to perform analytics, reporting and KPI’s in a time period spanning several years. In the presentation, there will be focus on the combination of a private cloud solution (public clouds are not an option here due to legislation) and DataStax enterprise. A working benchmark can show performance numbers of the real time processing and the storage of big amounts of CDR’s from several sources.

Speakers
avatar for Rokesh Jankie

Rokesh Jankie

CTO, QAFE Inc.
Rokesh is the founder and Chief Technical Officer of QAFE. Rokesh obtained his MA degree in Computer Science at Leiden University, The Netherlands in 1998. His field of specialization was Algorithms and NP-complete problems. As scheduling problems can be NP-complete, this is the area he decided to tackle. After completing his study, Rokesh worked for the University of Leiden, ORTEC consultants, and Ponte Vecchio before joining Qualogy. It is at... Read More →
avatar for Hallo Khaznadar

Hallo Khaznadar

CAO (Chief Architecture Officer), QAFE Inc.
Hallo Khaznadar is Chief Architecture Officer of QAFE and joined Qualogy/Qafe in May 2010. After obtaining an MSc in computer engineering from the Baghdad University in 1997, Hallo became a researcher and developer of the off-chip hardware/software (PSD chip) at the Technical University of Delft in The Netherlands. Combining his hardware and software knowledge enabled Hallo to be integrally involved in the design and implementation of various... Read More →


Thursday October 17, 2013 2:30pm - 3:10pm
Frobisher 4 Barbican Centre, Silk Street, London EC2Y 8DS, UK

2:30pm

Building a CQL Driver
I’m not a database driver expert, I’m just a Ruby developer who really likes Cassandra, but I’ve written a CQL driver. It wasn’t hard, and it feels really good to now have an understanding of how my applications talk to Cassandra. To know what happens when I connect, prepare a statement, or how my data is encoded on its way to the database. When my CQL collections got big and I got weird results back, I could use my knowledge to quickly debug the issue. I’d like to give you an overview of the new native protocol from the driver perspective, maybe it can help you understand Cassandra the way it has helped me.

Speakers
avatar for Theo Hultberg

Theo Hultberg

Chief Architect, Burt
Theo Hultberg is Chief Architect at Burt, a Gothenburg based big data startup that wants to revolutionize the advertising industry by showing that there's more to ad analytics than clicks. He’s the co-organizer of his local Ruby, Scala, and distributed systems user groups, but his only claim to fame is that he has more reputation on Stack Overflow than Jeff and Joel combined.


Thursday October 17, 2013 2:30pm - 3:10pm
Frobisher 2 & 3 Barbican Centre, Silk Street, London EC2Y 8DS, UK

2:30pm

Playlists at Spotify - Using Cassandra to Store Version Controlled Objects
All systems at Spotify have to deal with huge amounts of data. Playlists in particular is a unique challenge. We need to store more than one billion playlists, and make them accessible for not only the playlist owner but also subscribers. Furthermore, we need to handle concurrent changes to collaborative playlists and offline scenarios. The devised solution treats every playlist as a versioned object. We use Cassandra to store these objects in an efficient way, allowing fast read- and write queries. The road there was not pain free however. I will talk about the data model we ended up using, and lessons learned along the way.

Speakers
avatar for Jimmy Mardell

Jimmy Mardell

Senior Software Engineer, Spotify
Jimmy has worked at Spotify for three years as a Software Engineer. He is currently working full time on the eco-system around Cassandra clusters at Spotify.


Thursday October 17, 2013 2:30pm - 3:10pm
Auditorium Barbican Centre, Silk Street, London EC2Y 8DS, UK

2:30pm

Stump the Experts with Al Tobey & Patrick McFadin
It's time to play "Stump the Experts", with Al Tobey, Open Source Mechanic at DataStax, and Patrick McFadin, Chief Evanglist at DataStax. Bring your urgent Cassandra questions to this session and have our expert panel answer them for you. Best question that stumps our experts receives a full Raspberry Pi kit.

Speakers
avatar for Patrick McFadin

Patrick McFadin

Chief Evangelist, DataStax
Prior to working for DataStax, Patrick was the Chief Architect at Hobsons, an education services company. His responsibilities included ensuring product availability and scaling for all higher education products. Prior to this position, he was the Director of Engineering at Hobsons which he came to after they acquired his company, Link-11 Systems, a software services company. While at Link-11 Systems, he built the first widely popular CRM... Read More →
avatar for Al Tobey

Al Tobey

Open Source Advocate, DataStax


Thursday October 17, 2013 2:30pm - 3:10pm
Frobisher 5 & 6 Barbican Centre, Silk Street, London EC2Y 8DS, UK

3:15pm

Afternoon Break
Afternoon Break

Thursday October 17, 2013 3:15pm - 3:45pm
Barbican Centre Silk Street, London EC2Y 8DS, UK

3:45pm

Using Cassandra in a Telco Storage System
Data volume grows and in telecommunication area it is painful to support and scale RDBMS systems. This presentation shows how we switched from SQL to NoSQL. This will be an overview of aspects: * how we model schema for call data using NoSQL vs SQL. * what hardware architecture we use * NoSQL vs SQL insert-select performance * how we store graph data using C*.

Speakers
avatar for Pavel Pontryagin

Pavel Pontryagin

Senior Engineer, Peter-Service
Pavel Pontryagin is a developer, analyst and DB administer; he in the telecommunications department at Peter-Service.


Thursday October 17, 2013 3:45pm - 4:25pm
Frobisher 4 Barbican Centre, Silk Street, London EC2Y 8DS, UK

3:45pm

Cassandra Internals
From the microsecond your request hits an Apache Cassandra node there are many code paths, threads and machines involved in storing or fetching your data. This talk will step through the common operations and highlight the code responsible. Apache Cassandra solves many interesting problems to provide a scalable, distributed, fault tolerant database. Cluster wide operations track node membership, direct requests and implement consistency guarantees. At the node level, the Log Structured storage engine provides high performance reads and writes. All of this is implemented in a Java code base that has greatly matured over the past few years. This talk will step through read and write requests, automatic processes and manual maintenance tasks. I'll discuss the general approach to solving the problem and drill down to the code responsible for implementation. Existing Cassandra users, those wanting to contribute to the project and people interested in Dynamo based systems will all benefit from this tour of the code base.

Speakers
avatar for Aaron Morton

Aaron Morton

Co-Founder & Principal Consultant, The Last Pickle
Aaron Morton is the Co Founder & Principal Consultant at The Last Pickle (thelastpickle.com). A professional services company that works with clients to deliver and improve Apache Cassandra based solutions. He's based in New Zealand, is an Apache Cassandra Committer and a DataStax MVP for Apache Cassandra.


Thursday October 17, 2013 3:45pm - 4:25pm
Auditorium Barbican Centre, Silk Street, London EC2Y 8DS, UK

3:45pm

From Simple CQL to Time-Series Event Tracking and Aggregation Using Cassandra and Hadoop
FINN.no's is a classifieds website and Norway's busiest website. This session will go through various product development where c* has shown to be the best choice, focusing on our primary c* use-case: our in-house tracking solution that's collects raw time-series data in c* and aggregates minute-by-minute it using hadoop into various new datasets from advert-centric statistics to user-centric behavioural analysis. I'll cover the final technical design chosen after a number of development iterations touching on technologies: scribe, thrift, kafka, hadoop, pig, mahout; the hurdles faced along the way, and the throughput and performance of today's systems.

Speakers
avatar for Mick Semb Wever

Mick Semb Wever

Programmer, FINN.no
Programmer at FINN.no, Norway's largest classifieds website, working on core platform systems. Also a committer for Apache Tiles.


Thursday October 17, 2013 3:45pm - 4:25pm
Frobisher 2 & 3 Barbican Centre, Silk Street, London EC2Y 8DS, UK

3:45pm

Hardware Agnostic: Cassandra 1.2 on Raspberry Pi
The raspberry Pi is a credit-card sized $25 ARM based linux box designed to teach children the basics of programming. The machine comes with a 700MHz ARM and 512Mb of memory and boots off a SD card, not much power for running the likes of a Cassandra cluster. This presentation will discuss the problems of getting Cassandra up and running on the Pi and will answer the all important question: Why on Earth would you want to do this!?

Speakers
avatar for Andy Cobley

Andy Cobley

Lecturer, University of Dundee
Andy Cobley is a lecturer at the University of Dundee in Big Data, Business Intelligence and Internet programming.


Thursday October 17, 2013 3:45pm - 4:25pm
Frobisher 5 & 6 Barbican Centre, Silk Street, London EC2Y 8DS, UK

4:30pm

Mixing Batch and Real-Time: Cassandra with Shark
Everything Cassandra does is designed for a real-time workload of high volume inserts and frequent small queries. Cassandra has Hadoop and Hive integration, but performing long running ad-hoc queries with these tools is difficult without impacting real-time performance or requires duplicate clusters. This talk will explain how I'm integrating Cassandra with Shark, a drop-in Hive replacement developed by Berkeley's AmpLab. It's designed to give fine grained control over all resource usage so you can safely run arbitrary ad-hoc queries on your existing cluster with controlled and predictable impact.

Speakers
avatar for Richard Low

Richard Low

Analytics Tech Lead, SwiftKey
Richard is responsible for the analytics infrastructure at SwiftKey. Previously, he worked at Acunu where he led the Cassandra and analytics team.


Thursday October 17, 2013 4:30pm - 5:10pm
Frobisher 5 & 6 Barbican Centre, Silk Street, London EC2Y 8DS, UK

4:30pm

Practice Makes Perfect: Extreme Cassandra Optimization
Ooyala has been using Apache Cassandra since version 0.4. Our data ingest volume has exploded since 0.4 and Cassandra has scaled along with us. Al will cover many topics from an operational perspective on how to manage, tune, and scale Cassandra in a production environment.

Speakers
avatar for Al Tobey

Al Tobey

Open Source Advocate, DataStax


Thursday October 17, 2013 4:30pm - 5:10pm
Auditorium Barbican Centre, Silk Street, London EC2Y 8DS, UK

4:30pm

Denormalizing Your Data: A Java Library to Support Structured Data in Cassandra
In this session you'll learn about the design and implementation of a new open source general-purpose Java library that supports storing structured data in Cassandra. Instead of mapping the data to multiple tables like an ORM would or embedding data using serialization, this approach decomposes structured data of arbitrary complexity into separate columns of simple values, allowing the data to be retrieved or updated in parts using hierarchical paths. Implementations are included for Cassandra using both the Thrift and CQL3 APIs. In addition, Eric's experiences are shared regarding the challenges of using CQL3 vs. Thrift for schema-less data.

Speakers
avatar for Eric Zoerner

Eric Zoerner

Senior Software Developer, eBuddy
Eric is a developer at eBuddy working on the XMS messaging platform with focus on data management and user discovery using Cassandra as the database. He previously worked for GemStone systems and authored the OQL query engine for GemFire data fabric.


Thursday October 17, 2013 4:30pm - 5:10pm
Frobisher 2 & 3 Barbican Centre, Silk Street, London EC2Y 8DS, UK

5:15pm

Evening Reception, '14 Cassandra MVPs Announced, Lightning Talks & Closing
Evening Reception, Lightning Talks & Wrap-Up

Thursday October 17, 2013 5:15pm - 6:15pm
Auditorium Barbican Centre, Silk Street, London EC2Y 8DS, UK
 
Friday, October 18
 

8:00am

Registration and Breakfast
Registration opens for Workshop attendees. Breakfast is served.

*IMPORTANT: Bringing a laptop is completely optional. You are equally welcome to sit back and experience the activities being demonstrated by the instructor.

If you'd like to follow along with the hands-on activities during the workshop on a computer, you'll want to bring a 64-bit laptop that has 8GB of RAM and supports VMware.

More specifically, to use the Cassandra virtual machine image that we will be distributing on USB drives at the start of the session, you'll need the VMware player for your operating system:


Friday October 18, 2013 8:00am - 9:00am
Barbican Centre Silk Street, London EC2Y 8DS, UK

9:00am

Apache Cassandra Developer Starter Kit
Workshop attendees will learn about the Apache Cassandra Developer Starter Kit.

*IMPORTANT: Bringing a laptop is completely optional. You are equally welcome to sit back and experience the activities being demonstrated by the instructor.

If you'd like to follow along with the hands-on activities during the workshop on a computer, you'll want to bring a 64-bit laptop that has 8GB of RAM and supports VMware.

More specifically, to use the Cassandra virtual machine image that we will be distributing on USB drives at the start of the session, you'll need the VMware player for your operating system:


Friday October 18, 2013 9:00am - 12:00pm
Barbican Centre Silk Street, London EC2Y 8DS, UK

12:00pm

Lunch
Friday October 18, 2013 12:00pm - 1:00pm
Barbican Centre Silk Street, London EC2Y 8DS, UK

1:00pm

Creating an Application in Cassandra
Workshop attendees will learn how to create an application in Cassandra.

*IMPORTANT: Bringing a laptop is completely optional. You are equally welcome to sit back and experience the activities being demonstrated by the instructor.

If you'd like to follow along with the hands-on activities during the workshop on a computer, you'll want to bring a 64-bit laptop that has 8GB of RAM and supports VMware.

More specifically, to use the Cassandra virtual machine image that we will be distributing on USB drives at the start of the session, you'll need the VMware player for your operating system:


Friday October 18, 2013 1:00pm - 5:00pm
Barbican Centre Silk Street, London EC2Y 8DS, UK