one day, many perspectives, millions of new ideas

4. Data Science Day – Registration & Schedule

I am happy to announce that the registration for our 4th Data Science Day (DSDay) on “Data Driven Product Development” just started!


UPDATE: We are booked out! Pls. send me an email (klaas.bollhoefer(at) if you want to be put on the waiting list!

Our schedule this time (really awesome!!!!):

9:00 Doors open

9:45 – 10:00
Klaas Bollhoefer, The unbelievable Machine Company & Thomas Kwiatkowski, ImmobilienScout24: “Welcome!”

10:00 – 10:30 
Prof. Gitta Kutyniok, TU Ber
lin: “Tackling the Data Deluge with Mathematics”
Mathematics has developed various approaches to deal with the data deluge we face these days. In this talk, we will present some of these novel methodologies, for instance, compressed sensing and discuss their impact and future potential.

10:30 – 11:00
Jean Paul Schmetz, Chief Scientist of Hubert Burda Media: “Open Data Startups”

11:00 – 11:30
Jan-Arne Schwarz, ImmobilienScout24: “MarketNavigator: Real Estate Market Data via the Web.”

11:30 – 12:00 Coffee Break

12:00 – 12:30
Sebastian Küpers, The Otherland Group: “A different perspective: creating products out of data”
Sebastian is the co-founder of The Otherland Group. His vision is to make your world a smarter place with the help of the ‘context engine’ he develops. He will talk about the challenging task to understand a users context and the semantic of data to build great products out of data.

12:30 – 1:00
Karlheinz Pischke, Data Direct Networks (DDN): “Modernizing Hadoop Architecture For Superior Scalability, Efficiency & Productive Throughput”

In this presentation,  the presenter will discuss the limitations of a traditional Apache™ Hadoop® architectures that are today built on commodity compute with Direct Attached Storage [DAS]. The presentation reviews the design imperatives of DataDirect Networks’ hScaler Apache Hadoop appliance architecture and how the appliance has been engineered to eliminate the limitations that plague today’s purely commodity approaches.  Specific focus will be paid to DDN’s Storage Fusion Architecture and a unique approach to balancing high-throughput external storage with RDMA-enabled compute resources to drive up to 700% performance gains.  Finally, the presenter will review a case study at PayPal, where high-throughput storage is used to accelerate Hadoop-based real-time fraud detection.

1:00 – 1:30 Lunch Break

1:30 – 2:00
Felix Marczinowksi, Blue Yonder: “Predictive analytics in retail on the large scale”
How do two dozen physicists help you shopping online? Using a large online and mail order retailer as an example, I show the versatility of our approach, the challenges we faced and the solutions we implemented to deliver daily sales forecasts for millions of items.

2:00 – 2:30
Jörg Blumtritt, Datarella: “In the future we won’t be advertising to humans any more.”
With the rise of wearable computers like Google Glass or smart agents with near human language interfaces like Siri the Internet can hardly be viewed as mere “virtual reality” anymore, it becomes more and more “augmented reality” – like an additional dimension of our body. Machine-to-machine communication is of rising influence for our lives: “the Internet of Things” of Self-driving cars, smart-metering, etc. connects our appliences with the Net’s data. Many services and devices act as gatekeepers for information: recommandation through engines or our social media connections build a filter that literarily surround us like a bubble. Advertising is of course particularily vulnerable by these filters. Thus classic media placements will less likely be able to contact us. Instead of using advertising psychology, advertising have to hack into our personal agents’ algorithms. This will be very similar to today’s search engine optimization. Promoting new goods would be trying to get into the high ranks of as many avatars’ preferences as possible. Of course, continuous business would only be sustained, if the product would be judged satisfying by our avatar when taken into consideration.

2:30 – 2:45
Klaas Bollhöfer, The unbelievable Machine Company: “Wrap-up & Workshop planning”

2:45 – 3:15 Coffee Break

3:15 – 5:00 Workshop Sessions

5:00 – 5:30
Special talk by Michael Hausenblas, MapR: “All you need to know about Apache Drill”

Apache Drill – – is a distributed system for interactive ad-hoc analysis of large-scale datasets, inspired by Google’s Dremel technology. It is a design goal to scale to 10,000 servers or more and to be able to process Petabytes of data and trillions of records in seconds. In the talk we focus on Apache Drill’s architecture and extensibility points, including query languages and back-ends as well as a live demo of the system.

From 5:30 – open end
Get-together with beer, pizza & lots of time to meet, share ideas & talk about the day!

A big thanks to our sponsors:

ImmobilienScout24, Exasol, DDN, MapR & The unbelievable Machine Company GmbH (*um)

Enough food & drinks available all day. Talks and presentations are in english!

I am also very happy that DSDay is part of Big Data Week 2013 – “an international festival of data”! We managed to have several other events in Berlin that week – really, really great girls & guys! See all details here:



Post comment

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )


Connecting to %s


This entry was posted on March 28, 2013 by .

Partnering with:

Big Data Week

Organized by:

Zalando AG

Sponsors & supporters:


Media partner (so far)

%d bloggers like this: