Big Data Days 2020

24-26 ноября

Онлайн

Доклады

Dr. Christoph Zimmermann

European Open Source Thought Leader

Germany, Redis Labs

Talk

Redis: a Multi-Model DB for IoT and Beyond

An overview of Redis, an open-source multi-model NoSQL DB as a foundation for Big Data projects in the Internet of Things ecosystem and beyond.

Read more…

NoSQL
IoT
Redis

Mandy Chessell

IBM Distinguished Engineer, Master Inventor

UK, ODPi TSC & ODPi Egeria & IBM

Talk

Graph Processing for Open Metadata and Governance

Learn how ODPi Egeria uses its distributed virtual graph to connect metadata about an enterprise’s data and IT services from many different tools and then apply governance across this landscape.

Read more…

Metadata
Governance
Open Source

Valdas Maksimavicius

IT Architect Specializing in Data Analytics and Cloud Computing

Lithuania, Cognizant

Talk

Data Governance From an Engineering Perspective

In this talk, we will split data governance into into “bite-size” pieces. What are the tools, processes, and skill needed to build a compliant data platform?

Read more…

Metadata
Data Governance
Compliance

Timothy J Spann

Principal Field Engineer

USA, Cloudera

Talk

Introduction to FLaNK Stack

Introducing the FLaNK stack which combines Apache Flink, Apache NiFi, Apache Kafka and Apache Kudu to build fast applications for IoT, AI, rapid ingest.

Read more…

NiFi
Streaming
Kafka
Kudu
Flink

Ricardo Ferreira

Developer Advocate

USA, Elastic

Talk

Best Practices for Building Streaming Data Architectures

This talk will introduce the main building blocks for a streaming data architecture and how they can be put together to address business problems.

Read more…

Streaming Analytics
Kafka
Kinesis
Pulsar

Robin Moffatt

Senior Developer Advocate

UK, Confluent

Talk

Kafka as a Platform: the Ecosystem from the Ground Up

In this talk, we’ll look at the entire streaming platform provided by Apache Kafka and the Confluent community components. Starting with a lonely key-value pair, we’ll build up topics, partitioning, replication, and low-level Producer and Consumer APIs.

Read more…

Kafka
Streaming

Robin Moffatt

Senior Developer Advocate

UK, Confluent

Talk

Kafka as a Platform: the Ecosystem from the Ground Up

This talk will discuss the key design concepts within Kafka Connect and the pros and cons of standalone vs distributed deployment modes. We’ll do a live demo of building pipelines with Kafka Connect for streaming data in from databases, and out to targets including Elasticsearch.

Read more…

Kafka Connect
Data Integration
CDC

Audrey Lobo-Pulo

Founding Director

Australia, Phoensight

Talk

In the Shallow with AI

In this session we’ll briefly look at:
* The role of objective and subjective data, and how these influence data-driven insights;
* ‘Trans-contextual’ information and the implications for AI; 

Read more…

AI
AI ethics
Human-in-the-loop
socio-technology

Alex Sanginov

Data Monetization Evangelist

USA, ServiceNow

Talk

5 Pillars of User-Centric Analytics

Join the session to learn more and how to apply the successful analytics strategy to your own organization(s).

Read more…

Analytics Strategy
Analytics Workflows
AI

Einat Orr

CEO and Co-Founder of Data Lake Management Platform

Tel Aviv, Treeverse

Talk

Data Versioning – What Does it Mean?

In this talk we will go over the difference between these solutions by clustering them according to 4 main use cases:
1. Collaboration over data: enabling teams to collaborate over data over time, while contributing to the data evolution.
2. Managing ML pipelines: allowing pipeline management of ML projects, from model creation to production.

Read more…

Data Lake
Data Versioning

Carlos Manuel Duclos-Vergara

Software Engineer

Norway, Schibsted

Talk

Processing Billions of Events a Day Using Kafka and Kafka Streams

Designing a system to cope with loads of billions of events is harder than it seems. In this talk the presenter will go through the most common use cases and pitfalls and provide tips and good practices about how to design systems to avoid them.

Read more…

Kafka
Streaming
Schibsted

James Serra

Big Data/Data Warehouse Evangelist

USA, Microsoft

Talk

Azure Synapse Analytics Overview

James will talk about the new products and features that make up Azure Synapse Analytics and how it fits in a modern data warehouse, as well as provide demonstrations.

Read more…

Data Lake
Azure Synapse Analytics

Łukasz Osipiuk

Software Engineer

Poland, Starburstdata

Talk

Interactive BI Analytics with Presto

During the talk Lukasz and Karol will give a quick introduction to Presto in general, as well as some of its advanced features. They will show a demo, how it works in practice.

Read more…

Real Time BI
Data Federation
Delta Lake
Presto

Karol Sobczak

Software Engineer, Founding Member

Poland, Starburstdata

Łukasz Osipiuk

Software Engineer

Poland, Starburstdata

Karol Sobczak

Software Engineer, Founding Member

Poland, Starburstdata

Talk

Interactive BI Analytics with Presto

During the talk Lukasz and Karol will give a quick introduction to Presto in general, as well as some of its advanced features. They will show a demo, how it works in practice.

Read more…

Real Time BI
Data Federation
Delta Lake
Presto

« Hазад