Big Data Days 2021

Online Edition

28-30 Cентября

онлайн

Доклады

Andrea Spina

CTO

Italy, Radicalbit

Talk

Development of a Kafka-Powered Advanced Stream Commerce Platform

Since the GoLive platform itself is built on top of Kafka, we also will highlight the advantages of using the same streaming platform to achieve asynchronous communication between micro-services and real-time web-socketing.

Read more…

MLOps
Streaming
Kafka

Emily Gorcenski

Head of Data

Germany, ThoughtWorks

Talk

Using Service Level Objective Theory to Design Great Data Products

By exploring Service Level Objective theory, we’ll explore how to intentionally design effective and governable data products and how to move them into a state of automated data governance.

Read more…

Reliability Engineering
Data Mesh
AI

Lidor Gerstel

DEVOPS-Cloud Architect

Israel, Centerity

Talk

Real Time Streaming Data from AWS MSK Kafka to Cloudera

This Session will be on the real Use Case he did on a huge Medical Company, using open-source tools to get real-time data incrementally from Relation Database to Cloudera, will be a live demonstration on Getting events from Kafka and Data from RDS streamed to Cloudera using Stream sets Data Collector tools.

Read more…

Hadoop
Databases
ETL
NoSQL
Scala

Timothy J Spann

Developer Advocate

US, StreamNative

Talk

Real-Time Streaming in Any and All Clouds, Hybrid and Beyond

Today, data is being generated from devices and containers living at the edge of networks, clouds and data centers. We need to run business logic, analytics and deep learning at the scale and as events arrive.

Read more…

Streaming
Flink
Pulsar
Nifi

Wojciech Gawroński

Cloud Architect / Co-founder

Poland, Pattern Match

Talk

Real-Time Streaming in Any and All Clouds, Hybrid and Beyond

Today, data is being generated from devices and containers living at the edge of networks, clouds and data centers. We need to run business logic, analytics and deep learning at the scale and as events arrive.

Read more…

Streaming
Flink
Kafka
Nifi

Юлия Рубцова

Архитектор Решений

Россия, Data Monsters

Talk

Качество Или Количество Данных – Что Важнее?

Кажется, что в современном мире можно загрузить неструктурированные данные в нейронную сеть и она сама научится всему тому, что нужно сделать. Но так ли это, можно ли обойтись парой терабайт данных или лучше меньше да лучше? Рассмотрим задачи, в которых применимы разные подходы к данным.

Read more…

Exploratory Data Analysis
Data Quality
Data Labeling

Jameel Nabbo

Cybersecurity Researcher

The Netherlands

Talk

Neural Networks on the Source Code

In this research, you will be able to see how it would be possible to use machine learning and neural networks on the source code itself to find any security flaws without actually executing or building the source code (none-compiled) code.

Read more…

ML on Source Code
Static Code Analysis
Compilers

Julien Genovese

Data Scientist

Italy, Data Reply

Talk

Graph Data Science: from Theory to Application

With this theory, we try to deal with different social and interaction problems such as fraud detection, min path searching, and link predictions.

Read more…

Graph Data Science
MLlib

Lukas Vileikis

Technical Evangelist

Lithuania, Severalnines

Talk

The Importance of Performance in Open Source Databases

In this talk we will go through the reasons why monitoring the performance of your open source databases is so important – attendees will learn how to keep their open source databases running smoothly without compromising on security, performance or availability at the same time.

Read more…

Databases
MySQL
Performance
Security

Oliver Gindele

Chief Innovation Officer

Sweden, Datatonic

Talk

ML in Production – Serverless and Painless

In this session, Oliver will walk through some of the best serverless options on how to operationalize ML pipelines within the Tensorflow ecosystem and on the Google Cloud Platform, based on actual case studies.

Read more…

MLOps
Serverless
Containers
Tensorflow

« Hазад