Big Data Days 2019

 October 8-10   Moscow

Workshops List

No1

Ernestas Sysojevas

DATA MINER, Lithuania

Learn how Apache Hadoop addresses the limitations of traditional computing, helps businesses overcome real challenges, and powers new types of big data analytics. This workshop introduces Apache Hadoop ecosystem and outlines how to prepare the data center and manage Hadoop in production.
There are many components working together in the Apache Hadoop stack. By understanding how each functions, you gain more insight into Hadoop’s functionality in your own IT environment. We will go beyond the motivation for Apache Hadoop and will dissect the Hadoop Distributed File System (HDFS), MapReduce, and the general topology of a Hadoop cluster.

No1

No2

Valentina Djordjevic

Things Solver, Serbia

This workshop is dedicated to the machine learning techniques that can be used for anomaly detection. The session is organized in three phases, where each phase is more advanced and demanding than the previous one.
Phase 1 includes theoretical anomaly detection introduction and covering basic anomaly detection techniques like z-score and smoothed z-score for anomaly detection.
Phase 2 includes more advanced machine learning algorithms, able to work with multivariate datasets, like Isolation Forest and Elliptic Envelope.
Phase 3 includes using the Autoencoder neural network to detect anomalies in huge multivariate datasets.

No2

No3

There is an emerging need for efficiently building event-driven applications in the current microservices era, but the traditional message processing systems are falling behind as they are inflexible in adapting to the cloud. In this session, I will present Siddhi, which is a 100% open source stream processing system that provides an efficient way of implementing event-driven cloud-native applications that can run natively on Kubernetes, and integrate to various systems such as NATS, Kafka, email, and MongoDB. I will also do a hands-on session to showcase how you can build streaming data integration, streaming analytics, and machine learning based adaptive intelligence applications efficiently within minutes.

No3

No4

Mikhail Petrov

Счетная палата Российской Федерации, Russia

Many companies now are asking the questions “what is digital transformation?” “do we need it?” “if we need – how to approach it, how to start?”
Company’s culture of working with data is one of the most important success factors.
How to change the culture and to step on the transformation path? It is necessary to identify correctly the key steps, recruit a team of changes, find “agents of transformation”, identify “quick wins”, support all things with the motivation and training plan, determine the sources of data, technologies and products.

No4

No5

Pavlov Dmitriy

Arenadata, Russia

Andrey Konyaev

Arenadata, Russia

The last few years in the sphere of data warehouses (DWH) are best described by one phrase: the game has changed. In contrast to the mono-vendor solutions of the past, the modern data landscape is not represented by a single silver bullet system, or even several systems from one vendor. A business that wants to gain a competitive advantage from the available data is forced to use dozens, if not hundreds, of various components and systems, each effectively solves its narrow task.

At the same time, there is a growing tendency to abandon vendor lock-in solutions – now companies are increasingly choosing open source solutions. This allows them to diversify the risks of contractors and vendors, simultaneously opening the door to accumulate internal expertise on technologies with their subsequent independent support.

Another trend is also becoming more noticeable – more and more companies are choosing clouds instead of their own capacities for infrastructure. A few years ago these were mostly private installations, now there is an advantage in using public ones. Each cloud provider carries its own virtualization technologies, networks and other specifics.

No5