Big Data Days 2019

 8-10 октября   Москва

Мастер-классы

1

Эрнестас Сысоевас

DATA MINER, Литва

Узнайте, как Apache Hadoop справляется с ограничениями традиционных вычислений, помогает организациям решать реальные задачи и поддерживает новые типы аналитики больших данных. Мастер-класс посвящен введению в экосистему Apache Hadoop, а также созданию центра обработки данных и управлению Hadoop в production.
В стеке Apache Hadoop взаимодействует целый ряд компонентов. Поняв принцип действия каждого из них, вы получите лучшее представление о функциях Hadoop в вашей информационной среде. Мы выйдем за рамки мотивации Apache Hadoop и проанализируем распределенную файловую систему Hadoop (HDFS), MapReduce и общую топологию кластера Hadoop.

1

2

Valentina Djordjevic

Things Solver, Serbia

This workshop is dedicated to the machine learning techniques that can be used for anomaly detection. The session is organized in three phases, where each phase is more advanced and demanding than the previous one.
Phase 1 includes theoretical anomaly detection introduction and covering basic anomaly detection techniques like z-score and smoothed z-score for anomaly detection.
Phase 2 includes more advanced machine learning algorithms, able to work with multivariate datasets, like Isolation Forest and Elliptic Envelope.
Phase 3 includes using the Autoencoder neural network to detect anomalies in huge multivariate datasets.
The materials can be expanded or reduced in regards to the requirements, and can be a subject of adjustment.

2

3

Sriskandarajah Suhothayan

WSO2, Sri Lanka

There is an emerging need for efficiently building event-driven applications in the current microservices era, but the traditional message processing systems are falling behind as they are inflexible in adapting to the cloud. In this session, I will present Siddhi, which is a 100% open source stream processing system that provides an efficient way of implementing event-driven cloud-native applications that can run natively on Kubernetes, and integrate to various systems such as NATS, Kafka, email, and MongoDB. I will also do a hands-on session to showcase how you can build streaming data integration, streaming analytics, and machine learning based adaptive intelligence applications efficiently within minutes.

3