Apache Spark con Scala (Barcelona) (1ª edición)

Ha finalizado la venta de entradas

Ha finalizado la venta de entradas
¡Muchas gracias por tu inscripción! Nos pondremos en contacto contigo unos días antes del comienzo del curso para darte las instrucciones de configuración de tu portátil, y de seguimiento general del curso (uso del repositorio de Github, etc.). // El Equipo de Habla Computing

Información sobre el evento

Comparte este evento

Fecha y hora

Ubicación

Ubicación

Espais Barcelona

C/ Legalitat 60-62

08024 Barcelona

Spain

Ver mapa

Política de reembolso

Política de reembolso

Reembolsos hasta 1 día antes del evento

La tarifa de Eventbrite no es reembolsable.

Ha finalizado la venta de entradas

Ha finalizado la venta de entradas
¡Muchas gracias por tu inscripción! Nos pondremos en contacto contigo unos días antes del comienzo del curso para darte las instrucciones de configuración de tu portátil, y de seguimiento general del curso (uso del repositorio de Github, etc.). // El Equipo de Habla Computing
Descripción del evento

Descripción

Summary

This course offers a deep introduction into distributed programming with Apache Spark and Scala. It focuses on learning the fundamentals of Spark's computational model, which are explained through interactive examples. It will also provide insights on how to analyze program performance using SparkUI and how to optimize programs through practical exercises.

The language of the course will be in English.

Audience

  • Programmers with basic Scala knowledge interested in learning big data technologies.
  • Big data programmers interested in improving their skills, no previous Scala experience needed

Goals

At the end of the course the attendees should be able to:

  • Understand Spark's computational model
  • Develop testable and modular ETL programs with Spark
  • Identify and resolve common problems in Spark, with a special focus on performance

Syllabus

  • Computational model I: Introduction
    • Transformations and actions
    • Distributed computing
    • Narrow vs wide transformations
    • Joins
    • Caching
  • Computational model II: Execution plan
    • Jobs, stages and tasks
    • Cluster managers: Yarn, Standalone, Mesos
    • Driver and executors
    • SparkUI
  • APIs
    • Spark languages: SparkSQL, RDDs, ML, GraphX
    • Dataset: Statically typed
    • DataFrame: Dynamically typed
    • Datasets vs DataFrames
  • Reading and writting in Spark
    • Files: JSON, Parquet
    • Databases: JDBC, NoSQL
  • Patterns and antipatterns
    • How to avoid memory issues
    • Serialization issues
    • Caching issues
    • Tasks that never finish
    • Workflow patterns (ETL)
    • Testing patterns

Organization

  • Explanations during sessions will alternate between slides and live coding, with a preference for the later. Small exercises and challenges will be proposed after each block of contents, and solved in the classroom.
  • After each session, attendees will be encouraged to complete a number of additional exercises on Github. Habla Computing trainers will provide feedback via code reviews for those who completed this coursework in due time. A chat room will also be made available to offer guidance and foster collaboration among course attendees.

About the instructors

The instructors are Alfonso Roa Redondo y Mikel San Vicente Maeztu. Alfonso and Mikel are functional programming architects at Habla Computing. They have been using Scala for more than 5 years in multiple domains like fintech, predictive marketing, retail, logistics, civil engineering, etc... They also have extensive experience in consultancy with Akka, Spark, and other Scala ecosystem technologies. Alfonso is a co-organizer of Madrid's Scala Meetup (ScalaMad)

More info in this link:

https://www.hablapps.com/#spark-101-en

Can't attend this course but want to keep in touch? Join our newsletter:

http://eepurl.com/dgGDwX

Fecha y hora

Ubicación

Espais Barcelona

C/ Legalitat 60-62

08024 Barcelona

Spain

Ver mapa

Política de reembolso

Reembolsos hasta 1 día antes del evento

La tarifa de Eventbrite no es reembolsable.

Guarda este evento

Evento guardado