Spark In Action

Spark In Action by Jean-Georges Perrin. Download in PDF, EPUB, and Mobi Format for read it on your Kindle device, PC, phones or tablets. Spark In Action Second Edition books. Click Download for free ebooks.

Spark In Action Second Edition

Spark In Action
Author: Jean-Georges Perrin
Publisher: Manning Publications
ISBN: 1617295523
Size: 47.82 MB
Format: PDF
View: 3107
Get Books

Summary The Spark distributed data processing platform provides an easy-to-implement tool for ingesting, streaming, and processing data from any source. In Spark in Action, Second Edition, you’ll learn to take advantage of Spark’s core features and incredible processing speed, with applications including real-time computation, delayed evaluation, and machine learning. Spark skills are a hot commodity in enterprises worldwide, and with Spark’s powerful and flexible Java APIs, you can reap all the benefits without first learning Scala or Hadoop. Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications. About the technology Analyzing enterprise data starts by reading, filtering, and merging files and streams from many sources. The Spark data processing engine handles this varied volume like a champ, delivering speeds 100 times faster than Hadoop systems. Thanks to SQL support, an intuitive interface, and a straightforward multilanguage API, you can use Spark without learning a complex new ecosystem. About the book Spark in Action, Second Edition, teaches you to create end-to-end analytics applications. In this entirely new book, you’ll learn from interesting Java-based examples, including a complete data pipeline for processing NASA satellite data. And you’ll discover Java, Python, and Scala code samples hosted on GitHub that you can explore and adapt, plus appendixes that give you a cheat sheet for installing tools and understanding Spark-specific terms. What's inside Writing Spark applications in Java Spark application architecture Ingestion through files, databases, streaming, and Elasticsearch Querying distributed datasets with Spark SQL About the reader This book does not assume previous experience with Spark, Scala, or Hadoop. About the author Jean-Georges Perrin is an experienced data and software architect. He is France’s first IBM Champion and has been honored for 12 consecutive years. Table of Contents PART 1 - THE THEORY CRIPPLED BY AWESOME EXAMPLES 1 So, what is Spark, anyway? 2 Architecture and flow 3 The majestic role of the dataframe 4 Fundamentally lazy 5 Building a simple app for deployment 6 Deploying your simple app PART 2 - INGESTION 7 Ingestion from files 8 Ingestion from databases 9 Advanced ingestion: finding data sources and building your own 10 Ingestion through structured streaming PART 3 - TRANSFORMING YOUR DATA 11 Working with SQL 12 Transforming your data 13 Transforming entire documents 14 Extending transformations with user-defined functions 15 Aggregating your data PART 4 - GOING FURTHER 16 Cache and checkpoint: Enhancing Spark’s performances 17 Exporting data and building full data pipelines 18 Exploring deployment
Spark in Action, Second Edition
Language: en
Pages: 576
Authors: Jean-Georges Perrin
Categories: Computers
Type: BOOK - Published: 2020-06-02 - Publisher: Manning Publications
Summary The Spark distributed data processing platform provides an easy-to-implement tool for ingesting, streaming, and processing data from any source. In Spark in Action, Second Edition, you’ll learn to take advantage of Spark’s core features and incredible processing speed, with applications including real-time computation, delayed evaluation, and machine learning. Spark
Spark in Action
Language: en
Pages: 450
Authors: Petar Zecevic, Marko Bonaci
Categories: Computers
Type: BOOK - Published: 2016-08-28 - Publisher: Manning Publications
Working with big data can be complex and challenging, in part because of the multiple analysis frameworks and tools required. Apache Spark is a big data processing framework perfect for analyzing near-real-time streams and discovering historical patterns in batched data sets. But Spark goes much further than other frameworks. By
스파크를 다루는 기술 Spark in Action
Language: ko
Pages: 608
Authors: 페타 제체비치, 마르코 보나치
Categories: Computers
Type: BOOK - Published: 2018-06-27 - Publisher: (주)도서출판길벗
스파크의 방대한 내용을 고르고 깊게 다룬다! 철두철미하면서 상냥한 스파크 활용 가이드! 이 책은 스파크를 이해하고 활용하는 데 필요한 중요 내용을 빠짐없이 다룬다. 1부에서 스파크와 스파크의 풍부한 API를 소개하고, 2부에서 스파크를 구성하는 스파크 SQL, 스파크 스트리밍, 스파크 MLlib, 스파크 GraphX 컴포넌트를 알아본다. 그리고 3부는 스파크 자체 클러스터, 하둡의 YARN 클러스터 및
Потоковая обработка данных. Конвейер реального времени
Language: ru
Pages:
Authors: Эндрю Дж. Пселтис
Categories: Computers
Type: BOOK - Published: 2019-10-03 - Publisher: Litres
Эта насыщенная идеями книга научит вас думать об эффективном взаимодействии с быстрыми потоками данных. В ней выдержан идеальный баланс между широкой картиной и деталями реализации. На содержательных примерах и практических задачах вы узнаете о проектировании приложений, которые читают, анализируют, разделяют и сохраняют потоковые данные. Попутно вы поймете, какую роль играют
Spark Graphx in Action
Language: en
Pages: 225
Authors: Michael Malak, Robin East
Categories: Computers
Type: BOOK - Published: 2016-05-01 - Publisher: Manning Publications
While graphs are often the most natural way to represent the connections among data, the complexity of large graphs makes them conceptually difficult and computationally expensive to explore, query, and analyze. GraphX, a powerful graph processing API for the Apache Spark analytics engine, makes it possible to efficiently explore and
Викинь мотлох із життя! Мистецтво прибирання, яке змінить вас назавжди
Language: uk
Pages:
Authors: Марі Кондо
Categories: House & Home
Type: BOOK - Published: 2016-04-25 - Publisher: Family Leisure Club
Перекладено 35 мовами світу. Понад 67 тижнів № 1 у рейтингу бестселерів The New York Times. Продано понад 3 000 000 примірників. Втомилися від звичайного прибирання? За методом КонМарі ви приберете раз і ­назавжди! Він не зводиться до набору правил, як сортувати та зберігати речі. Це інструкція зі способу мислення,
Disruptive Analytics
Language: en
Pages: 262
Authors: Thomas W. Dinsmore
Categories: Computers
Type: BOOK - Published: 2016-08-27 - Publisher: Apress
Learn all you need to know about seven key innovations disrupting business analytics today. These innovations—the open source business model, cloud analytics, the Hadoop ecosystem, Spark and in-memory analytics, streaming analytics, Deep Learning, and self-service analytics—are radically changing how businesses use data for competitive advantage. Taken together, they are disrupting
Apache Spark Deep Learning Cookbook
Language: en
Pages: 474
Authors: Ahmed Sherif, Amrith Ravindra
Categories: Computers
Type: BOOK - Published: 2018-07-13 - Publisher: Packt Publishing Ltd
A solution-based guide to put your deep learning models into production with the power of Apache Spark Key Features Discover practical recipes for distributed deep learning with Apache Spark Learn to use libraries such as Keras and TensorFlow Solve problems in order to train your deep learning models on Apache
Народжені бігати. Рух до безмежних можливостей
Language: uk
Pages: 328
Authors: Крістофер Макдуґал
Categories: Psychology
Type: BOOK - Published: 2019-04-26 - Publisher: Наш формат
Прагнучи відродити мистецтво руху, з наукових лабораторій Гарварду автор вирушає в Північну Америку, до індіанського народу тараумара, представники якого без відпочинку долають навіть 80-кілометровий марафон. ДЛЯ КОГО КНИЖКА Книжка для найширшого кола читачів, усіх, хто цікавиться спортом, психологією бігу, етнографією, пригодницькими історіями.
Spark GraphX in Action
Language: en
Pages: 280
Authors: Michael East
Categories: Computer network protocols
Type: BOOK - Published: 2016 - Publisher:
Spark GraphX in Action starts out with an overview of Apache Spark and the GraphX graph processing API. This example-based tutorial then teaches you how to configure GraphX and how to use it interactively. Along the way, you'll collect practical techniques for enhancing applications and applying machine learning algorithms to