Описание

Hadoop is mostly written in Java, but that doesn’t exclude the use of other programming languages with this distributed storage and processing framework, particularly Python. With this concise book, you’ll learn how to use Python with the Hadoop Distributed File System (HDFS), MapReduce, the Apache Pig platform and Pig Latin script, and the Apache Spark cluster-computing framework.

Authors Zachary Radtka and Donald Miner from the data science firm Miner & Kasch take you through the basic concepts behind Hadoop, MapReduce, Pig, and Spark. Then, through multiple examples and use cases, you’ll learn how to work with these technologies by applying various Python tools.

Use the Python library Snakebite to access HDFS programmatically from within Python applications
Write MapReduce jobs in Python with mrjob, the Python MapReduce library
Extend Pig Latin with user-defined functions (UDFs) in Python
Use the Spark Python API (PySpark) to write Spark programs with Python
Learn how to use the Luigi Python workflow scheduler to manage MapReduce jobs and Pig scripts

Zachary Radtka, a platform engineer at Miner & Kasch, has extensive experience creating custom analytics that run on petabyte-scale data sets.

Похожее

Бренд

O'Reilly Media, Inc.

O'Reilly Media spreads the knowledge of innovators through its books, online services, magazines, research, and conferences. Since 1978, O'Reilly has been a chronicler and catalyst of leading-edge development, homing in on the technology trends that really matter and galvanizing their adoption by amplifying "faint signals" from the alpha geeks who are creating the future. An active participant in the technology community, the company has a long history of advocacy, meme-making, and evangelism.

Рифко | RifCo™

Hadoop with Python

Описание

Похожее

Бренд

O'Reilly Media, Inc.

Похожее

Добавить комментарий Отменить ответ

Рифко | RifCo™

Описание

Похожее

Бренд

O'Reilly Media, Inc.

Похожие товары

Машинное обучение с использованием библиотеки Н2О

Машинное обучение (PDF)

Построение систем машинного обучения на языке Python

Глубокое обучение на Python

Похожее

Добавить комментарий Отменить ответ