Course

Hands-On PySpark for Big Data Analysis

Packt

Updated Jan 20, 2020

Data is an incredible asset, especially when there are lots of it. Exploratory data analysis, business intelligence, and machine learning all depend on processing and analyzing Big Data at scale.

How do you go from working on prototypes on your local machine, to handling messy data in production and at scale?

This is a practical, hands-on course that shows you how to use Spark and its Python API to create performant analytics with large-scale data. Don't reinvent the wheel and wow your clients by building robust and responsible applications on Big Data.

Target Audience

This course is for developers, Data Scientists, Business Analysts or anyone who needs to reliably analyze large amounts of messy real-world data. Whether you’re tasked with creating your company's business intelligence function or creating great data platforms for your machine learning models or looking to use code to magnify the impact of your business, this course is for you. The only prerequisite is that you are familiar with basic Python and a desire to seek insight from Big Data.

Business Outcomes

Work with large amounts of data with agility using distributed datasets and in-memory caching
Source data from all popular data hosting platforms, including HDFS, Hive, JSON, and S3
Deploy Big Data analytics to production using PySpark’s easy to use API

Related learning

Big Data Analytics Using Apache SparkCourse ⋅ 840 mins

Big Data Processing using Apache SparkCourse ⋅ 60 mins

Azure Fundamentals: Big Data AnalyticsInteractive ⋅ 43 mins

Implement a Data Warehouse with Azure Synapse AnalyticsLink ⋅ 173 mins

Explore more technology skills

IT Software

Web Design and Development

Data & Analytics

Design and Animation

Gaming and Games Development

Devops, Networking and Security

Programming and Web Development

Computer Science and Engineering

Packt

GLOBAL

Packt is an exciting global IT content provider in the web development and emerging technology space. Founded in 2004 in Birmingham, UK, Packt's mission is to help the world put software to work in new ways, through the delivery of effective learning and information services to IT professionals. Packt’s content is developed for IT professionals, web developers, students and IT hobbyists who are looking to upskill or re-skill. With content that's been designed to be very hands on, as opposed to theory based, there is a clear focus on 'learning by doing’ - giving learners something to show at the end of each course! Through our partnership with Packt, Go1 Content Hub customers will now have access to the very latest AI, machine learning, data science and web development online courses.