Course Details
Discover more about this course and what it offers.
Scaling Data Analysis with Python and Dask Training Course
Category: DATA ANALYSIS TRAINING
About This Course
Dask is a flexible and high-performance Python library for parallel computing. It scales and accelerates big data processing with other Python-based data science libraries, such as Pandas, Numpy, and Scikit-Learn. This instructor-led, live training (online or onsite) is aimed at data scientists and software engineers who wish to use Dask with the Python ecosystem to build, scale, and analyze large datasets. By the end of this training, participants will be able to: Set up the environment to start building big data processing with Dask and Python. Explore the features, libraries, tools, and APIs available in Dask. Understand how Dask accelerates parallel computing in Python. Learn how to scale the Python ecosystem (Numpy, SciPy, and Pandas) using Dask. Optimize the Dask environment to maintain high performance in handling large datasets. Format of the Course Interactive lecture and discussion. Lots of exercises and practice. Hands-on implementation in a live-lab environment. Course Customization Options To request a customized training for this course, please contact us to arrange. This course is available as onsite live training in USA or online live training.
Duration: 14hrs
Venue: Online/Onsite