This 50-hour course covers Big Data, PySpark, AWS, Scala, and web scraping. PySpark, a Python-Apache Spark integration, will be the primary focus, teaching you data analysis from the ground up. You'll learn end-to-end workflows with PySpark, from cleaning data to building features and implementing machine learning models. The course includes practical explanations and live coding with PySpark, covering streaming data processing, machine learning applications, batch data handling, ETL pipelines, and full load and ongoing replication. You will also learn web scraping using Selenium and Scrapy, along with the use of CSS selectors. Basic understanding of HTML tags, Python, SQL, and Node.js is required, but no prior knowledge of data scraping and Scala is needed.
Live Online
Price-£597.00In-person
Find the Right Training to Elevate Your Skills and Career.