
Data Processing with Spark
Batch ETL processing on Airbnb London listings using PySpark DataFrame API: type conversion, regex cleaning of price fields, filtering and aggregations (max/unique, hosts/year). Focus on clear transformation steps and reproducible notebook/scripts.