Apache Spark Skills
Apache Spark commands for distributed data processing, SQL queries, streaming, and large-scale analytics.
3 skills
Spark Performance Tuning
Advanced
Optimize Spark job performance — partitioning strategies, caching, broadcast joins, shuffle optimization, adaptive query execution, and diagnosing bottlenecks with Spark UI.
sparkperformancetuningoptimization+1
Works with:claude-code, cursor, copilot +3
View Skill
Spark SQL from the Command Line
Intermediate
Run SQL queries with spark-sql CLI — querying Parquet, CSV, and JSON files directly, creating temporary views, using Hive metastore, and building interactive data analysis workflows.
sparksqlcommandline+3
Works with:claude-code, cursor, copilot +3
View Skill
spark-submit Configuration Guide
Beginner
Configure spark-submit for optimal job execution — resource allocation, deploy modes, packages, configuration properties, and submitting applications to YARN, Kubernetes, and standalone clusters.
sparkspark-submitconfigurationguide+3
Works with:claude-code, cursor, copilot +3
View Skill