Apache Spark
Submit Commands
Submit Spark applications to clusters. Configure deploy modes, resource allocation, dependencies, and cluster-specific options for optimal job execution.
9 commands
Pro Tips
Use '--deploy-mode cluster' for production to run driver on cluster and avoid network bottlenecks.
Set '--packages' for Maven coordinates to automatically download dependencies.
Use '--conf spark.dynamicAllocation.enabled=true' to auto-scale executors based on workload.
Common Mistakes
In client mode, driver runs locally - ensure network connectivity to cluster nodes.
Don't set executor memory higher than available node memory minus overhead.