Recent commits
[SPARK-56135][DOCS] Add option to use venv in PySpark docs
[SPARK-55875][UI] Switch SQL tab query listing to client-side DataTables
[SPARK-56128][K8S] Use Java 21-jre instead of 21 image in K8s Dockerfile
Apache Spark - A unified analytics engine for large-scale data processing
Spark is a unified analytics engine for large-scale data processing. It provides high-level APIs in Scala, Java, Python, and R (Deprecated), and an optimized engine that supports general computation graphs for data analysis. It also supports a rich set of higher-level tools including Spark SQL for SQL and DataFrames, pandas API on Spark for pandas workloads, MLlib for machine learning, GraphX for graph processing, and Structured Streaming for stream processing.
[SPARK-56135][DOCS] Add option to use venv in PySpark docs
[SPARK-55875][UI] Switch SQL tab query listing to client-side DataTables
[SPARK-56128][K8S] Use Java 21-jre instead of 21 image in K8s Dockerfile
Spark is a unified analytics engine for large-scale data processing. It provides high-level APIs in Scala, Java, Python, and R (Deprecated), and an optimized engine that supports general computation graphs for data analysis. It also supports a rich set of higher-level tools including Spark SQL for SQL and DataFrames, pandas API on Spark for pandas workloads, MLlib for machine learning, GraphX for graph processing, and Structured Streaming for stream processing.
[SPARK-56135][DOCS] Add option to use venv in PySpark docs
[SPARK-55875][UI] Switch SQL tab query listing to client-side DataTables
[SPARK-56128][K8S] Use Java 21-jre instead of 21 image in K8s Dockerfile