Dark Mode

Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

Apache Spark

Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.

Here are 9,632 public repositories matching this topic...

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  • Updated Mar 20, 2024
  • Python

flink learning blog. http://www.54tianzhisheng.cn/ Han Flink Ru Men , Gai Nian , Yuan Li , Shi Zhan , Xing Neng Diao You , Yuan Ma Jie Xi Deng Nei Rong . She Ji Flink Connector, Metrics, Library, DataStream API, Table API & SQL Deng Nei Rong De Xue Xi An Li ,Huan You Flink Luo Di Ying Yong De Da Xing Xiang Mu An Li (PVUV, Ri Zhi Cun Chu , Bai Yi Shu Ju Shi Shi Qu Zhong , Jian Kong Gao Jing )Fen Xiang . Huan Ying Da Jia Zhi Chi Wo De Zhuan Lan <>

  • Updated Mar 12, 2025
  • Java

Suite of tools for deploying and training deep learning models using the JVM. Highlights include model import for keras, tensorflow, and onnx/pytorch, a modular and tiny c++ library for running math code and a java based math library on top of the core c++ library. Also includes samediff: a pytorch/tensorflow like library for running deep learn...

  • Updated Feb 20, 2026
  • Java

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

  • Updated Feb 23, 2026
  • Jupyter Notebook

Created by Matei Zaharia

Released May 26, 2014

Followers
436 followers
Repository
apache/spark
Website
github.com/topics/spark
Wikipedia
Wikipedia

Related topics

hadoop scala