Tools related to big data and machine learning
Apache Superset (incubating) is a modern, enterprise-ready business…
Metabase is an open source business intelligence tool. It lets you ask…
This is the Helm chart for the Spark-on-Kubernetes Operator. Spark Operator is…
Zeppelin, a web-based notebook that enables interactive data analytics. You can…
Dask.distributed. Dask.distributed is a lightweight library for distributed…
JasperReports Server is a stand-alone and embeddable reporting server. It…
Dask is a flexible parallel computing library for analytics. See documentation…
Spring Cloud Data Flow is a toolkit for building data integration and real-time…
Kubed is a data visualization DSL embedded within the Kotlin programming…
TensorFlow Serving is a flexible, high-performance serving system for machine…
Horovod is a distributed training framework for TensorFlow, and it's provided…
Hadoop is a framework for running large scale distributed applications. The…
TensorFlow is an open source software library for high-performance numerical…
TensorFlowâ„¢ is an open source software library for high-performance numerical…
Pachyderm is a language-agnostic and cloud infrastructure-agnostic large-scale…
Spark is a fast and general cluster computing system for Big Data. It provides…
Kanister is a framework that enables application-level data management on…
The sole purpose of Parse was to demystify the process of backend development.…
Tell us about a new Kubernetes application
Never miss a thing! Sign up for our newsletter to stay updated.
Discover and learn about everything Kubernetes