zuston's repos on GitHub
CSS · 12 人关注
lizi
The tool is to fetch github discussion as blog post and deploy to vercel or github page
Rust · 6 人关注
legacy-riffle
Rust based Apache Uniffle shuffle server (riffle)
Go · 2 人关注
AtcalMq
the message queue for ane trace big data, which serves for the maching learning prediction
Rust · 2 人关注
curvine
High performance distributed cache system. Built by Rust.
Java · 1 人关注
dolphinscheduler
Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.
0 人关注
advisor
Open-source implementation of Google Vizier for hyper parameters tuning
Java · 0 人关注
alluxio
Alluxio, formerly Tachyon, Unify Data at Memory Speed
Jupyter Notebook · 0 人关注
analytics-zoo
Distributed Tensorflow, Keras and BigDL on Apache Spark
0 人关注
angel
A Flexible and Powerful Parameter Server for large-scale machine learning
JavaScript · 0 人关注
AtCal
Ane Trace Calculate (AtCal), Jobs for large data analysis
Java · 0 人关注
BeanMapper
choose bean mapping framework
Rust · 0 人关注
blaze
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
0 人关注
butterfree
A tool for building feature stores.
0 人关注
byteps
A high performance and generic framework for distributed DNN training
Java · 0 人关注
ByteTCC
ByteTCC Transaction Manager旨在提供一个兼容JTA的基于TCC机制的分布式事务管理器。
0 人关注
caelus
Set of Kubernetes solutions for reusing idle resources of nodes by running extra batch jobs
0 人关注
candle
Minimalist ML framework for Rust
0 人关注
ceresdb
CeresDB is a high-performance, distributed, cloud native time-series database.
0 人关注
CLIC
旨在提供一个跨平台计算框架来统一异构软件系统
0 人关注
CloudShuffleService
Cloud Shuffle Service(CSS) is a general purpose remote shuffle solution for compute engines, including Spark/Flink/MapReduce.
0 人关注
comment
Only for blog comment
0 人关注
custom-op
Guide for building custom op for TensorFlow
0 人关注
datafusion
Apache DataFusion SQL Query Engine
Rust · 0 人关注
datafusion-distributed
Repo for donation of distributed DataFusion prototype - repo name will change
0 人关注
datafusion-postgres
Serving any JSON/CSN/Parquet/Arrow files like Postgres tables with Datafusion
Scala · 0 人关注
deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
0 人关注
direct-spark-sql
a hyper-optimized single-node(local) version of spark sql engine, which's fundamental data structure is scala Iterator rather than RDD.
Python · 0 人关注
estimator
TensorFlow Estimator
0 人关注
fedb
FEDB is a NewSQL optimised for Realtime Inference and Decisioning applications
Java · 0 人关注
Firestorm
Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shuffle data on remote servers
Java · 0 人关注
flink
Apache Flink
0 人关注
flink-native-k8s-operator
Flink native Kubernetes Operator is a java based control plane for running Apache Flink native application on Kubernetes.
0 人关注
flink-recommandSystem-demo
:helicopter::rocket:基于Flink实现的商品实时推荐系统。flink统计商品热度,放入redis缓存,分析日志信息,将画像标签和实时记录放入Hbase。在用户发起推荐请求后,根据用户画像重排序热度榜,并结合协同过滤和标签两个推荐模块为新生成的榜单的每一个产品添加关联产品,最后返回新的用户列表。
Java · 0 人关注
flinkx
基于flink的分布式同步工具
0 人关注
fluss
Apache Fluss is a streaming storage built for real-time analytics.
0 人关注
fluss-rust
Rust Client for Apache Fluss (Incubating)
0 人关注
fory
A blazingly fast multi-language serialization framework powered by JIT and zero-copy.
0 人关注
genie
Distributed Big Data Orchestration Service
Go · 0 人关注
go-jd
京东自动登录,在线商品自动下单
Go · 0 人关注
GoLab
go experiment
Java · 0 人关注
griffin
Mirror of Apache griffin
0 人关注
GrokkingStreamingSystems
The source code for this book: Grokking Streaming Systems: Real-time Event Processing (https://www.manning.com/books/grokking-streaming-systems).
Go · 0 人关注
guery
Distributed SQL query engine written in Go for big data
Java · 0 人关注
hadoop
Mirror of Apache Hadoop
0 人关注
hazelcast
Open-source distributed computation and storage platform
Rust · 0 人关注
hdrs
HDFS Native Client in Rust via HDFS C API libhdfs
Python · 0 人关注
hearthbreaker
A Hearthstone: Heroes of WarCraft Simulator for the purposes of Machine Learning and Data Mining
Java · 0 人关注
hive
Apache Hive
0 人关注
horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
0 人关注
incubator-gluten
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
0 人关注
incubator-hudi
Upserts, Deletes And Incremental Processing on Big Data.
Java · 0 人关注
incubator-uniffle
Uniffle is a high performance, general purpose Remote Shuffle Service.
0 人关注
InjectGUI
macOS Integrated Injection Framework (GUI version)
C++ · 0 人关注
io
Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO
JavaScript · 0 人关注
IQL
An ad hoc query service based on the spark sql engine.(基于spark sql引擎的即席查询服务)
0 人关注
iraft
another raft protocol implementation by go programming language, just for learning raft
0 人关注
jmx_exporter
A process for exposing JMX Beans via HTTP for Prometheus consumption
Go · 0 人关注
juicefs
A distributed POSIX file system built on top of Redis and S3.
Java · 0 人关注
jvm-sandbox
Real - time non-invasive AOP framework container based on JVM
0 人关注
katib
Repository for hyperparameter tuning
0 人关注
koordinator
QoS based scheduling system for hybrid orchestration workloads on Kubernetes, bringing workloads the best layout and status.
0 人关注
kruise
Automated management of large-scale applications on Kubernetes (project under CNCF)
0 人关注
KungFu
KungFu: Making Training in Distributed Machine Learning Adaptive
PHP · 0 人关注
lazyphp-webDemo
the school work is based on the framework of lazyphp3
Java · 0 人关注
LinkedMatrix
Hadoop Task about LinkedMatrix from neo4j
Python · 0 人关注
LMCache
Redis for LLMs
0 人关注
logforth
A versatile and extensible logging implementation.