Stars
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
This is my Smart-home Installation repository
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
A high-throughput and memory-efficient inference and serving engine for LLMs
ποΈ AI generated subtitles and segmented chapters for podcasts
π Awesome Data Catalogs and Observability Platforms.
Hypothesis is a powerful, flexible, and easy to use library for property-based testing.
A collection of RBIR projects and posts for anyone interested in joining this journey.
A cloud native embedded storage engine built on object storage.
Limbo is a project to build the modern evolution of SQLite.
The native Rust implementation for Apache Hudi, with Python API bindings.
Learn Rust dark magics by implementing an expression framework in database systems
Let's build an OLAP database from scratch! π§ UNDER CONSTRUCTION π§
Techniques and numbers for estimating system's performance from first-principles
A DataFusion-powered Serverless S3 Proxy.
Chronos: Pretrained Models for Probabilistic Time Series Forecasting
tracking papers, datasets, and models of "large language model (LLM) for time series"
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
Extensible SQL Lexer and Parser for Rust
Continuous profiling for analysis of CPU and memory usage, down to the line number and throughout time. Saving infrastructure cost, improving performance, and increasing reliability.
Label Studio is a multi-type data labeling and annotation tool with standardized output format
1οΈβ£πποΈ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java
Projects for an undergraduate OS course
π Tech blogs & talks by companies that run Apache Flink in production
What are the differences between the transaction isolation levels in databases? This is a suite of test cases which differentiate isolation levels.