Skip to content
@stanford-futuredata

Future Data Systems

We are a CS research group building data-intensive systems

Popular repositories Loading

  1. ColBERT ColBERT Public

    ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

    Python 3.1k 388

  2. macrobase macrobase Public

    MacroBase: A Search Engine for Fast Data

    Java 661 126

  3. ARES ARES Public

    Automated Evaluation of RAG Systems

    Python 486 53

  4. noscope noscope Public

    Accelerating network inference over video

    Python 437 122

  5. sparser sparser Public

    Sparser: Raw Filtering for Faster Analytics over Raw Data

    C 432 55

  6. dawn-bench-entries dawn-bench-entries Public

    DAWNBench: An End-to-End Deep Learning Benchmark and Competition

    Python 262 74

Repositories

Showing 10 of 69 repositories
  • ColBERT Public

    ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

    stanford-futuredata/ColBERT’s past year of commit activity
    Python 3,082 MIT 388 77 17 Updated Nov 18, 2024
  • ARES Public

    Automated Evaluation of RAG Systems

    stanford-futuredata/ARES’s past year of commit activity
    Python 486 Apache-2.0 53 10 2 Updated Nov 4, 2024
  • FrugalGPT Public

    FrugalGPT: better quality and lower cost for LLM applications

    stanford-futuredata/FrugalGPT’s past year of commit activity
    Jupyter Notebook 185 Apache-2.0 21 3 0 Updated Sep 19, 2024
  • stk Public
    stanford-futuredata/stk’s past year of commit activity
    Python 88 Apache-2.0 20 2 1 Updated Aug 26, 2024
  • gavel Public

    Code for "Heterogenity-Aware Cluster Scheduling Policies for Deep Learning Workloads", which appeared at OSDI 2020

    stanford-futuredata/gavel’s past year of commit activity
    Jupyter Notebook 125 MIT 31 8 2 Updated Jul 25, 2024
  • InQuest Public

    Accelerating Aggregation Queries on Unstructured Streams of Data

    stanford-futuredata/InQuest’s past year of commit activity
    Python 7 2 1 0 Updated Apr 18, 2024
  • Megatron-LM Public Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer models at scale

    stanford-futuredata/Megatron-LM’s past year of commit activity
    Python 33 2,444 0 2 Updated Jan 19, 2024
  • tasti Public

    Semantic Indexes for Machine Learning-based Queries over Unstructured Data (SIGMOD 2022)

    stanford-futuredata/tasti’s past year of commit activity
    Python 15 5 0 0 Updated Jan 17, 2024
  • omg Public
    stanford-futuredata/omg’s past year of commit activity
    Python 20 Apache-2.0 3 0 0 Updated Sep 20, 2023
  • abae Public

    Accelerating Approximate Aggregation Queries with Expensive Predicates (VLDB 21)

    stanford-futuredata/abae’s past year of commit activity
    Python 3 1 0 0 Updated Sep 20, 2023