The Big Match (incubator)

A very slow but cheap and free timeout guarantee search text engine. Powered by Lucene query

Introduction

Someday you'll need a slow but cheap and timeout free text search engine. Elasticsearch is a best tool of the market to go fast. But it has a cost. Big-Match is slow but cheap.

High level architecture

Example

Indexer

import com.github.bigmatch.indexer.input.BigMatchTimeSeriesInputJob

case class Tweet(text: String, timestamp: Long, date: String) extends Indexable
class TweetScheduler extends BigMatchTimeSeriesInputJob[Tweet] {
  def input(start: DateTime, end: DateTime) : Dataset[Tweet] =
    spark.read.parquet(s"/user/big-match/${start}.csv", s"/user/big-match/${end}.csv")
      .as[Tweet]
      
  def output: Path = new Path("/user/big-match/output")
}

Description

Stack:

the core of big match relies on:

Apache Spark
Apache Solr

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
big-match-core/src		big-match-core/src
big-match-indexer/src/main/scala/com/github/bigmatch/indexer		big-match-indexer/src/main/scala/com/github/bigmatch/indexer
project		project
.gitignore		.gitignore
README.MD		README.MD
big-match.png		big-match.png
build.sbt		build.sbt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Big Match (incubator)

Introduction

High level architecture

Example

Indexer

Description

Stack:

About

Releases

Packages

Languages

cesarcolle/big-match

Folders and files

Latest commit

History

Repository files navigation

The Big Match (incubator)

Introduction

High level architecture

Example

Indexer

Description

Stack:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages