Development

Tantivy is a lightweight full-text search engine

Hamza Musa

11 May 2022 — 1 min read

Tantivy is a full-text search engine library written in Rust.

It is closer to Apache Lucene than to Elasticsearch or Apache Solr in the sense it is not an off-the-shelf search engine server, but rather a crate that can be used to build such a search engine.

Tantivy is, in fact, strongly inspired by Lucene's design.

If you are looking for an alternative to Elasticsearch or Apache Solr, check out Quickwit, our search engine built on top of Tantivy.

Features

Full-text search
Configurable tokenizer (stemming available for 17 Latin languages with third party support for Chinese (tantivy-jieba and cang-jie), Japanese (lindera, Vaporetto, and tantivy-tokenizer-tiny-segmenter) and Korean (lindera + lindera-ko-dic-builder)
Fast (check out the 🐎 ✨ benchmark ✨ 🐎)
Tiny startup time (<10ms), perfect for command-line tools
BM25 scoring (the same as Lucene)
Natural query language (e.g. (michael AND jackson) OR "king of pop")
Phrase queries search (e.g. "michael jackson")
Incremental indexing
Multithreaded indexing (indexing English Wikipedia takes < 3 minutes on my desktop)
Mmap directory
SIMD integer compression when the platform/CPU includes the SSE2 instruction set
Single valued and multivalued u64, i64, and f64 fast fields (equivalent of doc values in Lucene)
&[u8] fast fields
Text, i64, u64, f64, dates, and hierarchical facet fields
LZ4 compressed document store
Range queries
Faceted search
Configurable indexing (optional term frequency and position indexing)
JSON Field
Aggregation Collector: range buckets, average, and stats metrics
LogMergePolicy with deletes
Searcher Warmer API

Supported programming languages

Rust
Python
Ruby

License

Tantivy is released under the MIT License.

Resources

https://github.com/quickwit-oss/tantivy

How Patients With Heart Conditions Can Prepare For Life Insurance

Applying for life insurance when you have a heart condition requires preparation because insurance companies examine medical data before they decide to offer coverage. A heart condition is not a certain reason for a denial of coverage. Companies are likely to assess your diagnosis, medical treatments, current health and statistical

Why Austin's Lifestyle Creates Muscle Pain You Shouldn't Ignore

Austin is an active city that keeps you moving. A workday may end with a run, a bike ride, or a hike. That routine feels healthy, but it also places repeated demands on your muscles. Many people expect soreness after exercise. They pay less attention to stiffness that returns every

Types of Compensation Available After a Cancer Misdiagnosis

A cancer misdiagnosis can lead to serious physical, emotional, and financial consequences, and compensation may be available when medical negligence causes harm. Depending on the circumstances, a person affected by a misdiagnosis may seek damages for medical expenses, lost income, additional treatment costs, pain and suffering, and other losses connected

Puter.js: The Missing piece of AI Coding. (Guest Post by: Reynaldi Chernando)

Most backends for vibe coding still expect you to create an account, spin up a project, copy API keys, configure a client, and read a few pages of docs before anything actually runs. That setup friction was always annoying, especially in a workflow where the coding itself is automated. The