search-engine

Tantivy: Open source Full-Text Search Engine

Hazem Abbas

Nov 22, 2022 — 1 min read

Table of Content

Tantivy is a full-text search engine library written in the Rust programming language.

It is closer to Apache Lucene than to Elasticsearch or Apache Solr in the sense it is not an off-the-shelf search engine server, but rather a crate that can be used to build such a search engine.

Features

Full-text search
Configurable tokenizer (stemming available for 17 Latin languages with third party support for Chinese (tantivy-jieba and cang-jie), Japanese (lindera, Vaporetto, and tantivy-tokenizer-tiny-segmenter) and Korean (lindera + lindera-ko-dic-builder)
Fast (check out the 🐎 ✨ benchmark ✨ 🐎)
Tiny startup time (<10ms), perfect for command-line tools
BM25 scoring (the same as Lucene)
Natural query language (e.g. (michael AND jackson) OR "king of pop")
Phrase queries search (e.g. "michael jackson")
Incremental indexing
Multithreaded indexing (indexing English Wikipedia takes < 3 minutes on my desktop)
Mmap directory
SIMD integer compression when the platform/CPU includes the SSE2 instruction set
Single valued and multivalued u64, i64, and f64 fast fields (equivalent of doc values in Lucene)
&[u8] fast fields
Text, i64, u64, f64, dates, and hierarchical facet fields
LZ4 compressed document store
Range queries
Faceted search
Configurable indexing (optional term frequency and position indexing)
JSON Field
Aggregation Collector: range buckets, average, and stats metrics
LogMergePolicy with deletes
Searcher Warmer API

License

The project is released under the MIT Language.

Resources

Source code

search-engine Open-source programming Rust search Web-based Apps web development data engineering

Tantivy: Open source Full-Text Search Engine

Hazem Abbas

Table of Content

Features

License

Resources

Are You Truly Ready to Put Your Mobile or Web App to the Test?

Articles

Systems

Development

Apps

Science - Healthcare

Open-source Apps

Medical Apps

Lists

Dev. Resources

Read more

Doctor's Guide to GenAI: Which Tools to Use and How to Use Them Wisely!

AI Isn’t Ready to Fire Your Developers (Yet); Lessons from a Friend’s Mistake

Top 14 Open-source MTA (Message/ Mail Transfer Agent) for Enterprise and Agencies

Why A-Frame is the Best Web Framework for Building 3D/AR/VR Experiences, 10+ Reasons

Table of Content

Features

License

Resources

Read More Articles in search-engine

DeepSeek R1: Open-Source AI Model Surpasses OpenAI and Claude with Superior Cost Efficiency

Why We're Betting Big on DeepSeek-V3: A Personal Dive into the Open-Source AI That’s Changing the Game and Redefining AI Excellence

Is Bing Dying? Let’s Unpack the Current Search Engine Wars

It is not too late for Google Search to Recover! Why People Are Leaving Google Search?

MiniPerplx: A New AI-Powered Search Engine to Challenge the Big Players?

Get your Own AI-Powered Self-hosted Search Engine Perplexity Clone with Farfalle

Articles

Systems

Development

Apps

Science - Healthcare

Open-source Apps

Medical Apps

Lists

Dev. Resources

Read more

Doctor's Guide to GenAI: Which Tools to Use and How to Use Them Wisely!

AI Isn’t Ready to Fire Your Developers (Yet); Lessons from a Friend’s Mistake

Top 14 Open-source MTA (Message/ Mail Transfer Agent) for Enterprise and Agencies

Why A-Frame is the Best Web Framework for Building 3D/AR/VR Experiences, 10+ Reasons