Tantivy: Open source Full-Text Search Engine

Tantivy: Open source Full-Text Search Engine
Photo by Mark Eder / Unsplash

Tantivy is a full-text search engine library written in the Rust programming language.

It is closer to Apache Lucene than to Elasticsearch or Apache Solr in the sense it is not an off-the-shelf search engine server, but rather a crate that can be used to build such a search engine.

Features

  • Full-text search
  • Configurable tokenizer (stemming available for 17 Latin languages with third party support for Chinese (tantivy-jieba and cang-jie), Japanese (lindera, Vaporetto, and tantivy-tokenizer-tiny-segmenter) and Korean (lindera + lindera-ko-dic-builder)
  • Fast (check out the 🐎 ✨ benchmark ✨ 🐎)
  • Tiny startup time (<10ms), perfect for command-line tools
  • BM25 scoring (the same as Lucene)
  • Natural query language (e.g. (michael AND jackson) OR "king of pop")
  • Phrase queries search (e.g. "michael jackson")
  • Incremental indexing
  • Multithreaded indexing (indexing English Wikipedia takes < 3 minutes on my desktop)
  • Mmap directory
  • SIMD integer compression when the platform/CPU includes the SSE2 instruction set
  • Single valued and multivalued u64, i64, and f64 fast fields (equivalent of doc values in Lucene)
  • &[u8] fast fields
  • Text, i64, u64, f64, dates, and hierarchical facet fields
  • LZ4 compressed document store
  • Range queries
  • Faceted search
  • Configurable indexing (optional term frequency and position indexing)
  • JSON Field
  • Aggregation Collector: range buckets, average, and stats metrics
  • LogMergePolicy with deletes
  • Searcher Warmer API

License

The project is released under the MIT Language.

Resources

Read more

How AI-Powered Documentation Is Reducing Administrative Burden in Healthcare

How AI-Powered Documentation Is Reducing Administrative Burden in Healthcare

Healthcare organizations continue to face growing administrative demands as patient volumes increase and regulatory requirements become more complex. This challenge affects healthcare providers across many specialties and locations. For instance, the Colorado Behavioral Health Administration (BHA) laws and rules establish the regulatory framework for behavioral health providers. These rules cover

By Hazem Abbas