Go to file
Damir Jelić 46be799248 Merge remote-tracking branch 'origin/search_api_simplification' 2020-01-05 13:52:35 +01:00
src Merge remote-tracking branch 'origin/search_api_simplification' 2020-01-05 13:52:35 +01:00
tantivy CR: Adding trailing lines and removing some trailing spaces. 2019-08-30 07:37:20 +09:00
tests Merge remote-tracking branch 'origin/search_api_simplification' 2020-01-05 13:52:35 +01:00
.gitignore Fixing build. 2019-08-01 17:22:40 +09:00
.travis.yml travis: Add the cargo bin path to the PATH. 2019-08-08 15:14:45 +02:00
Cargo.toml Cargo.toml: Update the dependency versions. 2020-01-05 12:40:07 +01:00
LICENSE Initial python bindings implementation. 2019-06-04 11:09:58 +02:00
MANIFEST.in Initial python bindings implementation. 2019-06-04 11:09:58 +02:00
Makefile Initial python bindings implementation. 2019-06-04 11:09:58 +02:00
README.md Update README.md 2019-10-06 21:59:08 +01:00
pyproject.toml Initial python bindings implementation. 2019-06-04 11:09:58 +02:00
rust-toolchain rust-toolchain: Update the toolchain version. 2020-01-05 12:40:37 +01:00
rustfmt.toml Initial python bindings implementation. 2019-06-04 11:09:58 +02:00
tox.ini tantivy: Add a tox.ini file. 2019-08-08 14:22:23 +02:00

README.md

Build Status License: MIT

tantivy-py

Python bindings for tantivy.

Installation

The bindings can be installed using setuptools:

python3 setup.py install --user

Note that this requires setuptools-rust to be installed. Another thing to note is that the bindings are using PyO3, which requires rust nightly and only supports python3.

Usage

tantivy-py has a similar API to tantivy. To create a index first a schema needs to be built. After that documents can be added to the index and a reader can be created to search the index.

import tantivy

# Declaring our schema.
schema_builder = tantivy.SchemaBuilder()
schema_builder.add_text_field("title", stored=True)
schema_builder.add_text_field("body", stored=True)
schema = schema_builder.build()

# Creating our index (in memory, but filesystem is available too)
index = tantivy.Index(schema)


# Adding one document.
writer = index.writer()
writer.add_document(tantivy.Document(
    title=["The Old Man and the Sea"],
    body=["""He was an old man who fished alone in a skiff in the Gulf Stream and he had gone eighty-four days now without taking a fish."""],
))
# ... and committing
writer.commit()


# Reload the index to ensure it points to the last commit.
index.reload()
searcher = index.searcher()
query = index.parse_query("fish days", ["title", "body"])
top_docs = tantivy.TopDocs(3)

(best_score, best_doc_address) = searcher.search(query, top_docs)[0]
best_doc = searcher.doc(best_doc_address) 
assert best_doc["title"] == ["The Old Man and the Sea"]
print(best_doc)