4.3 KiB
How-to Guides
Installation
tantivy-py can be installed using from pypi using pip:
pip install tantivy
If no binary wheel is present for your operating system the bindings will be build from source, this means that Rust needs to be installed before building can succeed.
Note that the bindings are using PyO3, which only supports python3.
Set up a development environment to work on tantivy-py itself
Setting up a development environment can be done in a virtual environment using
nox
or using local packages using the provided Makefile
.
For the nox
setup install the virtual environment and build the bindings using:
python3 -m pip install nox
nox
For the Makefile
based setup run:
make
Running the tests is done using:
make test
The nox
test session will pass pytest arguments through. For example,
to run only the tests including "simple_search" in the test name, and only
on Python 3.11:
nox -s test-3.11 -- -k simple_search
Doctests
Doctests are automatically
enabled for all docstrings in the tantivy
module. Here is a very basic
introduction. Consider the following hypothetical Rust struct
:
/// Tantivy's Document is the object that can be indexed and then searched for.
///
/// Documents are fundamentally a collection of unordered tuples
/// (field_name, value). In this list, one field may appear more than once.
///
/// Example:
/// >>> doc = tantivy.Document()
/// >>> doc.add_text("title", "The Old Man and the Sea")
/// >>> doc.add_text("body", ("He was an old man who fished alone in a "
/// ... "skiff in the Gulf Stream and he had gone "
/// ... "eighty-four days now without taking a fish."))
/// >>> doc
/// Document(body=[He was an ],title=[The Old Ma])
///
#[pyclass(module = "tantivy")]
#[derive(Clone, Default, PartialEq)]
pub(crate) struct Document {
pub(crate) field_values: BTreeMap<String, Vec<Value>>,
}
When the tests are executed, pytest will automatically search all the docstrings
for >>>
and ...
and execute the code in the docstring. The output of the
code is compared to the text that follows the code. If the output matches, the
test passes. If the output does not match, the test fails.
In the above example, a Tantivy document object is created, and then the representation of the document is printed. This representation, and indeed any output that manual typing would produce, is compared to the text that follows and this is how doctests work.
Doctests are a great way to ensure that the documentation is accurate and up to date, and doctests are therefore encouraged be present on every public interface that users will interact with. However, doctest are not suitable for coverage testing and other more advanced testing methods so you must judge when to use them.
Working on tantivy-py documentation
Please be aware that this documentation is structured using the Diátaxis framework. In very simple terms, this framework will suggest the correct location for different kinds of documentation. Please make sure you gain a basic understanding of the goals of the framework before making large pull requests with new documentation.
This documentation uses the MkDocs framework. This package is specified as an optional dependency in the pyproject.toml
file. To install all optional dev dependencies into your virtual env, run the following command:
pip install .[dev]
The MkDocs documentation itself is comprehensive. MkDocs provides some additional context and help around writing with markdown.
If all you want to do is make a few edits right away, the documentation content is in the /docs
directory and consists of Markdown files, which can be edited with any text editor.
The most efficient way to work is to run a MkDocs livereload server in the background. This will launch a local web server on your dev machine, serve the docs (by default at http://localhost:8000
), and automatically reload the page after you save any changes to the documentation files.