Commit Graph

59 Commits

Author SHA1 Message Date
Chris Tam
cd8dd29097
Expose is_valid_field_name() to bindings (#151) 2023-11-14 13:44:32 +01:00
Chris Tam
2040463d52
Support lenient parser (#114) 2023-10-03 09:52:22 +02:00
Chris Tam
11f8bc4611
Update to tantivy v0.21.0 (#132) 2023-09-28 10:17:15 +02:00
Stone
4af7d7c45f
Adding delete_all_documents method for IndexWriter (#133) 2023-09-27 23:42:38 +02:00
Caleb Hattingh
f164b0ef4a
Snippet generator (fixes #36 and #63) (#122)
Co-authored-by: Justin Greene <justin.greene@intouchsol.com>
Co-authored-by: Paul Masurel <paul@quickwit.io>
Co-authored-by: Pratyush Mittal <pratyushmittal@gmail.com>
Co-authored-by: mukeshsahnis <er.mks89@gmail.com>
2023-09-13 12:03:39 +02:00
Adam Reichold
76512f859b
Fine-tune handling of GIL for search (#124) 2023-09-11 17:58:17 +02:00
Caleb Hattingh
91a422b49e
Increase default writer heap (fixes #118) (#121) 2023-09-10 22:57:57 +02:00
Chris Tam
05dde2d232
Support pickling of some objects (#97) 2023-08-26 14:13:29 +02:00
Chris Tam
9f932aeebe
Add support for booleans in schemas and docs (#105) 2023-08-16 00:39:28 +02:00
Chris Tam
bbcd1f2aef
Raise errors instead of unwrapping in document (#106) 2023-08-05 20:55:56 +02:00
Chris Tam
50809a186d
Clean up document.rs (#101) 2023-08-04 17:19:39 +02:00
Chris Tam
8b33e00c58
Support copy, deepcopy, eq on types (#99) 2023-08-04 09:23:31 +02:00
Chris Tam
35ed22e6d5
Expose IndexWriter::wait_merging_threads() (#100) 2023-07-22 21:57:30 +02:00
Chris Tam
b377f570ef
Add schema validation to PyDict -> Document (#88)
* Add schema validation to PyDict -> Document

* Address comments

* Add documentation about new functionality
2023-07-22 00:13:03 +02:00
Sidhant Arora
a266f41974
feat: tantivy_0.20.1_upgrade (#82)
* Added api changes from tantivy-0.20.1

* lint fix

* Increase test writer heap to 10_000_000

* Revert test back to original check

* Update src/searcher.rs

Co-authored-by: Cameron <561860+wallies@users.noreply.github.com>

---------

Co-authored-by: Caleb Hattingh <caleb.hattingh@gmail.com>
Co-authored-by: Cameron <561860+wallies@users.noreply.github.com>
2023-07-21 09:44:47 +10:00
Caleb Hattingh
8356af5410
Fix lint 2023-03-26 15:06:30 +02:00
Caleb Hattingh
2f65cc65ff
Include check for bytes in extract_value, fixes #72 2023-03-26 15:03:31 +02:00
Sidhant29
3971325326
updated add_float_field signature 2023-03-18 14:45:18 +10:00
Sidhant Arora
4ffdffe4d6
Merge branch 'master' into add_float_support 2023-03-16 15:11:30 +10:00
Phill Mell-Davies
164adc87e1
Tantivy 0.19.2 (#67)
* Adding __init__.py file to the tantivy folder to make maturin happy

Add Cargo.lock to the repo

Set the git-fetch-with-cli cargo flag so that we can override fetch settings

Renaming .cargo/config to .cargo/config.toml

Adding github-quiq-sh cargo registry

Point dependencies at our github-quiq-sh registry

Trying to resolve this build issue, pointing pyo3-build-config at our github-quiq-sh registry

SER-21487: Enable support for all standard Tantivy languages plus Chinese + Japanese in tantivy-py

SER-21487: Use uname rather than UNAME in the Makefile

SER-21487: Fix document date handling

SER-23013: Upgrade Tantivy and other dependencies

* Upgrade to Tantivy 0.19.1

* Apply rustfmt and fix bug when fast option = None

* Upgrade to tantivy-0.19.2

* Standardize around using 'cargo fmt' rather than 'rustfmt'

* Reverting to old style dependencies

* Linting with clippy

* Switching out hashmap for defining tokenizers for an array, and adding test for Spanish indexing

* Use cargo fmt instead of rustfmt on the Lint ci step
2023-02-14 13:20:59 +00:00
Sidhant29
94bfb0600c
Fixed typo 2023-01-17 11:01:43 +10:00
Sidhant29
caf52766be
Added add_float() function for Document 2023-01-17 10:48:18 +10:00
Sidhant29
75f2f0296d
Added float support 2023-01-17 10:43:39 +10:00
Shaobo
7efd477e0b Bump tantivy version 0.17.0 2022-04-27 11:41:08 +08:00
messense
a9e75e5c16
Update to pyo3 0.15 (#40) 2022-01-17 10:16:08 +09:00
huishan
338ac950f1
Bump tantivy version 0.16.0 (#34)
* Bump version 0.14
* Bump version 0.15
* Bump version 0.16
2022-01-03 22:51:13 +09:00
Moriyoshi Koizumi
57f01aac7a Release GIL on searcher acquisition. 2021-05-01 15:37:33 +09:00
Paul Masurel
c86f0fc1d6 Avoid truncating utf-8 strings in the middle of a codepoint in
`Document::__repr__`

Closes #24
2020-10-01 22:36:28 +09:00
Damir Jelić
7af1b68a45 tantivy: Merge our imports. 2020-09-20 10:19:29 +02:00
Damir Jelić
2b77bb4b0c index: Fix the formatting. 2020-09-19 15:31:48 +02:00
Damir Jelić
785e37fb06 query: Upgrade to Tantivy 0.13.1 so we don't need the query hack anymore. 2020-09-19 15:17:07 +02:00
Damir Jelić
e91726e010 tantivy-py: Upgrade PyO3.
This removes our nightly requirement but sadly it adds a requirement for
things that are kept inside a Python class to be Send. Luckily for us
almost everything in Tantivy is Send, except for the Query trait.

This patch works around this by keeping the parser and query string
inside our python Query object. This sadly means that we are going to
parse the query string twice.
2020-09-19 11:48:25 +02:00
sourcepirate
20989628e4 updated tests for offset query 2020-09-07 20:06:16 +02:00
sourcepirate
42d29b5e94 changed the args position 2020-09-07 20:06:16 +02:00
sourcepirate
e7b3e3527a formated the searcher.rs file 2020-09-05 23:58:59 +08:00
sourcepirate
89e1cabc58 support for offset to do record pagination 2020-09-05 23:51:54 +08:00
Damir Jelić
dc5f21562d index: Format the file. 2020-04-19 13:43:22 +02:00
Damir Jelić
b75f9d75b6 tantivy-py: Upgrade to PyO3 0.9. 2020-04-19 13:35:14 +02:00
Damir Jelić
094f8974ea searcher: Add support to search and order the results by a field. 2020-04-19 12:26:56 +02:00
Damir Jelić
32b6b67af4 README: Update the docs for the new simpler search API. 2020-01-05 17:22:58 +01:00
Damir Jelić
044b65321e index: Remove a redundant clone. 2020-01-05 16:00:02 +01:00
Damir Jelić
5c590ff157 Format the repo. 2020-01-05 15:59:43 +01:00
Damir Jelić
a5ac1a5463 searcher: Remove the unused schema. 2020-01-05 14:08:57 +01:00
Damir Jelić
46be799248 Merge remote-tracking branch 'origin/search_api_simplification' 2020-01-05 13:52:35 +01:00
Damir Jelić
19219316a8 index: Fix some lint issues. 2020-01-05 12:39:10 +01:00
Damir Jelić
f8e39a7b7f searcher: Remove the ability to order the search result.
Ordering the search result by a field requires the field to be set up to
support this at the index creation time. If it wasn't properly set up,
such a search would crash the Python interpreter.

Until a search returns an error that we can convert to a Python
exception this feature will unlikely be supported.
2019-12-17 23:17:44 +01:00
Damir Jelić
cfa15a001d searcher: Use a search result struct. 2019-12-17 20:50:26 +01:00
Damir Jelić
d46417c220 searcher: Allow the search to be sorted by an unsigned field. 2019-12-17 20:50:26 +01:00
Damir Jelić
fbea6fe633 searcher: Remove the collector concept.
This patch removes the TopDocs collector class and adds a limit
argument on the search method.
2019-12-17 20:50:26 +01:00
Paul Masurel
41cdad4157 Using tantivy 0.11 2019-12-17 23:23:53 +09:00