Commit Graph

62 Commits (adfdae0e64d2e8551a65532f697e8a855c497491)

Author SHA1 Message Date
Chris Tam adfdae0e64
Support passing dict to add JSON field (#158) 2023-11-21 20:44:21 +01:00
Caleb Hattingh 5d3d2790df
doc: enable doctests (#156) 2023-11-20 11:44:32 +10:00
Chris Tam eeaad34a98
Better support bytes, IPs, and JSON (#152) 2023-11-19 12:08:33 +01:00
Chris Tam cd8dd29097
Expose `is_valid_field_name()` to bindings (#151) 2023-11-14 13:44:32 +01:00
Chris Tam 2040463d52
Support lenient parser (#114) 2023-10-03 09:52:22 +02:00
Chris Tam 11f8bc4611
Update to tantivy v0.21.0 (#132) 2023-09-28 10:17:15 +02:00
Stone 4af7d7c45f
Adding delete_all_documents method for IndexWriter (#133) 2023-09-27 23:42:38 +02:00
Caleb Hattingh f164b0ef4a
Snippet generator (fixes #36 and #63) (#122)
Co-authored-by: Justin Greene <justin.greene@intouchsol.com>
Co-authored-by: Paul Masurel <paul@quickwit.io>
Co-authored-by: Pratyush Mittal <pratyushmittal@gmail.com>
Co-authored-by: mukeshsahnis <er.mks89@gmail.com>
2023-09-13 12:03:39 +02:00
Adam Reichold 76512f859b
Fine-tune handling of GIL for search (#124) 2023-09-11 17:58:17 +02:00
Caleb Hattingh 91a422b49e
Increase default writer heap (fixes #118) (#121) 2023-09-10 22:57:57 +02:00
Chris Tam 05dde2d232
Support pickling of some objects (#97) 2023-08-26 14:13:29 +02:00
Chris Tam 9f932aeebe
Add support for booleans in schemas and docs (#105) 2023-08-16 00:39:28 +02:00
Chris Tam bbcd1f2aef
Raise errors instead of unwrapping in document (#106) 2023-08-05 20:55:56 +02:00
Chris Tam 50809a186d
Clean up document.rs (#101) 2023-08-04 17:19:39 +02:00
Chris Tam 8b33e00c58
Support copy, deepcopy, eq on types (#99) 2023-08-04 09:23:31 +02:00
Chris Tam 35ed22e6d5
Expose IndexWriter::wait_merging_threads() (#100) 2023-07-22 21:57:30 +02:00
Chris Tam b377f570ef
Add schema validation to PyDict -> Document (#88)
* Add schema validation to PyDict -> Document

* Address comments

* Add documentation about new functionality
2023-07-22 00:13:03 +02:00
Sidhant Arora a266f41974
feat: tantivy_0.20.1_upgrade (#82)
* Added api changes from tantivy-0.20.1

* lint fix

* Increase test writer heap to 10_000_000

* Revert test back to original check

* Update src/searcher.rs

Co-authored-by: Cameron <561860+wallies@users.noreply.github.com>

---------

Co-authored-by: Caleb Hattingh <caleb.hattingh@gmail.com>
Co-authored-by: Cameron <561860+wallies@users.noreply.github.com>
2023-07-21 09:44:47 +10:00
Caleb Hattingh 8356af5410
Fix lint 2023-03-26 15:06:30 +02:00
Caleb Hattingh 2f65cc65ff
Include check for bytes in extract_value, fixes #72 2023-03-26 15:03:31 +02:00
Sidhant29 3971325326
updated add_float_field signature 2023-03-18 14:45:18 +10:00
Sidhant Arora 4ffdffe4d6
Merge branch 'master' into add_float_support 2023-03-16 15:11:30 +10:00
Phill Mell-Davies 164adc87e1
Tantivy 0.19.2 (#67)
* Adding __init__.py file to the tantivy folder to make maturin happy

Add Cargo.lock to the repo

Set the git-fetch-with-cli cargo flag so that we can override fetch settings

Renaming .cargo/config to .cargo/config.toml

Adding github-quiq-sh cargo registry

Point dependencies at our github-quiq-sh registry

Trying to resolve this build issue, pointing pyo3-build-config at our github-quiq-sh registry

SER-21487: Enable support for all standard Tantivy languages plus Chinese + Japanese in tantivy-py

SER-21487: Use uname rather than UNAME in the Makefile

SER-21487: Fix document date handling

SER-23013: Upgrade Tantivy and other dependencies

* Upgrade to Tantivy 0.19.1

* Apply rustfmt and fix bug when fast option = None

* Upgrade to tantivy-0.19.2

* Standardize around using 'cargo fmt' rather than 'rustfmt'

* Reverting to old style dependencies

* Linting with clippy

* Switching out hashmap for defining tokenizers for an array, and adding test for Spanish indexing

* Use cargo fmt instead of rustfmt on the Lint ci step
2023-02-14 13:20:59 +00:00
Sidhant29 94bfb0600c
Fixed typo 2023-01-17 11:01:43 +10:00
Sidhant29 caf52766be
Added add_float() function for Document 2023-01-17 10:48:18 +10:00
Sidhant29 75f2f0296d
Added float support 2023-01-17 10:43:39 +10:00
Shaobo 7efd477e0b Bump tantivy version 0.17.0 2022-04-27 11:41:08 +08:00
messense a9e75e5c16
Update to pyo3 0.15 (#40) 2022-01-17 10:16:08 +09:00
huishan 338ac950f1
Bump tantivy version 0.16.0 (#34)
* Bump version 0.14
* Bump version 0.15
* Bump version 0.16
2022-01-03 22:51:13 +09:00
Moriyoshi Koizumi 57f01aac7a Release GIL on searcher acquisition. 2021-05-01 15:37:33 +09:00
Paul Masurel c86f0fc1d6 Avoid truncating utf-8 strings in the middle of a codepoint in
`Document::__repr__`

Closes #24
2020-10-01 22:36:28 +09:00
Damir Jelić 7af1b68a45 tantivy: Merge our imports. 2020-09-20 10:19:29 +02:00
Damir Jelić 2b77bb4b0c index: Fix the formatting. 2020-09-19 15:31:48 +02:00
Damir Jelić 785e37fb06 query: Upgrade to Tantivy 0.13.1 so we don't need the query hack anymore. 2020-09-19 15:17:07 +02:00
Damir Jelić e91726e010 tantivy-py: Upgrade PyO3.
This removes our nightly requirement but sadly it adds a requirement for
things that are kept inside a Python class to be Send. Luckily for us
almost everything in Tantivy is Send, except for the Query trait.

This patch works around this by keeping the parser and query string
inside our python Query object. This sadly means that we are going to
parse the query string twice.
2020-09-19 11:48:25 +02:00
sourcepirate 20989628e4 updated tests for offset query 2020-09-07 20:06:16 +02:00
sourcepirate 42d29b5e94 changed the args position 2020-09-07 20:06:16 +02:00
sourcepirate e7b3e3527a formated the searcher.rs file 2020-09-05 23:58:59 +08:00
sourcepirate 89e1cabc58 support for offset to do record pagination 2020-09-05 23:51:54 +08:00
Damir Jelić dc5f21562d index: Format the file. 2020-04-19 13:43:22 +02:00
Damir Jelić b75f9d75b6 tantivy-py: Upgrade to PyO3 0.9. 2020-04-19 13:35:14 +02:00
Damir Jelić 094f8974ea searcher: Add support to search and order the results by a field. 2020-04-19 12:26:56 +02:00
Damir Jelić 32b6b67af4 README: Update the docs for the new simpler search API. 2020-01-05 17:22:58 +01:00
Damir Jelić 044b65321e index: Remove a redundant clone. 2020-01-05 16:00:02 +01:00
Damir Jelić 5c590ff157 Format the repo. 2020-01-05 15:59:43 +01:00
Damir Jelić a5ac1a5463 searcher: Remove the unused schema. 2020-01-05 14:08:57 +01:00
Damir Jelić 46be799248 Merge remote-tracking branch 'origin/search_api_simplification' 2020-01-05 13:52:35 +01:00
Damir Jelić 19219316a8 index: Fix some lint issues. 2020-01-05 12:39:10 +01:00
Damir Jelić f8e39a7b7f searcher: Remove the ability to order the search result.
Ordering the search result by a field requires the field to be set up to
support this at the index creation time. If it wasn't properly set up,
such a search would crash the Python interpreter.

Until a search returns an error that we can convert to a Python
exception this feature will unlikely be supported.
2019-12-17 23:17:44 +01:00
Damir Jelić cfa15a001d searcher: Use a search result struct. 2019-12-17 20:50:26 +01:00