Starbeamrainbowlabs
1abcd96699
Remove stray debug statement
2019-12-23 22:02:41 +00:00
Starbeamrainbowlabs
456f749ffe
Bugfix: Squash bug in new array_simple search optimisation
2019-12-23 21:58:23 +00:00
Starbeamrainbowlabs
e4eee4e281
Fix comment typo
2019-12-15 22:38:44 +00:00
Starbeamrainbowlabs
6d675fc783
Bugfix: Add missing apostrophes in stop words
2019-12-15 20:21:05 +00:00
Starbeamrainbowlabs
6f4b1a62e9
Fix + weighted word support on stas-parse action
2019-12-15 20:03:04 +00:00
Starbeamrainbowlabs
c80f26962e
Refactor stas_split to be more fasterererer
...
Informal testing shows that it's gone from taking ~18% of the total time
to ~4% of the total time :D
2019-12-15 17:56:56 +00:00
Starbeamrainbowlabs
843f0f7ee9
Update comment
2019-12-10 01:13:51 +00:00
Starbeamrainbowlabs
d53f0ed85a
Remove search::transliterate, as it has a hgue performance overhead.
...
Use search::$literator->transliterate() directly instead.
2019-12-08 21:04:59 +00:00
Starbeamrainbowlabs
8156055b5c
Improve search index write & lookup performance by implementing new arr_simple system
...
By serialising and deserialising lists of numbers with implode &
explode, we can further cut down on the json_* calls which are
reeeeeally slow.
2019-12-06 23:40:28 +00:00
Starbeamrainbowlabs
aea1255f10
Bugfix: Return correct type in StorageBox::delete()
2019-09-21 21:05:14 +01:00
Starbeamrainbowlabs
157c6dabdd
If it's a list of strings, then it should be sorted correctly.
2019-09-03 18:16:01 +01:00
Starbeamrainbowlabs
f160a82063
Add note to search page linking to query syntax on help page
2019-08-24 20:47:41 +01:00
Starbeamrainbowlabs
da5b3a5df8
Do some documentation work, and add missing help sections
2019-08-24 19:56:14 +01:00
Starbeamrainbowlabs
e773e36de5
Tweak stas-parse action output
2019-08-23 01:29:11 +01:00
Starbeamrainbowlabs
632375417d
Add apiDoc comment
2019-08-23 01:27:35 +01:00
Starbeamrainbowlabs
e6ba31df23
Add debug stas-parse action
2019-08-23 01:24:17 +01:00
Starbeamrainbowlabs
276b4c808f
Add STAS parsing to query-searchindex output
2019-08-23 00:51:39 +01:00
Starbeamrainbowlabs
4d51ae924e
Fix intitle: & intags: syntax - game, set, match.
2019-08-22 22:23:30 +01:00
Starbeamrainbowlabs
9505e0653e
Fix some mroe odd bugs in the new search system
2019-08-22 22:11:09 +01:00
Starbeamrainbowlabs
edf1be5801
Fix a *huge* number of bugs in the new search system, but it's not ready just yet
2019-08-22 21:38:17 +01:00
Starbeamrainbowlabs
e08e775d98
Finish refactoring invindex_query
2019-08-22 17:43:14 +01:00
Starbeamrainbowlabs
b93dd3d9cc
Start refactoring query_invindex & rename it to invindex_query
...
....but of course it's not finished yet. We're doing well, but there are
a few thorny issues to go.
Mainly: We need to seriously optimise ids::getpagename(), 'cause we'll
need it a *lot* when we get to implementing the size, before, and after
colon : directives.
2019-08-18 21:25:48 +01:00
Starbeamrainbowlabs
ce6df06817
Start refactoring the search system to use a new key-value store backend
...
....but it's not finished yet.
It should improve performance significantly when it's done & optimised,
as we won't have to load the entire search index into memory & decode it
just to perform a single query.
2019-08-18 18:52:29 +01:00
Starbeamrainbowlabs
38badd3c1f
[search] Add StorageBox.php as an extra data file
...
It's time to refactor the search system to use an SQLite-backed
key-value data store. It's just a shame that something designed for this
like LevelDB / RocksDB doesn't have a PHP package that we can use :-/
We can always switch later, I suppose.
2019-08-17 20:47:51 +01:00
Starbeamrainbowlabs
7088990027
Minor code formatting
2019-08-17 01:19:04 +01:00
Starbeamrainbowlabs
5609506def
minor formatting
2019-08-16 01:14:38 +01:00
Starbeamrainbowlabs
127270ff89
Bugfix: Correct search query performance metrics
2019-08-15 23:46:23 +01:00
Starbeamrainbowlabs
ddc36bf48e
Remove commented code
2019-08-15 23:17:33 +01:00
Starbeamrainbowlabs
0a5ba3ff59
Improve search invindex alteration performance
...
This will be especially noticable when using invindex-rebuild
2019-08-15 23:06:06 +01:00
Starbeamrainbowlabs
50efd4bb49
Bump versions
2019-05-06 23:48:34 +01:00
Starbeamrainbowlabs
c177b66b42
Bugfix: Don't throw a warning if the search index doesn't exist yet
2019-05-06 20:22:36 +01:00
Starbeamrainbowlabs
a3330829cb
Bump module versions & go over documentation comments
2019-02-10 23:18:34 +00:00
Starbeamrainbowlabs
5b670f5981
Refactor method names in page renderer
2019-01-27 22:56:51 +00:00
Starbeamrainbowlabs
c7d7de3d7e
Don't include semicolons in greedy internal links
2018-09-29 23:40:23 +01:00
Starbeamrainbowlabs
39098ac0fb
Display an ellipsis at the beginning of a search context if it doesn't start at the beginning of a page
2018-09-29 13:32:17 +01:00
Starbeamrainbowlabs
24775724d1
Bugfix: Correctly calculate the end offset of search context snippets
2018-09-29 13:27:17 +01:00
Starbeamrainbowlabs
284b404946
Typos in comments
2018-09-12 21:27:51 +01:00
Starbeamrainbowlabs
31d555f482
Bump version of search module
2018-07-01 12:14:06 +01:00
Starbeamrainbowlabs
1f6f780177
Restyle matching tags in search results
2018-06-30 11:46:07 +01:00
Starbeamrainbowlabs
8955d6d131
Save the character offset, not the token offset in the inverted index
2018-06-30 11:19:38 +01:00
Starbeamrainbowlabs
cdee30c286
Add $capture_offsets option to tokenize().
...
TODO: Utilise this in the indexer & update the changelog mentioning that
_all_ inverted indexes will need to be rebuilt
2018-06-30 00:08:57 +01:00
Starbeamrainbowlabs
8403ffd5c3
Bugfix: Increment $i when we hit a stop word when indexing.
...
There's also another bug here - in that the offsets generated contain
are the index in the array of tokens, when we need it to be the index in
the source text!
2018-06-29 23:51:10 +01:00
Starbeamrainbowlabs
9d7a21e993
Format the index action nicely
2018-06-29 12:08:38 +01:00
Starbeamrainbowlabs
19e49777b2
Search System; Don't bother getting a page's id if we don't need to
2018-06-26 14:28:11 +01:00
Starbeamrainbowlabs
3d3b6c491a
Seriously optimise the search system via some profiling.
2018-06-26 14:15:19 +01:00
Starbeamrainbowlabs
67648199d7
Add search time header for analysis purposes
2018-06-26 00:11:01 +01:00
Starbeamrainbowlabs
75b6b6c55f
Optimise the search context extractor, but evediently there's more work to be done.
2018-06-26 00:06:20 +01:00
Starbeamrainbowlabs
93494b6729
Transliterate in the suggest-pages action too
2018-06-25 23:03:00 +01:00
Starbeamrainbowlabs
49b91aa6f9
Search: Transliterate characters so you don't have to remember the diacritics when searching
2018-06-25 22:53:53 +01:00
Starbeamrainbowlabs
d1a10207d1
Made rebuilding search idnex progress bar fill up completely when done
2018-04-07 13:47:39 +01:00